CN102339292A - Distributed searching method and system - Google Patents

Distributed searching method and system Download PDF

Info

Publication number
CN102339292A
CN102339292A CN2010102378153A CN201010237815A CN102339292A CN 102339292 A CN102339292 A CN 102339292A CN 2010102378153 A CN2010102378153 A CN 2010102378153A CN 201010237815 A CN201010237815 A CN 201010237815A CN 102339292 A CN102339292 A CN 102339292A
Authority
CN
China
Prior art keywords
index
keyword
content source
search
search platform
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102378153A
Other languages
Chinese (zh)
Inventor
王爱宝
张涛
杨德利
李屹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Telecom Corp Ltd
Original Assignee
China Telecom Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Telecom Corp Ltd filed Critical China Telecom Corp Ltd
Priority to CN2010102378153A priority Critical patent/CN102339292A/en
Publication of CN102339292A publication Critical patent/CN102339292A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a distributed searching method and system, wherein the method comprises the following steps of: building keywords on a website, and indexing URL (Universal Resource Locator) of a keyword-containing content source; and returning the index to a search platform. In the invention, the index is built from the content source, so that the work of building the index is transferred to the content source on one hand, on the other hand, the content source can return the index relation between the keywords and the URL of the content source rather than the complete information of the content source back to the search platform, thus greatly improving the efficiency of search engines and reducing the excessive interference on searched websites.

Description

Distributed search methods and system
Technical field
The present invention relates to information retrieval field, more specifically, relate to a kind of distributed search methods and system.
Background technology
In recent years, along with social network services (Social Networking Services, SNS), website such as blog in vogue; Popular more and more interested in this type of site information; And simultaneously, the website is from the purpose of propaganda and profit, and also being ready very much provides away information in time.So just between search service provider (for example, google, baidu etc.) and SNS, blog class website, formed a kind of believable cooperative relationship, for the user valuable timely information is provided jointly.
But, because mostly present search technique is to utilize reptile with after the extracting of the information on the internet, on search platform, carry out information classification and opening relationships index, go into database at last and supply user's query search, so there is following several problem in this method:
(1) in information extracting process, such as information classification, set up groundworks such as index, warehouse-in and all on search platform, carry out, greatly influenced the efficient of search engine;
(2) in information extracting process, reptile need be with the information back search engine that is grasped, so that engine is classified, screens, kept useful information and reject garbage information.In this process, need carry out full-text search, likewise have influence on the efficient of search engine.
Summary of the invention
The technical matters that the present invention will solve provides a kind of distributed search methods, can significantly improve the efficient of search engine.
The invention provides a kind of distributed search methods, comprise that the website sets up Universal Resource Locator (Universal Resource Locator, URL) the index of keyword to the content source that comprises keyword; Index is returned to search platform.
According to an embodiment of the inventive method, this method also comprises: the website regularly or aperiodically obtains keyword from search platform.
According to another embodiment of the inventive method, this method also comprises: judge the whether meaningful renewal of content source in the website, if meaningful renewal then prepares to set up index.
According to the another embodiment of the inventive method, this method also comprises: search platform utilizes ordering rule that the index that returns is sorted, and the result after will sorting deposits database in for the retrieval use.
Embodiment again according to the inventive method returns to search platform through following manner with index: the mode that reptile initiatively grasps; Or the mode that initiatively reports of website.
Distributed search methods of the present invention; Come self-built index by content source; The work that to set up index so on the one hand moves to content source, and the content source complete information that need not again the returned content source is given search platform on the other hand, only need return the index relative of keyword with the URL of content source; Thereby promoted the efficient of search engine greatly, and reduced by the excessive interference of search website.
Another technical matters that the present invention will solve provides a kind of distributed search system, can significantly improve the efficient of search engine.
The invention provides a kind of distributed search system, comprise the index apparatus for establishing, be used to set up the index of keyword to the URL of the content source that comprises keyword; The index dispensing device links to each other with the index apparatus for establishing, is used for index is returned to search platform.
An embodiment of system according to the invention, this system also comprises: the keyword deriving means, link to each other with the index apparatus for establishing, be used for regularly or aperiodically obtaining keyword from search platform.
Another embodiment of system according to the invention, this system also comprises: judgment means, link to each other with the index apparatus for establishing, be used to judge whether meaningful renewal of content source, if meaningful renewal then prepares to set up index.
The another embodiment of system according to the invention, this system also comprises: search platform, link to each other with the index dispensing device, be used to utilize ordering rule that the index that returns is sorted, and the result after will sorting deposit database in for the retrieval use.
An embodiment again of system according to the invention, the index dispensing device returns to search platform through following manner with index: the mode that reptile initiatively grasps; Or the mode that initiatively reports of website.
Distributed search system of the present invention; Come self-built index by content source; The work that to set up index so on the one hand moves to content source, and the content source complete information that need not again the returned content source is given search platform on the other hand, only need return the index relative of keyword with the URL of content source; Thereby promoted the efficient of search engine greatly, and reduced by the excessive interference of search website.
Description of drawings
Accompanying drawing described herein is used to provide further understanding of the present invention, constitutes the application's a part.In the accompanying drawings:
Fig. 1 is the schematic flow sheet of first embodiment of the inventive method.
Fig. 2 is the schematic flow sheet of second embodiment of the inventive method.
Fig. 3 is the schematic flow sheet of the 3rd embodiment of the inventive method.
Fig. 4 is the schematic flow sheet of the 4th embodiment of the inventive method.
Fig. 5 is the schematic flow sheet of the 5th embodiment of the inventive method.
Fig. 6 is the schematic flow sheet of the 7th embodiment of the inventive method.
Fig. 7 is the structural representation of first embodiment of system of the present invention.
Fig. 8 is the structural representation of second embodiment of system of the present invention.
Fig. 9 is the structural representation of the 3rd embodiment of system of the present invention.
Figure 10 is the structural representation of the 4th embodiment of system of the present invention.
Figure 11 is the synoptic diagram as a result of the 5th embodiment of system of the present invention.
Embodiment
With reference to the accompanying drawings the present invention is more comprehensively described, exemplary embodiment of the present invention wherein is described.Exemplary embodiment of the present invention and explanation thereof are used to explain the present invention, but do not constitute improper qualification of the present invention.
In order to improve the efficient of search engine, the present invention is directed to site information trusty, a kind of content-based distributed search methods and system that builds index that be derived from proposed.Content source trusty is transferred in its work that will set up index, after index set up in the keyword that is obtained from search platform by the content source utilization, returns to search platform to keyword with the URL index of content source, thereby has set up distributed search framework.
Fig. 1 is the schematic flow sheet of first embodiment of the inventive method.
As shown in Figure 1, this embodiment can may further comprise the steps:
S102, the index of keyword to the URL of the content source that comprises keyword set up in the website, and wherein, this keyword can be from the keyword dictionary of search platform;
S104 returns to search platform with index.
Alternatively, can also comprise the title of webpage, the time of webpage final updating, the type of webpage, the length of main contents and the main contents of webpage in this index.
This embodiment comes self-built index by content source; The work that to set up index so on the one hand moves to content source; The content source complete information that need not again the returned content source is given search platform on the other hand; Only need return the index relative of keyword, thereby promote the efficient of search engine greatly with the URL of content source.In addition, traditional search generally is to utilize reptile to grasp carried out full text by search website contents, and no matter whether the content that is grasped is the information that search platform is concerned about.And the present invention only requires that the trusted website provides the keyword that the satisfies condition index to the URL of content source according to protocol requirement, and the just visit of having ready conditions of partial content is so can reduce by the excessive interference of search website.
Fig. 2 is the schematic flow sheet of second embodiment of the inventive method.
As shown in Figure 2, this embodiment can may further comprise the steps:
S202; The website regularly or aperiodically obtains keyword from search platform; Wherein, this website is website trusty, and it has set up the mutual trust relation with search platform; Such website can be accepted the searching request of search platform on the one hand passively, also can on one's own initiative relevant information be uploaded to search platform on the other hand;
S204, the index of keyword to the URL of the content source that comprises keyword set up in the website;
S206 returns to search platform with index.
This embodiment can come the matching content source according to the keyword that search platform provides, and sets up the index of keyword to the URL of content source, utilizes the trusted website to set up the demand of keyword to the index of content source thereby can satisfy search platform.
Fig. 3 is the schematic flow sheet of the 3rd embodiment of the inventive method.
As shown in Figure 3, this embodiment can may further comprise the steps:
S302 judges the whether meaningful renewal (increased fresh content or deleted content) of content source in the website, if having, then prepares to set up index; For example, can adopt a corresponding web page contents of URL of a hash table storage through a webpage fingerprint after the MD5 algorithm, promptly; <url, md5 (content) >, then; Can if change, then preserve one group of binary sequence < (index1 through whether md5 (content) being changed judge whether this web page contents changes with a tabulation; Length1), (index2, length2) ....; Wherein index1 is the position that changes, and length is the content-length that changes, and can extract the content of wherein upgrading like this;
S304, the index of keyword to the URL of the content source that comprises keyword set up in the website;
S306 returns to search platform with index.
This embodiment can have when renewal to set up index in content source, thereby reduces the indexing service amount of website to a great extent.
Fig. 4 is the schematic flow sheet of the 4th embodiment of the inventive method.
As shown in Figure 4, this embodiment can may further comprise the steps:
S402, the index of keyword to the URL of the content source that comprises keyword set up in the website;
S404 returns to search platform with index;
S406, search platform utilize ordering rule that the index that returns is sorted, and the result after will sorting deposits database in for the retrieval use.Wherein, ordering rule can be the matching degree of keyword in URL.For example, can represent the matching degree of this keyword in URL through the frequency of occurrences of keyword in content source.
In this embodiment, after search platform sorts to the index that returns, effectively improved the following effectiveness of retrieval of search platform.
Fig. 5 is the schematic flow sheet of the 5th embodiment of the inventive method.
As shown in Figure 5, this embodiment can may further comprise the steps:
S502, the website regularly or aperiodically obtains keyword from search platform;
S504 judges the whether meaningful renewal of content source in the website, if meaningful renewal then prepares to set up index;
S506, the index of keyword to the URL of the content source that comprises keyword set up in the website;
S508 returns to search platform with index;
S510, search platform utilize ordering rule that the index that returns is sorted, and the result after will sorting deposits database in for the retrieval use.
In the 6th embodiment of the inventive method, can index be returned to search platform: the mode that reptile initiatively grasps through following manner; Or the mode that initiatively reports of website.No matter which kind of mode of employing, the information that finally arrives search platform is the index of keyword to the content origin url, rather than concrete content source in full.Like this, the full text that the information that returns to search platform need not content source can satisfy the needs of retrieval.
This embodiment can obtain index from the trusted website in several ways, has improved the dirigibility of obtaining index.
Fig. 6 is the schematic flow sheet of the 7th embodiment of the inventive method.
As shown in Figure 6, content-basedly be derived from that the distributed search framework of building index mainly comprises information analysis, sets up index, information processing and warehouse-in four major parts, and three big databases such as keyword dictionary, sort rules library and information bank.Content source is after information analysis processes such as finish message and classification; Utilize the keyword dictionary of search platform to set up the concordance list of keyword to the content origin url; In search platform, concordance list is put and is stored in storage confession user search use in the information bank after information processings such as ordering.
Next, set forth the function of each several part in detail:
(1) information analysis
Content source finds that update content is arranged, and after the analyses and comparison, extracts wherein newly-increased content.
(2) set up index
This part is operated in the trusted website carries out, and the trusted website obtains keyword from search platform, and the content source after the information analysis is set up the index of keyword to the content origin url.
(3) information processing
The approach that search platform obtains the trusted content source has two: a kind of is to utilize reptile initiatively to go to grasp, and another kind is that the trusted website initiatively reports search platform.No matter which kind of mode, the information that finally arrives search platform is the index of keyword to the content origin url, rather than concrete content source in full.
Keyword to the index of content origin url according to the processing of sorting of the matching degree of keyword in URL.
(4) warehouse-in
The index that will pass through after ordering is handled is gone into information bank for the retrieval use.
This embodiment has improved the efficient of search engine effectively, and has reduced by the excessive interference of search website.Wherein, the process that index is set up is carried out in by search website, is obtained keyword by search website from search platform, carries out matching treatment with the content source of self, forms the index of keyword to the content origin url, and returns to search platform.Like this, the full text that the information that returns to search platform need not content source can satisfy the needs of retrieval.This embodiment can be widely used in the construction to the information search system of trusted website.
Fig. 7 is the structural representation of first embodiment of system of the present invention.
As shown in Figure 7, the system of this embodiment comprises:
Index apparatus for establishing 11 is used to set up the index of keyword to the URL of the content source that comprises keyword, and wherein, this keyword can be from the keyword dictionary of search platform;
Index dispensing device 12 links to each other with index apparatus for establishing 11, is used for index is returned to search platform.
Alternatively, can also comprise the title of webpage, the time of webpage final updating, the type of webpage, the length of main contents and the main contents of webpage in this index.
In this embodiment; Content source is obtained keyword; Accomplish the task of setting up index at the content source, return to search platform to keyword with the link URL relation of content source again, thereby realized the search of multiple step format; This seeks and can reduce search engine and by the load of search website, improve search efficiency.In addition, the information getting method that this embodiment proposes is not the full text that obtains content source, but obtains the link URL relation of keyword with content source, and in search platform, need not to build index again, can put in storage after handling through simple information to supply user search to use.Can promote the efficient of search engine so on the one hand, also avoid on the other hand by the excessive interference of search website.
Fig. 8 is the structural representation of second embodiment of system of the present invention.
As shown in Figure 8, compare with embodiment among Fig. 7, the system of this embodiment also comprises:
Keyword deriving means 21 links to each other with index apparatus for establishing 11, is used for regularly or aperiodically obtaining keyword from search platform.
This embodiment can come the matching content source according to the keyword that search platform provides, and sets up the index of keyword to the URL of content source, utilizes the trusted website to set up the demand of keyword to the index of content source thereby can satisfy search platform.
Fig. 9 is the structural representation of the 3rd embodiment of system of the present invention.
As shown in Figure 9, compare with embodiment among Fig. 7, the system of this embodiment also comprises:
Judgment means 31 links to each other with index apparatus for establishing 11, is used to judge whether meaningful renewal of content source, if meaningful renewal then prepares to set up index.For example, can adopt a corresponding web page contents of URL of a hash table storage through a webpage fingerprint after the MD5 algorithm, promptly; <url, md5 (content) >, then; Can if change, then preserve one group of binary sequence < (index1 through whether md5 (content) being changed judge whether this web page contents changes with a tabulation; Length1), (index2, length2) ....; Wherein index1 is the position that changes, and length is the content-length that changes, and can extract the content of wherein upgrading like this.
This embodiment can have when renewal to set up index in content source, thereby reduces the indexing service amount of website to a great extent.
Figure 10 is the structural representation of the 4th embodiment of system of the present invention.
Shown in figure 10, compare with embodiment among Fig. 7, the system of this embodiment also comprises:
Search platform 41 links to each other with index dispensing device 12, is used to utilize ordering rule that the index that returns is sorted, and the result after will sorting deposits database in for the retrieval use.Wherein, ordering rule can be the matching degree of keyword in URL.For example, can represent the matching degree of this keyword in URL through the frequency of occurrences of keyword in content source.
After this embodiment sorts to the index that returns, effectively improved the following effectiveness of retrieval of search platform.
Figure 11 is the synoptic diagram as a result of the 5th embodiment of system of the present invention.
Shown in figure 11, the system of this embodiment comprises:
Index apparatus for establishing 11 is used to set up the index of keyword to the URL of the content source that comprises keyword;
Index dispensing device 12 links to each other with index apparatus for establishing 11, is used for index is returned to search platform.
Keyword deriving means 21 links to each other with index apparatus for establishing 11, is used for regularly or aperiodically obtaining keyword from search platform.
Judgment means 31 links to each other with index apparatus for establishing 11, is used to judge whether meaningful renewal of content source, if meaningful renewal then prepares to set up index.
Search platform 41 links to each other with index dispensing device 12, is used to utilize ordering rule that the index that returns is sorted, and the result after will sorting deposits database in for the retrieval use.
In the 6th embodiment of system of the present invention, the index dispensing device can return to search platform with index through following manner: the mode that reptile initiatively grasps; Or the mode that initiatively reports of website.
Be that example describes in further detail the present invention with content monitoring system of China Telecom below.
Content monitoring system of China Telecom is the system that information monitoring is carried out in numerous portal website of China Telecom and the original content class of users such as SNS, microblogging website.System need analyze the content of related web site, and provides corresponding control order.
In the keyword dictionary of content monitoring system, preserved a large amount of flame keywords and public sentiment related term, the original content class of each portal website or user website regularly or aperiodically obtains these keywords, matees the content source of self.When the meaningful renewal in this website, promptly carry out matching operation.When having matching content, supervised the website and returned to the content monitoring system to keyword and corresponding contents URL with regard to timing or not timing ground, different supervision instructions will be assigned according to the extent of injury of content source by the content monitoring system.
Description of the invention provides for example with for the purpose of describing, and is not the disclosed form that exhaustively perhaps limit the invention to.A lot of modifications and variation are conspicuous for those of ordinary skill in the art.Selecting and describing embodiment is for better explanation principle of the present invention and practical application, thereby and makes those of ordinary skill in the art can understand the various embodiment that have various modifications that the present invention's design is suitable for special-purpose.

Claims (10)

1. a distributed search methods is characterized in that, said method comprises:
The index of keyword to the URL of the content source that comprises said keyword set up in the website;
Said index is returned to search platform.
2. method according to claim 1 is characterized in that, said method also comprises:
Said website regularly or aperiodically obtains said keyword from said search platform.
3. method according to claim 1 is characterized in that, said method also comprises:
Judge the whether meaningful renewal of content source in the said website, if meaningful renewal then prepares to set up said index.
4. method according to claim 1 is characterized in that, said method also comprises:
Said search platform utilizes ordering rule that the said index that returns is sorted, and the result after will sorting deposits database in for the retrieval use.
5. method according to claim 1 is characterized in that, through following manner said index is returned to search platform:
The mode that reptile initiatively grasps; Or
The mode that said website initiatively reports.
6. a distributed search system is characterized in that, said system comprises:
The index apparatus for establishing is used to set up the index of keyword to the URL of the content source that comprises said keyword;
The index dispensing device links to each other with said index apparatus for establishing, is used for said index is returned to search platform.
7. system according to claim 6 is characterized in that, said system also comprises:
The keyword deriving means links to each other with said index apparatus for establishing, is used for regularly or obtains said keyword from said search platform aperiodically.
8. system according to claim 6 is characterized in that, said system also comprises:
Judgment means links to each other with said index apparatus for establishing, is used to judge whether meaningful renewal of content source, if meaningful renewal then prepares to set up said index.
9. method according to claim 6 is characterized in that, said system also comprises:
Said search platform links to each other with said index dispensing device, is used to utilize ordering rule that the said index that returns is sorted, and the result after will sorting deposits database in for the retrieval use.
10. system according to claim 6 is characterized in that, said index dispensing device returns to search platform through following manner with said index:
The mode that reptile initiatively grasps; Or
The mode that said website initiatively reports.
CN2010102378153A 2010-07-27 2010-07-27 Distributed searching method and system Pending CN102339292A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010102378153A CN102339292A (en) 2010-07-27 2010-07-27 Distributed searching method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102378153A CN102339292A (en) 2010-07-27 2010-07-27 Distributed searching method and system

Publications (1)

Publication Number Publication Date
CN102339292A true CN102339292A (en) 2012-02-01

Family

ID=45515029

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102378153A Pending CN102339292A (en) 2010-07-27 2010-07-27 Distributed searching method and system

Country Status (1)

Country Link
CN (1) CN102339292A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103617174A (en) * 2013-11-04 2014-03-05 同济大学 Distributed searching method based on cloud computing
CN105630830A (en) * 2014-11-05 2016-06-01 腾讯科技(深圳)有限公司 Method and device for establishing information relationship list
CN109286823A (en) * 2018-09-28 2019-01-29 传线网络科技(上海)有限公司 The acquisition methods and device of multimedia content
CN110516135A (en) * 2019-08-29 2019-11-29 杭州时趣信息技术有限公司 A kind of crawler system and method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6253198B1 (en) * 1999-05-11 2001-06-26 Search Mechanics, Inc. Process for maintaining ongoing registration for pages on a given search engine
US6490575B1 (en) * 1999-12-06 2002-12-03 International Business Machines Corporation Distributed network search engine
CN1595401A (en) * 2004-07-05 2005-03-16 朱龙安 A professional searching engine data gathering method
US20080104100A1 (en) * 2006-10-26 2008-05-01 Microsoft Corporation On-site search engine for the World Wide Web

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6253198B1 (en) * 1999-05-11 2001-06-26 Search Mechanics, Inc. Process for maintaining ongoing registration for pages on a given search engine
US6490575B1 (en) * 1999-12-06 2002-12-03 International Business Machines Corporation Distributed network search engine
CN1595401A (en) * 2004-07-05 2005-03-16 朱龙安 A professional searching engine data gathering method
US20080104100A1 (en) * 2006-10-26 2008-05-01 Microsoft Corporation On-site search engine for the World Wide Web

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
何明贵 等: "基于增量的网页快照及其可视化", 《现代图书情报技术》, no. 178, 31 May 2009 (2009-05-31), pages 73 - 74 *
梁斌: "《走进搜索引擎》", 31 October 2007, article "走进搜索引擎" *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103617174A (en) * 2013-11-04 2014-03-05 同济大学 Distributed searching method based on cloud computing
CN105630830A (en) * 2014-11-05 2016-06-01 腾讯科技(深圳)有限公司 Method and device for establishing information relationship list
CN109286823A (en) * 2018-09-28 2019-01-29 传线网络科技(上海)有限公司 The acquisition methods and device of multimedia content
CN109286823B (en) * 2018-09-28 2021-03-19 阿里巴巴(中国)有限公司 Multimedia content acquisition method and device
CN110516135A (en) * 2019-08-29 2019-11-29 杭州时趣信息技术有限公司 A kind of crawler system and method

Similar Documents

Publication Publication Date Title
CN102023989B (en) Information retrieval method and system thereof
CN101169780A (en) Semantic ontology retrieval system and method
CN101916294B (en) Method for realizing exact search by utilizing semantic analysis
WO2008098502A1 (en) Method and device for creating index as well as method and system for retrieving
CN108052632B (en) Network information acquisition method and system and enterprise information search system
CN104298771A (en) Massive web log data query and analysis method
CN101477554A (en) User interest based personalized meta search engine and search result processing method
CN102236710A (en) Method and equipment for displaying news information in query result
CN103617174A (en) Distributed searching method based on cloud computing
CN103942268A (en) Method and device for combining search and application and application interface
CN102375813A (en) Duplicate detection system and method for search engines
CN101101599A (en) Method for extracting advertisement main information from web page
CN104834736A (en) Method and device for establishing index database and retrieval method, device and system
CN102722499A (en) Search engine and implementation method thereof
CN103559258A (en) Webpage ranking method based on cloud computation
CN111859065A (en) Big data-based public opinion listening system
CN103970800A (en) Method and system for extracting and processing webpage related keywords
CN102339292A (en) Distributed searching method and system
CN104636386A (en) Information monitoring method and device
CN105824956A (en) Inverted index model based on link list structure and construction method of inverted index model
KR101556714B1 (en) Method, system and computer readable recording medium for providing search results
US8706705B1 (en) System and method for associating data relating to features of a data entity
CN100357942C (en) Mobile internet intelligent information retrieval engine based on key-word retrieval
Cheng et al. Efficient focused crawling strategy using combination of link structure and content similarity
CN102419746A (en) Three-dimensional search system and three-dimensional search method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120201