EP1622047A3 - Efficiently ranking web pages via matrix index manipulation and improved caching - Google Patents

Efficiently ranking web pages via matrix index manipulation and improved caching Download PDF

Info

Publication number
EP1622047A3
EP1622047A3 EP05107029A EP05107029A EP1622047A3 EP 1622047 A3 EP1622047 A3 EP 1622047A3 EP 05107029 A EP05107029 A EP 05107029A EP 05107029 A EP05107029 A EP 05107029A EP 1622047 A3 EP1622047 A3 EP 1622047A3
Authority
EP
European Patent Office
Prior art keywords
efficiently
web pages
matrix
eigenvector
matrix index
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP05107029A
Other languages
German (de)
French (fr)
Other versions
EP1622047A2 (en
Inventor
Frank David Mcsherry
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of EP1622047A2 publication Critical patent/EP1622047A2/en
Publication of EP1622047A3 publication Critical patent/EP1622047A3/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/289Object oriented databases
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99935Query augmenting and refining, e.g. inexact access
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99937Sorting
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99943Generating database or data structure, e.g. via user interface

Abstract

Methods and systems are described for computing page rankings more efficiently. Using an interconnectivity matrix describing the interconnection of web pages, a new matrix is computed. The new matrix is used to compute the average of values associated with each web page's neighboring web pages. The secondary eigenvector of this new matrix is computed, and indices for web pages are relabeled according to the eigenvector. The data structure storing the interconnectivity information is preferably also physically sorted according to the eigenvector. By reorganizing the matrix used in the web page ranking computations, caching is performed more efficiently, resulting in faster page ranking techniques. Methods for efficiently allocating the distribution of resources are also described.
EP05107029A 2004-07-30 2005-07-29 Efficiently ranking web pages via matrix index manipulation and improved caching Withdrawn EP1622047A3 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/903,345 US7379947B2 (en) 2004-07-30 2004-07-30 Efficiently ranking web pages via matrix index manipulation and improved caching

Publications (2)

Publication Number Publication Date
EP1622047A2 EP1622047A2 (en) 2006-02-01
EP1622047A3 true EP1622047A3 (en) 2006-03-29

Family

ID=34940328

Family Applications (1)

Application Number Title Priority Date Filing Date
EP05107029A Withdrawn EP1622047A3 (en) 2004-07-30 2005-07-29 Efficiently ranking web pages via matrix index manipulation and improved caching

Country Status (5)

Country Link
US (1) US7379947B2 (en)
EP (1) EP1622047A3 (en)
JP (1) JP2006065855A (en)
KR (1) KR20060048940A (en)
CN (1) CN1737803A (en)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100913256B1 (en) * 2005-04-14 2009-08-24 에스케이커뮤니케이션즈 주식회사 Method for evaluating a object by the relation among links in the information network having a multi link
US20070266025A1 (en) * 2006-05-12 2007-11-15 Microsoft Corporation Implicit tokenized result ranking
US8301728B2 (en) * 2006-07-21 2012-10-30 Yahoo! Inc. Technique for providing a reliable trust indicator to a webpage
KR20090050086A (en) * 2006-09-11 2009-05-19 인터내셔널 비지네스 머신즈 코포레이션 Method for recommending short-cuts in order to ease and fasten navigating within navigation topology
US8510349B1 (en) * 2006-12-06 2013-08-13 Zillow, Inc. Multi-faceted search
US7877384B2 (en) * 2007-03-01 2011-01-25 Microsoft Corporation Scoring relevance of a document based on image text
US7840522B2 (en) * 2007-03-07 2010-11-23 Microsoft Corporation Supervised rank aggregation based on rankings
US20080228719A1 (en) * 2007-03-13 2008-09-18 Fatdoor, Inc. People and business search result optimization
US7788254B2 (en) * 2007-05-04 2010-08-31 Microsoft Corporation Web page analysis using multiple graphs
US8024285B2 (en) * 2007-12-27 2011-09-20 Microsoft Corporation Determining quality of tier assignments
US7818303B2 (en) * 2008-01-29 2010-10-19 Microsoft Corporation Web graph compression through scalable pattern mining
US20100036809A1 (en) * 2008-08-06 2010-02-11 Yahoo! Inc. Tracking market-share trends based on user activity
US8250069B2 (en) 2008-10-14 2012-08-21 University Of Washington Green's function formulations for pagerank algorithm using helmholtz wave equation representations of internet interactions
US8311792B1 (en) * 2009-12-23 2012-11-13 Intuit Inc. System and method for ranking a posting
US20120203592A1 (en) * 2011-02-08 2012-08-09 Balaji Ravindran Methods, apparatus, and articles of manufacture to determine search engine market share
US9405923B2 (en) * 2011-09-29 2016-08-02 Red Hat, Inc. Establishing isolation between content hosting services executing on common support server
US8799296B2 (en) * 2012-02-23 2014-08-05 Borislav Agapiev Eigenvalue ranking of social offerings using social network information
JP5964781B2 (en) * 2013-05-20 2016-08-03 日本電信電話株式会社 SEARCH DEVICE, SEARCH METHOD, AND SEARCH PROGRAM
CN103399859A (en) * 2013-07-02 2013-11-20 北京奇虎科技有限公司 Web page authority value calculating method, device and system
CN106033465B (en) * 2015-03-19 2019-09-17 阿里巴巴集团控股有限公司 A kind of information-pushing method and device
US9898544B2 (en) * 2016-02-18 2018-02-20 Eshaa Alkhalifa Guided web navigation tool
US11023421B1 (en) * 2017-12-07 2021-06-01 Liberty Mutual Insurance Company Method, apparatus, and computer program product for compression of a graph
CN111538714B (en) * 2020-04-24 2023-10-17 咪咕文化科技有限公司 Instruction execution method and device, electronic equipment and storage medium
CN111796796B (en) * 2020-06-12 2022-11-11 杭州云象网络技术有限公司 FPGA storage method, calculation method, module and FPGA board based on sparse matrix multiplication

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6285999B1 (en) * 1997-01-10 2001-09-04 The Board Of Trustees Of The Leland Stanford Junior University Method for node ranking in a linked database
WO2002037796A2 (en) * 2000-10-31 2002-05-10 Daniel Victor Pavin Method and system for determining the popularity of a subject
EP1338966A2 (en) * 2002-02-20 2003-08-27 Microsoft Corporation Social mapping of contacts from computer communication information
US20040111412A1 (en) * 2000-10-25 2004-06-10 Altavista Company Method and apparatus for ranking web page search results
US20040122855A1 (en) * 2002-12-19 2004-06-24 Joann Ruvolo Method of creating and displaying relationship chains between users of a computerized network

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2691559B1 (en) 1992-05-25 1997-01-03 Cegelec REPLICATIVE OBJECT SOFTWARE SYSTEM USING DYNAMIC MESSAGING, IN PARTICULAR FOR REDUNDANT ARCHITECTURE CONTROL / CONTROL INSTALLATION.
US6198390B1 (en) 1994-10-27 2001-03-06 Dan Schlager Self-locating remote monitoring systems
JPH0927811A (en) 1995-07-13 1997-01-28 Fujitsu Ltd Congestion supervisory controller
AU3567997A (en) 1996-07-05 1998-02-02 Paulo Correa Controller-based radio frequency amplifier module and method
CA2219557C (en) 1997-10-29 2002-12-10 Ibm Canada Limited-Ibm Canada Limitee Run-time instrumentation for object oriented programmed applications
US6122628A (en) * 1997-10-31 2000-09-19 International Business Machines Corporation Multidimensional data clustering and dimension reduction for indexing and searching
US6324619B1 (en) 1998-03-27 2001-11-27 Sony Corporation Of Japan Process and system for managing run-time adaptation for general purpose distributed adaptive applications
US6216237B1 (en) 1998-06-19 2001-04-10 Lucent Technologies Inc. Distributed indirect software instrumentation
US6473791B1 (en) 1998-08-17 2002-10-29 Microsoft Corporation Object load balancing
US6295643B1 (en) 1998-12-10 2001-09-25 International Business Machines Corporation Method and apparatus for improving java virtual machine performance using persistent execution information
US6631409B1 (en) 1998-12-23 2003-10-07 Worldcom, Inc. Method and apparatus for monitoring a communications system
US6466980B1 (en) 1999-06-17 2002-10-15 International Business Machines Corporation System and method for capacity shaping in an internet environment
US7188106B2 (en) * 2001-05-01 2007-03-06 International Business Machines Corporation System and method for aggregating ranking results from various sources to improve the results of web searching

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6285999B1 (en) * 1997-01-10 2001-09-04 The Board Of Trustees Of The Leland Stanford Junior University Method for node ranking in a linked database
US20040111412A1 (en) * 2000-10-25 2004-06-10 Altavista Company Method and apparatus for ranking web page search results
WO2002037796A2 (en) * 2000-10-31 2002-05-10 Daniel Victor Pavin Method and system for determining the popularity of a subject
EP1338966A2 (en) * 2002-02-20 2003-08-27 Microsoft Corporation Social mapping of contacts from computer communication information
US20040122855A1 (en) * 2002-12-19 2004-06-24 Joann Ruvolo Method of creating and displaying relationship chains between users of a computerized network

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
DAS T K ET AL: "NetEffect: a network architecture for large-scale multi-user virtual worlds", ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY, PROCEEDINGS, 1997, pages 157 - 163, XP002365917, Retrieved from the Internet <URL:http://portal.acm.org/citation.cfm?doid=261135.261164> [retrieved on 20060203] *
KAMVAR S ET AL: "Extrapolation Methods for Accelerating PageRank Computations", PROCEEDINGS OF THE TWELFTH INTERNATIONAL WORLD WIDE WEB CONFERENCE, 20 May 2003 (2003-05-20), XP002343504, Retrieved from the Internet <URL:http://dbpubs.stanford.edu:8090/pub/showDoc.Fulltext?lang=en&doc=2003-16&format=pdf&compression=&name=2003-16.pdf> [retrieved on 20050905] *
PAGE L ET AL: "The PageRank citation ranking: bringing order to the web", STANFORD DIGITAL LIBRARIES WORKING PAPER, 1998, pages 1 - 17, XP002951571 *
WANG Z: "Improved Link-Based Algorithms for Ranking Web Pages", June 2003 (2003-06-01), XP002343499, Retrieved from the Internet <URL:http://www.cs.nyu.edu/csweb/Research/TechReports/TR2003-846/TR2003-846.pdf> [retrieved on 20050905] *
ZACK M H: "RESEARCHING ORGANIZATIONAL SYSTEMS USING SOCIAL NETWORK ANALYSIS", PROCEEDINGS OF THE HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, October 2000 (2000-10-01), pages 1 - 7, XP002205850 *

Also Published As

Publication number Publication date
JP2006065855A (en) 2006-03-09
KR20060048940A (en) 2006-05-18
EP1622047A2 (en) 2006-02-01
US20060026191A1 (en) 2006-02-02
US7379947B2 (en) 2008-05-27
CN1737803A (en) 2006-02-22

Similar Documents

Publication Publication Date Title
EP1622047A3 (en) Efficiently ranking web pages via matrix index manipulation and improved caching
Tendler et al. Evolutionary tradeoffs, Pareto optimality and the morphology of ammonite shells
Camm et al. A note on optimal algorithms for reserve site selection
Mishler Getting rid of species
Azzalini et al. Clustering via nonparametric density estimation
Deng et al. A comparative study on feature weight in text categorization
Eeley et al. Species richness, species range size and ecological specialisation among African primates: geographical patterns and conservation implications
EP1986091A3 (en) Management device and management method
Milewski et al. Proposed standard weight (Ws) equation and length-categorization standards for stream-dwelling brown trout (Salmo trutta)
Casey A comparison of mechanical and energetic estimates of flight cost for hovering sphinx moths
CN106251172A (en) A kind of user based on Information Entropy is worth score calculation method and system
WO2004094995A3 (en) Hierarchical evaluation of cells
Ziegler et al. Macro-and microstructural diversity of sea urchin teeth revealed by large-scale micro-computed tomography survey
CN103279505A (en) Mass data processing method based on semantic meaning
CN106453611A (en) A method and apparatus for load balancing at a plurality of storage nodes
Hyatt et al. Proposed standard-weight (Ws) equations for kokanee, golden trout and bull trout
CN111985770A (en) Ecological space priority identification method based on multiple ecosystem service capability indexes
CN106201903B (en) A kind of three-dimensional flash memory erasing control method and its system
EP1492026A3 (en) System and method for expressing and calculating a relationship between OLAP measures
EP1760646A3 (en) System and method for managing and regulating object allocations
Richards et al. Reinstatement of Indian Ocean Porolithon coarctatum and P. gardineri based on sequencing type specimens, and P. epiphyticum sp. nov.(Corallinales, Rhodophyta), with comments on subfamilies Hydrolithoideae and Metagoniolithoideae
CN106648906A (en) Container cloud resource intelligent recovery and enabling system and method
Threlfall et al. Collection of population-based cancer staging information in Western Australia–a feasibility study
CN105553723B (en) A kind of Virtual Cluster laying method of network flow perception
Gofas et al. Nomenclature of the smaller Mediterranean Cerithium species

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA HR MK YU

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA HR MK YU

17P Request for examination filed

Effective date: 20060928

17Q First examination report despatched

Effective date: 20061023

AKX Designation fees paid

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20110201