Home | Business News | Browse by Publication | J | Journal of the Tennessee Academy of Science

Natural networks and the Google[TM] search engine.

Publication: Journal of the Tennessee Academy of Science
Publication Date: 01-JAN-07
Format: Online
Delivery: Immediate Online Access
Full Article Title: Natural networks and the Google[TM] search engine.(Report)

Article Excerpt
ABSTRACT -- The Google internet search engine is well known for its success. In addition to it being of intrinsic interest, the underlying algorithm can be applied to any matrix Markov process and so has broad interdisciplinary applications. This article is directed to readers across the sciences and technology, and is intended to give a generally accessible introduction to the mathematical basis of the Google search algorithm.

**********

Early search engines for the internet gave out long lists of unordered "title matches". In 1998, Brin and Page published their results (Brin and Page, 1998) on PageRank, a new approach to ranking websites, which in part led to the success of Google[TM]. The approach appeals directly to the natural graph structure of the internet. In an efficient way, it uses the citation (link) pattern of the network to rank sites. The result, therefore, does not give some absolute rank of a document, but measures how a document fares in the "citation competition" relative to other documents in the same network. It is commonly supposed that once a month or so Google runs a crawl and selection from the internet to allow for an update of their database and website rankings.

The success of the Google search engine now is common knowledge; and mathematical analysis of the Page Rank algorithm is based on well-known (but non-trivial) results that reach back to the 20th century. The mathematician may recall the fixed-point theorems of Brower, Hahn-Banach (Istratescu, 1981) and the matrix theorem of Perron-Frobenius (Berman and Plemmons, 1994; Meyer, 2000). There is a body of literature on search methods. An excellent survey article that includes a discussion of Page Rank (as well as other eigenvector methods for web information retrieval) is (Langville and Meyer, 2005). A further bibliographical source can be found in (Kamvar et al., 2004; Sankaralingam et al., 2003). Specialized articles take the defining equation for PageRank as a starting point, and go on to consider: Aspects of how to use the defining equation (Craven ace. 2004; Rogers, 2002); Theoretical results on convergence rates (Kamvar et al., 2004); and Illustrations of non-trivial implementation of algorithms, and their refinements (Sankaralingam et al., 2003). There is a very good article for mathematics students (Bryan and Leise, 2006). The derivation of the equation, though, does not follow the Brin and Page approach as such, but is based on the interesting linear "continuity equation" given in (Kleinberg, 1999). Ultimately, the two approaches are mathematically equivalent. The Bryan and Leise article also provides proofs regarding linear algebraic quantities involved in the PageRank formula.

In addition to being of intrinsic interest, the algorithm has broad interdisciplinary significance. Indeed, the Google algorithm can be applied to any matrix Markov process. This short article therefore is expository in nature, and is intended for a general audience in science and technology. Having a modest familiarity with matrix multiplication would be helpful background. For the purposes of illustration, we use only 2 D or 3 D examples. All results extend to the general case.

CITATION RANK OF A WEBSITE

Suppose we have a collection of only three documents A, B and C. Of course, for actual internet searches, a typical...

View this article FREE - Now for a Limited Time, try Goliath Business News
Free for 3 Days!



More articles from Journal of the Tennessee Academy of Science
Florida Ethnobotany.(Book review), January 01, 2007
A People's History of Science: Miners, Midwives, and "Low Mechanicks."..., January 01, 2007

Looking for additional articles?
Search our database of over 3 million articles.

Looking for more in-depth information on this industry?
Search our complete database of Industry & Market reports by text, subject, publication name or publication date.

About Goliath
Whether you're looking for sales prospects, competitive information, company analysis or best practices in managing your organization, Goliath can help you meet your business needs.

Our extensive business information databases empower business professionals with both the breadth and depth of credible, authoritative information they need to support their business goals. Whether it be strategic planning, sales prospecting, company research or defining management best practices - Goliath is your leading source for accurate information.