Google的技术剖析
日期:2008年4月22日 作者: 查看:[大字体 中字体 小字体]-
创始人Sergey Brin 和 Lawrence Page的研究论文
来源:www.51web.bizThe Anatomy of a Large-Scale
Hypertextual Web Search Engine
Sergey Brin and Lawrence Page{sergey, page}@cs.stanford.edu
Computer Science Department, Stanford University, Stanford, CA 94305
Abstract
In this paper, we present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext. Google is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems. The prototype with a full text and hyperlink database of at least 24 million pages is available at http://google.stanford.edu/
To engineer a search engine is a challenging task. Search engines index tens to hundreds of millions of web pages involving a comparable number of distinct terms. They answer tens of millions of queries every day. Despite the importance of large-scale search engines on the web, very little academic research has been done on them. Furthermore, due to rapid advance in technology and web proliferation, creating a web search engine today is very different from three years ago. This paper provides an in-depth description of our large-scale web search engine -- the first such detailed public description we know of to date.
Apart from the problems of scaling traditional search techniques to data of this magnitude, there are new technical challenges involved with using the additional information present in hypertext to produce better search results. This paper addresses this question of how to build a practical large-scale system which can exploit the additional information present in hypertext. Also we look at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want. - [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] [31] 下一页
-
- Google的技术剖析 相关文章:
- ·危机重重 2007年度网络安全分析报告
- ·Linux系统网络配置详细解析
- ·诊断和分析提高搜索引擎网站排名
- ·简单分析2007年11月十大网络安全漏洞
- ·简单分析Script脚本跨站攻击漏洞技术
- ·深入分析Windows操作系统死机问题
- ·解析Leopard的备份工具:Time Machine
- ·详细分析操作系统死机的问题
- ·电子杂志的创编制作过程解析
- ·分析Windows操作系统死机问题
- Google的技术剖析 相关软件
- ·SK魔兽录像分析器 V1.4 Beta 3
- ·文物典藏系列-故宫馆藏文房四宝赏析
- ·中国古籍白话解析系列合集(典藏版V1.1)
- ·《股票常识与技术分析》
- ·大败笔:34个最新的营销失败案例分析
- ·英语迷津-相似词语辨析
- ·金融炼金术:证券分析的逻辑
- ·佳作赏析
- ·股市基本分析
- ·梦的解析
- 特别声明:本站除部分特别声明禁止转载的专稿外的其他文章可以自由转载,但请务必注明出处和原始作
- 者.文章版权归文章原始作者所有.对于被本站转载文章的个人和网站,我们表示深深的谢意。如果本站转
- 载的文章有版权问题请联系编辑人员,我们尽快予以更正. 转载请注明来源:http://www.hackhome.com
下一篇:了解google的两本电子书
精品推荐
热点TOP10
- ·Google的技术剖析
- ·Google AdSense 全面解析 申请+操作+作弊+忠告
- ·google的分析(analytics)js代码分析以及重写
- ·《Google排名技巧》共十五课学习笔记
- ·Google的AdSense服务在中国的部分合作网站分布及流量
- ·google maps api document 中文翻译
- ·Google Sitemap 在线生成中文版
- ·活学活用Google
- ·提高Google排名的技巧略谈
- ·Google入门到精通
- ·GOOGLE搜索秘籍(一)
- ·google搜索原理论文上(内容枯燥但非常有用)
- ·GOOGLE排名经验谈
- ·针对Google进行网站优化
- ·google提交Sitemaps的常见问题解答
- ·在线版“PowerPoint”,Google演示文稿初体验
- ·Google反作弊小组成员专访
- ·Google搜索技巧2007版
- ·GOOGLE搜索高级技巧大集合
- ·Google注册域名大全
特别推荐
- ·11种途径将提升英文网站PR值
- ·google提交Sitemaps的常见问题解答
- ·提高Google域名信任度的8个方法
- ·使用Google工具条有助于网站收录
- ·Google搜索引擎介绍
- ·google沙盒效应产生的原因及其避免方法
- ·Google搜索技巧2007版
- ·总结:Google使用技巧
- ·技巧:GoogleTalk快捷键列表!
- ·教你如何解除“该网站可能会损害您的计算机”提示
- ·网站赚钱:Google关键词广告创建的十二高招
- ·十个值得推荐的Google搜索技巧
- ·狂想Google未来十大功能
- ·《Google排名技巧》共十五课学习笔记
- ·Google AdSense优化的5个最重点提示
- ·如何让你的网站远离“该网站可能会损害您的计算机”警告?
- ·Gmail帐号被盗怎么办?几步即可找回
- ·Google Earth共享发布地标使用详解
- ·33招Google技巧玩法
- ·试试几个有趣的Google新鲜技巧玩法
