Avenue U

posts(42) comments(0) trackbacks(0)
  • BlogJava
  • 联系
  • RSS 2.0 Feed 聚合
  • 管理

常用链接

  • 我的随笔
  • 我的评论
  • 我的参与

留言簿

  • 给我留言
  • 查看公开留言
  • 查看私人留言

随笔分类

  • C++(1)
  • Core Java(2)
  • My Master-degree Project(33)
  • SSH(4)
  • struts2(1)

随笔档案

  • 2009年7月 (1)
  • 2009年6月 (41)

Core Java

最新随笔

  • 1. String Stream in C++
  • 2. Validators in Struts2
  • 3. An Interceptor Example in Strut2-Spring-Hibernate Application
  • 4. 3 Validators in Struts2-Spring-Hibernate
  • 5. Strut2-Spring-Hibernate under Lomboz Eclipse3.3
  • 6. Run Spring by Maven2 in Vista
  • 7. Appendix B
  • 8. 5 Conclusion
  • 9. 4.7 Sentence Rank on Yahoo News Page
  • 10. 4.6 Sentence Rankv

搜索

  •  

最新评论

阅读排行榜

评论排行榜

View Post

Catalog

List of Tables vii

List of Figures viii

Abstract xi

1.    Introduction. 1

2.    Related Work. 3

2.1      Seung-Taek Park’s Study on Lexical Signature. 3

2.2      Martin Klein and Michael Nelson’s study on Lexical Signature. 4

2.3      Robust Hyperlinks 6

2.4      Michal Cutler’s Study on HTML Structure. 6

2.5      Graph-Based ranking algorithm.. 9

2.5.1       Word-Rank. 12

2.5.2       Word-Rank on Web Pages 13

2.5.3       Sentence-Rank. 14

2.5.4       Sentence-Rank on Web Pages 16

3     Experiments Design and Setup. 18

3.1      Experiments Steps 18

3.2      Search Engine Selection. 20

3.2.1       Google Ajex: 22

3.2.2       Google Base Data API 22

3.2.3       Extract Google Results by Brutal Force. 23

3.2.4       Yahoo web search API and news search API. 23

3.3      Data Set 24

3.3.1       Page Quality. 24

3.3.2       HTML Parsing and Text Extraction. 30

3.3.3       Query Length. 31

3.4      Result Page Comparison. 31

3.5      Deep Web Search Engine. 34

4     Experimental Result and Analysis 35

4.1      The basics 35

4.2      Title. 42

4.3      Google search tips: meta keys and meta description. 43

4.4      Word Rank. 46

4.5      Random pick sentence. 52

4.6      Sentence Rank. 53

4.7      Sentence Rank on Yahoo News Page. 57

5     Conclusion. 59

5.1      Summaries 59

5.2      Limitations 61

5.2.1       HTML Parsing and Text Extraction. 61

5.2.2       Solution. 63

References 64

Appendix A.. 66

Appendix B. 66

Appendix C. 71

Appendix D.. 73

posted on 2009-06-15 04:11 JosephQuinn 阅读(172) 评论(0)  编辑  收藏 所属分类: My Master-degree Project

新用户注册  刷新评论列表  

只有注册用户登录后才能发表评论。


网站导航:
博客园   IT新闻   Chat2DB   C++博客   博问   管理
相关文章:
  • Appendix B
  • 5 Conclusion
  • 4.7 Sentence Rank on Yahoo News Page
  • 4.6 Sentence Rankv
  • 4.5 Random pick sentence
  • 4.4 Word Rank
  • 4.3 Google search tips: meta keys and meta description
  • 4.2 Title
  • 4.1 The basics
  • 3.5 Deep Web Search Engine
 
 
Powered by:
BlogJava
Copyright © JosephQuinn