Avenue U

posts(42) comments(0) trackbacks(0)
  • BlogJava
  • 联系
  • RSS 2.0 Feed 聚合
  • 管理

常用链接

  • 我的随笔
  • 我的评论
  • 我的参与

留言簿

  • 给我留言
  • 查看公开留言
  • 查看私人留言

随笔分类

  • C++(1)
  • Core Java(2)
  • My Master-degree Project(33)
  • SSH(4)
  • struts2(1)

随笔档案

  • 2009年7月 (1)
  • 2009年6月 (41)

Core Java

最新随笔

  • 1. String Stream in C++
  • 2. Validators in Struts2
  • 3. An Interceptor Example in Strut2-Spring-Hibernate Application
  • 4. 3 Validators in Struts2-Spring-Hibernate
  • 5. Strut2-Spring-Hibernate under Lomboz Eclipse3.3
  • 6. Run Spring by Maven2 in Vista
  • 7. Appendix B
  • 8. 5 Conclusion
  • 9. 4.7 Sentence Rank on Yahoo News Page
  • 10. 4.6 Sentence Rankv

搜索

  •  

最新评论

阅读排行榜

评论排行榜

View Post

4.7 Sentence Rank on Yahoo News Page

Due to the excellent performance by sentence rank, a further experiment is conducted: applying sentence rank on real news web pages. In this section, due to the length of report, only implement undirected graph and 10 terms per query, the following success retrieve rate shows a high percentage value when the cosine similarity on 2 web pages is applied by using 4.1and 4.2. 10 terms a query means only take first 10 words in the selected sentence including stop words which is consistent with section 4.6. Unlike locating the exact address of a web page itself, this comparison leads to find similar topic document by comparing 2 different URL web pages, the details are all introduced in section 3.4.

  4.1

 4.2

Meanwhile, there are 3 search engines employed in this section: Yahoo News Search, Yahoo Web Search and Google Web Search. Unlike from section 4.6 to section 4.1 which only count URL string match as success retrieval, section 4.7 take document similarity into consideration, and if equation 4.2’s value is bigger than 0.9, which is also permitted in S. T Park and Xiaojun Wang’s research, a success retrieval is considered effective. There are 183 pages in this section which are all from May 4, 2009, Yahoo News, and all related URL addresses are listed in Appendix D.

 

 

Success Counts

Success Rate

Yahoo News Search

171

93.44%

Yahoo Web Search

178

97.27%

Google Web Search

177

96.72%

Table4.29

 

(a)                                                                                        (b)

Figure4.32

As Figure4.32’s (b) shows, the success rate is above 90% which satisfies the project’s initial requirements by applying a single text retrieval method.

posted on 2009-06-18 12:16 JosephQuinn 阅读(400) 评论(0)  编辑  收藏 所属分类: My Master-degree Project

新用户注册  刷新评论列表  

只有注册用户登录后才能发表评论。


网站导航:
博客园   IT新闻   Chat2DB   C++博客   博问   管理
相关文章:
  • Appendix B
  • 5 Conclusion
  • 4.7 Sentence Rank on Yahoo News Page
  • 4.6 Sentence Rankv
  • 4.5 Random pick sentence
  • 4.4 Word Rank
  • 4.3 Google search tips: meta keys and meta description
  • 4.2 Title
  • 4.1 The basics
  • 3.5 Deep Web Search Engine
 
 
Powered by:
BlogJava
Copyright © JosephQuinn