﻿<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/"><channel><title>BlogJava-paulwong-随笔分类-LOG ANALYST BIG DATA SYSTEM</title><link>http://www.blogjava.net/paulwong/category/53880.html</link><description /><language>zh-cn</language><lastBuildDate>Tue, 17 Feb 2015 16:06:19 GMT</lastBuildDate><pubDate>Tue, 17 Feb 2015 16:06:19 GMT</pubDate><ttl>60</ttl><item><title>开源分布式搜索平台ELK+Redis+Syslog-ng实现日志实时搜索</title><link>http://www.blogjava.net/paulwong/archive/2015/02/17/422972.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Tue, 17 Feb 2015 08:18:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2015/02/17/422972.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/422972.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2015/02/17/422972.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/422972.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/422972.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: logstash + elasticsearch + Kibana+Redis+Syslog-ngElasticSearch是一个基于Lucene构建的开源，分布式，RESTful搜索引擎。设计用于云计算中，能够达到实时搜索，稳定，可靠，快速，安装使用方便。支持通过HTTP使用JSON进行数据索引。logstash是一个应用程序日志、事件的传输、处理、管理和搜索的平台。你可以用它来统一对应用程序日...&nbsp;&nbsp;<a href='http://www.blogjava.net/paulwong/archive/2015/02/17/422972.html'>阅读全文</a><img src ="http://www.blogjava.net/paulwong/aggbug/422972.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2015-02-17 16:18 <a href="http://www.blogjava.net/paulwong/archive/2015/02/17/422972.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>用Kibana和logstash快速搭建实时日志查询、收集与分析系统</title><link>http://www.blogjava.net/paulwong/archive/2014/09/30/418428.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Tue, 30 Sep 2014 05:14:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2014/09/30/418428.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/418428.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2014/09/30/418428.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/418428.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/418428.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: &nbsp; Logstash是一个完全开源的工具，他可以对你的日志进行收集、分析，并将其存储供以后使用（如，搜索），您可以使用它。说到搜索，logstash带有一个web界面，搜索和展示所有日志。kibana 也是一个开源和免费的工具，他可以帮助您汇总、分析和搜索重要数据日志并提供友好的web界面。他可以为 Logstash 和 ElasticSearch 提供的日志分析的 Web 界面说到这里...&nbsp;&nbsp;<a href='http://www.blogjava.net/paulwong/archive/2014/09/30/418428.html'>阅读全文</a><img src ="http://www.blogjava.net/paulwong/aggbug/418428.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2014-09-30 13:14 <a href="http://www.blogjava.net/paulwong/archive/2014/09/30/418428.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>logstash最佳实践</title><link>http://www.blogjava.net/paulwong/archive/2014/09/30/418423.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Tue, 30 Sep 2014 03:24:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2014/09/30/418423.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/418423.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2014/09/30/418423.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/418423.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/418423.html</trackback:ping><description><![CDATA[0. 简介<br /><br /><br />1. 基础知识<br />1.1. 介绍<br />1.2. 安装<br />1.3. Hello World<br />1.4. 配置语法<br /><br /><br />2. 输入插件(Input)<br />2.1. 标准输入(Stdin)<br />2.2. 读取文件(File)<br />2.3. 读取网络数据(TCP)<br />2.4. 读取 Syslog 数据<br />2.5. 读取 Redis 数据<br /><br /><br />3. 编码插件(Codec)<br />3.1. 采用 JSON 编码<br />3.2. 合并多行数据(Multiline)<br /><br /><br />4. 过滤器插件(Filter)<br />4.1. Grok 正则捕获<br />4.2. 时间处理(Date)<br />4.3. 数据修改(Mutate)<br />4.4. GeoIP 查询归类<br />4.5. UserAgent 匹配归类<br />4.6. Key-Value 切分<br />4.7. 随心所欲的 Ruby 处理<br />4.8. 数值统计(Metrics)<br /><br /><br />5. 输出插件(Output)<br />5.1. 标准输出(Stdout)<br />5.2. 保存成文件(File)<br />5.3. 保存进 Elasticsearch<br />5.4. 输出到 Redis<br />5.5. 输出到 Statsd<br />5.6. 报警到 Nagios<br />5.7. 发送邮件(Email)<br />5.8. 调用命令执行(Exec)<br /><br /><br />6. 尚未进入官方库的常用插件<br />6.1. Kafka<br />6.2. HDFS<br />6.3. Scribe<br /><br /><br />7. 深入了解<br />7.1. 自己写一个插件<br />7.2. 为什么用 JRuby? 能用 MRI 运<br />7.3. 其他类似项目<img src ="http://www.blogjava.net/paulwong/aggbug/418423.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2014-09-30 11:24 <a href="http://www.blogjava.net/paulwong/archive/2014/09/30/418423.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Logstash logo开源日志管理 Logstash</title><link>http://www.blogjava.net/paulwong/archive/2014/08/20/417134.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Wed, 20 Aug 2014 01:22:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2014/08/20/417134.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/417134.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2014/08/20/417134.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/417134.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/417134.html</trackback:ping><description><![CDATA[logstash日志处理采用队列ZMQ，压力会很好的被缓冲，针对高并发的大数据量的日志处理是没有问题的，日志利用ES存放，就是个基于lucene的全文检索数据库，也不存在数据量的问题。<br />
<br />logstash 是一个应用程序日志、事件的传输、处理、管理和搜索的平台。<br />你可以用它来统一对应用程序日志进行收集管理，提供 Web 接口用于查询和统计。<br /><br />
<img src="http://static.oschina.net/uploads/img/201203/15125939_WkBe.png" alt="logstash screenshot" style="margin: 0px; padding: 0px; border: 0px; max-width: 700px;" /><p>&nbsp;</p>
<div><a href="http://logstash.net/docs/1.4.2/tutorials/getting-started-with-logstash" target="_blank">http://logstash.net/docs/1.4.2/tutorials/getting-started-with-logstash</a></div><img src ="http://www.blogjava.net/paulwong/aggbug/417134.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2014-08-20 09:22 <a href="http://www.blogjava.net/paulwong/archive/2014/08/20/417134.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>NoSql存储日志数据之Spring+Logback+Hbase深度集成</title><link>http://www.blogjava.net/paulwong/archive/2014/07/05/415490.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Sat, 05 Jul 2014 15:14:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2014/07/05/415490.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/415490.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2014/07/05/415490.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/415490.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/415490.html</trackback:ping><description><![CDATA[<br /><a href="http://www.cnblogs.com/xguo/p/3298956.html" target="_blank">http://www.cnblogs.com/xguo/p/3298956.html</a><img src ="http://www.blogjava.net/paulwong/aggbug/415490.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2014-07-05 23:14 <a href="http://www.blogjava.net/paulwong/archive/2014/07/05/415490.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Will be reviewing a new Apache Nutch book by Packt</title><link>http://www.blogjava.net/paulwong/archive/2014/01/28/409411.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Tue, 28 Jan 2014 12:00:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2014/01/28/409411.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/409411.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2014/01/28/409411.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/409411.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/409411.html</trackback:ping><description><![CDATA[Will be reviewing a new Apache Nutch book by Packt:&nbsp;<a href="http://www.packtpub.com/web-crawling-and-data-mining-with-apache-nutch/book" target="_blank">http://www.packtpub.com/web-crawling-and-data-mining-with-apache-nutch/book</a>
<img src ="http://www.blogjava.net/paulwong/aggbug/409411.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2014-01-28 20:00 <a href="http://www.blogjava.net/paulwong/archive/2014/01/28/409411.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>ELASTICSEARCH资源</title><link>http://www.blogjava.net/paulwong/archive/2013/09/12/404004.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Thu, 12 Sep 2013 09:51:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2013/09/12/404004.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/404004.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2013/09/12/404004.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/404004.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/404004.html</trackback:ping><description><![CDATA[加索引：<br />
<a href="http://www.elasticsearch.org/guide/reference/api/index_/" target="_blank">http://www.elasticsearch.org/guide/reference/api/index_/<br />
<br />
<br />
</a>查询：<br />
<a href="http://www.elasticsearch.org/blog/your-data-your-search/" target="_blank">http://www.elasticsearch.org/blog/your-data-your-search/</a><br />
<br />
<br />
JAVA API，注意端口是9300，不是9200<br />
<a href="http://stackoverflow.com/questions/16670219/why-cant-i-connect-to-elasticsearch-through-java-api" target="_blank">http://stackoverflow.com/questions/16670219/why-cant-i-connect-to-elasticsearch-through-java-api<br />
</a><a href="http://www.elasticsearch.org/guide/reference/java-api/client/">http://www.elasticsearch.org/guide/reference/java-api/client/</a>
<br /><br /><br />书籍<br /><a href="http://fuxiaopang.gitbooks.io/learnelasticsearch/getting_started/README.html" target="_blank">http://fuxiaopang.gitbooks.io/learnelasticsearch/getting_started/README.html</a><br /><br /><br /><img src ="http://www.blogjava.net/paulwong/aggbug/404004.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2013-09-12 17:51 <a href="http://www.blogjava.net/paulwong/archive/2013/09/12/404004.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>KAFKA资源</title><link>http://www.blogjava.net/paulwong/archive/2013/09/11/403955.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Wed, 11 Sep 2013 07:22:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2013/09/11/403955.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/403955.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2013/09/11/403955.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/403955.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/403955.html</trackback:ping><description><![CDATA[Running a Multi-Broker Apache Kafka 0.8 Cluster on a Single Node<br />
<a href="http://www.michael-noll.com/blog/2013/03/13/running-a-multi-broker-apache-kafka-cluster-on-a-single-node/" target="_blank">http://www.michael-noll.com/blog/2013/03/13/running-a-multi-broker-apache-kafka-cluster-on-a-single-node/<br />
<br />
<br />
</a>
Kafka部署与代码实例<br />
<a href="http://shift-alt-ctrl.iteye.com/blog/1930791" target="_blank">http://shift-alt-ctrl.iteye.com/blog/1930791</a>&nbsp;<br /><br />Flume-ng+Kafka+storm的学习笔记<br /><a href="http://blog.csdn.net/zxcvg/article/details/18600335" target="_blank">http://blog.csdn.net/zxcvg/article/details/18600335</a><br /><br /><br />Storm应用系列之&#8212;&#8212;集成Kafka(0.8版的KAFKA)<br /><a href="http://blog.csdn.net/xeseo/article/details/18615761" target="_blank">http://blog.csdn.net/xeseo/article/details/18615761</a><img src ="http://www.blogjava.net/paulwong/aggbug/403955.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2013-09-11 15:22 <a href="http://www.blogjava.net/paulwong/archive/2013/09/11/403955.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>STORM启动与部署TOPOLOGY</title><link>http://www.blogjava.net/paulwong/archive/2013/09/11/403942.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Wed, 11 Sep 2013 03:00:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2013/09/11/403942.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/403942.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2013/09/11/403942.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/403942.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/403942.html</trackback:ping><description><![CDATA[<ol><li>启动ZOOPKEEPER<div style="padding: 4px 5px 4px 4px; border: 1px solid rgb(204, 204, 204); width: 98%; font-size: 13px; -ms-word-break: break-all; background-color: rgb(238, 238, 238);"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><img align="top" alt="" src="http://www.blogjava.net/images/OutliningIndicators/None.gif" /><span style="color: rgb(0, 0, 0);">zkServer.sh&nbsp;start</span></div></li><li>启动NIMBUS<div style="padding: 4px 5px 4px 4px; border: 1px solid rgb(204, 204, 204); width: 98%; font-size: 13px; -ms-word-break: break-all; background-color: rgb(238, 238, 238);"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><img align="top" alt="" src="http://www.blogjava.net/images/OutliningIndicators/None.gif" /><span style="color: rgb(0, 0, 0);">storm&nbsp;nimbus&nbsp;&amp;</span></div></li><li>启动SUPERVISOR<div style="padding: 4px 5px 4px 4px; border: 1px solid rgb(204, 204, 204); width: 98%; font-size: 13px; -ms-word-break: break-all; background-color: rgb(238, 238, 238);"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><img align="top" alt="" src="http://www.blogjava.net/images/OutliningIndicators/None.gif" /><span style="color: rgb(0, 0, 0);">storm&nbsp;supervisor&nbsp;&amp;</span></div></li><li>启动UI<div style="padding: 4px 5px 4px 4px; border: 1px solid rgb(204, 204, 204); width: 98%; font-size: 13px; -ms-word-break: break-all; background-color: rgb(238, 238, 238);"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><img align="top" alt="" src="http://www.blogjava.net/images/OutliningIndicators/None.gif" /><span style="color: rgb(0, 0, 0);">storm ui &amp;</span></div></li><li>部署TOPOLOGY<div style="padding: 4px 5px 4px 4px; border: 1px solid rgb(204, 204, 204); width: 98%; font-size: 13px; -ms-word-break: break-all; background-color: rgb(238, 238, 238);"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><img align="top" alt="" src="http://www.blogjava.net/images/OutliningIndicators/None.gif" /><span style="color: rgb(0, 0, 0);">storm jar /opt/hadoop/loganalyst/storm-dependend/data/teststorm-1.0.jar teststorm.TopologyMain /opt/hadoop/loganalyst/storm-dependend/data/words.txt</span></div></li><li>删除TOPOLOGY<div style="padding: 4px 5px 4px 4px; border: 1px solid rgb(204, 204, 204); width: 98%; font-size: 13px; -ms-word-break: break-all; background-color: rgb(238, 238, 238);"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><img align="top" alt="" src="http://www.blogjava.net/images/OutliningIndicators/None.gif" /><span style="color: rgb(0, 0, 0);">storm&nbsp;kill&nbsp;{toponame}</span></div></li><li>激活TOPOLOGY<div style="padding: 4px 5px 4px 4px; border: 1px solid rgb(204, 204, 204); width: 98%; font-size: 13px; -ms-word-break: break-all; background-color: rgb(238, 238, 238);"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><img align="top" src="http://www.blogjava.net/images/OutliningIndicators/None.gif"  alt="" /><span style="color: rgb(0, 0, 0);">storm&nbsp;active&nbsp;{toponame}</span></div></li><li>不激活TOPOLOGY<div style="padding: 4px 5px 4px 4px; border: 1px solid rgb(204, 204, 204); width: 98%; font-size: 13px; -ms-word-break: break-all; background-color: rgb(238, 238, 238);"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><img align="top" src="http://www.blogjava.net/images/OutliningIndicators/None.gif"  alt="" /><span style="color: rgb(0, 0, 0);">storm&nbsp;deactive&nbsp;{toponame}</span></div></li><li>列出所有TOPOLOGY<div style="padding: 4px 5px 4px 4px; border: 1px solid rgb(204, 204, 204); width: 98%; font-size: 13px; -ms-word-break: break-all; background-color: rgb(238, 238, 238);"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><img align="top" src="http://www.blogjava.net/images/OutliningIndicators/None.gif"  alt="" /><span style="color: rgb(0, 0, 0);">storm&nbsp;list</span></div><br /><br /><br /></li></ol> 
 <img src ="http://www.blogjava.net/paulwong/aggbug/403942.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2013-09-11 11:00 <a href="http://www.blogjava.net/paulwong/archive/2013/09/11/403942.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>STORM资源</title><link>http://www.blogjava.net/paulwong/archive/2013/09/08/403826.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Sun, 08 Sep 2013 11:59:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2013/09/08/403826.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/403826.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2013/09/08/403826.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/403826.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/403826.html</trackback:ping><description><![CDATA[Install Storm<br /><a href="http://www.jansipke.nl/installing-a-storm-cluster-on-centos-hosts/" target="_blank">http://www.jansipke.nl/installing-a-storm-cluster-on-centos-hosts/</a><br /><a href="http://www.cnblogs.com/kemaswill/archive/2012/10/24/2737833.html" target="_blank">http://www.cnblogs.com/kemaswill/archive/2012/10/24/2737833.html</a><br /><a href="http://abentotoro.blog.sohu.com/197023262.html" target="_blank">http://abentotoro.blog.sohu.com/197023262.html</a><br /><a href="http://www.cnblogs.com/panfeng412/archive/2012/11/30/how-to-install-and-deploy-storm-cluster.html" target="_blank">http://www.cnblogs.com/panfeng412/archive/2012/11/30/how-to-install-and-deploy-storm-cluster.html</a><br /><br /><br />使用 Twitter Storm 处理实时的大数据<br /><a href="http://www.ibm.com/developerworks/cn/opensource/os-twitterstorm/" target="_blank">http://www.ibm.com/developerworks/cn/opensource/os-twitterstorm/</a><br /><br /><br />Storm数据流模型的分析及讨论<br /><a href="http://www.cnblogs.com/panfeng412/archive/2012/07/29/storm-stream-model-analysis-and-discussion.html" target="_blank">http://www.cnblogs.com/panfeng412/archive/2012/07/29/storm-stream-model-analysis-and-discussion.html</a><br /><a href="http://www.cnblogs.com/panfeng412/tag/Storm/" target="_blank">http://www.cnblogs.com/panfeng412/tag/Storm/</a><br /><br /><br />storm-kafka<br /><a href="https://github.com/nathanmarz/storm-contrib/tree/master/storm-kafka" target="_blank">https://github.com/nathanmarz/storm-contrib/tree/master/storm-kafka</a><br /><br /><br />使用Storm实现实时大数据分析！<br /><a href="http://www.csdn.net/article/2012-12-24/2813117-storm-realtime-big-data-analysis" target="_blank">http://www.csdn.net/article/2012-12-24/2813117-storm-realtime-big-data-analysis</a><br /><br /><br />storm-deploy-aws<br /><a href="https://github.com/nathanmarz/storm-deploy/wiki" target="_blank">https://github.com/nathanmarz/storm-deploy/wiki</a><br /><br /><br />!!!知乎网站上的Twitter Storm<br /><a href="http://www.zhihu.com/topic/19673110" target="_blank">http://www.zhihu.com/topic/19673110</a><br /><br /><br />storm-elastic-search<br /><a href="https://github.com/hmsonline/storm-elastic-search" target="_blank">https://github.com/hmsonline/storm-elastic-search</a><br /><br /><br />storm-examples<br /><a href="https://github.com/stormprocessor/storm-examples" target="_blank">https://github.com/stormprocessor/storm-examples</a><br /><br /><br />kafka-aws<br /><a href="https://github.com/nathanmarz/kafka-deploy" target="_blank">https://github.com/nathanmarz/kafka-deploy</a><br /> 
 
 
 
 
<br /><br />Next Gen Real-time Streaming with Storm-Kafka Integration<br /><a href="http://blog.infochimps.com/2012/10/30/next-gen-real-time-streaming-storm-kafka-integration/" target="_blank">http://blog.infochimps.com/2012/10/30/next-gen-real-time-streaming-storm-kafka-integration/</a><br /><br /><br />flume+kafka+storm+mysql 数据流 <br /><a href="http://blog.csdn.net/baiyangfu/article/details/8096088" target="_blank">http://blog.csdn.net/baiyangfu/article/details/8096088</a><br /><a href="http://blog.csdn.net/baiyangfu/article/category/1244640" target="_blank">http://blog.csdn.net/baiyangfu/article/category/1244640</a><br /><br /><br />Kafka学习笔记 <br /><a href="http://blog.csdn.net/baiyangfu/article/details/8096084" target="_blank">http://blog.csdn.net/baiyangfu/article/details/8096084</a><br /><br /><br />STORM+KAFKA<br /><a href="https://github.com/buildlackey/cep" target="_blank">https://github.com/buildlackey/cep</a><br /><br /><br />STORM+KETTLE<br /><a href="https://github.com/buildlackey/kettle-storm" target="_blank">https://github.com/buildlackey/kettle-storm</a><br /><br /><img src ="http://www.blogjava.net/paulwong/aggbug/403826.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2013-09-08 19:59 <a href="http://www.blogjava.net/paulwong/archive/2013/09/08/403826.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>STORM与HADOOP的比较</title><link>http://www.blogjava.net/paulwong/archive/2013/09/08/403824.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Sun, 08 Sep 2013 11:49:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2013/09/08/403824.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/403824.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2013/09/08/403824.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/403824.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/403824.html</trackback:ping><description><![CDATA[对于一堆时刻在增长的数据，如果要统计，可以采取什么方法呢？<br /><ol><li>等数据增长到一定程度的时候，跑一个统计程序进行统计。适用于实时性要求不高的场景。<br />如将数据导到HDFS，再运行一个MAP REDUCE JOB。<br /></li><li>如果实时性要求高的，上面的方法就不行了。因此就带来第二种方法。<br />在数据每次增长一笔的时候，就进行统计JOB，结果放到DB或搜索引擎的INDEX中。<br />STORM就是完成这种工作的。</li></ol><br />HADOOP与STORM比较<br /><ol><li>数据来源：HADOOP是HDFS上某个文件夹下的可能是成TB的数据，STORM是实时新增的某一笔数据</li><li>处理过程：HADOOP是分MAP阶段到REDUCE阶段，STORM是由用户定义处理流程，<br />流程中可以包含多个步骤，每个步骤可以是数据源(SPOUT)或处理逻辑(BOLT)</li><li>是否结束：HADOOP最后是要结束的，STORM是没有结束状态，到最后一步时，就停在那，直到有新<br />数据进入时再从头开始</li><li>处理速度：HADOOP是以处理HDFS上大量数据为目的，速度慢，STORM是只要处理新增的某一笔数据即可<br />可以做到很快。</li><li>适用场景：HADOOP是在要处理一批数据时用的，不讲究时效性，要处理就提交一个JOB，STORM是要处理<br />某一新增数据时用的，要讲时效性<br /></li><li>与MQ对比：HADOOP没有对比性，STORM可以看作是有N个步骤，每个步骤处理完就向下一个MQ发送消息，<br />监听这个MQ的消费者继续处理<br /><br /></li></ol><img src ="http://www.blogjava.net/paulwong/aggbug/403824.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2013-09-08 19:49 <a href="http://www.blogjava.net/paulwong/archive/2013/09/08/403824.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>为什么 Storm 比 Hadoop 快？是由哪几个方面决定的？</title><link>http://www.blogjava.net/paulwong/archive/2013/09/08/403822.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Sun, 08 Sep 2013 10:12:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2013/09/08/403822.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/403822.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2013/09/08/403822.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/403822.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/403822.html</trackback:ping><description><![CDATA[首先要明白Storm和Hadoop的应用领域，注意加粗、标红的关键字。<br /><br />Hadoop是基于Map/Reduce模型的，处理海量数据的离线分析工具。<br />Storm是分布式的、实时数据流分析工具，数据是源源不断产生的，例如Twitter的Timeline。<br /><br />再回到你说的速度问题，只能说Storm更适用于实时数据流，Map/Reduce模型在实时领域很难有所发挥，不能简单粗暴的说谁快谁慢。<br /><br /><hr /><br />这里的快主要是指的时延。<br /><br />storm的网络直传、内存计算，其时延必然比hadoop的通过hdfs传输低得多；当计算模型比较适合流式时，storm的流式处理，省去了批处理的收集数据的时间；因为storm是服务型的作业，也省去了作业调度的时延。所以从时延上来看，storm要快于hadoop。<br /><br />说一个典型的场景，几千个日志生产方产生日志文件，需要进行一些ETL操作存入一个数据库。<br /><br />假设利用hadoop，则需要先存入hdfs，按每一分钟切一个文件的粒度来算（这个粒度已经极端的细了，再小的话hdfs上会一堆小文件），hadoop开始计算时，1分钟已经过去了，然后再开始调度任务又花了一分钟，然后作业运行起来，假设机器特别多，几钞钟就算完了，然后写数据库假设也花了很少的时间，这样，从数据产生到最后可以使用已经过去了至少两分多钟。<br /><br />而流式计算则是数据产生时，则有一个程序去一直监控日志的产生，产生一行就通过一个传输系统发给流式计算系统，然后流式计算系统直接处理，处理完之后直接写入数据库，每条数据从产生到写入数据库，在资源充足时可以在毫秒级别完成。<br /><br /><br />当然，跑一个大文件的wordcount，本来就是一个批处理计算的模型，你非要把它放到storm上进行流式的处理，然后又非要让等所有已有数据处理完才让storm输出结果，这时候，你再把它和hadoop比较快慢，这时，其实比较的不是时延，而是比较的吞吐了。<br /><br /><hr /><br />Hadoop M/R基于HDFS，需要切分输入数据、产生中间数据文件、排序、数据压缩、多份复制等，效率较低。<br /><br />Storm 基于ZeroMQ这个高性能的消息通讯库，不持久化数据。<img src ="http://www.blogjava.net/paulwong/aggbug/403822.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2013-09-08 18:12 <a href="http://www.blogjava.net/paulwong/archive/2013/09/08/403822.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>linkedin高吞吐量分布式消息系统kafka使用手记</title><link>http://www.blogjava.net/paulwong/archive/2013/09/08/403821.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Sun, 08 Sep 2013 09:32:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2013/09/08/403821.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/403821.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2013/09/08/403821.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/403821.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/403821.html</trackback:ping><description><![CDATA[kafka是一种高吞吐量的分布式发布订阅消息系统，她有如下特性：<br /><br />通过O(1)的磁盘数据结构提供消息的持久化，这种结构对于即使数以TB的消息存储也能够保持长时间的稳定性能。<br />高吞吐量：即使是非常普通的硬件kafka也可以支持每秒数十万的消息。<br />支持通过kafka服务器和消费机集群来分区消息。<br />支持Hadoop并行数据加载。<br /><br />设计侧重高吞吐量，用于好友动态，相关性统计，排行统计，访问频率控制，批处理等系统。大部分的消息中间件能够处理实时性要求高的消息/数据，但是对于队列中大量未处理的消息/数据在持久性方面比较弱。<br /><br />kakfa的consumer使用拉的方式工作。<br /><br /><br /><strong>安装kafka</strong><br />下载：http://people.apache.org/~nehanarkhede/kafka-0.7.0-incubating/kafka-0.7.0-incubating-src.tar.gz<br /><br />&gt; tar xzf kafka-.tgz<br />&gt; cd kafka-<br />&gt; ./sbt update<br />&gt; ./sbt package<br />启动zkserver:<br />bin/zookeeper-server-start.sh config/zookeeper.properties<br />启动server:<br />bin/kafka-server-start.sh config/server.properties<br />就是这么简单。<br /><br /><br /><strong>使用kafka<br /></strong><div style="background-color: #eeeeee; font-size: 13px; border: 1px solid #cccccc; padding: 4px 5px 4px 4px; width: 98%; word-break: break-all;"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><span style="color: #0000FF; ">import</span>&nbsp;java.util.Arrays;&nbsp;&nbsp;<br /><span style="color: #0000FF; ">import</span>&nbsp;java.util.List;&nbsp;&nbsp;<br /><span style="color: #0000FF; ">import</span>&nbsp;java.util.Properties;&nbsp;&nbsp;<br /><span style="color: #0000FF; ">import</span>&nbsp;kafka.javaapi.producer.SyncProducer;&nbsp;&nbsp;<br /><span style="color: #0000FF; ">import</span>&nbsp;kafka.javaapi.message.ByteBufferMessageSet;&nbsp;&nbsp;<br /><span style="color: #0000FF; ">import</span>&nbsp;kafka.message.Message;&nbsp;&nbsp;<br /><span style="color: #0000FF; ">import</span>&nbsp;kafka.producer.SyncProducerConfig;&nbsp;&nbsp;<br />&nbsp;&nbsp;<br /><img src="http://www.blogjava.net/Images/dot.gif" alt="" />&nbsp;&nbsp;<br />&nbsp;&nbsp;<br />Properties&nbsp;props&nbsp;=&nbsp;<span style="color: #0000FF; ">new</span>&nbsp;Properties();&nbsp;&nbsp;<br />props.put(&#8220;zk.connect&#8221;,&nbsp;&#8220;127.0.0.1:2181&#8221;);&nbsp;&nbsp;<br />props.put("serializer.class",&nbsp;"kafka.serializer.StringEncoder");&nbsp;&nbsp;<br />ProducerConfig&nbsp;config&nbsp;=&nbsp;<span style="color: #0000FF; ">new</span>&nbsp;ProducerConfig(props);&nbsp;&nbsp;<br />Producer&lt;String,&nbsp;String&gt;&nbsp;producer&nbsp;=&nbsp;<span style="color: #0000FF; ">new</span>&nbsp;Producer&lt;String,&nbsp;String&gt;(config);&nbsp;&nbsp;<br />&nbsp;&nbsp;<br />Send&nbsp;a&nbsp;single&nbsp;message&nbsp;&nbsp;<br />&nbsp;&nbsp;<br /><span style="color: #008000; ">//</span><span style="color: #008000; ">&nbsp;The&nbsp;message&nbsp;is&nbsp;sent&nbsp;to&nbsp;a&nbsp;randomly&nbsp;selected&nbsp;partition&nbsp;registered&nbsp;in&nbsp;ZK&nbsp;&nbsp;</span><span style="color: #008000; "><br /></span>ProducerData&lt;String,&nbsp;String&gt;&nbsp;data&nbsp;=&nbsp;<span style="color: #0000FF; ">new</span>&nbsp;ProducerData&lt;String,&nbsp;String&gt;("test-topic",&nbsp;"test-message");&nbsp;&nbsp;<br />producer.send(data);&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<br />&nbsp;&nbsp;<br />producer.close(); &nbsp;</div><strong><br /><br /></strong>这样就是一个标准的producer。<br /><br />consumer的代码<br /><div style="background-color:#eeeeee;font-size:13px;border:1px solid #CCCCCC;padding-right: 5px;padding-bottom: 4px;padding-left: 4px;padding-top: 4px;width: 98%;word-break:break-all"><!--<br /><br />Code highlighting produced by Actipro CodeHighlighter (freeware)<br />http://www.CodeHighlighter.com/<br /><br />--><span style="color: #008000; ">//</span><span style="color: #008000; ">&nbsp;specify&nbsp;some&nbsp;consumer&nbsp;properties&nbsp;&nbsp;</span><span style="color: #008000; "><br /></span>Properties&nbsp;props&nbsp;=&nbsp;<span style="color: #0000FF; ">new</span>&nbsp;Properties();&nbsp;&nbsp;<br />props.put("zk.connect",&nbsp;"localhost:2181");&nbsp;&nbsp;<br />props.put("zk.connectiontimeout.ms",&nbsp;"1000000");&nbsp;&nbsp;<br />props.put("groupid",&nbsp;"test_group");&nbsp;&nbsp;<br />&nbsp;&nbsp;<br /><span style="color: #008000; ">//</span><span style="color: #008000; ">&nbsp;Create&nbsp;the&nbsp;connection&nbsp;to&nbsp;the&nbsp;cluster&nbsp;&nbsp;</span><span style="color: #008000; "><br /></span>ConsumerConfig&nbsp;consumerConfig&nbsp;=&nbsp;<span style="color: #0000FF; ">new</span>&nbsp;ConsumerConfig(props);&nbsp;&nbsp;<br />ConsumerConnector&nbsp;consumerConnector&nbsp;=&nbsp;Consumer.createJavaConsumerConnector(consumerConfig);&nbsp;&nbsp;<br />&nbsp;&nbsp;<br /><span style="color: #008000; ">//</span><span style="color: #008000; ">&nbsp;create&nbsp;4&nbsp;partitions&nbsp;of&nbsp;the&nbsp;stream&nbsp;for&nbsp;topic&nbsp;&#8220;test&#8221;,&nbsp;to&nbsp;allow&nbsp;4&nbsp;threads&nbsp;to&nbsp;consume&nbsp;&nbsp;</span><span style="color: #008000; "><br /></span>Map&lt;String,&nbsp;List&lt;KafkaMessageStream&lt;Message&gt;&gt;&gt;&nbsp;topicMessageStreams&nbsp;=&nbsp;&nbsp;&nbsp;<br />&nbsp;&nbsp;&nbsp;&nbsp;consumerConnector.createMessageStreams(ImmutableMap.of("test",&nbsp;4));&nbsp;&nbsp;<br />List&lt;KafkaMessageStream&lt;Message&gt;&gt;&nbsp;streams&nbsp;=&nbsp;topicMessageStreams.get("test");&nbsp;&nbsp;<br />&nbsp;&nbsp;<br /><span style="color: #008000; ">//</span><span style="color: #008000; ">&nbsp;create&nbsp;list&nbsp;of&nbsp;4&nbsp;threads&nbsp;to&nbsp;consume&nbsp;from&nbsp;each&nbsp;of&nbsp;the&nbsp;partitions&nbsp;&nbsp;&nbsp;</span><span style="color: #008000; "><br /></span>ExecutorService&nbsp;executor&nbsp;=&nbsp;Executors.newFixedThreadPool(4);&nbsp;&nbsp;<br />&nbsp;&nbsp;<br /><span style="color: #008000; ">//</span><span style="color: #008000; ">&nbsp;consume&nbsp;the&nbsp;messages&nbsp;in&nbsp;the&nbsp;threads&nbsp;&nbsp;</span><span style="color: #008000; "><br /></span><span style="color: #0000FF; ">for</span>(<span style="color: #0000FF; ">final</span>&nbsp;KafkaMessageStream&lt;Message&gt;&nbsp;stream:&nbsp;streams)&nbsp;{&nbsp;&nbsp;<br />&nbsp;&nbsp;executor.submit(<span style="color: #0000FF; ">new</span>&nbsp;Runnable()&nbsp;{&nbsp;&nbsp;<br />&nbsp;&nbsp;&nbsp;&nbsp;<span style="color: #0000FF; ">public</span>&nbsp;<span style="color: #0000FF; ">void</span>&nbsp;run()&nbsp;{&nbsp;&nbsp;<br />&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span style="color: #0000FF; ">for</span>(Message&nbsp;message:&nbsp;stream)&nbsp;{&nbsp;&nbsp;<br />&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span style="color: #008000; ">//</span><span style="color: #008000; ">&nbsp;process&nbsp;message&nbsp;&nbsp;</span><span style="color: #008000; "><br /></span>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;}&nbsp;&nbsp;&nbsp;<br />&nbsp;&nbsp;&nbsp;&nbsp;}&nbsp;&nbsp;<br />&nbsp;&nbsp;});&nbsp;&nbsp;<br />} &nbsp;</div><br /><strong><br /><br /><br /><br /></strong><img src ="http://www.blogjava.net/paulwong/aggbug/403821.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2013-09-08 17:32 <a href="http://www.blogjava.net/paulwong/archive/2013/09/08/403821.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>LOG ANALYST BIG DATA SYSTEM资源</title><link>http://www.blogjava.net/paulwong/archive/2013/09/08/403819.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Sun, 08 Sep 2013 08:21:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2013/09/08/403819.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/403819.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2013/09/08/403819.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/403819.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/403819.html</trackback:ping><description><![CDATA[日志抓取端：<br /><br /><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">apache kafka在数据处理中特别是日志和消息的处理上会有很多出色的表现，这里写个索引，关于kafka的文章暂时就更新到这里，最近利用空闲时间在对kafka做一些功能性增强，并java化，虽然现在已经有很多这样的版本，但是根据实际需求来改变才是最适合的。</p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">首先当然推荐的是kafka的官网&nbsp;<a href="http://kafka.apache.org/" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">http://kafka.apache.org/</a>&nbsp;</p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">在官网最值得参考的文章就是kafka design：<a href="http://kafka.apache.org/design.html" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">http://kafka.apache.org/design.html</a>，我的文章也基本都是参照这里的说明，大家要特别重视这篇文章，里面有好多理念都特别好，推荐多读几遍。</p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;"><span style="padding: 0px; margin: 0px; line-height: 1.5; font-size: 10pt;">在OSC的翻译频道有kafka design全中文的翻译，翻得挺好的，推荐一下：</span><a href="http://www.oschina.net/translate/kafka-design" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">http://www.oschina.net/translate/kafka-design</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">kafka的wiki是很不错的学习文档：<a href="https://cwiki.apache.org/confluence/display/KAFKA/Index" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">https://cwiki.apache.org/confluence/display/KAFKA/Index</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;"><span style="padding: 0px; margin: 0px;">&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;</span></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">接下来就是我写的一系列文章，文章都是循序渐进的方式带你了解kafka：</p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">关于kafka的基本知识，分布式的基础：<a href="http://my.oschina.net/ielts0909/blog/92972" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《分布式消息系统Kafka初步》</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">kafka的分布式搭建，quick start：<a href="http://my.oschina.net/ielts0909/blog/93190" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《kafka分布式环境搭建》</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">关于kafka的实现细节，这主要就是讲design的部分：<a href="http://my.oschina.net/ielts0909/blog/94153" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《细节上》</a>、<a href="http://my.oschina.net/ielts0909/blog/94733" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《细节下》</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">关于kafka开发环境，scala环境的搭建：<a href="http://my.oschina.net/ielts0909/blog/95440" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《开发环境搭建》</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">数据生产者，producer的用法：<a href="http://my.oschina.net/ielts0909/blog/100645" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《producer的用法》</a>、<a href="http://my.oschina.net/ielts0909/blog/114579" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《producer使用注意》</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">数据消费者，consumer的用法：<a href="http://my.oschina.net/ielts0909/blog/110280" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《consumer的用法》</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">还有些零碎的，关于通信段的源码解读：<a href="http://my.oschina.net/ielts0909/blog/102336" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《net包源码解读》</a>、<a href="http://my.oschina.net/ielts0909/blog/94997" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《broker配置》</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;"><span style="padding: 0px; margin: 0px;">&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;</span></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;"><span style="padding: 0px; margin: 0px;">扩展的阅读还有下面这些：</span></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">我的好友写的关于kafka和jafka的相关博客，特别好，我有很多问题也都找他解决的，大神一般的存在：<a href="http://rockybean.github.com/" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">http://rockybean.github.com/</a>&nbsp; &nbsp;<a href="http://my.oschina.net/rockybean" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">@rockybean</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">kafka的java化版本jafka：<a href="https://github.com/adyliu/jafka" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">https://github.com/adyliu/jafka</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">淘宝的metaQ：<a href="https://github.com/killme2008/Metamorphosis" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">https://github.com/killme2008/Metamorphosis</a></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;"><span style="padding: 0px; margin: 0px;">我最近在写的inforQ，刚开始写，我也纯粹是为了读下源码，不定期更新哈：<a href="https://github.com/ielts0909/inforq" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">https://github.com/ielts0909/inforq</a></span></p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">后面一阶段可能更新点儿关于cas的东西吧，具体也没想好，最近一直出差，写代码的时间都很少</p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">--------------------------------------------------------------------------------</p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">0.8版本的相关更新如下：</p><p style="padding: 0px; margin: 12px 0px; line-height: 25px; letter-spacing: 1px; color: #333333; font-family: 微软雅黑, Verdana, sans-serif, 宋体; background-color: #ffffff;">0.8更新内容介绍：<a href="http://my.oschina.net/ielts0909/blog/143543" target="_blank" rel="nofollow" style="padding: 0px; margin: 0px; color: #0072bb; outline: 0px;">《kafka0.8版本的一些更新》</a></p><img src ="http://www.blogjava.net/paulwong/aggbug/403819.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2013-09-08 16:21 <a href="http://www.blogjava.net/paulwong/archive/2013/09/08/403819.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item></channel></rss>