﻿<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/"><channel><title>BlogJava-paulwong-随笔分类-AI-REFORCE LEARNING</title><link>http://www.blogjava.net/paulwong/category/55404.html</link><description /><language>zh-cn</language><lastBuildDate>Mon, 19 May 2025 14:40:43 GMT</lastBuildDate><pubDate>Mon, 19 May 2025 14:40:43 GMT</pubDate><ttl>60</ttl><item><title>强化学习资源</title><link>http://www.blogjava.net/paulwong/archive/2025/04/30/451616.html</link><dc:creator>paulwong</dc:creator><author>paulwong</author><pubDate>Wed, 30 Apr 2025 06:15:00 GMT</pubDate><guid>http://www.blogjava.net/paulwong/archive/2025/04/30/451616.html</guid><wfw:comment>http://www.blogjava.net/paulwong/comments/451616.html</wfw:comment><comments>http://www.blogjava.net/paulwong/archive/2025/04/30/451616.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://www.blogjava.net/paulwong/comments/commentRss/451616.html</wfw:commentRss><trackback:ping>http://www.blogjava.net/paulwong/services/trackbacks/451616.html</trackback:ping><description><![CDATA[<div>蘑菇书EasyRL<br />
李宏毅老师的《深度强化学习》是强化学习领域经典的中文视频之一。李老师幽默风趣的上课风格让晦涩难懂的强化学习理论变得轻松易懂，他会通过很多有趣的例子来讲解强化学习理论。比如老师经常会用玩 Atari 游戏的例子来讲解强化学习算法。此外，为了教程的完整性，我们整理了周博磊老师的《强化学习纲要》、李科浇老师的《世界冠军带你从零实践强化学习》以及多个强化学习的经典资料作为补充。对于想入门强化学习又想看中文讲解的人来说绝对是非常推荐的。<br />
<br />
本教程也称为&#8220;蘑菇书&#8221;，寓意是希望此书能够为读者注入活力，让读者&#8220;吃&#8221;下这本蘑菇之后，能够饶有兴致地探索强化学习，像马里奥那样愈加强大，继而在人工智能领域觅得意外的收获。</div>
<div><a href="https://github.com/datawhalechina/easy-rl?tab=readme-ov-file" target="_blank">https://github.com/datawhalechina/easy-rl?tab=readme-ov-file</a><br />
</div>
<div><br />
</div>
<div><br />
</div>
<img src ="http://www.blogjava.net/paulwong/aggbug/451616.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://www.blogjava.net/paulwong/" target="_blank">paulwong</a> 2025-04-30 14:15 <a href="http://www.blogjava.net/paulwong/archive/2025/04/30/451616.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item></channel></rss>