楼主: SleepyTom
542 6

Reinforcement Learning Theory and Python Implementation [推广有奖]

  • 3关注
  • 12粉丝

已卖:13978份资源
好评率:99%
商家信誉:极好

教授

39%

还不是VIP/贵宾

-

威望
0
论坛币
195635 个
通用积分
1930.5213
学术水平
139 点
热心指数
186 点
信用等级
159 点
经验
5704 点
帖子
828
精华
0
在线时间
471 小时
注册时间
2007-5-8
最后登录
2026-2-7

楼主
SleepyTom 发表于 2025-8-10 09:01:55 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
Reinforcement Learning Theory and Python Implementation
Authors: Zhiqing Xiao

ISBN: 978-981-19-4932-6
Published: 29 September 2024

DOI: https://doi.org/10.1007/978-981-19-4933-3

Introduces not only algorithms and mathematical theory behind them, but also implementation details and usage examples

Covers both classical and modern RL algorithms, including algorithms for large models such as PPO, RLHF, PbRL, and IRL

Provides coding examples in all chapters, and all deep RL implementations have both TensorFlow and PyTorch versions

Reinforcement Learning: Theory and Python Implementation is a tutorial book on reinforcement learning, with explanations of both theory and applications. Starting from a uniform mathematical framework, this book derives the theory of modern reinforcement learning systematically and introduces all mainstream reinforcement learning algorithms such as PPO, SAC, and MuZero. It also covers key technologies of GPT training such as RLHF, IRL, and PbRL. Every chapter is accompanied by high-quality implementations, and all implementations of deep reinforcement learning algorithms are with both TensorFlow and PyTorch. Codes can be found on GitHub along with their results and are runnable on a conventional laptop with either Windows, macOS, or Linux.

This book is intended for readers who want to learn reinforcement learning systematically and apply reinforcement learning to practical applications. It is also ideal to academical researchers who seek theoretical foundation or algorithm enhancement in their cutting-edge AI research.

Zhiqing Xiao obtained doctoral degree from Tsinghua University in 2016 and has more than 15 years in academic research and industrial practices on data-analytics and AI. He is the author of two AI bestsellers in Chinese: “Reinforcement Learning” and “Application of Neural Network and PyTorch” and published many academic papers. He also contributed to recent versions of the open-source software Gym.
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Implementa implement Learning earning Theory

沙发
babylaugh(未真实交易用户) 发表于 2025-8-10 20:44:04
点赞分享

藤椅
yiyijiayuan(未真实交易用户) 在职认证  发表于 2025-8-11 08:05:16
坚决路过。

板凳
bloodfi(未真实交易用户) 发表于 2025-8-11 15:39:28
谢谢分享!

报纸
cre8(未真实交易用户) 发表于 2025-8-11 20:23:11
点赞分享 !

地板
512661101(未真实交易用户) 发表于 2025-8-11 20:50:03
谢谢分享!

7
Edwardu(未真实交易用户) 发表于 2025-8-11 21:53:07
感谢分享

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
扫码
拉您进交流群
GMT+8, 2026-2-7 18:18