Deep Reinforcement Learning with Python 2nd Edition by Nimish Sanghi [推广有奖]

3关注
12粉丝

已卖：14271份资源
好评率：99%
商家信誉：优秀

教授

41%

还不是VIP/贵宾

威望: 0 级
论坛币: 199894 个
通用积分: 2015.2094
学术水平: 139 点
热心指数: 186 点
信用等级: 159 点
经验: 6214 点
帖子: 834
精华: 0
在线时间: 473 小时
注册时间: 2007-5-8
最后登录: 2026-3-7

楼主

SleepyTom 发表于 2025-9-29 03:06:22 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

压缩文件中包含本书PDF文档以及代码。

Deep Reinforcement Learning with Python RLHF for Chatbots and Large Language Models 2nd Edition

by Nimish Sanghi

ISBN 979-8-8688-0272-0
Published: 15 July 2024

Gain a theoretical understanding to the most popular libraries in deep reinforcement learning (deep RL). This new edition focuses on the latest advances in deep RL using a learn-by-coding approach, allowing readers to assimilate and replicate the latest research in this field. New agent environments ranging from games, and robotics to finance are explained to help you try different ways to apply reinforcement learning. A chapter on multi-agent reinforcement learning covers how multiple agents compete, while another chapter focuses on the widely used deep RL algorithm, proximal policy optimization (PPO). You'll see how reinforcement learning with human feedback (RLHF) has been used by chatbots, built using Large Language Models, e.g. ChatGPT to improve conversational capabilities. You'll also review the steps for using the code on multiple cloud systems and deploying models on platforms such as Hugging Face Hub. The code is in Jupyter Notebook, which canbe run on Google Colab, and other similar deep learning cloud platforms, allowing you to tailor the code to your own needs. Whether it’s for applications in gaming, robotics, or Generative AI, Deep Reinforcement Learning with Python will help keep you ahead of the curve.