楼主: tmdzhu
5392 12

Approximate dynamic programming [推广有奖]

  • 1关注
  • 2粉丝

博士生

8%

还不是VIP/贵宾

-

威望
0
论坛币
58985 个
通用积分
56.9848
学术水平
2 点
热心指数
1 点
信用等级
1 点
经验
9071 点
帖子
136
精华
0
在线时间
250 小时
注册时间
2009-1-15
最后登录
2023-6-8

相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
Approximate dynamic programming : solving the curses of dimensionality (Wiley Series in Probability and Statistics) / Warren B. Powell.

CONTENTS
Preface xi
Acknowledgments xv
1 The challenges of dynamic programming 1
1.1 A dynamic programming example: a shortest path problem 2
1.2 The three curses of dimensionality 3
1.3 Some real applications 6
1.4 Problem classes 9
1.5 The many dialects of dynamic programming 12
1.6 What is new in this book? 14
1.7 Bibliographic notes 15
2 Some illustrative models 17
2.1 Deterministic problems 18
2.2 Stochastic problems 23
2.3 Information acquisition problems 36
2.4 A simple modeling framework for dynamic programs 39
2.5 Bibliographic notes 42
Problems 43
3 Introduction to Markov decision processes 47
3.1 The optimality equations 48
3.2 Finite horizon problems 53
3.3 Infinite horizon problems 55
3.4 Value iteration 56
3.5 Policy iteration 61
3.6 Hybrid valuepolicy
iteration 62
3.7 The linear programming method for dynamic programs 63
3.8 Monotone policies* 64
3.9 Why does it work?** 70
3.10 Bibliographic notes 85
Problems 85
4 Introduction to approximate dynamic programming 91
4.1 The three curses of dimensionality (revisited) 92
4.2 The basic idea 93
4.3 Sampling random variables 100
4.4 ADP using the postdecision
state variable 101
4.5 Lowdimensional
representations of value functions 107
4.6 So just what is approximate dynamic programming? 110
4.7 Experimental issues 112
4.8 Dynamic programming with missing or incomplete models 117
4.9 Relationship to reinforcement learning 117
4.10 But does it work? 119
4.11 Bibliographic notes 120
Problems 121
5 Modeling dynamic programs 127
5.1 Notational style 129
5.2 Modeling time 130
5.3 Modeling resources 133
5.4 The states of our system 137
5.5 Modeling decisions 144
5.6 The exogenous information process 149
5.7 The transition function 157
5.8 The contribution function 164
5.9 The objective function 166
5.10 A measuretheoretic
view of information** 168
5.11 Bibliographic notes 170
Problems 171
6 Stochastic approximation methods 177
6.1 A stochastic gradient algorithm 179
6.2 Some stepsize recipes 181
6.3 Stochastic stepsizes 188
6.4 Computing bias and variance 193
6.5 Optimal stepsizes 195
6.6 Some experimental comparisons of stepsize formulas 202
6.7 Convergence 207
6.8 Why does it work?** 209
6.9 Bibliographic notes 218
Problems 219
7 Approximating value functions 225
7.1 Approximation using aggregation 226
7.2 Approximation methods using regression models 235
7.3 Recursive methods for regression models 246
7.4 Neural networks 252
7.5 Batch processes 257
7.6 Why does it work?** 261
7.7 Bibliographic notes 264
Problems 266
8 ADP for finite horizon problems 269
8.1 Strategies for finite horizon problems 270
8.2 Qlearning
8.3 Temporal difference learning 277
8.4 Policy iteration 280
8.5 Monte Carlo value and policy iteration 282
8.6 The actorcritic
paradigm 283
8.7 Bias in value function estimation 284
8.8 State sampling strategies 288
8.9 Starting and stopping 291
8.10 A taxonomy of approximate dynamic programming strategies 294
8.11 Why does it work** 296
8.12 Bibliographic notes 296
Problems 297
9 Infinite horizon problems 301
9.1 From finite to infinite horizon 302
9.2 Algorithmic strategies 302
9.3 Stepsizes for infinite horizon problems 311
9.4 Error measures 313
9.5 Direct ADP for online
applications 315
9.6 Finite horizon models for steady state applications 315
9.7 Why does it work?** 317
9.8 Bibliographic notes 317
Problems 317
10 Exploration vs. exploitation 321
10.1 A learning exercise: the nomadic trucker 321
10.2 Learning strategies 324
10.3 A simple information acquisition problem 328
10.4 Gittins indices and the information acquisition problem 330
10.5 Variations 335
10.6 The knowledge gradient algorithm 337
10.7 Information acquisition in dynamic programming 340
10.8 Bibliographic notes 343
Problems 344
11 Value function approximations for special functions 349
11.1 Value functions versus gradients 350
11.2 Linear approximations 351
11.3 Piecewise linear approximations 353
11.4 The SHAPE algorithm 357
11.5 Regression methods 360
11.6 Cutting planes* 363
11.7 Why does it work?** 375
11.8 Bibliographic notes 381
Problems 382
12 Dynamic resource allocation 385
12.1 An asset acquisition problem 386
12.2 The blood management problem 390
12.3 A portfolio optimization problem 399
12.4 A general resource allocation problem 402
12.5 A fleet management problem 414
12.6 A driver management problem 420
12.7 Bibliographic references 424
Problems 425
13 Implementation challenges 431
13.1 Will ADP work for your problem? 431
13.2 Designing an ADP algorithm for complex problems 432
13.3 Debugging an ADP algorithm 434
13.4 Convergence issues 435
13.5 Modeling your problem 436
13.6 Online

Approximate_Dynamic_Programming_Solving_the_Curses_of_Dimensionality.rar (22.73 MB, 需要: 100 个论坛币) 本附件包括:
  • Approximate Dynamic Programming Solving the Curses of Dimensionality.pdf
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Programming Approximate Dynamic Program Approx Programming Dynamic Approximate

沙发
zs3644 发表于 2009-7-14 22:12:15 |只看作者 |坛友微信交流群
感谢楼主分享!!

使用道具

藤椅
klshang82 发表于 2009-7-14 22:32:21 |只看作者 |坛友微信交流群
So expensive!

使用道具

板凳
tmdzhu 发表于 2009-7-15 19:57:33 |只看作者 |坛友微信交流群
自己顶下。。。。。

使用道具

报纸
闪电之云 发表于 2009-8-7 00:04:21 |只看作者 |坛友微信交流群
帖子不错,但是主人买好贵哦~
都穷了!

使用道具

地板
Xaero 发表于 2009-12-28 16:46:33 |只看作者 |坛友微信交流群
这年头, 太阳的!啥书都能找到!
下之!
十年一觉扬州梦。
智不足以Academy,才尚不够Industry,情无力于Life。

使用道具

7
jinjintang 发表于 2010-12-12 00:47:10 |只看作者 |坛友微信交流群
这书也太贵了吧!没有币呀!
6# Xaero

使用道具

8
sortout 发表于 2010-12-13 11:32:22 |只看作者 |坛友微信交流群
每次想下个书,都整这么贵
啥论坛啊

使用道具

9
inventory 发表于 2011-1-29 08:39:21 |只看作者 |坛友微信交流群
好书,可是好贵呀

使用道具

10
chenxuan07 发表于 2011-4-2 04:59:13 |只看作者 |坛友微信交流群
1# *****zhu


http://ishare.iask.sina.com.cn/f/13663645.html?retcode=0

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-28 11:40