欢迎光临当当，请登录免费注册

男频| 女频

当当云阅读

当当云阅读文字

万本电子书0元读

万本电子书0元读

搜索

购物车

图书分类

小说: 侦探/悬疑/推理; 情感/都市; 科幻/魔幻; 作品集; 外国小说

文艺: 文学; 青春文学; 传记; 艺术; 动漫/幽默

历史文化: 哲学/宗教; 历史; 政治/军事; 文化; 社会科学; 古籍; 法律

经济/管理: 管理; 经济; 投资理财; 市场/营销; 商务沟通; 中国经济; 国际经济

心理/励志: 心理学; 女性心理学; 儿童心理学; 情绪管理; 职场/人际交往; 人生哲学

生活: 两性关系; 亲子/家教; 旅游/地图; 烹饪/美食; 保健/养生

童书: 儿童文学; 启蒙读物; 少儿英语; 动漫/图画书

科技/教育: 科普读物; 计算机/网络; 自然科学; 中小学教辅; 考试; 外语; 工具书

原版书: 外文原版书; 港台圖書; 小语种

我要充值赠送20%

顶部广告

当当云阅读 > 科技 > 计算机/网络 > 软件系统 > 多主体强化学习协作策略研究

多主体强化学习协作策略研究

| | 手机阅读

扫描下载当当云阅读App

多主体强化学习协作策略研究电子书

多主体的研究与应用是近年来备受关注的热点领域，多主体强化学习理论与方法、多主体协作策略的研究是该领域重要研究方向，其理论和应用价值极为广泛，备受广大从事计算机应用、人工智能、自动控制、以及经济管理等领域研究者的关注。

售价：¥

纸质售价：¥37.90购买纸书

127人正在读 | 1人评论

6.2

作者：孙若莹,赵刚

出版社：清华大学出版社

出版时间：2014-08-01

字数：26.8万

所属分类：科技 > 计算机/网络 > 软件系统

温馨提示：数字商品不支持退换货，不提供源文件，不支持导出打印

为你推荐

读书简介
目录
累计评论(1条)

读书简介
目录
累计评论(1条)

多主体的研究与应用是近年来备受关注的热领域，多主体强化学习理论与方法、多主体协作策略的研究是该领域重要研究方向，其理论和应用价值极为广泛，备受广大从事计算机应用、人工智能、自动控制、以及经济管理等领域研究者的关注。孙若莹、赵刚所著的《多主体强化学习协作策略研究》清晰地介绍了多主体、强化学习及多主体协作等基本概念和基础内容，明确地阐述了有关多主体强化学习、协作策略研究的发展过程及*动向，深地探讨了多主体强化学习与协作策略的理论与方法，具体地分析了多主体强化学习与协作策略在相关研究领域的应用方法。全书系统脉络清晰、基本概念清楚、图表分析直观，注重内容的体系化和实用性。通过本书的阅读和学习，读者即可掌握多主体强化学习及协作策略的理论和方法，更可了解在实际工作中应用这些研究成果的手段。本书可作为从事计算机应用、人工智能、自动控制、以及经济管理等领域研究者的学习和阅读参考，同时高等院校相关专业研究生以及人工智能爱好者也可从中获得借鉴。<br/>

目录展开

About the Authors

Preface

Chapter 1 Introduction

1.1 Reinforcement Learning

1.1.1 Generality of Reinforcement Learning

1.1.2 Reinforcement Learning on Markov Decision Processes

1.1.3 Integrating Reinforcement Learning into Agent Architecture

1.2 Multiagent Reinforcement Learning

1.2.1 Multiagent Systems

1.2.2 Reinforcement Learning in Multiagent Systems

1.2.3 Learning and Coordination in Multiagent Systems

1.3 Ant System for Stochastic Combinatorial Optimization

1.3.1 Ants Forage Behavior

1.3.2 Ant Colony Optimization

1.3.3 MAX-MIN Ant System

1.4 Motivations and Consequences

1.5 Book Summary

Bibliography

Chapter 2 Reinforcement Learning and Its Combination with Ant Colony System

2.1 Introduction

2.2 Investigation into Reinforcement Learning and Swarm Intelligence

2.2.1 Temporal Differences Learning Method

2.2.2 Active Exploration and Experience Replay in Reinforcement Learning

2.2.3 Ant Colony System for Traveling Salesman Problem

2.3 The Q-ACS Multiagent Learning Method

2.3.1 The Q-ACS Learning Algorithm

2.3.2 Some Properties of the Q-ACS Learning Method

2.3.3 Relation with Ant-Q Learning Method

2.4 Simulations and Results

2.5 Conclusions

Bibliography

Chapter 3 Multiagent Learning Methods Based on Indirect Media Information Sharing

3.1 Introduction

3.2 The Multiagent Learning Method Considering Statistics Features

3.2.1 Accelerated K-certainty Exploration

3.2.2 The T-ACS Learning Algorithm

3.3 The Heterogeneous Agents Learning

3.3.1 The D-ACS Learning Algorithm

3.3.2 Some Discussions about the D-ACS Learning Algorithm

3.4 Comparisons with Related State-of-the-arts

3.5 Simulations and Results

3.5.1 Experimental Results on Hunter Game

3.5.2 Experimental Results on Traveling Salesman Problem

3.6 Conclusions

Bibliography

Chapter 4 Action Conversion Mechanism in Multiagent Reinforcement Learning

4.1 Introduction

4.2 Model-Based Reinforcement Learning

4.2.1 Dyna-Q Architecture

4.2.2 Prioritized Sweeping Method

4.2.3 Minimax Search and Reinforcement Learning

4.2.4 RTP-Q Learning

4.3 The Q-ac Multiagent Reinforcement Learning

4.3.1 Task Model

4.3.2 Converting Action

4.3.3 Multiagent Cooperation Methods

4.3.4 Q-value Update

4.3.5 The Q-ac Learning Algorithm

4.3.6 Using Adversarial Action Instead of ε Probability Exploration

4.4 Simulations and Results

4.5 Conclusions

Bibliography

Chapter 5 Multiagent Learning Approaches Applied to Vehicle Routing Problems

5.1 Introduction

5.2 Related State-of-the-arts

5.2.1 Some Heuristic Algorithms

5.2.2 The Vehicle Routing Problem with Time Windows

5.3 The Multiagent Learning Applied to CVRP and VRPTW

5.4 Simulations and Results

5.5 Conclusions

Bibliography

Chapter 6 Multiagent learning Methods Applied to Multicast Routing Problems

6.1 Introduction

6.2 Multiagent Q-learning Applied to the Network Routing

6.2.1 Investigation into Q-routing

6.2.2 AntNet Investigation

6.3 Some Multicast Routing in Mobile Ad Hoc Networks

6.4 The Multiagent Q-learning in the Q-MAP Multicast Routing Method

6.4.1 Overview of the Q-MAP Multicast Routing

6.4.2 Join Query Packet, Join Reply Packet and Membership Maintenance

6.4.3 Convergence Proof of Q-MAP Method

6.5 Simulations and Results

6.6 Conclusions

Bibliography

Chapter 7 Multiagent Reinforcement Learning for Supply Chain Management

7.1 Introduction

7.2 Related Issues of Supply Chain Management

7.3 SCM Network Scheme with Multiagent Reinforcement Learning

7.3.1 SCM with Multiagent

7.3.2 The RL Agents in SCM Network

7.4 Application of the Q-ACS Method to SCM

7.4.1 The Application Model in SCM

7.4.2 The Q-ACS Learning Applied to the SCM System

7.5 Conclusion

Bibliography

Chapter 8 Multiagent Learning Applied in Supply Chain Ordering Management

8.1 Introduction

8.2 Supply Chain Management Model

8.3 The Multiagent Learning Model for SC Ordering Management

8.4 Simulations and Results

8.5 Conclusions

Bibliography

累计评论(1条) 1个书友正在讨论这本书 发表评论

发表评论

发表评论，分享你的想法吧！

当当云阅读

买过这本书的人还买过

读了这本书的人还在读

支持设备

同类图书排行榜

01

AI效率手册:从ChatGPT开启*能

AI效率手册:从ChatGPT开启*能￥51.07

常青著

￥51.07

02

AI时代,学什么,怎么学

AI时代,学什么,怎么学￥17.99

和渊著

￥17.99

03

这就是DeepSeek:普通人如何抓住AI红利

这就是DeepSeek:普通人如何抓住AI红利￥38.80

何华平编著

￥38.80

04

AI帮你赢

AI帮你赢￥44.67

谭少卿著

￥44.67

05

大模型导论

大模型导论￥44.67

张成文编著

￥44.67

06

ChatGPT+AI文案写作实战108招

ChatGPT+AI文案写作实战108招￥55.86

苏海

￥55.86

07

万物皆计算:科学奇才的探索之旅

万物皆计算:科学奇才的探索之旅￥64.90

[美]斯蒂芬·沃尔弗拉姆(Stephen Wolfram) 著

￥64.90

08

AI助理:用ChatGPT轻松搞定工作

AI助理:用ChatGPT轻松搞定工作￥34.90

杜雨,刁盛鑫著

￥34.90

09

人工智能和深度学习导论

人工智能和深度学习导论￥44.67

[美] 奥斯瓦尔德·坎佩萨托(Oswald Campesato ) 著

￥44.67

10

为什么伟大不能被计划

为什么伟大不能被计划￥59.00

[美]肯尼斯·斯坦利;[美]乔尔·雷曼

￥59.00

更多同类图书 >

电子书排行榜

新书排行榜

5元封顶

关注我们

最受欢迎的阅读产品

关注我们：
- 新浪微博
- 官方微信
关于我们

欢迎反馈宝贵意见给我们

客服书吧：当当读书5.0问答

意见反馈

Copyright (C) 当当网 2004-2021, All Rights Reserved

京ICP备17043473号-1|出版物经营许可证新出发京批字第直0673号

当当网收录的免费小说作品、频道内容、书友评论、用户上传文字、图片等其他一切内容及在当当网所做之广告均属用户个人行为，与当当网无关。

当当云阅读

二维码

0元畅读数万本精选电子书