深度強化學習實踐（影印版英文版）

內容簡介

　強化學習（RL）的新發展結合深度學習（DL），在訓練代理以類似人的方式解決複雜問題方面取得了未有的進步。Google使用算法在著名的Atari街機遊戲中獲勝將該領域推至高峰，研究人員也在源源不斷地產生新的想法。

　　《深度強化學習實踐（影印版英文版）》介紹了RL的基礎知識，為你提供了編寫智慧型學習代理所需的原理，以承擔一系列艱巨的實際任務。讓你了解如何在“格線世界”環境中實現Q-learning，教你的代理購買和交易股票，發現自然語言模型如何推動了聊天機器人的火爆。

Preface

Chapter 1： What is Reinforcement Learning?

Learning - supervised， unsupervised， and reinforcement

RL formalisms and relations

Reward

The agent

The environment

Actions

Observations

Markov decision processes

Markov process

Markov reward process

Markov decision process

Summary

Chapter 2： OpenAI Gym

The anatomy of the agent

Hardware and software requirements

OpenAI Gym API

Action space

Observation space

The environment

Creation of the environment

The CartPole session

The random CartPole agent

The extra Gym functionality - wrappers and monitors

Wrappers

Monitor

Summary

Chapter 3： Deep Learning with PyTorch

Tensors

Creation of tensors

Scalar tensors

Tensor operations

GPU tensors

Gradients

Tensors and gradients