MARC 닫기
03159cam a2200529Ii 4500
000001334906
20210114163614
m d |
cr |||||||||||
190402s2018 maua ob 001 0 eng
▼a 1175918416
▼a 9780262352703
▼q (electronic bk.)
▼a 0262352702
▼q (electronic bk.)
▼z 9780262039246
▼q (hardcover
▼q alkaline paper)
▼z 0262039249
▼q (hardcover
▼q alkaline paper)
▼a 2517937
▼b (N$T)
▼a (OCoLC)1091191532
▼z (OCoLC)1175918416
▼a INA
▼b eng
▼e rda
▼e pn
▼c INA
▼d YDX
▼d UKAHL
▼d OCLCQ
▼d N$T
▼d EBLCP
▼d 248023
▼a Q325.6
▼b .R45 2018
▼a 006.3/1
▼2 23
▼a Sutton, Richard S.
▼a Reinforcement learning:
▼b an introduction /:
▼c Richard S. Sutton and Andrew G. Barto.
▼a Second edition.
▼a Cambridge, Massachusetts:
▼b The MIT Press,
▼c [2018].
▼a 1 online resource (xxii, 526 pages).
▼a text
▼b txt
▼2 rdacontent
▼a computer
▼b c
▼2 rdamedia
▼a online resource
▼b cr
▼2 rdacarrier
▼a Adaptive computation and machine learning
▼a Includes bibliographical references and index.
▼g 1.
▼t Introduction --
▼g I.
▼t Tabular Solution Methods:
▼g 2.
▼t Multi-armed Bandits --
▼g 3.
▼t Finite Markov Decision processes --
▼g 4.
▼t Dynamic programming --
▼g 5.
▼t Monte Carlo methods --
▼g 6.
▼t Temporal-difference learning --
▼g 7.
▼t n-step Bootstrapping --
▼g 8.
▼t Planning and learning with tabular methods--
▼g I.
▼t Approximate Solution Methods:
▼g 9.
▼t On-policy Prediction with Approximation--
▼g 10.
▼t On-policy Control with Approximation--
▼g 11.
▼t O↵-policy Methods with Approximation --
▼g 12.
▼t Eligibility Traces--
▼g 13.
▼t Policy Gradient Methods--
▼g III.
▼t Looking Deeper:
▼g 14.
▼t Psychology --
▼g 15.
▼t Neuroscience --
▼g 16.
▼t Applications and Case Studies --
▼g 17.
▼t Frontiers
▼a "Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives while interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the field's key ideas and algorithms."--
▼c Provided by publisher.
▼a OCLC control number change
▼a Reinforcement learning.
▼a Reinforcement learning.
▼2 fast
▼0 (OCoLC)fst01732553
▼a Electronic books.
▼a Barto, Andrew G.
▼i Print version:
▼a Sutton, Richard S.
▼t Reinforcement learning.
▼b Second edition.
▼d Cambridge, Massachusetts : The MIT Press, [2018],
▼z 0262039249,
▼z 9780262039246
▼w (DLC) 2018023826
▼w (OCoLC)1043175824
▼a Adaptive computation and machine learning.
▼3 EBSCOhost
▼u http://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=2517937
▼a Askews and Holts Library Services
▼b ASKH
▼n AH37519960
▼a YBP Library Services
▼b YANK
▼n 301368137
▼a ProQuest Ebook Central
▼b EBLB
▼n EBL6260249
▼a EBSCOhost
▼b EBSC
▼n 2517937
▼a 강리원
▼a eBook
▼a 92
▼b N$T
| 자료유형 : | eBook |
|---|---|
| ISBN : | 9780262352703 |
| ISBN : | 0262352702 |
| ISBN : | |
| ISBN : | |
| 개인저자 : | Sutton, Richard S. |
| 서명/저자사항 : | Reinforcement learning: an introduction /: Richard S. Sutton and Andrew G. Barto. |
| 판사항 : | Second edition. |
| 발행사항 : | Cambridge, Massachusetts: The MIT Press, [2018]. |
| 형태사항 : | 1 online resource (xxii, 526 pages). |
| 총서사항 : | Adaptive computation and machine learning |
| 서지주기 : | Includes bibliographical references and index. |
| 내용주기 : | 1. Introduction -- I. Tabular Solution Methods: 2. Multi-armed Bandits -- 3. Finite Markov Decision processes -- 4. Dynamic programming -- 5. Monte Carlo methods -- 6. Temporal-difference learning -- 7. n-step Bootstrapping -- 8. Planning and learning with tabular methods-- I. Approximate Solution Methods: 9. On-policy Prediction with Approximation-- 10. On-policy Control with Approximation-- 11. O↵-policy Methods with Approximation -- 12. Eligibility Traces-- 13. Policy Gradient Methods-- III. Looking Deeper: 14. Psychology -- 15. Neuroscience -- 16. Applications and Case Studies -- 17. Frontiers |
| 요약 : | "Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives while interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the field's key ideas and algorithms."-- Provided by publisher. |
| 일반주제명 : | Reinforcement learning. -- |
| 일반주제명 : | Reinforcement learning. -- |
| 개인저자 : | Barto, Andrew G. |
| 기타형태 저록 : | Print version: Sutton, Richard S. Reinforcement learning. Second edition. Cambridge, Massachusetts : The MIT Press, [2018], 0262039249, 9780262039246 |
| 언어 | 영어 |
| URL : |
|---|
서평쓰기