기능목록>검색 : 해군사관학교 학술정보원

MARC 닫기

LDR
03159cam a2200529Ii 4500
001
000001334906
005
20210114163614
006
m d |
007
cr |||||||||||
008
190402s2018 maua ob 001 0 eng
019
▼a 1175918416
020
▼a 9780262352703
▼q (electronic bk.)
020
▼a 0262352702
▼q (electronic bk.)
020
▼z  9780262039246
▼q  (hardcover
▼q  alkaline paper)
020
▼z  0262039249
▼q  (hardcover
▼q  alkaline paper)
035
▼a 2517937
▼b (N$T)
035
▼a (OCoLC)1091191532
▼z (OCoLC)1175918416
040
▼a  INA
▼b  eng
▼e  rda
▼e  pn
▼c  INA
▼d  YDX
▼d  UKAHL
▼d  OCLCQ
▼d  N$T
▼d  EBLCP
▼d  248023
050
▼a Q325.6
▼b .R45 2018
082
▼a 006.3/1
▼2 23
100
▼a Sutton, Richard S.
245
▼a  Reinforcement learning:
▼b  an introduction /:
▼c  Richard S. Sutton and Andrew G. Barto.
250
▼a Second edition.
260
▼a  Cambridge, Massachusetts:
▼b  The MIT Press,
▼c  [2018].
300
▼a 1 online resource (xxii, 526 pages).
336
▼a  text
▼b  txt
▼2  rdacontent
337
▼a  computer
▼b  c
▼2  rdamedia
338
▼a  online resource
▼b  cr
▼2  rdacarrier
490
▼a Adaptive computation and machine learning
504
▼a Includes bibliographical references and index.
505
▼g  1.
▼t  Introduction --
▼g  I.
▼t  Tabular Solution Methods:
▼g  2.
▼t  Multi-armed Bandits --
▼g  3.
▼t  Finite Markov Decision processes --
▼g  4.
▼t  Dynamic programming --
▼g  5.
▼t  Monte Carlo methods --
▼g  6.
▼t  Temporal-difference learning --
▼g  7.
▼t  n-step Bootstrapping --
▼g  8.
▼t  Planning and learning with tabular methods--
▼g  I.
▼t  Approximate Solution Methods:
▼g  9.
▼t  On-policy Prediction with Approximation--
▼g  10.
▼t  On-policy Control with Approximation--
▼g  11.
▼t  O↵-policy Methods with Approximation --
▼g  12.
▼t  Eligibility Traces--
▼g  13.
▼t  Policy Gradient Methods--
▼g  III.
▼t  Looking Deeper:
▼g  14.
▼t  Psychology --
▼g  15.
▼t  Neuroscience --
▼g  16.
▼t  Applications and Case Studies --
▼g  17.
▼t  Frontiers
520
▼a "Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives while interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the field's key ideas and algorithms."--
▼c Provided by publisher.
590
▼a OCLC control number change
650
▼a Reinforcement learning.
650
▼a  Reinforcement learning.
▼2  fast
▼0  (OCoLC)fst01732553
655
▼a Electronic books.
700
▼a Barto, Andrew G.
776
▼i  Print version:
▼a  Sutton, Richard S.
▼t  Reinforcement learning.
▼b  Second edition.
▼d  Cambridge, Massachusetts : The MIT Press, [2018],
▼z  0262039249,
▼z  9780262039246
▼w  (DLC) 2018023826
▼w  (OCoLC)1043175824
830
▼a Adaptive computation and machine learning.
856
▼3 EBSCOhost
▼u http://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=2517937
938
▼a  Askews and Holts Library Services
▼b  ASKH
▼n  AH37519960
938
▼a  YBP Library Services
▼b  YANK
▼n  301368137
938
▼a  ProQuest Ebook Central
▼b  EBLB
▼n  EBL6260249
938
▼a  EBSCOhost
▼b  EBSC
▼n  2517937
990
▼a 강리원
991
▼a eBook
994
▼a 92
▼b N$T

자료유형 :	eBook
ISBN :	9780262352703
ISBN :	0262352702
ISBN :
ISBN :
개인저자 :	Sutton, Richard S.
서명/저자사항 :	Reinforcement learning: an introduction /: Richard S. Sutton and Andrew G. Barto.
판사항 :	Second edition.
발행사항 :	Cambridge, Massachusetts: The MIT Press, [2018].
형태사항 :	1 online resource (xxii, 526 pages).
총서사항 :	Adaptive computation and machine learning
서지주기 :	Includes bibliographical references and index.
내용주기 :	1. Introduction -- I. Tabular Solution Methods: 2. Multi-armed Bandits -- 3. Finite Markov Decision processes -- 4. Dynamic programming -- 5. Monte Carlo methods -- 6. Temporal-difference learning -- 7. n-step Bootstrapping -- 8. Planning and learning with tabular methods-- I. Approximate Solution Methods: 9. On-policy Prediction with Approximation-- 10. On-policy Control with Approximation-- 11. O↵-policy Methods with Approximation -- 12. Eligibility Traces-- 13. Policy Gradient Methods-- III. Looking Deeper: 14. Psychology -- 15. Neuroscience -- 16. Applications and Case Studies -- 17. Frontiers
요약 :	"Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives while interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the field's key ideas and algorithms."-- Provided by publisher.
일반주제명 :	Reinforcement learning. --
일반주제명 :	Reinforcement learning. --
개인저자 :	Barto, Andrew G.
기타형태 저록 :	Print version: Sutton, Richard S. Reinforcement learning. Second edition. Cambridge, Massachusetts : The MIT Press, [2018], 0262039249, 9780262039246
언어	영어

해군사관학교 학술정보원

로컬네비게이션

전체메뉴

정회원신청

검색

바구니 담기 완료

신규서재 추가

내서재 담기

내보내기

상세정보

Reinforcement learning : an introduction / / Second edition

소장정보

예약

무인예약대출

동일서가 자료

이 자료와 함께 본 자료

서평

서평쓰기

태그

태그추가

QR코드

도서관 검색봇 서비스 앤디