Black Bg

정회원신청

정회원 신청은 대출이 가능한 소속 부대 도서관 홈페이지에서 요청하셔야 합니다.
정회원 신청 하시겠습니까?

닫기
검색

검색

  • Home
  • 기능목록
  • 검색

상세정보

Python Reinforcement Learning : Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow

QR코드
도서 상세정보
자료유형 : eBook
ISBN : 1838640142 
ISBN : 9781838640149 
서명/저자사항 : Python Reinforcement Learning:  Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow /:  Sudharsan Ravichandiran, Sean Saito, Rajalingappaa Shanmugamani and Yang Wenzhuo. 
발행사항 : Birmingham:  Packt Publishing, Limited,  2019. 
형태사항 : 1 online resource (484 pages). 
일반주기 : Implementation of the Atari emulator 
내용주기 : Cover; Title Page; Copyright and Credits; About Packt; Contributors; Table of Contents; Preface; Chapter 1: Introduction to Reinforcement Learning; What is RL?; RL algorithm; How RL differs from other ML paradigms; Elements of RL; Agent; Policy function; Value function; Model; Agent environment interface; Types of RL environment; Deterministic environment; Stochastic environment; Fully observable environment; Partially observable environment; Discrete environment; Continuous environment; Episodic and non-episodic environment; Single and multi-agent environment; RL platforms 
내용주기 : OpenAI Gym and UniverseDeepMind Lab; RL-Glue; Project Malmo; ViZDoom; Applications of RL; Education; Medicine and healthcare; Manufacturing; Inventory management; Finance; Natural Language Processing and Computer Vision; Summary; Questions; Further reading; Chapter 2: Getting Started with OpenAI and TensorFlow; Setting up your machine; Installing Anaconda; Installing Docker; Installing OpenAI Gym and Universe; Common error fixes; OpenAI Gym; Basic simulations; Training a robot to walk; OpenAI Universe; Building a video game bot; TensorFlow; Variables, constants, and placeholders; Variables 
내용주기 : ConstantsPlaceholders; Computation graph; Sessions; TensorBoard; Adding scope; Summary; Questions; Further reading; Chapter 3: The Markov Decision Process and Dynamic Programming; The Markov chain and Markov process; Markov Decision Process; Rewards and returns; Episodic and continuous tasks; Discount factor; The policy function; State value function; State-action value function (Q function); The Bellman equation and optimality; Deriving the Bellman equation for value and Q functions; Solving the Bellman equation; Dynamic programming; Value iteration; Policy iteration 
내용주기 : Solving the frozen lake problemValue iteration; Policy iteration; Summary; Questions; Further reading; Chapter 4: Gaming with Monte Carlo Methods; Monte Carlo methods; Estimating the value of pi using Monte Carlo; Monte Carlo prediction; First visit Monte Carlo; Every visit Monte Carlo; Let's play Blackjack with Monte Carlo; Monte Carlo control; Monte Carlo exploration starts; On-policy Monte Carlo control; Off-policy Monte Carlo control; Summary; Questions; Further reading; Chapter 5: Temporal Difference Learning; TD learning; TD prediction; TD control; Q learning 
내용주기 : Solving the taxi problem using Q learningSARSA; Solving the taxi problem using SARSA; The difference between Q learning and SARSA; Summary; Questions; Further reading; Chapter 6: Multi-Armed Bandit Problem; The MAB problem; The epsilon-greedy policy; The softmax exploration algorithm; The upper confidence bound algorithm; The Thompson sampling algorithm; Applications of MAB; Identifying the right advertisement banner using MAB; Contextual bandits; Summary; Questions; Further reading; Chapter 7: Playing Atari Games; Introduction to Atari games; Building an Atari emulator; Getting started 
요약 : Reinforcement learning and deep reinforcement learning are the trending and most promising branches of artificial intelligence. This Learning Path will enable you to master not only the basic reinforcement learning algorithms but also the advanced deep reinforcement learning algorithms and their limitations. 
일반주제명 : Python (Computer program language) -- 
일반주제명 : Reinforcement learning. -- 
개인저자 : Ravichandiran, Sudharsan, author.
개인저자 : Saito, Sean, author.
개인저자 : Shanmugamani, Rajalingappaa, author.
개인저자 : Wenzhuo, Yang, author.
기타형태 저록 : Print version: Ravichandiran, Sudharsan. Python Reinforcement Learning : Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow. Birmingham : Packt Publishing, Limited, ©2019, 9781838649777
언어 영어
원문
URL :
    • 예약
    • 인쇄
    • SSMS
    • 서가부재
    • 보존서고
    • 우선정리예약
    • 무인예약대출

    예약

    1. 1. 예약현황은 홈페이지 로그인 후 예약 페이지에 확인 가능합니다.
    2. 2. 도착 통보된 예약자료 대출을 원하지 않는 경우에는 예약 현황에서 취소할 수 있습니다.
    3. 3. 기타 문의사항은 도서관에 문의 바랍니다.
    닫기

    무인예약대출

    1. 1. 무인예약대출 현황은 홈페이지 로그인 후 무인예약대출 페이지에 확인 가능합니다.
    2. 2. 무인예약대출자료 대출을 원하지 않는 경우에는 무인예약대출 페이지에서 신청 또는 접수상태인 경우만 취소할 수 있습니다.
    3. 3. 희망대출일은 신청일로부터 최대 1주일 까지 가능합니다.
    4. 4. 희망대출일을 선택하지 않은 경우 대출대기 통보 후 1주일까지 기기에서 대출가능합니다.
    5. 5. 기타 문의사항은 도서관에 문의 바랍니다.
    닫기
    서평쓰기

    서평쓰기

    서평쓰기
    닫기
    태그추가

    태그추가

    닫기

    QR코드

    닫기
    챗봇
    • 도서관 대화형 검색봇 서비스 앤디입니다.