Black Bg

정회원신청

정회원 신청은 대출이 가능한 소속 부대 도서관 홈페이지에서 요청하셔야 합니다.
정회원 신청 하시겠습니까?

닫기
검색

검색

  • Home
  • 기능목록
  • 검색

상세정보

Reinforcement Learning Algorithms with Python : Learn, Understand, and Develop Smart Algorithms for Addressing AI Challenges

QR코드
도서 상세정보
자료유형 : eBook
ISBN : 1789139708 
ISBN : 9781789139709 
개인저자 : Lonza, Andrea.
서명/저자사항 : Reinforcement Learning Algorithms with Python:  Learn, Understand, and Develop Smart Algorithms for Addressing AI Challenges /:  Andrea Lonza. 
발행사항 : Birmingham:  Packt Publishing, Limited,  2019. 
형태사항 : 1 online resource (356 pages). 
일반주기 : Implementing REINFORCE with baseline 
내용주기 : Cover; Title Page; Copyright and Credits; Dedication; About Packt; Contributors; Table of Contents; Preface; Section 1: Algorithms and Environments; Chapter 1: The Landscape of Reinforcement Learning; An introduction to RL; Comparing RL and supervised learning; History of RL; Deep RL; Elements of RL; Policy; The value function; Reward; Model; Applications of RL; Games; Robotics and Industry 4.0; Machine learning; Economics and finance; Healthcare; Intelligent transportation systems; Energy optimization and smart grid; Summary; Questions; Further reading 
내용주기 : Chapter 2: Implementing RL Cycle and OpenAI GymSetting up the environment; Installing OpenAI Gym; Installing Roboschool; OpenAI Gym and RL cycles; Developing an RL cycle; Getting used to spaces; Development of ML models using TensorFlow; Tensor; Constant; Placeholder; Variable; Creating a graph; Simple linear regression example; Introducing TensorBoard; Types of RL environments; Why different environments?; Open source environments; Summary; Questions; Further reading; Chapter 3: Solving Problems with Dynamic Programming; MDP; Policy; Return; Value functions; Bellman equation 
내용주기 : Categorizing RL algorithmsModel-free algorithms; Value-based algorithms; Policy gradient algorithms; Actor-Critic algorithms; Hybrid algorithms; Model-based RL; Algorithm diversity; Dynamic programming; Policy evaluation and policy improvement; Policy iteration; Policy iteration applied to FrozenLake; Value iteration; Value iteration applied to FrozenLake; Summary; Questions; Further reading; Section 2: Model-Free RL Algorithms; Chapter 4: Q-Learning and SARSA Applications; Learning without a model; User experience; Policy evaluation; The exploration problem; Why explore?; How to explore 
내용주기 : TD learningTD update; Policy improvement; Comparing Monte Carlo and TD; SARSA; The algorithm; Applying SARSA to Taxi-v2; Q-learning; Theory; The algorithm; Applying Q-learning to Taxi-v2; Comparing SARSA and Q-learning; Summary; Questions; Chapter 5: Deep Q-Network; Deep neural networks and Q-learning; Function approximation; Q-learning with neural networks; Deep Q-learning instabilities; DQN; The solution; Replay memory; The target network; The DQN algorithm; The loss function; Pseudocode; Model architecture; DQN applied to Pong; Atari games; Preprocessing; DQN implementation; DNNs 
내용주기 : The experienced bufferThe computational graph and training loop; Results; DQN variations; Double DQN; DDQN implementation; Results; Dueling DQN; Dueling DQN implementation; Results; N-step DQN; Implementation; Results; Summary; Questions; Further reading; Chapter 6: Learning Stochastic and PG Optimization; Policy gradient methods; The gradient of the policy; Policy gradient theorem; Computing the gradient; The policy; On-policy PG; Understanding the REINFORCE algorithm; Implementing REINFORCE; Landing a spacecraft using REINFORCE; Analyzing the results; REINFORCE with baseline 
요약 : With this book, you will understand the core concepts and techniques of reinforcement learning. You will take a look into each RL algorithm and will develop your own self-learning algorithms and models. You will optimize the algorithms for better precision, use high-speed actions and lower the risk of anomalies in your applications. 
일반주제명 : Computer algorithms. -- 
일반주제명 : Python (Computer program language) -- 
일반주제명 : Computer algorithms. -- 
일반주제명 : Python (Computer program language) -- 
기타형태 저록 : Print version: Lonza, Andrea. Reinforcement Learning Algorithms with Python : Learn, Understand, and Develop Smart Algorithms for Addressing AI Challenges. Birmingham : Packt Publishing, Limited, ©2019, 9781789131116
언어 영어
원문
URL :
    • 예약
    • 인쇄
    • SSMS
    • 서가부재
    • 보존서고
    • 우선정리예약
    • 무인예약대출

    예약

    1. 1. 예약현황은 홈페이지 로그인 후 예약 페이지에 확인 가능합니다.
    2. 2. 도착 통보된 예약자료 대출을 원하지 않는 경우에는 예약 현황에서 취소할 수 있습니다.
    3. 3. 기타 문의사항은 도서관에 문의 바랍니다.
    닫기

    무인예약대출

    1. 1. 무인예약대출 현황은 홈페이지 로그인 후 무인예약대출 페이지에 확인 가능합니다.
    2. 2. 무인예약대출자료 대출을 원하지 않는 경우에는 무인예약대출 페이지에서 신청 또는 접수상태인 경우만 취소할 수 있습니다.
    3. 3. 희망대출일은 신청일로부터 최대 1주일 까지 가능합니다.
    4. 4. 희망대출일을 선택하지 않은 경우 대출대기 통보 후 1주일까지 기기에서 대출가능합니다.
    5. 5. 기타 문의사항은 도서관에 문의 바랍니다.
    닫기
    서평쓰기

    서평쓰기

    서평쓰기
    닫기
    태그추가

    태그추가

    닫기

    QR코드

    닫기
    챗봇
    • 도서관 대화형 검색봇 서비스 앤디입니다.