Policy Based Methods

Continuous Control: Deep Deterministic Policy Gradient

This project repository contains my work for the Udacity’s Deep Reinforcement Learning Nanodegree Project 2: Continuous Control. Project’s goal In this environment, a double-jointed arm can move to target locations. A reward of +0.