
DDPG: Deep Deterministic Policy Gradient (TensorFlow 2.x)
July 2025CompletedA robust, modular, and extensible reimplementation of the DDPG reinforcement learning algorithm (Lillicrap et al., 2015) using TensorFlow 2.x. Built for reproducibility, research comparison, and interactive experimentation with Gym environments.
- Modular architecture: separate agent, networks, noise, buffer, wrappers
- Hyperparameter sweeps, YAML config system, and TensorBoard logging
- Includes full unit test suite and reproducibility controls (seed, version logging)
- Supports Gym β₯ 0.26, Apple Silicon acceleration (tensorflow-macos, -metal)
π―Reinforcement Learningπ§ DDPGπ’TensorFlowποΈOpenAI Gym