Recent posts

RL Algorithms – PPO, DDPG and TRPO

    less than 1 minute read    

This week I learned the details of some RL algorithms (PPO, DDPG, TRPO) and implemented them. Please see my CoLab Python Notebook for more information.

Medi.RL: Final Project Proposal

    less than 1 minute read    

During week Mar 18th to Mar 22nd, I surveyed current literature in the applications of RL in business, health care, and other real life applications ( RLinRL...

Train and Test Policies in OpenAI Gym Environments

    less than 1 minute read    

This week I played around with OpenAI Gym. Specifically, I explored most environments in Gym, tested a random policy, deterministic heuristic policy and t...