r
rupalibhati

rupalibhati

5,0(14)

Reinforcement Learning Researcher

India
Engels, Hindi
Sommige informatie wordt in het Engels weergegeven.
Over mij
Reinforcement Learning projects in python. I have experience in the following reinforcement learning algorithms: 1. Value Iteration 2. Policy Iteration 3. Q-learning 4. DQN (Deep Q Network) 5. DDPG (Deep Deterministic Policy Gradient) 6. TRPO (Trust Region Policy Optimisation) 7. PPO (Proximal Policy Optimisation) I am very comfortable with OpenAI gym and can create any kind of custom environment. I can work with Tensorflow and Pytorch. ... Lees meer

Skills

r
rupalibhati
rupalibhati
offline