Ik train een deep reinforcement learning agent voor jou


Over deze dienst
Automatische vertaling
Ervaren Research Engineer in Computer Vision in Reinforcement Learning die bedreven is in het trainen van reinforcement learning agents.
Voorgaand werk omvat:
- Implementatie van onderzoeksartikelen.
- Q learning agents voor het spelen van single- en multiplayer games.
- Training van de meeste OpenAI Gym-omgevingen.
- DQN training met alleen NumPy vanaf nul.
- Training van meerdere op maat gemaakte agents.
- Training van elke Reinforcement Learning agent op aangepaste omgevingen.
Biedt state-of-the-art implementaties van reinforcement learning algoritmes voor jouw aangepaste omgevingen of omgevingen van OpenAI gymnasium.
In staat om zowel eenvoudige als complexe omgevingen te beheren.
- Bedreven in MDPs, TD en Q-learning.
- DQN (Deep Q-Networks)
- PPO (Proximal Policy Optimization)
- TRPO (Trust Region Policy Optimization)
- Actor-Critic methoden
- A2C (Advantage Actor-Critic)
- A3C (Asynchronous Advantage Actor-Critic)
- Monte Carlo methoden
- DDPG (Deep Deterministic Policy Gradient)
- SAC (Soft Actor-Critic)
- HER (Hindsight Experience Replay)
- ACER (Actor-Critic met Experience Replay)
Neem contact op voordat je je bestelling plaatst voor snelle hulp.
Wees gerust, je ontvangt snel een reactie.
Maak kennis met Hakim Ali
- Afkomstig uitPakistan
- Lid sindsjan 2023
- Laatste levering2 jaar
Talen
Engels
Automatische vertaling
4 reviews van deze dienst
| (4) | ||
| (0) | ||
| (0) | ||
| (0) | ||
| (0) |
Specificering van de beoordeling
- Communicatieniveau van de freelancer
- Aanbevelingswaardig
- Dienst zoals beschreven
Sorteer op
A ash5355
Terugkerende klant

Verenigd Koninkrijk
This is second time I work with him..Great job..Indeed a smart person who deliver the work in a day or two as quick as posdibke covering all the requirements.
US$ 50-US$ 100
Prijs
4 dagen
Looptijd
Nuttig?N 
nemosu
Terugkerende klant

Verenigde Arabische Emiraten
Hakim is very brilliant and talented. He gave me a perfect MLP built from scratch without using any Python libraries as I asked him and implemented TD correctly to the game. He was very patient with me in changing anything I point to him and answer any questions I had. He cares about his clients and...
US$ 50-US$ 100
Prijs
4 dagen
Looptijd
Nuttig?A ash5355
Terugkerende klant

Verenigd Koninkrijk
Best work ever..He delivered within few hours..Perfectionist..would definitely recommend him ..He trained an RL agent for me to race car..
Nuttig?Z zagato5800

Duitsland
Thanks again for your fast and good work, ihkali! I am very satisfied with the results achieved and would also let you solve RL tasks in the future! In addition, a nice contact who also responds to questions very well.
Nuttig?
4 reviews van deze dienst
| (4) | ||
| (0) | ||
| (0) | ||
| (0) | ||
| (0) |
Specificering van de beoordeling
- Communicatieniveau van de freelancer
- Aanbevelingswaardig
- Dienst zoals beschreven
Sorteer op
A ash5355
Terugkerende klant

Verenigd Koninkrijk
This is second time I work with him..Great job..Indeed a smart person who deliver the work in a day or two as quick as posdibke covering all the requirements.
US$ 50-US$ 100
Prijs
4 dagen
Looptijd
Nuttig?N 
nemosu
Terugkerende klant

Verenigde Arabische Emiraten
Hakim is very brilliant and talented. He gave me a perfect MLP built from scratch without using any Python libraries as I asked him and implemented TD correctly to the game. He was very patient with me in changing anything I point to him and answer any questions I had. He cares about his clients and...
US$ 50-US$ 100
Prijs
4 dagen
Looptijd
Nuttig?A ash5355
Terugkerende klant

Verenigd Koninkrijk
Best work ever..He delivered within few hours..Perfectionist..would definitely recommend him ..He trained an RL agent for me to race car..
Nuttig?Z zagato5800

Duitsland
Thanks again for your fast and good work, ihkali! I am very satisfied with the results achieved and would also let you solve RL tasks in the future! In addition, a nice contact who also responds to questions very well.
Nuttig?
