Marc Velay – Medium

Marc Velay

Custom Gymnasium Envs and Multi-Input Stable-Baselines3 Agents

Create a custom Gymnasium environment using Dict spaces and process them with stable-baselines3's MultiInput Policy.

5 min readAug 7, 2023

--

Custom Gymnasium Envs and Multi-Input Stable-Baselines3 Agents

--

Marc Velay
in
Towards Data Science

Easily Tune RL HyperParameters with Hydra's Optuna sweeper

Reinforcement Learning agents are very susceptible to their hyperparameters. Easily find the best combo with Hydra and Optuna!

8 min readFeb 1, 2023

--

Easily Tune RL HyperParameters with Hydra's Optuna sweeper

--

Marc Velay
in
Towards Data Science

Reinforcement Learning Intro: Markov Decision Process

A Markov Decision Process is fundamental in RL. We focus on presenting it as a general mathematical framework and its main difficulties.

9 min readAug 16, 2022

--

1

Decision Making at a crossroads

--

1

Marc Velay
in
Towards Data Science

Target Networks: Slow and Steady Wins the Race

Dive deep into Target Networks and why they have become so popular in state-of-the-art Reinforcement Learning algorithms.

5 min readFeb 17, 2022

--

1

Target Networks: Slow and Steady Wins the Race

--

1

Marc Velay

Implementing Hawkes Processes in Python using tick

Few online ressources present how to implement HPs in Python. Here is a tutorial on exactly that, in python using the tick library!

2 min readDec 2, 2020

--

Implementing Hawkes Processes in Python using tick

--

Marc Velay

A brief introduction to Hawkes Processes

Hawkes processes are a type of stochastic processes. These are used to model stochastic — hear random — Point Processes. Hawkes processes…

3 min readNov 19, 2020

--

1

A brief introduction to Hawkes Processes

--

1

Marc Velay

Marc Velay

PhD student in reinforcement learning. I share my journey here and at https://velaylearning.com

Following

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams