Open in app

Sign In

Write

Sign In

Marc Velay
Marc Velay

256 Followers

Home

About

Published in Towards Data Science

·Feb 1

Tuning RL HyperParameters with Hydra's Optuna sweeper

Configure your Stable-Baselines3 tuning pipeline with ease using Hydra and Optuna — Reinforcement Learning (RL) agents are very susceptible to their hyperparameters. The same agent can go from utterly worthless after training to the top of the leaderboard with the correct hyperparameters! Yet finding the correct combination can be very time-consuming or even futile without the right tools. The main goal of…

Reinforcement Learning

8 min read

Easily Tune RL HyperParameters with Hydra's Optuna sweeper
Easily Tune RL HyperParameters with Hydra's Optuna sweeper
Reinforcement Learning

8 min read


Published in Towards Data Science

·Aug 16, 2022

Reinforcement Learning Intro: Markov Decision Process

A Markov Decision Process is one of the most fundamental knowledge in Reinforcement Learning. It’s used to represent decision making in optimization problems. The version we present here is the Finite MDP, which analyses discrete time, with discrete action problems, with some amount of stochasticity involved. …

Reinforcement Learning

9 min read

Reinforcement Learning Intro: Markov Decision Process
Reinforcement Learning Intro: Markov Decision Process
Reinforcement Learning

9 min read


Published in Towards Data Science

·Feb 17, 2022

Target Networks: Slow and Steady Wins the Race

Reinforcement Learning is the third family of Machine Learning algorithms, after supervised and unsupervised learning. The aim is to find optimal behavior in an environment through interactions and learning from them. In a perfect world, we can efficiently measure the state of the environment, predict the outcome of each of…

Machine Learning

5 min read

Target Networks: Slow and Steady Wins the Race
Target Networks: Slow and Steady Wins the Race
Machine Learning

5 min read


Dec 2, 2020

Implementing Hawkes Processes in Python using tick

Hawkes processes are a type of stochastic processes. These are used to model stochastic Point Processes. I have presented this algorithm in more theoretical details in this post. There are few resources that go into great depth about this widely used algorithm. There are fewer yet that cover its use…

Python

2 min read

Implementing Hawkes Processes in Python using tick
Implementing Hawkes Processes in Python using tick
Python

2 min read


Nov 19, 2020

A brief introduction to Hawkes Processes

Hawkes processes are a type of stochastic processes. These are used to model stochastic — hear random — Point Processes. Hawkes processes a random and finite series of events that are governed by a probabilistic rule. They were originally created to mitigate a disadvantage in Poisson processes, another type of…

Financial Modelling

3 min read

A brief introduction to Hawkes Processes
A brief introduction to Hawkes Processes
Financial Modelling

3 min read

Marc Velay

Marc Velay

256 Followers

PhD student in reinforcement learning. I share my journey here and at https://velaylearning.com

Following
  • Antonello Zanini

    Antonello Zanini

  • Eva Rtology

    Eva Rtology

  • Tony U. Francisco

    Tony U. Francisco

  • J.J. Pryor

    J.J. Pryor

  • Bartek Kulas

    Bartek Kulas

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech