Skip to content
View bay3s's full-sized avatar
🎧
🎧
  • Amsterdam, Netherlands

Block or report bay3s

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. rl-squared rl-squared Public

    RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning

    Python 12 3

  2. auto-dr auto-dr Public

    Automatic Domain Randomization (ADR) proposed in "Solving Rubik's Cube with a Robot Hand"

    Python 7

  3. ppo-parallel ppo-parallel Public

    Parallelized implementation of Proximal Policy Optimization (PPO).

    Python 1

  4. reinforce-rl reinforce-rl Public

    Vanilla Policy Gradient (REINFORCE) implementation with PyTorch

    Jupyter Notebook 1

  5. meta-rl meta-rl Public

    An in-depth exploration and comparative analysis of representative methods for Meta Reinforcement Learning and Curriculum Design.

    Jupyter Notebook

  6. maml maml Public

    Implementation of supervised model-agnostic meta-learning in PyTorch

    Jupyter Notebook 1