Skip to content
View hetankevin's full-sized avatar

Block or report hetankevin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. hybridcov hybridcov Public

    Source code for a paper on how optimism can boost sample efficiency within hybrid RL, accepted to RLC 2024.

    Jupyter Notebook 1 1

  2. mdpmix mdpmix Public

    Source code for learning mixtures of Markov chains and MDPs, accepted as a short oral presentation to ICML 2023.

    Jupyter Notebook 1

  3. off-policy off-policy Public

    Source code for off-policy evaluation and optimization in the presence of unobserved confounding, accepted to AISTATS 2024.

    Jupyter Notebook 1

  4. diffPomp diffPomp Public

    Estimation, filtering, and inference for partially-observed Markov processes via a two-stage algorithm involving gradient descent (with a novel gradient estimate for the particle filter) warm-start…

    Jupyter Notebook 2 1

  5. probono probono Public

    Algorithms for accelerating personalization (in the context of linear bandit recommendations) to new users, given access to embeddings from other users with heterogenous tastes and preferences. Nov…

    Jupyter Notebook 1