Research Recap -

Concept Bottleneck Models

human-AI collaboration

Imagine deploying a neural network. That’s it, that’s the joke.

Vaibhav Balloli

Offline Reinforcement Learning

RL

Reinforcement Learning(RL), as simply stated in Sutton and Barto Sutton and Barto (2018) , is learning what to do i.e map situations to actions to maximize the cumulative…

Vaibhav Balloli

Lessons from writing Research Code

code

setup

research

Having written a good amount of research code for a while now, I was wondering what “good research code” can be construed as. This topic has been discussed heavily with a…

Vaibhav Balloli

Development Setup

code

setup

My programming language of choice has mostly been Python lately, with dev both on Linux and Windows (and WSL) - both when I’m working on personal and professional projects.…

Vaibhav Balloli