Work

Research projects, experiments, and creative applications.

2025

Nova Act: SOTA browser-use model

Nova Act: SOTA browser-use model

2025

In my internship with Amazon AGI, I worked on RL for a browser-use model. I led model performance on two benchmarks & worked on algorithms/performance.

LLMs can invent their own compression

LLMs can invent their own compression

2025

As a constrained optimization problem, LLMs can use RL to invent their own compression schemes to increase its context window.

Natural Deception with RL

Natural Deception with RL

2025

Language models, when trained on hidden-information games, naturally learn deceptive techniques to win the game by any means.

Cross Lingual Alignment

Cross Lingual Alignment

2025

Research under Cohere Labs for a compute-efficient post training to represent different languages as modalities for multilingual language models.

Cursor Observe

Cursor Observe

2025

Let agents actively observe the terminal and proactively fix training runs, security alerts and potential bugs.

PokeOS

PokeOS

2025

Local tunnel MCP for text agents to interact with your OS. Poke can text others, fill out forms on your browser, play music and fix code.

nanochatVL

nanochatVL

2025

Giving vision to Karpathy's nanochat for <$10 of compute, by implementing LLaVA via SIGLIP encoder injection and fine-tuning on vision Q&A.

Shadow

Shadow

2025

Open-source background coding agent with 1.4k stars on GitHub. Feature-filled agent that works in a MicroVM with full codebase understanding.

Kino AI: Hollywood Video Editing Agent

Kino AI: Hollywood Video Editing Agent

2025

Multimodal agent and long-context video understanding to help hollywood editors. Worked on the infra, video retrieval & the agent.

Local VLM on a Samsung Galaxy

Local VLM on a Samsung Galaxy

2025

Tricked a Galaxy S24 to run Moondream 3B VLM locally, with quantization + local linux setup on phone. Built at TreeHacks 2025

GPU optimized voxel grids

GPU optimized voxel grids

2025

Designed and implemented GPU-optimized voxel grids for humanoid design team in Waterloo. Co-led ML team.

2024

2023

2022