Rajan Agarwal

Research Engineer, currently focused on post-training. Software Engineering Student @ University of Waterloo.

This fall, I will work on RL for web agents as a Research Engineer at Amazon AGI Lab. Previously, I built multimodal video editing agents for hollywood at Kino AI and low-level train safety systems at Hitachi Rail.

I am a deeply technical person. I'm constantly building, learning and breaking things. I'm obsessed with learning how things work and designing novel solutions to problems I can't get out of my head. Right now, I'm most curious about multimodal models and coding agents.

Reinforcement Learning @ Amazon AGI

Work

Research projects, experiments, and creative applications.

View All →

Shadow

Open Source, 2025

Open-source background coding agent with 1.2k stars on GitHub. Feature-filled agent that works in a MicroVM with full codebase understanding.

Building a Video Editing Agent

Internship, 2025

Multimodal agent and long-context video understanding to help hollywood editors. Worked on the the most powerful video retrieval and editing agent.

Distributed Training

Research, 2024

A decentralized cross-device model training system with model and tensor parallelism to reduce compute needed to train large models.

Humanoid Behaviour

Research, 2024

Leading behaviour and interaction software for a humanoid robot design team in Waterloo. Built GPU-optimized 3D voxel grids for awareness models.

AI Agents as Citizens

Research, 2024

City simulation of Los Angeles with AI Agents, simulating human behaviour and optimizing transit routing with RL.

Multilingual Understanding

Research, 2024

Research at Cohere for AI to represent different languages as different modalities for training multilingual language models.

Shapeshift

Project, 2023

Deep learning pipeline that analyzes seismic frequencies and local policy to design affordable earthquake-resistant buildings.

Offline Mesh Network

Research, 2022

An offline mesh network written in Swift to allow for cross-device transfer of files entirely offline, creating a chain of encrypted nodes.

LLM Memory Architecture

Project, 2023

Long-term memory with multimodal knowledge graphs to search 7 days of video and audio within 5 seconds. Winners @ Hack the North 2023.

VLM Jailbreak on Phone

Research, 2024

Jailbroke a Galaxy S24 to run Moondream 3B VLM locally, with quantization + local linux setup on phone. Built at TreeHacks 2025

Crowdsourcing Policy

Project, 2024

Generating policy recommendations with AI agents from citizen complaints.