My works

These are projects conducted in my classes at MIT and beyond

KINA Benchmark

KINA Benchmark

A high-density benchmark spanning 261 disciplines with game-theoretic annotation
CogGym

CogGym

A scalable framework for comparing AI models against humans using cognitive science experiments
40Hz Toolbox

40Hz Toolbox

VR-based 40Hz light therapy for adult amblyopia and Alzheimer's disease
Caveman Papers

Caveman Papers

Animation agents that turn research papers into short-form reels
Persona Collapse

Persona Collapse

Why many “different” AI personas end up sounding strangely the same
Team Mulan

Team Mulan

The first all-girls robotics team in China