I've worked as an ML Engineer at various startups, and I am based in the bay area.
I studied computer science and mathematics at Princeton University.
My Senior Thesis is focused on applying spectral transfer unit (STU) to state prediction in world models for Atari environments. I have had the privilege of being advised by Prof. Elad Hazan.
Some other interests: marimba, karaoke, swimming, languages, reading, rowing. Feel free to reach out by emailing shivamkak3 [ at ] gmail [ dot ] com.
Student Researcher @ Google DeepMind
Working under Prof. Elad Hazan with research focus on spectral transformers
Machine Learning Engineer @ MrBeast
Founding ML engineer for stealth startup within Beast Industries
Software Engineer Intern @ Niantic, Inc.
Researched geospatial reasoning tasks and asset generation models
Research Engineer @ Hillspire
Worked with Sebastian Thrun and Eric Schmidt to incubate a funded consumer video generation startup
Selected Projects
SocraticSeminar: Reasoning in LLMs via Multiagent Debate
Built a multi-agent debate framework for LLMs, evaluated on GSM8K, MMLU, BIG-bench Chess, and arithmetic benchmarks with deployment as a native iOS app
Dual-Class Share Arbitrage Algorithm
Built a trading bot with Alpaca API that finds discrepancies in 9 dual-class shares and executes simultaneous long/short positions
Selected Talks
Intro to Recommendation Algorithms Infrastructure: Understanding the Magic of ByteDance's Real-Time ML Production Pipeline, Monolith
Outlined Monolith's distributed worker system, fault-tolerant collisionless embedding table, and online training schedules for sparse vs. dense parameters
Genetic Algorithms for ML
A survey of genetic algorithms followed by a deep dive on NeuroEvolution of Augmenting Topologies (NEAT) architecture and a few notable variants (HyperNEAT, coevolutionary NEAT)