Yash Raj Pandey

AI systems engineer. I build local-first LLM infrastructure, agent platforms, evaluation systems, and developer tools.

At UF IFAS, I lead the AI systems function I proposed. I also reproduce new model research, contribute fixes upstream, and write about what broke. Outside work, I follow football and build small tools I want to use myself.

Selected Work

BatonDeveloper toolA kit that installs a reliable code-delegation workflow into Claude Code
LoomaAI infrastructureLocal-first project memory for coding agents
mddocsCollaboration toolGit-native collaborative Markdown, with an agent API

All work ->

I also have 45+ pull requests merged across 30+ open-source projects, including llama.cpp, Apple MLX, and Google Research's TabFM. See the pull requests.

Writing

Baton: I Built a Tool for Delegating Code to an AI, Then Used It to Build ItselfJul 13, 2026A standalone kit that installs a reliable orchestrator-delegates-to-executor workflow into Claude Code - and the dogfooding loop that wrote it
I Tried to Break Google's New Tabular Foundation Model. Then I Fixed It.Jul 1, 2026An independent, reproducible evaluation of TabFM - what held up, what did not, and the bug fix that got merged into Google's repo
From 0% to 100%: Debugging a KV Cache Compression Algorithm on a 16GB MacBookMar 27, 2026TurboQuant on an M1 Pro - five bugs across two codebases, one coupled math fix, and 100% needle retrieval at 16K with 3.5x less KV memory

All writing ->