Themes & Publications

Research

My work spans causal inference, model evaluation, and large-scale ML systems. The throughline is using rigorous statistical methodology to make black-box models easier to trust and reason about.

Active Research

Approximate Leave-One-Out for non-convex models

With Andrea Montanari at Granica. We estimate LOO Hessians via low-rank updates of the full-sample Hessian and mask eigen-directions associated with negative curvature in the LOO Hessian. The resulting prediction error for single-index data-generating processes with a mismatched Teacher–Student stays within single-digit percent.

Large Tabular Models

Foundation models for tabular data that incorporate column semantics, with dataset generalization measured via per-example permutation of columns.

Selected Publications & Projects

Open Source · 2025

Mini-LLM Pretraining Framework

Self-contained PyTorch codebase for transformer LLMs from scratch: RoPE/NoPE, MoE, KV cache, LoRA, mixed-precision training. Config-driven scaling from small to billion-parameter models.

Write-up → GitHub →

Technical Report · 2024

Gemini 1.5 Technical Report

Core contributor on LLM-as-judge evaluation. Designed a self-critique framework with embedded rubrics achieving inter-rater reliability comparable to humans; proposed model-assisted estimation to combine human and model scores into an unbiased, lower-variance estimator.

arXiv →

Preprint · 2024

Hangover Effects in Professional Sports

Quasi-experiment with Guido Imbens. Bookmaker spreads serve as a conditioner for unanticipated effects; teams visiting cities with higher nightlife indices consistently underperform expectations. Replicates across NBA and MLB.

arXiv → Rationale Nightlife dist. Heatmap Online perf.

Conference Abstract · 2016

Sepsis Prognosis with Bayesian ML

Summer research at Lawrence Livermore with Kaiser Research: machine-learned and Bayesian models for sepsis trajectory and prognosis from EHR signals.

ATS Abstract →

Working Paper · 2015

Distributed Min-Cut

Draft paper from a summer of research with Reza Zadeh at Stanford on a distributed algorithm for graph min-cut.

Draft PDF →

Honors Thesis · 2012

NCAA Athletic Participation & GPA

Quasi-experimental causal estimate using walk-on entry and mid-career injury as within-subject identification. Heterogeneous treatment effects by sport and entering SAT.

PDF →