Welcome to Learn AutoResearch
Learn AutoResearch is a project-based course on automating research using the autoresearch framework — a generalization of Karpathy's autonomous ML training loop to any domain with a measurable metric.
"Set the GOAL → The agent runs the LOOP → You wake up to results."
What you will learn
- Define measurable research goals — turn vague objectives into mechanical metrics any agent can optimize.
- Run autonomous improvement loops — one change per iteration, automatic rollback, git as memory.
- Debug scientifically — falsifiable hypotheses, evidence-based investigation, zero-error termination.
- Predict before acting — five expert perspectives before committing to any major change.
- Audit security autonomously — STRIDE + OWASP + red-team analysis with code-level evidence.
- Ship with confidence — 8-phase pipeline covering code, content, and deployments.
Get started
Lectures
12 lectures from first principles (why manual iteration fails) to advanced overnight runs and CI/CD integration.
Projects
Six hands-on projects — each with a starter codebase and a reference solution, building to a full end-to-end pipeline.
Resource Library
Copy-ready templates: research.md, evaluate.py, results.tsv, and metric cheat sheets for 15 domains.
The Core Loop
Every autoresearch command is built on the same five-stage loop:
Course Structure
The course is organized into 6 phases, each containing 2 lectures and 1 hands-on project:
| Phase | Theme | Lectures | Project |
|---|---|---|---|
| 1 | Why AutoResearch Works | L01–L02 | Sort optimization |
| 2 | Master the Core Loop | L03–L04 | Function fitting |
| 3 | Debug & Fix | L05–L06 | FastAPI debugging |
| 4 | Predict & Reason | L07–L08 | Architecture debate |
| 5 | Security & Scenarios | L09–L10 | Security audit pipeline |
| 6 | Ship & Advanced Patterns | L11–L12 | End-to-end research |
Next steps
- Lecture 01: Why Manual Iteration Fails — Start with Karpathy's original insight.
- Project 01: Your First Research Loop — Run the sort optimization example hands-on.
- Templates — Grab the research.md and evaluate.py templates for your own projects.