-
Auditing a Codebase for 87 cents in 50 lines of code using RLMs
-
AI's Hedonic Treadmill vs. Task Horizon Exponentials
-
Achieving 20 percentage-point improvement in structured extraction tasks using DSPy and GEPA
-
Using DSPy to Detect Document Boundaries
-
Acceleration: Notes on 'Measuring AI Ability to Complete Long Tasks'
-
Summarizing video transcripts with an LLM
-
Measuring LLM Confidence