Vanishing Gradients
a data podcast with hugo bowne-anderson
Displaying all 2 Episode of Vanishing Gradients with the tag “evals”.
-
Episode 60: 10 Things I Hate About AI Evals with Hamel Husain
September 30th, 2025 | 1 hr 13 mins
ai, data science, evals, genai, llms, machine learning
Most AI teams find "evals" frustrating, but ML Engineer Hamel Husain argues they’re just using the wrong playbook. In this episode, he lays out a data-centric approach to systematically measure and improve AI, turning unreliable prototypes into robust, production-ready systems.
-
Episode 50: A Field Guide to Rapidly Improving AI Products -- With Hamel Husain
June 17th, 2025 | Season 1 | 27 mins 42 secs
ai, data science, evals, llms, machine learning
Hugo talks with Hamel Hussain (ex-Airbnb, GitHub, DataRobot) about how to improve AI products through evaluation, error analysis, and iteration. They discuss why most teams overlook debugging LLM systems, how to prioritize what to fix, and why evals are not just metrics—but a full development process.