Vanishing Gradients

a data podcast with hugo bowne-anderson

Displaying 2 items of Vanishing Gradients with the tag "evals".

“evals” RSS Feed

Episode 60: 10 Things I Hate About AI Evals with Hamel Husain

September 30th, 2025 | 1 hr 13 mins

ai, data science, evals, genai, llms, machine learning

Most AI teams find "evals" frustrating, but ML Engineer Hamel Husain argues they’re just using the wrong playbook. In this episode, he lays out a data-centric approach to systematically measure and improve AI, turning unreliable prototypes into robust, production-ready systems.
Episode 50: A Field Guide to Rapidly Improving AI Products -- With Hamel Husain

June 17th, 2025 | Season 1 | 27 mins 42 secs

ai, data science, evals, llms, machine learning

Hugo talks with Hamel Hussain (ex-Airbnb, GitHub, DataRobot) about how to improve AI products through evaluation, error analysis, and iteration. They discuss why most teams overlook debugging LLM systems, how to prioritize what to fix, and why evals are not just metrics—but a full development process.