Vanishing Gradients

a data podcast with hugo bowne-anderson

About the show

A podcast about all things data, brought to you by data scientist Hugo Bowne-Anderson.

It's time for more critical conversations about the challenges in our industry in order to build better compasses for the solution space! To this end, this podcast will consist of long-format conversations between Hugo and other people who work broadly in the data science, machine learning, and AI spaces. We'll dive deep into all the moving parts of the data world, so if you're new to the space, you'll have an opportunity to learn from the experts. And if you've been around for a while, you'll find out what's happening in many other parts of the data world.

Episodes

  • Poster Image

    Episode 39: From Models to Products: Bridging Research and Practice in Generative AI at Google Labs

    November 26th, 2024  |  Season 1  |  1 hr 43 mins
    ai, data science, llms, machine learning

    From building rockets at SpaceX to advancing generative AI at Google Labs, Ravin Kumar has carved a unique path through the world of technology. In this episode, we explore how to build scalable, reliable AI systems, the skills needed to work across the AI/ML pipeline, and the real-world impact of tools like open-weight models such as Gemma. Ravin also shares insights into designing AI tools like Notebook LM with the user journey at the forefront.

  • Poster Image

    Episode 38: The Art of Freelance AI Consulting and Products: Data, Dollars, and Deliverables

    November 5th, 2024  |  Season 1  |  1 hr 23 mins
    consulting, data science, genai, llms, machine learning

    Hugo speaks with Jason Liu, an independent AI consultant with experience at Meta and Stitch Fix. At Stitch Fix, Jason developed impactful AI systems, like a $50 million product similarity search and the widely adopted Flight recommendation framework. Now, he helps startups and enterprises design and deploy production-level AI applications, with a focus on retrieval-augmented generation (RAG) and scalable solutions.

  • Poster Image

    Episode 37: Prompt Engineering, Security in Generative AI, and the Future of AI Research Part 2

    October 8th, 2024  |  Season 1  |  50 mins 36 secs
    data science, genai, llms, machine learning, nlp, prompt engineering

    Hugo speaks with three leading figures from the world of AI research: Sander Schulhoff, a recent University of Maryland graduate and lead contributor to the Learn Prompting initiative; Philip Resnik, professor at the University of Maryland, known for his pioneering work in computational linguistics; and Dennis Peskoff, a researcher from Princeton specializing in prompt engineering and its applications in the social sciences.

  • Poster Image

    Episode 36: Prompt Engineering, Security in Generative AI, and the Future of AI Research Part 1

    September 30th, 2024  |  Season 1  |  1 hr 3 mins
    data science, genai, llms, machine learning, nlp, prompt engineering

    Hugo speaks with three leading figures from the world of AI research: Sander Schulhoff, a recent University of Maryland graduate and lead contributor to the Learn Prompting initiative; Philip Resnik, professor at the University of Maryland, known for his pioneering work in computational linguistics; and Dennis Peskoff, a researcher from Princeton specializing in prompt engineering and its applications in the social sciences.

  • Poster Image

    Episode 35: Open Science at NASA -- Measuring Impact and the Future of AI

    September 19th, 2024  |  Season 1  |  58 mins 13 secs
    data science, genai, llms, machine learning

    Hugo speaks with Dr. Chelle Gentemann, Open Science Program Scientist for NASA’s Office of the Chief Science Data Officer, about NASA’s ambitious efforts to integrate AI across the research lifecycle. In this episode, we’ll dive deeper into how AI is transforming NASA’s approach to science, making data more accessible and advancing open science practices.

  • Poster Image

    Episode 34: The AI Revolution Will Not Be Monopolized

    August 22nd, 2024  |  Season 1  |  1 hr 42 mins
    data science, genai, llms, machine learning, nlp

    Hugo speaks with Ines Montani and Matthew Honnibal, the creators of spaCy and founders of Explosion AI. Collectively, they've had a huge impact on the fields of industrial natural language processing (NLP), ML, and AI through their widely-used open-source library spaCy and their innovative annotation tool Prodigy.

  • Poster Image

    Episode 33: What We Learned Teaching LLMs to 1,000s of Data Scientists

    August 12th, 2024  |  Season 1  |  1 hr 25 mins

    Hugo speaks with Dan Becker and Hamel Husain, two veterans in the world of data science, machine learning, and AI education. Collectively, they’ve worked at Google, DataRobot, Airbnb, Github (where Hamel built out the pre-cursor to copilot and more). And they both currently work as independent LLM and Generative AI consultants.

    Dan and Hamel recently taught a course on fine-tuning large language models that evolved into a full-fledged conference, attracting over 2,000 participants.

    In this episode, we dive deep into their experience and the unique insights it gave them into the current state and future of AI education and application.

  • Poster Image

    Episode 32: Building Reliable and Robust ML/AI Pipelines

    July 27th, 2024  |  Season 1  |  1 hr 15 mins
    data science, genai, llms, machine learning

    Hugo speaks with Shreya Shankar, a researcher at UC Berkeley focusing on data management systems with a human-centered approach. Shreya's work is at the cutting edge of human-computer interaction (HCI) and AI, particularly in the realm of large language models (LLMs). Her impressive background includes being the first ML engineer at Viaduct, doing research engineering at Google Brain, and software engineering at Facebook.

  • Poster Image

    Episode 31: Rethinking Data Science, Machine Learning, and AI

    July 9th, 2024  |  Season 1  |  1 hr 36 mins
    data science, genai, llms, machine learning

    Hugo speaks with Vincent Warmerdam, a senior data professional and machine learning engineer at :probabl, the exclusive brand operator of scikit-learn. Vincent is known for challenging common assumptions and exploring innovative approaches in data science and machine learning.

  • Poster Image

    Episode 30: Lessons from a Year of Building with LLMs (Part 2)

    June 26th, 2024  |  Season 1  |  1 hr 15 mins

    Hugo speaks about Lessons Learned from a Year of Building with LLMs with Eugene Yan from Amazon, Bryan Bischof from Hex, Charles Frye from Modal, Hamel Husain from Parlance Labs, and Shreya Shankar from UC Berkeley (Part 2).

  • Poster Image

    Episode 29: Lessons from a Year of Building with LLMs (Part 1)

    June 26th, 2024  |  Season 1  |  1 hr 30 mins

    Hugo speaks about Lessons Learned from a Year of Building with LLMs with Eugene Yan from Amazon, Bryan Bischof from Hex, Charles Frye from Modal, Hamel Husain from Parlance Labs, and Shreya Shankar from UC Berkeley (Part 1).

  • Poster Image

    Episode 28: Beyond Supervised Learning: The Rise of In-Context Learning with LLMs

    June 10th, 2024  |  Season 1  |  1 hr 5 mins
    data science, genai, llms, machine learning

    Hugo speaks with Alan Nichol, co-founder and CTO of Rasa, where they build software to enable developers to create enterprise-grade conversational AI and chatbot systems across industries like telcos, healthcare, fintech, and government.

  • Poster Image

    Episode 27: How to Build Terrible AI Systems

    May 31st, 2024  |  Season 1  |  1 hr 32 mins
    data science, genai, llms, machine learning

    Hugo speaks with Jason Liu, an independent consultant who uses his expertise in recommendation systems to help fast-growing startups build out their RAG applications. He was previously at Meta and Stitch Fix is also the creator of Instructor, Flight, and an ML and data science educator.

  • Poster Image

    Episode 26: Developing and Training LLMs From Scratch

    May 15th, 2024  |  Season 1  |  1 hr 51 mins
    data science, genai, machine learning

    Hugo speaks with Sebastian Raschka, a machine learning & AI researcher, programmer, and author.They’ll tell you everything you need to know about LLMs, but were too afraid to ask: from covering the entire LLM lifecycle, what type of skills you need to work with them, what type of resources and hardware, prompt engineering vs fine-tuning vs RAG, how to build an LLM from scratch, and much more.

  • Poster Image

    Episode 25: Fully Reproducible ML & AI Workflows

    March 18th, 2024  |  Season 1  |  1 hr 20 mins
    data science, genai, machine learning

    Hugo speaks with Omoju Miller, a machine learning guru and founder and CEO of Fimio, where she is building 21st century dev tooling.

  • Poster Image

    Episode 24: LLM and GenAI Accessibility

    February 27th, 2024  |  Season 1  |  1 hr 30 mins
    data science, genai, machine learning

    Hugo speaks with Johno Whitaker, a Data Scientist/AI Researcher doing R&D with answer.ai, about where we’ve come from regarding tooling and accessibility for foundation models, ML, and AI, where we are, and where we’re going.