AI Engineering with Chip Huyen
AI Engineering with Chip HuyenOn today’s episode of The Pragmatic Engineer, I’m joined by Chip Huyen, a computer scientist, author of the freshly published O’Reilly book AI Engineering, and an expert in applied machine learning.
Before we start: a rare giveaway! To celebrate the podcast’s YouTube channel crossing 100K subscribers, I’m doing a giveaway of 100 books (10 of them signed and shipped) and 100 newsletter subscriptions. To enter, just subscribe to the podcast on the platform you use, leave a star review, and fill out this form, indicating you’d like to participate. The giveaway closes in 72 hours. More details here. Stream the Latest EpisodeAvailable now on YouTube, Apple and Spotify. See the episode transcript at the top of this page, and a summary at the bottom. Brought to You By• Swarmia — The engineering intelligence platform for modern software organizations. • Graphite — The AI developer productivity platform. • Vanta — Automate compliance and simplify security with Vanta. — In This EpisodeOn today’s episode of The Pragmatic Engineer, I’m joined by Chip Huyen, a computer scientist, author of the freshly published O’Reilly book AI Engineering, and an expert in applied machine learning. Chip has worked as a researcher at Netflix, was a core developer at NVIDIA (building NeMo, NVIDIA’s GenAI framework), and co-founded Claypot AI. She also taught Machine Learning at Stanford University. In this conversation, we dive into the evolving field of AI Engineering and explore key insights from Chip’s book, including: • How AI Engineering differs from Machine Learning Engineering • Why fine-tuning is usually not a tactic you’ll want (or need) to use • The spectrum of solutions to customer support problems – some not even involving AI! • The challenges of LLM evals (evaluations) • Why project-based learning is valuable—but even better when paired with structured learning • Exciting potential use cases for AI in education and entertainment • And more! TakeawaysMy biggest takeaways from this episode: 1. “AI engineering” feels closer to software engineering than to ML engineering. The term itself is very new, and “AI engineering” evolved from ML engineering. A big difference is that thanks to LLMs being easy to use (both via APIs, and locally) “AI engineering” is much more about building a product first – and later on, getting around to tweaking the model itself. ML engineering was the other way around: spend a lot of time building a model, and then build an application on it. 2. To get good at AI engineering, focus on the basics. Understand what an LLM is (and how it works), how to evaluate them, how to use RAG, what finetuning is, and how to optimize inference. All of these techniques are foundational, and will remain important in a few years’ time as well. Chip’s book AI Engineering covers all these topics. 3. “The more we want to not miss out on things, the more things we will miss.” When I asked Chip about what she thinks about the fear of missing out (FOMO) across AI: she said it’s ironic that when you are worried about missing out, and keep up with everything new – you stay shallow! Chip doesn’t read news: she chooses to focus on deep research or learning instead. If there’s something that is important today: it will be important next week, after she’s done finishing learning whatever she was focused on. 4. Solving the problem is more important than using the latest AI tools. Amusingly, a lot of teams miss this part: and they build overcompliacated AI solutions that do practically nothing for the business. Before the interview, I’ve started to go through Chip’s newly published book: AI Engineering. It feels to me this book will become a desk reference, and I recommend it. The Pragmatic Engineer deepdives relevant for this episode• Applied AI Software Engineering: RAG • How do AI software engineering agents work? • AI Tooling for Software Engineers in 2024: Reality Check • IDEs with GenAI features that Software Engineers love Timestamps(00:00) Intro (01:31) A quick overview of AI Engineering (06:45) How Chip ensured her book stays current amidst the rapid advancements in AI (11:35) A definition of AI Engineering and how it differs from Machine Learning Engineering (18:15) Simple first steps in building AI applications (24:38) An explanation of BM25 (retrieval system) (25:28) The problems associated with fine-tuning (29:40) Simple customer support solutions for rolling out AI thoughtfully (35:29) Chip’s thoughts on staying focused on the problem (37:04) The challenge in evaluating AI systems (40:03) Use cases in evaluating AI (43:09) The importance of prioritizing users’ needs and experience (48:09) Common mistakes made with Gen AI (53:57) A case for systematic problem solving (54:57) Project-based learning vs. structured learning (1:00:07) Why AI is not the end of engineering (1:04:56) How AI is helping education and the future use cases we might see (1:08:58) Rapid fire round A summary of the conversationWhat is AI engineering?
Ways to build AI applications
Practical approaches and avoiding pitfalls
Evaluating AI systems
Learning AI Engineering
Resources & MentionsWhere to find Chip Huyen: • X: https://x.com/chipro • LinkedIn: https://www.linkedin.com/in/chiphuyen/ • Website: https://huyenchip.com/ Mentions during the episode: • AI Engineering: Building Applications with Foundation Models: https://www.amazon.com/AI-Engineering-Building-Applications-Foundation/dp/1098166302/r • DeepSeek: https://www.deepseek.com/ • Multimodality and Large Multimodal Models (LMMs): https://huyenchip.com/2023/10/10/multimodal.html • BM25: https://en.wikipedia.org/wiki/Okapi_BM25 • Microsoft Bot Framework: https://dev.botframework.com/ • Python: https://www.python.org/ • Javascript: https://www.javascript.com/ • ChatGPT: https://chatgpt.com/ • Claude: https://claude.ai/ • Llama: https://www.llama.com/ • Complex Adaptive Systems: An Introduction to Computational Models of Social Life (Princeton Studies in Complexity): https://www.amazon.com/Complex-Adaptive-Systems-Introduction-Computational/dp/0691127026/ • The Selfish Gene: https://www.amazon.com/Selfish-Gene-Anniversary-Landmark-Science/dp/0198788606/ • Antifragile: Things That Gain from Disorder: https://www.amazon.com/dp/0812979680/ — Production and marketing by Pen Name. For inquiries about sponsoring the podcast, email podcast@pragmaticengineer.com. You’re on the free list for The Pragmatic Engineer. For the full experience, become a paying subscriber. Many readers expense this newsletter within their company’s training/learning/development budget. This post is public, so feel free to share and forward it. If you enjoyed this post, you might enjoy my book, The Software Engineer's Guidebook. Here is what Tanya Reilly, senior principal engineer and author of The Staff Engineer's Path said about it:
|
Comments
Post a Comment