Prev play stop Next mute max volume 00:00 00:00 repeat Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin. Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic) The MAD Podcast with Matt Turck Kho Tổng Hợp 41,799 5 tháng trước Xem video Facebook Tweet XEM MÔ TẢ Sholto Douglas, a key researcher at Anthropic, reveals the breakthroughs behind Claude Sonnet 4.5—the world's leading coding model—and why we might be just 2-3 years from AI matching human-level performance on most computer-facing tasks. You'll discover why RL on language models suddenly started working in 2024, how agents maintain coherency across 30-hour coding sessions through self-correction and memory systems, and why the "bitter lesson" of scale keeps proving clever priors wrong. Sholto shares his path from top-50 world fencer to Google's Gemini team to Anthropic, explaining why great blog posts sometimes matter more than PhDs in AI research. He discusses the culture at big AI labs and why Anthropic is laser-focused on coding (it's the fastest path to both economic impact and AI-assisted AI research). Sholto also discusses how the training pipeline is still "held together by duct tape" with massive room to improve, and why every benchmark created shows continuous rapid progress with no plateau in sight. Bold predictions: individuals will soon manage teams of AI agents working 24/7, robotics is about to experience coding-level breakthroughs, and policymakers should urgently track AI progress on real economic tasks. A clear-eyed look at where AI stands today and where it's headed in the next few years. Anthropic Website - https://www.anthropic.com Twitter - https://x.com/AnthropicAI Sholto Douglas LinkedIn - https://www.linkedin.com/in/sholto Twitter - https://x.com/_sholtodouglas FIRSTMARK Website - https://firstmark.com Twitter - https://twitter.com/FirstMarkCap Matt Turck (Managing Director) LinkedIn - https://www.linkedin.com/in/turck/ Twitter - https://twitter.com/mattturck LISTEN ON: Spotify - https://open.spotify.com/show/7yLATDSaFvgJG80ACcRJtq Apple - https://podcasts.apple.com/us/podcast/the-mad-podcast-with-matt-turck/id1686238724 00:00 - Intro 01:09 - The Rapid Pace of AI Releases at Anthropic 02:49 - Understanding Opus, Sonnet, and Haiku Model Tiers 04:14 - Shelto's Journey: From Australian Fencer to AI Researcher 12:01 - The Growing Pool of AI Talent 16:16 - Breaking Into AI Research Without Traditional Credentials 18:29 - What "Taste" Means in AI Research 23:05 - Moving to Google and Building Gemini's Inference Stack 25:08 - How Anthropic Differs from Other AI Labs 31:46 - Why Anthropic Is Laser-Focused on Coding 36:40 - Inside a 30-Hour Autonomous Coding Session 38:41 - Examples of What AI Can Build in 30 Hours 43:13 - The Breakthroughs That Enabled 30-Hour Runs 46:28 - What's Actually Driving the Performance Gains 47:42 - Pre-Training vs Reinforcement Learning Explained 52:11 - Test-Time Compute and the New Scaling Paradigm 55:55 - Why RL on LLMs Finally Started Working 59:38 - Are We on Track to AGI? 1:02:05 - Why the "Plateau" Narrative Is Wrong 1:03:41 - Sonnet's Performance Across Economic Sectors 1:05:47 - Preparing for a World of 10-100x Individual Leverage Mp3 liên quan 6:24 The Aircrete Building System Aircrete Europe 473,044 view 9 năm trước 11:40 HOW TO INSULATE A CABIN FLOOR AND KEEP IT RODENT FREE OFF GRID HOMESTEADING With The Boss Of The Swamp 946,544 view 7 năm trước 42:02 Đóng ghe rập p26☆cách thợ vô cặp áp ra và bắt dè nước Thanh Điền NTĐ 61,975 view 1 năm trước 18:02 Why do our brains love music? | Dr. John Rehner Iversen | TEDxMcMasterU TEDx Talks 36,715 view 9 tháng trước 34:52 The LAST Anglo-Saxon kings of England | Harold II and the road to 1066 HistoryExtra Podcasts 13,908 view 6 tháng trước 16:21 The 13 Tricks Smokey Yunick Used That NASCAR Never Forgave... REVVED UP REVEALS 181,105 view 5 tháng trước 4:51 CAT 374 Großbagger - 72 Tonnen Einsatzgewicht -Stahlkoloss auf Kette- Autobahnbau- ACHTUNG BAUSTELLE Bryce Media / ffarmer99 81,766 view 1 năm trước 8:54 BUY SMART: The 4 Worst and 3 Best Midsize Pickup Trucks of 2025! Let's Drive 19,537 view 1 năm trước 1:25:56 Văn Luận Hát Rong | 2 Cha Con Hát Rong Có Giọng Hát Cực Hay | Không Còn Nhớ Người Yêu - Văn luận Ca Sĩ Đường Phố 4,048 view 6 tháng trước 12:41 Is the Highway Code Law? | Can Cyclists Run Red Lights? | BlackBeltBarrister BlackBeltBarrister 278,278 view 4 năm trước 45:59 The Mysterious Origins Of The Great Lakes | Naked Science Season 6 Episode 3 Spark 2,791,771 view 1 năm trước 28:47 The CEO Of Ford Made Some Very Interesting Statements… Here’s Our Response! Royalty Auto Service 454,249 view 4 tháng trước 25:03 Amazing Auditions That Will Make Your Jaw DROP! | AGT: Extreme 2022 America's Got Talent 7,857,172 view 3 năm trước 14:37 Why I Bought A Nissan and Not Toyota (Biggest Toyota Fan) OverlandNomad 357,985 view 1 năm trước 47:02 They Spent Millennia on a Sword—Human Fixed It With a Rock in Minutes | Sci-Fi | HFY Stories Galactic Heroes United 55 view 4 tháng trước 17:11 Rendering Series #1 SCRATCH COAT (How To Mix Render/ Application/ Rendering Tips & Tricks) Plastering For Beginners 61,734 view 4 năm trước 4:10:18 The WORST Medieval Plagues You’ve NEVER Heard Of Eternal Mysteries 23,713 view 6 tháng trước 32:32 STORMS & Smoke & STUNNING VIEWS OH MY! | BACKPACKING the WIND RIVER RANGE | Titcomb Basin Catherine Gregory 72,757 view 4 năm trước 53:53 Đời này không có gì là mãi mãi, hãy vui vẻ chấp nhận để sống trọn kiếp người - SC. Thích Nữ Tâm Tâm Phật Pháp Online 9,688 view 1 năm trước 15:22 The Joker Was Always Evil | Batman The Animated Series Serum Lake 326,560 view 1 năm trước Thể loại nhạc