Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic) The MAD Podcast with Matt Turck Kho Tổng Hợp 41,816 8 tháng trước Add Nghe mp3 Facebook Tweet XEM MÔ TẢ Sholto Douglas, a key researcher at Anthropic, reveals the breakthroughs behind Claude Sonnet 4.5—the world's leading coding model—and why we might be just 2-3 years from AI matching human-level performance on most computer-facing tasks. You'll discover why RL on language models suddenly started working in 2024, how agents maintain coherency across 30-hour coding sessions through self-correction and memory systems, and why the "bitter lesson" of scale keeps proving clever priors wrong. Sholto shares his path from top-50 world fencer to Google's Gemini team to Anthropic, explaining why great blog posts sometimes matter more than PhDs in AI research. He discusses the culture at big AI labs and why Anthropic is laser-focused on coding (it's the fastest path to both economic impact and AI-assisted AI research). Sholto also discusses how the training pipeline is still "held together by duct tape" with massive room to improve, and why every benchmark created shows continuous rapid progress with no plateau in sight. Bold predictions: individuals will soon manage teams of AI agents working 24/7, robotics is about to experience coding-level breakthroughs, and policymakers should urgently track AI progress on real economic tasks. A clear-eyed look at where AI stands today and where it's headed in the next few years. Anthropic Website - https://www.anthropic.com Twitter - https://x.com/AnthropicAI Sholto Douglas LinkedIn - https://www.linkedin.com/in/sholto Twitter - https://x.com/_sholtodouglas FIRSTMARK Website - https://firstmark.com Twitter - https://twitter.com/FirstMarkCap Matt Turck (Managing Director) LinkedIn - https://www.linkedin.com/in/turck/ Twitter - https://twitter.com/mattturck LISTEN ON: Spotify - https://open.spotify.com/show/7yLATDSaFvgJG80ACcRJtq Apple - https://podcasts.apple.com/us/podcast/the-mad-podcast-with-matt-turck/id1686238724 00:00 - Intro 01:09 - The Rapid Pace of AI Releases at Anthropic 02:49 - Understanding Opus, Sonnet, and Haiku Model Tiers 04:14 - Shelto's Journey: From Australian Fencer to AI Researcher 12:01 - The Growing Pool of AI Talent 16:16 - Breaking Into AI Research Without Traditional Credentials 18:29 - What "Taste" Means in AI Research 23:05 - Moving to Google and Building Gemini's Inference Stack 25:08 - How Anthropic Differs from Other AI Labs 31:46 - Why Anthropic Is Laser-Focused on Coding 36:40 - Inside a 30-Hour Autonomous Coding Session 38:41 - Examples of What AI Can Build in 30 Hours 43:13 - The Breakthroughs That Enabled 30-Hour Runs 46:28 - What's Actually Driving the Performance Gains 47:42 - Pre-Training vs Reinforcement Learning Explained 52:11 - Test-Time Compute and the New Scaling Paradigm 55:55 - Why RL on LLMs Finally Started Working 59:38 - Are We on Track to AGI? 1:02:05 - Why the "Plateau" Narrative Is Wrong 1:03:41 - Sonnet's Performance Across Economic Sectors 1:05:47 - Preparing for a World of 10-100x Individual Leverage Video liên quan 33:57 Hunter Pauley’s Basecamp Setup | Vecel Outdoors Truck Tour Vecel Outdoors 36 view 7 tháng trước Add 1:19:55 Pháp Thoại Mới Đăng Ngày 05. 11. 2025 - Thầy Thích Pháp Hòa Tu Viện Trúc Lâm Thượng Tọa Thích Pháp Hòa 38 view 7 tháng trước Add 29:27 💥Йому скидали ліки, щоб не помер. Пакет підписували "руzкому". Він плакав | Невигадані історії 5 канал 31 view 7 tháng trước Add 59:50 300 Graves Revealed Beneath This Cemetery—Unbelievable Discovery! Sidestep: Adventures Into History 34 view 7 tháng trước Add 7:13 2024 Host Mammoth Truck Camper The RV Guy 34 view 7 tháng trước Add 1:42:03 Michael Bolton, Foreigner, Phil Collins, Lionel Richie, Elton John 🎧 Grandes Éxitos 70s, 80s, 90s Grandes Éxitos 80s 90s 36 view 7 tháng trước Add 53:36 20-Year-Old Manipulator Realizes She's Going To Prison Judge Williams 33 view 7 tháng trước Add 1:30:42 FULL EPISODE | "You Look Like Human Shrek" | Big Fat Quiz of The Year 2022 The Big Fat Quiz Channel 57 view 7 tháng trước Add 36:09 Tutorial 58. Hard sudoku puzzle with lots of techniques. Sudoku Guy 38 view 7 tháng trước Add 1:49:29 Chinese Dj 2025 - 25 首中国 DJ Remix 歌曲将让你动起来 💥【拥抱你离去 ♪ 情火 ♪ 公蝦米 ♪ 怎麼愛都愛不夠 ...】年最劲爆的DJ歌曲 👍 2025夜店舞曲 重低音 最火歌曲chinese dj remix 36 view 7 tháng trước Add 36:12 Hat Friedrich Merz sein neues Kabinett richtig besetzt? | Markus Lanz vom 29. April 2025 ZDFheute Nachrichten 41 view 7 tháng trước Add 25:39 유럽 최후의 독재국가, 벨라루스 서재로36 19 view 7 tháng trước Add 18:00 JOHNNY CASH interrupted Elvis mid-song – what happened next shocked 40,000 people Elvis Presley: Behind the Legend 39 view 7 tháng trước Add 4:40 Elvis Presley Home Movies - RARE FOOTAGE at Hillcrest Home Terry Stephenson Music 39 view 7 tháng trước Add 7:59 Muhammad Ali vs Women 1979 - Rare Interview Afud101 43 view 7 tháng trước Add 8:12 Whitney Houston talks about Elvis in surprising, rare interview Elvis News Examiner 40 view 7 tháng trước Add 16:38 Żniwa W STARYM STYLU | Snopowiązałka Warta 2 | Kombajn Claas SF | Prasa Welger AP50 RetroTRAKTOR 33 view 7 tháng trước Add 9:15 Jak wyglądają żniwa Vistulą z 1968 roku? Rolnicy Z Kujawskiej Dzielnicy 42 view 7 tháng trước Add 1:26:59 7 True Small Town Horror Stories | "We Should’ve Never Moved There" 😱 Nightmare Grimroot 42 view 7 tháng trước Add 32:52 Not everyone knows this secret! Boil CD disc and see what most people don't even imagine happens! Creation Tips 35 view 7 tháng trước Add