Prev play stop Next mute max volume 00:00 00:00 repeat Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin. Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic) The MAD Podcast with Matt Turck Kho Tổng Hợp 41,790 3 tháng trước Xem video Facebook Tweet XEM MÔ TẢ Sholto Douglas, a key researcher at Anthropic, reveals the breakthroughs behind Claude Sonnet 4.5—the world's leading coding model—and why we might be just 2-3 years from AI matching human-level performance on most computer-facing tasks. You'll discover why RL on language models suddenly started working in 2024, how agents maintain coherency across 30-hour coding sessions through self-correction and memory systems, and why the "bitter lesson" of scale keeps proving clever priors wrong. Sholto shares his path from top-50 world fencer to Google's Gemini team to Anthropic, explaining why great blog posts sometimes matter more than PhDs in AI research. He discusses the culture at big AI labs and why Anthropic is laser-focused on coding (it's the fastest path to both economic impact and AI-assisted AI research). Sholto also discusses how the training pipeline is still "held together by duct tape" with massive room to improve, and why every benchmark created shows continuous rapid progress with no plateau in sight. Bold predictions: individuals will soon manage teams of AI agents working 24/7, robotics is about to experience coding-level breakthroughs, and policymakers should urgently track AI progress on real economic tasks. A clear-eyed look at where AI stands today and where it's headed in the next few years. Anthropic Website - https://www.anthropic.com Twitter - https://x.com/AnthropicAI Sholto Douglas LinkedIn - https://www.linkedin.com/in/sholto Twitter - https://x.com/_sholtodouglas FIRSTMARK Website - https://firstmark.com Twitter - https://twitter.com/FirstMarkCap Matt Turck (Managing Director) LinkedIn - https://www.linkedin.com/in/turck/ Twitter - https://twitter.com/mattturck LISTEN ON: Spotify - https://open.spotify.com/show/7yLATDSaFvgJG80ACcRJtq Apple - https://podcasts.apple.com/us/podcast/the-mad-podcast-with-matt-turck/id1686238724 00:00 - Intro 01:09 - The Rapid Pace of AI Releases at Anthropic 02:49 - Understanding Opus, Sonnet, and Haiku Model Tiers 04:14 - Shelto's Journey: From Australian Fencer to AI Researcher 12:01 - The Growing Pool of AI Talent 16:16 - Breaking Into AI Research Without Traditional Credentials 18:29 - What "Taste" Means in AI Research 23:05 - Moving to Google and Building Gemini's Inference Stack 25:08 - How Anthropic Differs from Other AI Labs 31:46 - Why Anthropic Is Laser-Focused on Coding 36:40 - Inside a 30-Hour Autonomous Coding Session 38:41 - Examples of What AI Can Build in 30 Hours 43:13 - The Breakthroughs That Enabled 30-Hour Runs 46:28 - What's Actually Driving the Performance Gains 47:42 - Pre-Training vs Reinforcement Learning Explained 52:11 - Test-Time Compute and the New Scaling Paradigm 55:55 - Why RL on LLMs Finally Started Working 59:38 - Are We on Track to AGI? 1:02:05 - Why the "Plateau" Narrative Is Wrong 1:03:41 - Sonnet's Performance Across Economic Sectors 1:05:47 - Preparing for a World of 10-100x Individual Leverage Mp3 liên quan 31:39 I Got FIRED at 6:30 AM for Attitude—By Noon I Signed with Their Competitor for Double Office Revenge 7,118 view 2 tháng trước 1:21:48 Three Men Attacked A Mafia Boss In A Restaurant — Then Waitress’s Hidden Skill Changed Everything Mafia Boss Stories 5,659 view 2 tháng trước 52:19 Baltimore’s IG Murder Crew That Caught 40 Bodies & Made Killing a Requirement Codeside 32,581 view 3 tháng trước 1:02:51 There Is A Dogman War Coming, And We Will Kill Them All Dr. Whisper 10,375 view 2 tháng trước 21:01 W5: Frances best-kept secret in North America Official W5 1,161,839 view 7 năm trước 1:16:24 Wie Schnell Können Wir Derzeit im Weltraum Reisen? | Dokumentation zum Einschlafen Universum Mysterien 1,105 view 2 tháng trước 31:23 Trung Quốc Đụng Độ Mỹ – Philippines Trên Biển Đông, Cuộc Chiến Căng Như Dây Đàn Quân Sự Tối Mật 93,444 view 2 tháng trước 3:56:44 full video, The kind man gave the girl an abandoned house and set up a tent by the river. Lý Phúc Ca 200,711 view 2 tháng trước 30:11 Grouse Camp 2025, Day 7 (October 12, 2025) American Falconry 1,181 view 2 tháng trước 13:08 LADA NIVA bei LADA-EMS (Doc Slawa) abholen! @Kaynert ist SCHULD! IGARJOK 5,224 view 11 tháng trước 2:09:03 Laissez-vous guider - Le Paris du Moyen-Age - Reconstitution historique 3D - MG Notre Histoire 1,706,727 view 2 năm trước 31:32 He Sold Vegetables, Now Runs a ₹750 Cr Real Estate Empire | Stories From Bharat E51 | Curly Tales Curly Tales 436,407 view 7 tháng trước 35:22 Watch this before buying Laptop | Best Laptops for all Students under Rs 30K to Rs 1lakh | 2024-25 Aman Dhattarwal 1,202,621 view 1 năm trước 2:00:47 Spanish Deep House Mix Vol. 3 | SOLWAV SOLWAV 27,419 view 2 tháng trước 36:50 70+ ans ? 4 vitamines plus efficaces que le magnésium pour reconstruire les muscles UNIQUEMENT DES VITAMINES 1 3,874 view 2 tháng trước 53:56 The Emperor Who Couldn’t Stop 1,000 Rabbits Facinati 43 view 2 tháng trước 11:39 Spark Autumn Love #30dscbl19 day 29 and #Countefeitkitchallenge Amanda Creates 53 view 2 tháng trước 39:26 Building on 5,000 Years of Innovation — Sanjeev Sanyal on India’s Shipbuilding Revival Observer Research Foundation 41,873 view 2 tháng trước 38:15 Knit Kind, Episode 60: Shawls, and FOs, and WIPs, Oh My! Knit Kind with Erika Field, Grapefruit & Gardenia 1,013 view 2 tháng trước 1:01:17 We BINGED ALL of Dungeon Soup...Season 1 Sorta Stupid 156,761 view 2 tháng trước Thể loại nhạc