Artwork

Innhold levert av Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky. Alt podcastinnhold, inkludert episoder, grafikk og podcastbeskrivelser, lastes opp og leveres direkte av Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky eller deres podcastplattformpartner. Hvis du tror at noen bruker det opphavsrettsbeskyttede verket ditt uten din tillatelse, kan du følge prosessen skissert her https://no.player.fm/legal.
Player FM - Podcast-app
Gå frakoblet med Player FM -appen!

Claude Opus 4.5, Olmo 3, and a Paper on Diffusion + Auto Regression

47:45
 
Del
 

Manage episode 521719471 series 3703995
Innhold levert av Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky. Alt podcastinnhold, inkludert episoder, grafikk og podcastbeskrivelser, lastes opp og leveres direkte av Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky eller deres podcastplattformpartner. Hvis du tror at noen bruker det opphavsrettsbeskyttede verket ditt uten din tillatelse, kan du følge prosessen skissert her https://no.player.fm/legal.

In this episode of Artificial Developer Intelligence, hosts Shimin and Dan explore the latest advancements in AI models, including the release of Claude Opus 4.5 and Gemini 3. They discuss the implications of these models on software engineering, the rise of open-source models like Olmo 3, and the enhancements in the Claude Developer Platform. The conversation also delves into the challenges of relying on AI for coding tasks, the potential pitfalls of the AI bubble, and the future of written exams in the age of AI.

Takeaways

  • Claude Opus 4.5 setting benchmarks, enhance usability and reduce token consumption.
  • The introduction of open-source models like Olmo 3 is a significant development in AI.
  • The future of written exams may be challenged by AI's ability to generate human-like responses.
  • Relying too heavily on AI can lead to a lack of critical thinking and problem-solving skills.
  • The AI bubble is at 25s to midnight
  • Recent research suggests that AI models can improve their performance through emulating query based search.
  • The importance of prompt engineering in AI interactions is highlighted.

Resources Mentioned
Introducing Claude Opus 4.5
Build with Nano Banana Pro, our Gemini 3 Pro Image model
Andrej Karpathy's Post about Nano Banana Pro
Olmo 3: Charting a path through the model flow to lead open-source AI
Introducing advanced tool use on the Claude Developer Platform
TiDAR: Think in Diffusion, Talk in Autoregression
SSRL: SELF-SEARCH REINFORCEMENT LEARNING
Mira Murati's Thinking Machines seeks $50 billion valuation in funding talks, Bloomberg News reports
Boom, bubble, bust, boom. Why should AI be different?
Nvidia didn’t save the market. What’s next for the AI trade?

Chapters

  • (00:00) - Introduction to Artificial Developer Intelligence
  • (01:25) - Claude Opus 4.5
  • (07:02) - Exploring Gemini 3 and Image Models
  • (11:24) - Olmo 3 and The Rise of Open Flow Models
  • (15:46) - Innovations in AI Tools and Platforms
  • (19:33) - Research Insights: Diffusion and Auto-Regression Models
  • (23:39) - Advancements in AI Output Efficiency
  • (25:45) - Exploring Self Search Reinforcement Learning
  • (27:48) - The Dilemma of Language Models
  • (30:11) - Prompt Engineering and Search Integration
  • (32:55) - Dan's Rants on AI Limitations
  • (38:17) - 2 Minutes to Midnight
  • (46:41) - Outro

Connect with ADIPod
  continue reading

4 episoder

Artwork
iconDel
 
Manage episode 521719471 series 3703995
Innhold levert av Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky. Alt podcastinnhold, inkludert episoder, grafikk og podcastbeskrivelser, lastes opp og leveres direkte av Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky eller deres podcastplattformpartner. Hvis du tror at noen bruker det opphavsrettsbeskyttede verket ditt uten din tillatelse, kan du følge prosessen skissert her https://no.player.fm/legal.

In this episode of Artificial Developer Intelligence, hosts Shimin and Dan explore the latest advancements in AI models, including the release of Claude Opus 4.5 and Gemini 3. They discuss the implications of these models on software engineering, the rise of open-source models like Olmo 3, and the enhancements in the Claude Developer Platform. The conversation also delves into the challenges of relying on AI for coding tasks, the potential pitfalls of the AI bubble, and the future of written exams in the age of AI.

Takeaways

  • Claude Opus 4.5 setting benchmarks, enhance usability and reduce token consumption.
  • The introduction of open-source models like Olmo 3 is a significant development in AI.
  • The future of written exams may be challenged by AI's ability to generate human-like responses.
  • Relying too heavily on AI can lead to a lack of critical thinking and problem-solving skills.
  • The AI bubble is at 25s to midnight
  • Recent research suggests that AI models can improve their performance through emulating query based search.
  • The importance of prompt engineering in AI interactions is highlighted.

Resources Mentioned
Introducing Claude Opus 4.5
Build with Nano Banana Pro, our Gemini 3 Pro Image model
Andrej Karpathy's Post about Nano Banana Pro
Olmo 3: Charting a path through the model flow to lead open-source AI
Introducing advanced tool use on the Claude Developer Platform
TiDAR: Think in Diffusion, Talk in Autoregression
SSRL: SELF-SEARCH REINFORCEMENT LEARNING
Mira Murati's Thinking Machines seeks $50 billion valuation in funding talks, Bloomberg News reports
Boom, bubble, bust, boom. Why should AI be different?
Nvidia didn’t save the market. What’s next for the AI trade?

Chapters

  • (00:00) - Introduction to Artificial Developer Intelligence
  • (01:25) - Claude Opus 4.5
  • (07:02) - Exploring Gemini 3 and Image Models
  • (11:24) - Olmo 3 and The Rise of Open Flow Models
  • (15:46) - Innovations in AI Tools and Platforms
  • (19:33) - Research Insights: Diffusion and Auto-Regression Models
  • (23:39) - Advancements in AI Output Efficiency
  • (25:45) - Exploring Self Search Reinforcement Learning
  • (27:48) - The Dilemma of Language Models
  • (30:11) - Prompt Engineering and Search Integration
  • (32:55) - Dan's Rants on AI Limitations
  • (38:17) - 2 Minutes to Midnight
  • (46:41) - Outro

Connect with ADIPod
  continue reading

4 episoder

Alle episoder

×
 
Loading …

Velkommen til Player FM!

Player FM scanner netter for høykvalitets podcaster som du kan nyte nå. Det er den beste podcastappen og fungerer på Android, iPhone og internett. Registrer deg for å synkronisere abonnement på flere enheter.

 

Hurtigreferanseguide

Copyright 2025 | Personvern | Vilkår for bruk | | opphavsrett
Lytt til dette showet mens du utforsker
Spill