『Last Week in AI』のカバーアート

Last Week in AI

Last Week in AI

著者: Skynet Today
無料で聴く

このコンテンツについて

Weekly summaries of the AI news that matters!Copyright 2024 All rights reserved. 政治・政府
エピソード
  • #212 - o3 pro, Cursor 1.0, ProRL, Midjourney Sued
    2025/06/17
    Our 212th episode with a summary and discussion of last week's big AI news! Recorded on 06/33/2025 Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. In this episode: OpenAI introduces O3 PRO for ChatGPT, highlighting significant improvements in performance and cost-efficiency.Anthropic sees an influx of talent from OpenAI and DeepMind, with significantly higher retention rates and competitive advantages in AI capabilities.New research indicates that reinforcing negative responses in LLMs significantly improves performance across all metrics, highlighting novel approaches in reinforcement learning.A security flaw in Microsoft Copilot demonstrates the growing risk of AI agents being hacked, emphasizing the need for robust protection against zero-click attacks. Timestamps + Links: (00:00:11) Intro / Banter(00:01:31) News Preview(00:02:46) Response to Listener ReviewsTools & Apps(00:04:48) OpenAI adds o3 Pro to ChatGPT and drops o3 price by 80 per cent, but open-source AI is delayed(00:09:10) Cursor AI editor hits 1.0 milestone, including BugBot and high-risk background agents(00:13:07) Mistral releases a pair of AI reasoning models(00:16:18) Elevenlabs' Eleven v3 lets AI voices whisper, laugh and express emotions naturally(00:19:00) ByteDance's Seedance 1.0 is trading blows with Google's Veo 3(00:22:42) Google Reveals $20 AI Pro Plan With Veo 3 Fast Video Generator For Budget Creators Applications & Business(00:25:42) OpenAI and DeepMind are losing engineers to Anthropic in a one-sided talent war(00:34:32) OpenAI slams court order to save all ChatGPT logs, including deleted chats(00:37:24) Nvidia’s Biggest Chinese Rival Huawei Struggles to Win at Home(00:43:06) Huawei Expected to Break Semiconductor Barriers with Development of High-End 3nm GAA Chips; Tape-Out by 2026(00:45:21) TSMC’s 1.4nm Process, Also Called Angstrom, Will Make Even The Most Lucrative Clients Think Twice When Placing Orders, With An Estimate Claiming That Each Wafer Will Cost $45,000(00:47:43) Mistral AI Launches Mistral Compute To Replace Cloud Providers from US, China Projects & Open Source(00:51:26) ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Research & Advancements(00:57:27) Kinetics: Rethinking Test-Time Scaling Laws(01:05:12) The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning(01:10:45) Predicting Empirical AI Research Outcomes with Language Models(01:15:02) EXP-Bench: Can AI Conduct AI Research Experiments? Policy & Safety(01:20:07) Large Language Models Often Know When They Are Being Evaluated(01:24:56) Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence(01:31:16) Exclusive: New Microsoft Copilot flaw signals broader risk of AI agents being hacked—‘I would be terrified’(01:35:01) Claude Gov Models for U.S. National Security Customers Synthetic Media & Art(01:37:32) Disney And NBCUniversal Sue AI Company Midjourney For Copyright Infringement(01:40:39) AMC Networks is teaming up with AI company Runway
    続きを読む 一部表示
    1 時間 46 分
  • #211 - Claude Voice, Flux Kontext, wrong RL research?
    2025/06/03
    Our 211th episode with a summary and discussion of last week's big AI news! Recorded on 05/31/2025 Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. Join our Discord here! https://discord.gg/nTyezGSKwP In this episode: Recent AI podcast covers significant AI news: startups, new tools, applications, investments in hardware, and research advancements.Discussions include the introduction of various new tools and applications such as Flux's new image generating models and Perplexity's new spreadsheet and dashboard functionalities.A notable segment focuses on OpenAI's partnership with the UAE and discussions on potential legislation aiming to prevent states from regulating AI for a decade.Concerns around model behaviors and safety are discussed, highlighting incidents like Claude Opus 4's blackmail attempt and Palisade Research's tests showing AI models bypassing shutdown commands. Timestamps + Links: (00:00:10) Intro / Banter(00:01:39) News Preview(00:02:50) Response to Listener Comments Tools & Apps (00:07:10) Anthropic launches a voice mode for Claude(00:10:35) Black Forest Labs’ Kontext AI models can edit pics as well as generate them(00:15:30) Perplexity’s new tool can generate spreadsheets, dashboards, and more(00:18:43) xAI to pay Telegram $300M to integrate Grok into the chat app(00:22:42) Opera’s new AI browser promises to write code while you sleep(00:24:17) Google Photos debuts redesigned editor with new AI tools Applications & Business (00:25:13) Top Chinese memory maker expected to abandon DDR4 manufacturing at the behest of Beijing(00:30:04) Oracle to Buy $40 Billion Worth of Nvidia Chips for First Stargate Data Center(00:31:47) UAE makes ChatGPT Plus subscription free for all residents as part of deal with OpenAI(00:35:34) NVIDIA Corporation (NVDA) to Launch Cheaper Blackwell AI Chip for China, Says Report(00:38:39) The New York Times and Amazon ink AI licensing deal Projects & Open Source (00:41:11) DeepSeek’s distilled new R1 AI model can run on a single GPU(00:45:19) Google Unveils SignGemma, an AI Model That Can Translate Sign Language Into Spoken Text(00:47:08) Open-sourcing circuit tracing tools(00:49:42) Hugging Face unveils two new humanoid robots Research & Advancements (00:52:33) PANGU PRO MOE: MIXTURE OF GROUPED EXPERTS FOR EFFICIENT SPARSITY(00:58:55) DataRater: Meta-Learned Dataset Curation(01:05:05) Incorrect Baseline Evaluations Call into Question Recent LLM-RL Claims (01:10:17) Maximizing Confidence Alone Improves Reasoning(01:11:00) Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence(01:11:44) One RL to See Them All(01:15:05) Efficient Reinforcement Finetuning via Adaptive Curriculum Learning Policy & Safety (01:17:58) Trump's 'Big Beautiful Bill' could ban states from regulating AI for a decade(01:24:31) Researchers claim ChatGPT o3 bypassed shutdown in controlled test(01:30:10) Anthropic’s new AI model turns to blackmail when engineers try to take it offline(01:31:09) Anthropic Faces Backlash As Claude 4 Opus Can Autonomously Alert Authorities(01:35:37) Claude helps users make bioweapons(01:35:49) The Claude 4 System Card is a Wild Read
    続きを読む 一部表示
    1 時間 38 分
  • #210 - Claude 4, Google I/O 2025, OpenAI+io, Gemini Diffusion
    2025/05/26
    Our 210th episode with a summary and discussion of last week's big AI news! Recorded on 05/23/2025 Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. Join our Discord here! https://discord.gg/nTyezGSKwP In this episode: Google's Gemini diffusion technology showcases significant improvements in speed and efficiency for generating text, potentially revolutionizing the auto-regressive generation paradigm.Anthropic activates AI Safety Level 3 protections for Claude Opus 4, implementing robust measures such as bug bounties, synthetic jailbreak data, and preliminary egress bandwidth controls to mitigate bio-risk threats.OpenAI responds to the California Attorney General, refuting claims by the not-for-private-gain coalition and defending their controversial restructuring plans amidst ongoing criticism.Mistral delays the release of its Llama 4 Behemoth model due to training challenges, while Meta faces similar obstacles in rolling out its large-scale AI models, signaling difficulties in reaching frontier level performance. Timestamps + Links: (00:00:00) Intro / Banter(00:01:43) News PreviewTools & Apps (00:02:58) Anthropic’s new Claude 4 AI models can reason over many steps (00:09:58) Google Unveils A.I. Chatbot, Signaling a New Era for Search (00:14:04) Google rolls out Project Mariner, its web-browsing AI agent (00:16:40) Veo 3 can generate videos — and soundtracks to go along with them (00:21:26) Imagen 4 is Google’s newest AI image generator (00:23:15) Google Meet is getting real-time speech translation (00:25:36) Google’s new Jules AI agent will help developers fix buggy code (00:26:43) GitHub’s new AI coding agent can fix bugs for you (00:28:50) Mistral’s new Devstral model was designed for codingApplications & Business (00:29:53) OpenAI Unites With Jony Ive in $6.5 Billion Deal to Create A.I. Devices (00:36:10) OpenAI’s planned data center in Abu Dhabi would be bigger than Monaco (00:41:18) LM Arena, the organization behind popular AI leaderboards, lands $100M (00:45:21) Nvidia CEO says next chip after H20 for China won't be from Hopper series (00:46:39) Google’s Gemini AI app has 400M monthly active users (00:51:15) AI Servers: End demand intact, but rising gap between upstream build and system production (2025.5.18) Projects & Open Source (00:53:46) Meta Is Delaying the Rollout of Its Flagship AI ModelResearch & Advancements (00:57:53) Gemini Diffusion (01:03:07) Chain-of-Model Learning for Language Model (01:09:16) Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space (01:15:38) Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training (01:20:16) Lessons from Defending Gemini Against Indirect Prompt Injections (01:23:35) How Fast Can Algorithms Advance Capabilities? (01:30:20) Reinforcement Learning Finetunes Small Subnetworks in Large Language ModelsPolicy & Safety(01:31:12) Exclusive: What OpenAI Told California's Attorney General(01:38:25) Activating AI Safety Level 3 Protections
    続きを読む 一部表示
    1 時間 45 分

Last Week in AIに寄せられたリスナーの声

カスタマーレビュー:以下のタブを選択することで、他のサイトのレビューをご覧になれます。