AI Updates on 2025-10-06

AI Model Announcements

  • OpenAI announces GPT-5 Pro and Sora 2 are both available in the API starting today at DevDay @AndrewCurran_
  • OpenAI launches AgentKit, a complete set of building blocks for developers to build, deploy and optimize agent workflows with visual builder, evals, and guardrails @gdb
  • OpenAI introduces Apps in ChatGPT, allowing users to chat with apps like Canva, Booking.com, Spotify, and Figma directly within conversations @OpenAI
  • OpenAI makes Codex generally available with new SDK and enterprise features, demonstrated with live vibe coding including voice interface @gdb
  • Anthropic releases Petri, an open-source automated auditing tool for testing AI models across diverse scenarios for behaviors like sycophancy and deception @AnthropicAI
  • Google DeepMind announces CodeMender, an AI agent using Gemini Deep Think that automatically patches critical software vulnerabilities, having already submitted 72 high-quality fixes to major open-source projects @GoogleDeepMind
  • Microsoft updates Copilot memory to allow users to add, modify, and delete what Copilot knows about them, with the ability to direct both remembering and forgetting @Copilot

AI Industry Analysis

  • ChatGPT reaches 800 million weekly active users and OpenAI's API processes over 6 billion tokens per minute, with 4 million developers now building with OpenAI tools @AndrewCurran_
  • Private AI startups raised $377 billion in H1 2025, more than any full year in history, with 2x the capital per company averaging $36M @deedydas
  • OpenAI partners with AMD to deploy 6GW of AMD GPUs, beginning with a 1GW deployment in the second half of 2026, as part of scaling next-gen AI infrastructure @OpenAINewsroom
  • Perplexity expands internationally by opening an office in Berlin, Germany, with 4 MTS onboarded @AravSrinivas
  • Engineering leaders interviewing for AI product positions often lack actual AI knowledge beyond using ChatGPT, according to a recruiter at a publicly traded tech company @GergelyOrosz
  • AI infrastructure spending may be driven partly by lack of market exposure options to transformative AI, with data centers being one of the few ways to get "AGI" hedges in portfolios @emollick
  • 2026 is expected to be when recent massive AI infrastructure investments start becoming available as usable compute @natolambert

AI Ethics & Society

  • Microsoft researchers reveal a confidential research effort exploring how open-source AI tools could bypass biosecurity checks, helping create fixes now influencing global standards @MSFTResearch
  • Concerns raised about the trajectory of open AI models in America, with debates about potential bans on open weights models despite practical implementation challenges @natolambert
  • Discussion of whether interacting with AIs might actually be better for human flourishing in some cases, challenging assumptions about AI interaction being inherently negative @jeffclune

AI Applications

  • Figma launches integration with ChatGPT allowing users to create FigJam diagrams through natural language prompts @figma
  • Mattel uses Sora 2 for instant sketch to toy concept generation, demonstrating AI video applications in product design @gdb
  • Comet browser introduces new addiction pattern where users open long YouTube videos and use AI assistant to navigate to specific timestamps based on questions rather than linear viewing @AravSrinivas
  • AI-assisted online shopping continues booming according to new U.S. holiday e-commerce forecasts @TechCrunch
  • Stanford introduces MedAgentBench, a virtual environment to test whether AI agents can handle complex clinical workflows like retrieving patient data, ordering tests, and prescribing medications @StanfordHAI

AI Research

  • GPT-5 Pro achieves breakthrough results in mathematics, solving a problem previously unsolved by LLMs and only solved by 60 humans, plus solving an open problem in real analysis @deedydas
  • Research shows small Transformers perform better at multiplication when trained to stop relying on explicit Chain-of-Thought steps, suggesting hidden-thought circuits might emerge spontaneously in frontier-scale training @davidad
  • A 7B model fine-tuned for forms and documents beats GPT-4.1 on 1,000 extraction tasks, trained for only $196 using synthetic training data and LoRA with Group Relative Policy Optimization @rohanpaul_ai
  • GLM-4.6 becomes the new #1 top open model on Hugging Face Arena, ranking #4 overall and surpassing DeepSeek R1 which had been champion for months @arena
  • Research confirms LoRA rank=1 closely matches full fine-tuning performance on many RL fine-tuning problems, with successful reproductions showing significant parameter efficiency @johnschulman2
  • New lightweight open-source text-to-speech model kani-tts-370m released with 370M parameters, achieving natural and expressive voice with real-time inference on RTX 3060 @Tu7uruu
  • Science systems are breaking under flood of human-created knowledge, with concerns about how to handle potential flood of AI-generated discoveries and translate them into streams of inquiry and practice @emollick