AI Updates on 2025-07-19

AI Model Announcements

  • OpenAI achieves gold medal-level performance on the 2025 International Mathematical Olympiad with an experimental reasoning LLM that uses general-purpose reinforcement learning and test-time compute scaling @OpenAI
  • OpenAI clarifies that GPT-5 is releasing soon but the IMO gold model is a separate experimental system that won't be released for many months @OpenAI
  • OpenAI rolls out Advanced Voice upgrades to ChatGPT free users with more natural and expressive speech and improved translation capabilities @OpenAI
  • Perplexity launches Comet, a new AI interface that allows users to build custom widgets and tasks with hybrid client-server compute architecture @AravSrinivas

AI Industry Analysis

  • Meta's Superintelligence team consists of 44 people with 50% from China, 75% with PhDs, and 40% from OpenAI, with each member likely earning $10-100M per year @deedydas
  • Perplexity's Comet reaches #5 on India's Play Store across all app categories and #2 in Productivity, showing rapid adoption @AravSrinivas
  • Lee Robinson joins Cursor to focus on developer education, emphasizing the need to teach both new and experienced developers how to effectively use AI coding tools @leerob
  • Greptile raises Series A at $180M valuation backed by Benchmark, highlighting intensifying competition in the AI-powered code review space @TechCrunch
  • Section 174 tax changes that plagued US tech businesses since 2023 are mostly reversed, expected to incentivize more US hiring and less international hiring @GergelyOrosz

AI Ethics & Society

  • Simon Willison warns about prompt injection vulnerabilities in GitHub MCP server, where attackers can trick AI agents into stealing private data through malicious instructions @simonw
  • Scott Belsky predicts data wars as companies cut off API/MCP access while users demand portability of memory and data, questioning whether customers will ultimately win @scottbelsky
  • TechCrunch advises users to think twice before granting AI access to personal data for privacy and security reasons @TechCrunch

AI Applications

  • Ethan Mollick demonstrates Veo 3 Fast creating video game scenes as community theater productions, showcasing creative AI video generation capabilities @emollick
  • Perplexity's Comet enables automated Reddit mining for structured review analysis and can play chess through self-play functionality @AravSrinivas
  • ChatGPT's platform now includes agents that can plan meals and purchase ingredients, generate editable presentations based on industry competitors, and accomplish real-world tasks @TechCrunch
  • Jack Dorsey releases two apps in less than a week using vibe coding with AI tool Goose for messaging and sun exposure tracking @TechCrunch
  • Hamel Husain observes blog posts now written for computers, where users can paste URLs into Claude and ask it to set up projects automatically @HamelHusain

AI Research

  • OpenAI's experimental model achieves IMO gold medal performance using natural language proofs under human competition rules without tools, representing a major milestone in mathematical reasoning @gdb
  • The IMO achievement uses general-purpose reinforcement learning and test-time compute scaling rather than narrow task-specific methodology, marking progress toward general intelligence @AndrewCurran_
  • François Chollet defines intelligence as efficiency in acquiring new skills rather than a collection of skills, warning that benchmark scores can be misleading about actual AI system intelligence @fchollet
  • Nathan Lambert suggests OpenAI may have achieved very-long-episode RL with 1M-100M tokens per answer, combining extended reinforcement learning with massive test-time compute scaling @krishnakaasyap
  • Jared Friedman observes a divergence between skills that can be benchmarked and reinforcement learned versus those that cannot, noting ChatGPT excels at math but struggles with writing cold emails @snowmaker
  • Ethan Mollick notes the IMO achievement was viewed as unlikely with prediction markets giving only 20% chance of happening this year, emphasizing its significance as a hard test done without tools @emollick