AI Updates on 2025-07-19
AI Model Announcements
- OpenAI achieves gold medal-level performance on the 2025 International Mathematical Olympiad with an experimental reasoning LLM that uses general-purpose reinforcement learning and test-time compute scaling @OpenAI
- OpenAI clarifies that GPT-5 is releasing soon but the IMO gold model is a separate experimental system that won't be released for many months @OpenAI
- OpenAI rolls out Advanced Voice upgrades to ChatGPT free users with more natural and expressive speech and improved translation capabilities @OpenAI
- Perplexity launches Comet, a new AI interface that allows users to build custom widgets and tasks with hybrid client-server compute architecture @AravSrinivas
AI Industry Analysis
- Meta's Superintelligence team consists of 44 people with 50% from China, 75% with PhDs, and 40% from OpenAI, with each member likely earning $10-100M per year @deedydas
- Perplexity's Comet reaches #5 on India's Play Store across all app categories and #2 in Productivity, showing rapid adoption @AravSrinivas
- Lee Robinson joins Cursor to focus on developer education, emphasizing the need to teach both new and experienced developers how to effectively use AI coding tools @leerob
- Greptile raises Series A at $180M valuation backed by Benchmark, highlighting intensifying competition in the AI-powered code review space @TechCrunch
- Section 174 tax changes that plagued US tech businesses since 2023 are mostly reversed, expected to incentivize more US hiring and less international hiring @GergelyOrosz
AI Ethics & Society
- Simon Willison warns about prompt injection vulnerabilities in GitHub MCP server, where attackers can trick AI agents into stealing private data through malicious instructions @simonw
- Scott Belsky predicts data wars as companies cut off API/MCP access while users demand portability of memory and data, questioning whether customers will ultimately win @scottbelsky
- TechCrunch advises users to think twice before granting AI access to personal data for privacy and security reasons @TechCrunch
AI Applications
- Ethan Mollick demonstrates Veo 3 Fast creating video game scenes as community theater productions, showcasing creative AI video generation capabilities @emollick
- Perplexity's Comet enables automated Reddit mining for structured review analysis and can play chess through self-play functionality @AravSrinivas
- ChatGPT's platform now includes agents that can plan meals and purchase ingredients, generate editable presentations based on industry competitors, and accomplish real-world tasks @TechCrunch
- Jack Dorsey releases two apps in less than a week using vibe coding with AI tool Goose for messaging and sun exposure tracking @TechCrunch
- Hamel Husain observes blog posts now written for computers, where users can paste URLs into Claude and ask it to set up projects automatically @HamelHusain
AI Research
- OpenAI's experimental model achieves IMO gold medal performance using natural language proofs under human competition rules without tools, representing a major milestone in mathematical reasoning @gdb
- The IMO achievement uses general-purpose reinforcement learning and test-time compute scaling rather than narrow task-specific methodology, marking progress toward general intelligence @AndrewCurran_
- François Chollet defines intelligence as efficiency in acquiring new skills rather than a collection of skills, warning that benchmark scores can be misleading about actual AI system intelligence @fchollet
- Nathan Lambert suggests OpenAI may have achieved very-long-episode RL with 1M-100M tokens per answer, combining extended reinforcement learning with massive test-time compute scaling @krishnakaasyap
- Jared Friedman observes a divergence between skills that can be benchmarked and reinforcement learned versus those that cannot, noting ChatGPT excels at math but struggles with writing cold emails @snowmaker
- Ethan Mollick notes the IMO achievement was viewed as unlikely with prediction markets giving only 20% chance of happening this year, emphasizing its significance as a hard test done without tools @emollick