AI Updates on 2025-07-24
AI Model Announcements
- Alibaba releases Qwen3-Coder-480B-A35B, a 480B parameter MoE model with 35B active parameters achieving 70% on SWE-Bench Verified and 1M context length, potentially the best coding model yet @deedydas
- Alibaba launches Qwen3-MT, their most powerful translation model supporting 92+ languages and covering 95%+ of the world's population, trained on trillions of multilingual tokens @Alibaba_Qwen
- Tom Warren reports GPT-5 will launch in August with GPT-5-mini launching simultaneously in both client and API, and GPT-5-nano planned for API only @AndrewCurran_
- OpenAI plans to launch an open source model before GPT-5, described as similar to o3-mini with reasoning capabilities @AndrewCurran_
AI Industry Analysis
- Google processes over 980 trillion tokens monthly across their surfaces, doubling from 480 trillion in May, with Gemini app reaching 450M monthly active users @AndrewCurran_
- Over 70 million user videos have been created with Veo 3, demonstrating significant adoption of Google's video generation model @AndrewCurran_
- Safe Superintelligence (Ilya Sutskever's company) will exclusively use Google TPUs for their AI development @AndrewCurran_
- Meta adopts novel approach of building weather-proof tents to house GPU clusters, enabling new data centers to come online in months instead of years @AIatMeta
- Financial Times reports over $1 billion worth of NVIDIA chips have reached China in the last three months, including Blackwell chips, despite export controls @AndrewCurran_
- China now has 5 frontier AI labs competing globally: DeepSeek, Alibaba Qwen, Bytedance, Hailuo, and Kimi, with rapid development pace at likely lower costs than US counterparts @deedydas
- Research shows developers save the most time with AI tools through stack trace analysis and refactoring rather than code generation, based on DX research with 180 companies @GergelyOrosz
- Forward-looking tech companies like GitHub and Shopify are hiring more interns because of AI, observing CS students use AI tools more fluently than before @GergelyOrosz
- Jack Dorsey releases two apps in less than a week using AI tool Goose for rapid development, demonstrating the "vibe coding" trend @TechCrunch
AI Ethics & Society
- President Trump's AI Summit comments on copyright suggest AI should be able to learn from content without paying for each use, comparing it to human learning and noting China doesn't follow such restrictions @AndrewCurran_
- New government requirements state that to be eligible for agency contracts, an LLM must be developed with truth-seeking and ideological neutrality principles @AndrewCurran_
- Ethan Mollick demonstrates that over 60% of older links from New York Times articles are now broken, suggesting only LLMs will "remember" much of the web's ephemeral content @emollick
- Careful review of Humanity's Last Exam benchmark reveals many questions have incorrect "right" answers, highlighting ongoing challenges in AI measurement and benchmarking @emollick
- François Chollet warns against the tendency to anthropomorphize AI systems that are not human, emphasizing the importance of understanding their true nature @fchollet
AI Applications
- Perplexity launches Comet browser with AI assistant capabilities that can distribute itself and onboard new users, receiving positive reviews for its functionality @testingcatalog
- Cursor releases Bugbot, which found over 1M+ bugs in human-written PRs in the past month, with over half being real logic issues that were fixed before merging @cursor_ai
- GitHub launches Spark, a prompt-to-app platform for creating and iterating on React apps with user authentication and persistent storage @simonw
- Figma releases Make to everyone, a prompt-to-app solution that allows users to create prototypes and publish to Figma Community @figma
- Google introduces photo to video feature coming to Google Photos and YouTube Shorts @sundarpichai
- Google launches virtual try-on clothes feature using AI technology @TechCrunch
- Linear introduces Dashboards feature allowing users to create custom views to monitor key metrics @linear
- xAI partners with Kalshi to bring Grok to prediction markets @xai
AI Research
- Anthropic develops three AI agents for alignment auditing that can autonomously uncover hidden goals, build safety evaluations, and surface concerning behaviors, with their investigator agent winning 42% of auditing challenges @AnthropicAI
- Google achieves gold-medal level performance in the International Mathematical Olympiad using an advanced version of Gemini with Deep Think mode @sundarpichai
- Research introduces Rubrics as Rewards (RaR) framework using structured, checklist-style rubrics as interpretable reward signals for on-policy training, yielding relative improvements on HealthBench-1k @iScienceLuvr
- Cameron Wolfe explains that reward models remain relevant in the age of reasoning models, as most systems still use both RLHF for human preference alignment and RLVR for verifiable reasoning tasks @cwolferesearch
- Anthropic launches "AI psychiatry" team as part of interpretability efforts to research model personas, motivations, and situational awareness and how they lead to concerning behaviors @Jack_W_Lindsey
- MIT scientists program living cells with logic gates like biological computers to detect and destroy cancer with precision @MIT
- PyTorch demonstrates SmolLM3-3B running at 15 tokens/sec on Galaxy S22 using TorchAO and ExecuTorch for on-device deployment @PyTorch