AI Updates on 2025-08-20
AI Model Announcements
- Google announces Veo 3 video generation model with sound capabilities, allowing users to turn words or photos into videos with audio @AndrewCurran_
- Google releases new Gemini Nano model powering the Pixel 10 series, featuring improved personalization and proactive assistance @Google
- ByteDance releases Seed-OSS 36B LLM on Hugging Face, featuring powerful long-context, reasoning, and agentic capabilities @HuggingPapers
- IBM and NASA release Surya, the first open-source AI foundation model for heliophysics with 366M parameters, trained on 9 years of Solar Dynamics Observatory data to predict space weather @ClementDelangue
- NVIDIA's Cosmos Reason 7B-parameter VLM achieves over 500,000 downloads on Hugging Face, designed for physical AI and robotics applications @NVIDIAAIDev
AI Industry Analysis
- Perplexity reports serving over 300 million user queries weekly, representing 3x growth in approximately 9 months from their previous 100M weekly milestone @AravSrinivas
- EliseAI raises $250M Series E led by a16z, surpassing $100M ARR as an AI property manager and healthcare administrator addressing friction in housing and healthcare industries @aleximm
- Gergely Orosz observes peak AI hype with investors funding questionable AI startups like mattress companies using AI to "fix sleep" and AI-powered jewelry, suggesting FOMO-driven investment decisions @GergelyOrosz
- Microsoft announces expanded partnership with NFL, bringing Copilot and Azure AI Foundry to football operations both on and off the field @satyanadella
- Anthropic launches Claude Code for Team and Enterprise plans with flexible pricing, allowing organizations to mix standard and premium seats across their teams @claudeai
AI Ethics & Society
- Harvard students who previously developed facial recognition app for Meta's Ray-Ban glasses are launching a startup making smart glasses with always-on microphones, raising privacy concerns @TechCrunch
- Gergely Orosz suggests AI tooling going mainstream will help non-technical people understand why building good software is difficult, as they experience the gap between expectations and reality @GergelyOrosz
AI Applications
- Google introduces Magic Cue on Pixel phones, using Gemini capabilities to proactively surface helpful information and actions across apps when needed @GoogleAI
- Google Photos launches conversational editing feature allowing users to make photo changes by describing them in natural language @TechCrunch
- Google announces Voice Translate for Pixel phones, enabling real-time call translation using the caller's voice for more authentic multilingual conversations @GoogleAI
- Google introduces Camera Coach using Gemini models to read scenes and provide guidance for perfect photography shots @GoogleAI
- Perplexity launches SuperMemory feature in final testing stages, claiming superior performance compared to existing memory solutions @AravSrinivas
- Perplexity introduces Max Assistant mode on Comet for subscribers, capable of running long-horizon research tasks contextually to reading content @AravSrinivas
- Sierra demonstrates AI agent simulations for testing, including voice simulations with background noise to harden agent performance before deployment @btaylor
- Brex's AI agent built on Sierra platform answers customer questions 90% faster, saving customers 15,000 hours annually @btaylor
- Carbon Robotics uses AI-powered laser weeding robots that have destroyed 15 billion weeds across 100+ crops without herbicides, delivering dramatic yield increases @NVIDIAAI
- Google introduces Pixel Journal, a new journaling app using on-device AI to suggest personalized writing prompts @TechCrunch
- Google announces AI-powered personal health coach built with Gemini coming to Fitbit devices @TechCrunch
AI Research
- Microsoft Research introduces GPT-5 Pro demonstrating capability to prove new mathematical theorems, successfully proving a better bound than published in a convex optimization paper @SebastienBubeck
- Berkeley AI Research presents XQuant, achieving 10-12.5x memory savings versus FP16 with near-zero accuracy loss by leveraging underutilized compute units for KV cache rematerialization @adityastomar_
- Cursor team rebuilds MoE layers at kernel level with MXFP8, achieving 3.5x faster MoE layer performance and 1.5x end-to-end training speedup @stuart_sul
- PyTorch introduces ZenFlow for LLM training with offloading, delivering 5x faster training, 85% fewer GPU stalls, and 2x lower I/O overhead @PyTorch
- Microsoft Research releases MindJourney enabling AI to navigate and interpret 3D environments from limited visual input for improved navigation and planning tasks @MSFTResearch
- Nathan Lambert analyzes the spectrum of reasoning effort in AI models, noting that all current models use similar reinforcement learning techniques with varying token usage rather than binary reasoning classifications @natolambert
- Ethan Mollick demonstrates AI video generation capabilities by creating music videos from academic paper abstracts, showcasing evolving consistency in character generation and lip syncing @emollick
- Simon Willison tests Qwen-Image-Edit model on 64GB M2 MacBook Pro, generating rainbow-colored pelican images in 25 minutes with 10 inference steps, compared to 2 hours 59 minutes for full 50 steps @simonw