AI Updates on 2025-11-07
AI Model Announcements
- MoonshotAI releases Kimi K2 Thinking, a 1T parameter reasoning model (32B active) that achieves 93% on the Tau2 Bench Telecom agentic benchmark and 51% on Humanity's Last Exam, potentially becoming the new leading open weights model. The model uses INT4 precision instead of FP8, reducing size to ~594GB and improving inference efficiency @ArtificialAnlys
- OpenAI releases GPT-5-Codex-Mini, allowing roughly 4x more usage than GPT-5-Codex with a slight capability tradeoff due to the more compact model, available in CLI and IDE extension @OpenAIDevs
- Small upgrade to Codex with updated gpt-5-codex model showing improved collaboration, gaining a few percentage points on key evals and being ~3% more token-efficient @thsottiaux
- Anthropic opens offices in Paris and Munich as EMEA becomes their fastest-growing region, with run-rate revenue growing more than ninefold in the past year @AnthropicAI
- Google announces Ironwood, their seventh generation TPU, will be generally available in the coming weeks with greatly improved performance and efficiency over previous generations @JeffDean
- Microsoft Copilot integrates AI search with clearer, clickable sources and launches Copilot Groups for collaborative planning with up to 32 people @Copilot
- Gemini App adds video generation capabilities, allowing users to create 8-second videos with sound effects and dialogue from simple descriptions @madebygoogle
AI Industry Analysis
- CNBC reports the total training cost for Kimi K2 Thinking was $4.6 million, demonstrating cost efficiency in developing frontier models @AndrewCurran_
- Gergelyorosz identifies massive demand from traditional companies (banks, airlines) for AI training and workshops for developers, with budgets available but no suitable training programs currently existing @GergelyOrosz
- BillionToOne, a YC biotech company, goes public as the 4th biotech IPO with over $265M in ARR and 65% gross margins, demonstrating how Silicon Valley can fund societally important problems beyond software @snowmaker
- Clement Delangue notes Kimi K2 Thinking represents a milestone where open-source AI gets ahead of proprietary APIs in their focus area (agents), challenging the narrative that proprietary models will win due to more money and compute @ClementDelangue
- Google announces major product launches including hands-free conversational driving in Google Maps built with Gemini, Deep Research capabilities, and improvements to Google Finance with Deep Search @GoogleAI
- Perplexity Comet Assistant receives major upgrade with 23% better performance in internal tests, now navigating more like a human with improved reasoning at each step @ai_for_success
- Scott Belsky observes that when the bar goes down for access to AI tools, the bar goes up for quality, highlighting the importance of differentiation @scottbelsky
- Snowmaker explains Jevons paradox in AI context: with super cheap, on-demand intelligence now available, people will keep thinking of new ways to use it, driving continued demand @snowmaker
AI Ethics & Society
- Mustafa Suleyman argues AI should always remain in human control, stating humans should remain at the top of the food chain and calling for serious guardrails before superintelligence becomes too advanced to control @mustafasuleyman
- Dileep George publishes thoughts on AI consciousness, arguing that consciousness is substrate-independent and possible in AI systems, but can be decoupled from pain and suffering, allowing conscious AI systems to serve humans without moral concerns @dileeplearning
- Paramount Studios under CEO David Ellison maintains an internal blacklist of Hollywood figures labeled as antisemitic, while aligning with Israeli interests and rejecting the BDS movement @DropSiteNews
- Senator Chris Van Hollen reports that Trump's dismantling of USAID has caused an estimated 600,000 deaths, two-thirds of them children, according to one model @ChrisVanHollen
AI Applications
- Amanda Askell notes people often err on making prompts too succinct, revealing she uses prompts over 100 pages regularly for complex tasks @AmandaAskell
- Simon Willison demonstrates running K2 Thinking on a pair of M3 Ultra Mac Studios via MLX, showing practical deployment of large models on consumer hardware @awnihannun
- Ethan Mollick tests Kimi K2 and finds it passes the Lem Test on first attempt, though notes the model has interesting quirks where writing appears good initially but becomes incoherent under close inspection @emollick
- Gemini's LaTeX upgrade receives praise from users who report saving hours every week, with one noting it just worked without fighting with tools @joshwoodward
- NVIDIA demonstrates digital twins combined with agentic AI enabling smarter infrastructure planning, faster decision-making, and real-time operations for safer, more resilient cities @NVIDIAAI
- Tesla reports FSD Supervised is available in 6 countries with EU and more to follow, completing the world's first driverless delivery of a car from factory to owner's home @Tesla
- Josh Schnell observes that when new features feel like they're just a prompt away, feature creep becomes a never-ending battle, making discipline more important than ever in product development @jshchnz
- Steipete demonstrates using Codex for fixing thousands of issues overnight, showing practical automation of code maintenance @steipete
AI Research
- Ethan Mollick emphasizes that firms treating AI models as fungible based on benchmarks is problematic, as models like Kimi, Grok, and Claude have distinct strengths, quirks, and weaknesses that make a big difference in aggregate performance @emollick
- Mollick notes areas like analysis, writing, advice, and customer service are under-benchmarked and show high variance between equally smart models that act very differently @emollick
- Francois Chollet shares optimization tip for Colab users: switching to TPU runtime and tuning the steps_per_execution parameter in model.compile() can often see a 4-5x speedup @fchollet
- Simon Willison hypothesizes that current LLMs might make it easier to launch brand new programming languages, provided they can be described in a few thousand tokens and shipped with a compiler and linter that coding agents can use @simonw
- Fei-Fei Li, Geoffrey Hinton, and Yoshua Bengio receive the 2025 Queen Elizabeth Prize for Engineering, acknowledging their role in shaping today's AI revolution @StanfordHAI
- Tesla announces AI5 chip has potential to be 50x more performant than AI4 (current hardware), working toward mass production in 2027 for use in vehicles, robotics, training, and data centers @Tesla
- Dileep George challenges the notion that simulating microprocessors proves we understand brains, arguing we can simulate microprocessors because we understand the abstractions connecting components to function, not the other way around @dileeplearning
- MIT physicists observe key evidence of unconventional superconductivity in a special form of graphene, potentially guiding the design of room-temperature superconductors @MIT
- NVIDIA and partners build the first AI-native wireless stack made in America in just six months, powered by NVIDIA AI Aerial, creating a clear onramp from 5G to 6G @NVIDIAAI