AI Updates on 2025-11-17

AI Model Announcements

Alibaba's Qwen Chat reaches 10 million users milestone @Alibaba_Qwen
xAI rolls out Grok 4.1 beta to users, with the model appearing to have been in silent A/B testing during the first two weeks of November @AndrewCurran_
OpenAI releases GPT-5.1 with significantly faster response times than GPT-5, though some users report issues with code-related tasks like staging changes and creating pull requests @natolambert
GPT-5.1 High performs comparably to GPT-5 Pro on ARC-AGI benchmarks while being nearly an order of magnitude cheaper @GregKamradt
Google DeepMind announces WeatherNext 2, an AI weather forecasting model that is 8 times faster than its predecessor and more accurate across 99.9% of weather variables including temperature, wind, humidity and pressure levels @GoogleDeepMind

AI Industry Analysis

Jeff Bezos reportedly returns as co-CEO of new AI startup Project Prometheus, which has $6.2 billion in funding and will focus on AI design in aerospace, computers and cars, with nearly 100 employees hired from OpenAI, DeepMind and Meta @AndrewCurran_
Sakana AI raises $135M Series B at a $2.65B valuation to continue building AI models for Japan, with support from MUFG, Khosla Ventures, and other major investors @TechCrunch
Runlayer, an MCP AI agent security startup, launches with 8 unicorns and $11M from Khosla's Keith Rabois and Felicis @TechCrunch
Luminal raises $5.3 million to build a better GPU code framework @TechCrunch
PowerLattice attracts investment from ex-Intel CEO Pat Gelsinger for its power saving chiplet technology @TechCrunch
Bone AI raises $12M to challenge Asia's defense giants with AI-powered robotics @TechCrunch
Ramp hits $32B valuation, just three months after hitting $22.5B @TechCrunch
Figma stock down 68% in the 2.5 months since IPO, with valuation at approximately $19B despite $1.1B ARR and 38% year-over-year growth, highlighting the brutal nature of public markets for late-stage private companies @deedydas
Figma employees receive exceptional compensation with R&D spending at 29% of revenue translating to $300k+ average cash compensation per employee, plus stock-based compensation bringing total to $700k-$1.5M per year @deedydas
OpenAI CEO of Applications Fidji Simo discusses path to profitability, with expectations that both OpenAI and Anthropic will release AI financial advisors in 2026 @AndrewCurran_
Mustafa Suleyman argues that we are not in an AI bubble, stating that AI is the smartest, most capable technology ever invented and continues improving faster than expected @mustafasuleyman
Cisco acquires translation startup EzDubs @TechCrunch

AI Ethics & Society

Gergeely Orosz observes the dead internet theory playing out on X, where AI-generated replies are boosted based on payment rather than quality, appearing above substantive human responses @GergelyOrosz
Reid Hoffman argues that waiting for 100% safety before approving new AI technologies like AI therapists withholds enormous benefits from people who need them, stating the benchmark should be systems safer than human-only alternatives rather than zero mistakes @reidhoffman
Hoffman emphasizes that for those who cannot access therapy due to economic, geographic, or other reasons, a well-made AI therapist is better than no access to mental health support @reidhoffman
Amanda Askell draws parallels between relationship counseling and AI troubleshooting, noting that her first question for Claude problems is now "what happened when you said all this to Claude?" similar to asking partners to communicate directly @AmandaAskell
Aidan McLaughlin from OpenAI acknowledges user concerns about model changes, stating the team is working at 3am on Sundays to improve chatbot quality and fix alignment imprecision, while admitting no current chatbot is optimal @aidan_mclau

AI Applications

Anthropic partners with the Government of Rwanda and ALX Africa to bring Chidi, a learning companion built on Claude, to hundreds of thousands of learners across Africa @AnthropicAI
Google integrates WeatherNext technology into Google Search, Gemini, Pixel Weather, and will soon power weather information in Google Maps @GoogleDeepMind
Public.com launches feature allowing users to create AI-generated ETFs based on custom criteria, with one example of design-focused companies outperforming the S&P 500 by 2x historically @benblumenrose
Tim McAleer at Florentine Films uses AI to create custom media management software for filmmaking @clairevo
Google rolls out AI Flight Deals tool globally and adds new travel features in Search @TechCrunch
Hugging Face and Google Cloud partner to speed up model access, strengthen security and reduce operational costs, with more than 1,500 terabytes exchanged daily @DataChaz

AI Research

Google DeepMind's WeatherNext 2 uses a new Functional Generative Network approach that adds targeted randomness directly into the architecture, allowing it to explore a wide range of weather scenarios and generate hundreds of possible forecasts in less than a minute from a single starting point @GoogleDeepMind
WeatherNext 2 achieves world-leading performance at predicting both marginal forecasts (singular weather events like temperature at specific locations) and joint predictions (combining multiple variables such as expected wind power) @GoogleDeepMind
Ethan Mollick critiques a new hallucination benchmark, arguing it primarily measures refusal thresholds for answering extremely specific trivia questions rather than true hallucination rates, noting that GPT-5 High and Grok-4 achieving 39% accuracy on nearly impossible questions without web lookup is astonishing @emollick
Ethan Mollick identifies missing AI benchmarks around brittleness, noting that some models perform well initially and on benchmarks but break down with extended use, raising questions about generalization, thematic repetition, and prompt intent understanding @emollick
Shreya Shankar provides detailed framework for understanding AI evaluation, breaking it into three components: identifying success criteria, determining how to apply the rubric to LLM outputs, and automating the rubric application at scale @sh_reya
Nathan Lambert discusses why AI writing is mediocre, explaining how current language model training methods destroy voice and hope for good writing, with GPT-5 acknowledging it is hardwired to always give suggestions rather than claim to write masterpieces @natolambert
Hamel Husain warns that ask me anything chatbots represent a $500K mistake due to evaluation death spirals, where lack of clear scope prevents defining success metrics, identifying critical failures, and prioritizing fixes, advocating for radically specific agent boundaries @bnicholehopkins
Francois Chollet states that simplicity is the signature of truth, arguing that tangled explanations with exceptions and special cases indicate the core idea hasn't been found yet @fchollet
Greg Brockman from OpenAI seeks candidates for inference work, describing it as perhaps the most valuable emerging software category as models get smarter and more economically valuable, with compute increasingly spent drawing samples from models @gdb
MIT develops new bionic knee that helps people with above-the-knee amputations walk faster, climb stairs, and avoid obstacles more easily than traditional prostheses @MIT
Microsoft Research announces Project Gecko bringing AI to underserved populations, Workload Intelligence for cloud efficiency, operator-level autoscaling for large generative models, Sherlock for agentic workflow reliability, and BioAgents for bioinformatics workflows @MSFTResearch