AI Updates on 2025-11-17
AI Model Announcements
- Alibaba's Qwen Chat reaches 10 million users milestone @Alibaba_Qwen
- xAI rolls out Grok 4.1 beta to users, with the model appearing to have been in silent A/B testing during the first two weeks of November @AndrewCurran_
- OpenAI releases GPT-5.1 with significantly faster response times than GPT-5, though some users report issues with code-related tasks like staging changes and creating pull requests @natolambert
- GPT-5.1 High performs comparably to GPT-5 Pro on ARC-AGI benchmarks while being nearly an order of magnitude cheaper @GregKamradt
- Google DeepMind announces WeatherNext 2, an AI weather forecasting model that is 8 times faster than its predecessor and more accurate across 99.9% of weather variables including temperature, wind, humidity and pressure levels @GoogleDeepMind
AI Industry Analysis
- Jeff Bezos reportedly returns as co-CEO of new AI startup Project Prometheus, which has $6.2 billion in funding and will focus on AI design in aerospace, computers and cars, with nearly 100 employees hired from OpenAI, DeepMind and Meta @AndrewCurran_
- Sakana AI raises $135M Series B at a $2.65B valuation to continue building AI models for Japan, with support from MUFG, Khosla Ventures, and other major investors @TechCrunch
- Runlayer, an MCP AI agent security startup, launches with 8 unicorns and $11M from Khosla's Keith Rabois and Felicis @TechCrunch
- Luminal raises $5.3 million to build a better GPU code framework @TechCrunch
- PowerLattice attracts investment from ex-Intel CEO Pat Gelsinger for its power saving chiplet technology @TechCrunch
- Bone AI raises $12M to challenge Asia's defense giants with AI-powered robotics @TechCrunch
- Ramp hits $32B valuation, just three months after hitting $22.5B @TechCrunch
- Figma stock down 68% in the 2.5 months since IPO, with valuation at approximately $19B despite $1.1B ARR and 38% year-over-year growth, highlighting the brutal nature of public markets for late-stage private companies @deedydas
- Figma employees receive exceptional compensation with R&D spending at 29% of revenue translating to $300k+ average cash compensation per employee, plus stock-based compensation bringing total to $700k-$1.5M per year @deedydas
- OpenAI CEO of Applications Fidji Simo discusses path to profitability, with expectations that both OpenAI and Anthropic will release AI financial advisors in 2026 @AndrewCurran_
- Mustafa Suleyman argues that we are not in an AI bubble, stating that AI is the smartest, most capable technology ever invented and continues improving faster than expected @mustafasuleyman
- Cisco acquires translation startup EzDubs @TechCrunch
AI Ethics & Society
- Gergeely Orosz observes the dead internet theory playing out on X, where AI-generated replies are boosted based on payment rather than quality, appearing above substantive human responses @GergelyOrosz
- Reid Hoffman argues that waiting for 100% safety before approving new AI technologies like AI therapists withholds enormous benefits from people who need them, stating the benchmark should be systems safer than human-only alternatives rather than zero mistakes @reidhoffman
- Hoffman emphasizes that for those who cannot access therapy due to economic, geographic, or other reasons, a well-made AI therapist is better than no access to mental health support @reidhoffman
- Amanda Askell draws parallels between relationship counseling and AI troubleshooting, noting that her first question for Claude problems is now "what happened when you said all this to Claude?" similar to asking partners to communicate directly @AmandaAskell
- Aidan McLaughlin from OpenAI acknowledges user concerns about model changes, stating the team is working at 3am on Sundays to improve chatbot quality and fix alignment imprecision, while admitting no current chatbot is optimal @aidan_mclau
AI Applications
- Anthropic partners with the Government of Rwanda and ALX Africa to bring Chidi, a learning companion built on Claude, to hundreds of thousands of learners across Africa @AnthropicAI
- Google integrates WeatherNext technology into Google Search, Gemini, Pixel Weather, and will soon power weather information in Google Maps @GoogleDeepMind
- Public.com launches feature allowing users to create AI-generated ETFs based on custom criteria, with one example of design-focused companies outperforming the S&P 500 by 2x historically @benblumenrose
- Tim McAleer at Florentine Films uses AI to create custom media management software for filmmaking @clairevo
- Google rolls out AI Flight Deals tool globally and adds new travel features in Search @TechCrunch
- Hugging Face and Google Cloud partner to speed up model access, strengthen security and reduce operational costs, with more than 1,500 terabytes exchanged daily @DataChaz
AI Research
- Google DeepMind's WeatherNext 2 uses a new Functional Generative Network approach that adds targeted randomness directly into the architecture, allowing it to explore a wide range of weather scenarios and generate hundreds of possible forecasts in less than a minute from a single starting point @GoogleDeepMind
- WeatherNext 2 achieves world-leading performance at predicting both marginal forecasts (singular weather events like temperature at specific locations) and joint predictions (combining multiple variables such as expected wind power) @GoogleDeepMind
- Ethan Mollick critiques a new hallucination benchmark, arguing it primarily measures refusal thresholds for answering extremely specific trivia questions rather than true hallucination rates, noting that GPT-5 High and Grok-4 achieving 39% accuracy on nearly impossible questions without web lookup is astonishing @emollick
- Ethan Mollick identifies missing AI benchmarks around brittleness, noting that some models perform well initially and on benchmarks but break down with extended use, raising questions about generalization, thematic repetition, and prompt intent understanding @emollick
- Shreya Shankar provides detailed framework for understanding AI evaluation, breaking it into three components: identifying success criteria, determining how to apply the rubric to LLM outputs, and automating the rubric application at scale @sh_reya
- Nathan Lambert discusses why AI writing is mediocre, explaining how current language model training methods destroy voice and hope for good writing, with GPT-5 acknowledging it is hardwired to always give suggestions rather than claim to write masterpieces @natolambert
- Hamel Husain warns that ask me anything chatbots represent a $500K mistake due to evaluation death spirals, where lack of clear scope prevents defining success metrics, identifying critical failures, and prioritizing fixes, advocating for radically specific agent boundaries @bnicholehopkins
- Francois Chollet states that simplicity is the signature of truth, arguing that tangled explanations with exceptions and special cases indicate the core idea hasn't been found yet @fchollet
- Greg Brockman from OpenAI seeks candidates for inference work, describing it as perhaps the most valuable emerging software category as models get smarter and more economically valuable, with compute increasingly spent drawing samples from models @gdb
- MIT develops new bionic knee that helps people with above-the-knee amputations walk faster, climb stairs, and avoid obstacles more easily than traditional prostheses @MIT
- Microsoft Research announces Project Gecko bringing AI to underserved populations, Workload Intelligence for cloud efficiency, operator-level autoscaling for large generative models, Sherlock for agentic workflow reliability, and BioAgents for bioinformatics workflows @MSFTResearch