AI Updates on 2025-12-15

AI Model Announcements

NVIDIA releases Nemotron 3 Nano, a 30B hybrid reasoning model with mixture-of-experts architecture combining Mamba-Transformer design, featuring 1M context window and leading performance on SWE-Bench, reasoning and chat benchmarks @ctnzr
NVIDIA announces full Nemotron 3 family with unprecedented openness, releasing training data, NeMo Gym reinforcement learning library, and complete training code alongside models, with Super and Ultra variants coming in following months @nvidianewsroom
Alibaba releases Qwen Code v0.5.0 with VSCode integration, native TypeScript SDK, support for OpenAI-compatible reasoning models including DeepSeek V3.2 and Kimi-K2, and Russian language support @Alibaba_Qwen
Apple releases Sharp, a monocular view synthesis model capable of generating views in less than a second @_akhaliq
AI2 introduces Bolmo, the first fully open byte-level language model built by byteifying Olmo 3, matching or surpassing state-of-the-art subword models across wide range of tasks @allen_ai

AI Industry Analysis

Senior engineers at top tech companies report their jobs now primarily consist of prompting Cursor or Claude Code with Opus 4.5 and sanity checking output, suggesting AI has crossed threshold of generalizing to most software tasks @deedydas
Developer reports spending $260 in tokens to complete three-day migration that was estimated to take weeks, raising questions about whether companies will absorb $12-35K annual token costs per developer on top of salaries @GergelyOrosz
Companies pushing for 20% productivity increases to justify AI spending, with unpredictability of metered costs driving preference for fixed-price AI coding plans over pay-per-use models @GergelyOrosz
Experienced developers extract significantly more value from AI tools than less experienced developers, as they can precisely specify tasks rather than generic prompting @GergelyOrosz
President Trump launches US Tech Force hiring 1000 engineers with partnerships from OpenAI, Oracle, Palantir, Anduril, Apple, Amazon, Google, Microsoft, NVIDIA, and xAI for high-impact technology initiatives @AndrewCurran_
Mirelo raises $41M seed round led by a16z and Index for foundation model focused on sound layer for video generation @a16z
First Voyage raises $2.5M for AI companion that helps users build habits @TechCrunch
Sierra announces new office in Paris as company expands internationally @btaylor

AI Research

Olmo 3 release sets new standard for transparency with full data release, 100-page report, open training infrastructure, and reproducible evaluations, enabling rigorous experiments with zero barrier to entry @cwolferesearch
Nemotron 3 Nano achieves Intelligence Index score of 52 with only 3.6B active parameters out of 31.6B total, representing 6-point lead over similarly-sized Qwen3 30B and 15-point improvement over previous Nemotron Nano 9B V2 @ArtificialAnlys
All frontier AI models now pass all levels of challenging Chartered Financial Analyst exam using paywalled mock exams to reduce leakage risk, with prompting strategy showing minimal impact on most question types @emollick
MIT's DisCIPL uses LLM to steer smaller language models to collaborate on open-ended tasks with constraints like advanced puzzles and math proofs, achieving accuracy and efficiency comparable to leading models @MIT_CSAIL
Professor historically skeptical of model usefulness reports GPT 5.2 Pro represents step change in usefulness for algebraic geometry and number theory research applications @AndrewCurran_
NVIDIA's Parallel-Distill-Refine framework achieves 93.3% accuracy on AIME 2024 compared to 79.4% for standard long chain-of-thought at matched latency, demonstrating bounded memory iteration can substitute for long reasoning traces @rsalakhu
Prime Intellect collaborates with NVIDIA to integrate NeMo Gym's RL environments into their Environments Hub, making it easier for teams to scale reinforcement learning @AndrewCurran_

AI Applications

Google's Gemini Agent now available for Google AI Ultra users in US, capable of tackling tasks like car rental by comparing prices, gathering inbox information, and booking within budget constraints @GeminiApp
Figma Slides and Figma Buzz now available in ChatGPT for creating presentations and invites through conversational interface @figma
IBM releases CUGA, open-source enterprise agent that automates tasks by writing and executing code given workspace files, with built-in tools for enterprise tasks and MCP support @huggingface
Zapier's Executive Business Partner implements AI-powered meeting prep agent, meeting coach for exec team alignment, and pre-doc review system enabling CEO-level feedback before meetings @clairevo
Developer reports running complex tasks through Codex with GPT 5.2 Extra High for 2.5 and 1.75 hours respectively, completing all acceptance criteria with full test coverage and zero broken code @gdb
Zoom brings AI assistant to web with access for free users @TechCrunch

AI Ethics & Society

Merriam-Webster names slop as 2025 Word of the Year, reflecting concerns about AI-generated content quality @TechCrunch
Chatbots struggle with file management in ways CLI versions do not, with Gemini frequently confusing which files are referenced and ChatGPT often misplacing generated files @emollick
Claude's conversation compacting feature doesn't work well for knowledge work compared to coding, abruptly resetting tone and flow unlike rolling context windows @emollick