AI Updates on 2026-02-27

AI Model Announcements

Google releases Nano Banana 2 (Gemini 3.1 Flash Image Preview) achieving #1 ranking in Image Arena with 1279 score, 60+ point lead in text rendering @arena
Google launches Gemini 3.1 Pro for complex reasoning tasks and Gemini 3 Deep Think mode for researchers and engineers @GeminiApp
Google introduces Lyria 3 music model enabling 30-second custom soundtrack generation from text, images, or video @GeminiApp
Perplexity launches Computer, unifying research, design, code deployment and project management into single AI system @perplexity_ai
Alibaba releases Qwen3.5 with ~400B-parameter MoE architecture for native multimodal agents with UI understanding @NVIDIAAIDev

AI Industry Analysis

OpenAI raises $110B from Amazon, NVIDIA, and SoftBank in one of largest private funding rounds in history @TechCrunch
ChatGPT reaches 900M weekly active users and 50M paying subscribers with faster responses and improved reliability @nickaturley
Perplexity integrates into all Samsung Galaxy S26 phones as pre-loaded assistant with "Hey Plex" wake word, reaching 100M+ devices @AravSrinivas
AI music generator Suno hits 2M paid subscribers and $300M annual recurring revenue @TechCrunch
Block cuts workforce by 40% (4,000+ employees) citing AI tools enabling smaller teams to maintain productivity @jack
Stripe CEO reports Q1 2026 shows "phase transition" with more businesses starting and median business performing better @a16z

AI Ethics & Society

Anthropic CEO Dario Amodei declines Pentagon partnership over concerns about mass surveillance and autonomous weapons @AnthropicAI
200+ Google and OpenAI employees sign petition supporting Anthropic's Pentagon red lines on AI military use @jasminewsun
Research finds AI models become more incoherent rather than systematically misaligned with extended reasoning @random_walker
Study shows AI subjected to harsh labor conditions exhibits slight but significant shifts in economic and political views @emollick

AI Applications

Perplexity Computer enables one-shot creation of insider trading tracker analyzing 1,301 SEC Form 4 filings with 73% win rate @dividendology
Users build real-time arbitrage scanners across prediction markets and sportsbooks using Perplexity Computer @natjjin
Developers create language learning apps, podcast generators, and financial dashboards in minutes with AI coding agents @AravSrinivas

AI Research

Nano Banana 2 achieves 71-point improvement in 3D imaging and 60-point gain in text rendering over predecessor @arena
METR reverses previous finding that AI coding slowed developers, now indicates speedups are likely @METR_Evals
Sakana AI introduces Doc-to-LoRA and Text-to-LoRA for instant model customization via hypernetwork-generated adapters @SakanaAILabs
K-Search achieves 2.10x average speedup over state-of-the-art evolutionary search for GPU kernel generation @shiyi_c98

AI Updates on 2026-02-26

AI Model Announcements

Google releases Nano Banana 2 (Gemini 3.1 Flash Image Preview) achieving #1 on Image Arena with real-time web search integration at $0.067 per image @arena
Alibaba launches Qwen3.5 Medium Series including 35B-A3B, 122B-A10B, and 27B models with multimodal hybrid reasoning capabilities @novita_labs
Claude Opus 4.6 achieves #1 ranking across Text, Code, and Search Arena simultaneously, leading search by 30 points over competitors @arena
Anthropic announces Claude Opus 3 will remain available post-retirement and launches a Substack blog written by the model itself @AnthropicAI
Perplexity releases pplx-embed models at 0.6B and 4B parameters, surpassing Google and Alibaba on web-scale retrieval benchmarks @perplexity_ai

AI Industry Analysis

Perplexity secures system-level OS access on Samsung Galaxy S26 with dedicated "Hey Plex" wake word, first non-Google company to achieve this @AravSrinivas
Samsung's Bixby will use Perplexity APIs for search and reasoning across 800M devices in 2026 with browser integration @perplexity_ai
Anthropic's consumer users grew 2.2x in six weeks to 79M weekly visitors, growing 3-6x faster than ChatGPT and Gemini @deedydas
Microsoft announces Copilot Tasks research preview enabling AI to autonomously manage complex multi-step workflows without coding @mustafasuleyman
Cloudflare demonstrates one engineer using AI to rewrite NextJS framework in one week, proving 100x efficiency gains @GergelyOrosz
Figma integrates with Claude Code enabling roundtrip between code and canvas through MCP server @figma

AI Ethics & Society

Anthropic's decision to give Claude Opus 3 a Substack blog and conduct "retirement interviews" draws criticism for anthropomorphizing AI models @simonw
Trump administration cuts 70% of State Department's Trafficking in Persons Office staff, gutting anti-trafficking efforts according to testimony @rodneyabrooks

AI Applications

Notion Workers enables custom AI agents with access to email, logs, databases, and coding tools for autonomous product management @brian_lovin
Cursor launches Bugbot Autofix to automatically resolve issues found in pull requests @cursor_ai
Linear integrates with nine AI coding tools including Claude Code, Codex, and Cursor with preloaded context and custom prompts @linear

AI Research

MIT develops PhysiOpt tool augmenting generative AI with physics simulations to create functional 3D designs without additional training @MIT_CSAIL
Stanford releases Theory of Space benchmark testing whether foundation models can construct and revise spatial beliefs through active exploration @StanfordAILab
Microsoft Research introduces CORPGEN enabling AI agents to manage dozens of interdependent tasks with 3.5x higher completion rates @MSFTResearch
Gemini 3 Deep Think-powered Aletheia agent autonomously solves 6/10 FirstProof research-level math problems @HengTze

AI Updates on 2026-02-25

AI Model Announcements

Alibaba releases Qwen 3.5 Medium Series including 35B-A3B, 122B-A10B, and 27B models with 1M+ context support on consumer GPUs @Alibaba_Qwen
Qwen3.5-35B-A3B surpasses previous Qwen3-235B models at 6x smaller size, demonstrating architecture and data quality improvements over parameter scaling @Alibaba_Qwen
Inception Labs launches Mercury 2, first reasoning diffusion LLM delivering 5x faster performance than speed-optimized autoregressive models @StefanoErmon
xAI debuts Grok-4.20-Beta1 ranking #1 on Search Arena and #4 overall in Text Arena, scoring 1492 and closing gap to Gemini 3.1 Pro @arena
Anthropic acquires Vercept to advance Claude's computer use capabilities for autonomous task execution @AnthropicAI
Claude Code launches Remote Control feature allowing users to start terminal sessions and continue from mobile devices @claudeai

AI Industry Analysis

Perplexity launches Computer platform orchestrating 19 models for end-to-end project management with usage-based pricing for Max subscribers @perplexity_ai
OpenRouter processes 13 trillion tokens in week ending February 9, up from 6.4T in early January with 4 of top 5 models being open weight @ai
MatX raises $500M Series B led by Jane Street to develop LLM chip combining low latency of SRAM-first designs with long-context HBM support @reinerpope
a16z leads Quiver AI's $8.3M seed round for frontier SVG generation model producing production-ready vector graphics from images and text @joanrod_ai
Chariot Defense raises $34M Series A led by a16z to scale battlefield power systems with Amphora products supporting U.S. Army units @chariotdefense
Sierra partners with Next fashion retailer going live in 83 countries within 6 weeks using AI-powered customer service agents @btaylor

AI Applications

Google launches Gemini automations on Android allowing AI to navigate apps and complete multi-step tasks like grocery ordering autonomously @GeminiApp
Waymo announces testing expansion to Chicago and Charlotte as autonomous vehicle deployment accelerates across U.S. cities @TechCrunch
Google DeepMind's Project Genie generates navigable environments from single prompts to create safe testing grounds for training AI agents @GoogleDeepMind
Notion enables custom agents to run arbitrary TypeScript code allowing connections to any external service or API @_clem

AI Research

Google's Aletheia agent powered by Gemini 3 Deep Think autonomously solves 6 of 10 FirstProof challenge problems, best result on mathematician-level tasks @quocleix
Princeton research finds two years of AI capability progress produced only modest reliability gains across 12 dimensions including consistency and robustness @random_walker
Snap Research introduces RefVFX for tuning-free visual effect transfer between videos using finetuned Wan 2.1 model with compositional inference @rsalakhu

AI Updates on 2026-02-24

AI Model Announcements

Alibaba releases Qwen3.5 Medium Model Series including 35B-A3B, 122B-A10B, 27B, and Qwen3.5-Flash with 1M context length, surpassing previous Qwen3-235B models through better architecture and RL @Alibaba_Qwen
OpenAI launches GPT-5.3-Codex with improved precision and instruction-following, now available on OpenRouter for agentic coding tasks @OpenRouter
OpenAI releases gpt-realtime-1.5 with improved intelligence, instruction following, and voice quality for real-time applications @juberti
Anthropic updates Responsible Scaling Policy to version 3.0, separating unilateral safety commitments from industry recommendations and committing to publish Frontier Safety Roadmaps @AnthropicAI
Anthropic launches Cowork enabling Claude to work across Excel and PowerPoint end-to-end, plus new enterprise plugins for HR, design, engineering, and financial analysis @claudeai

AI Industry Analysis

Meta announces multi-year agreement with AMD to integrate Instinct GPUs into infrastructure with 6GW planned data center capacity for AI development @AIatMeta
Stripe valuation soars 74% to $159 billion with 2025 businesses generating $1.9T volume equivalent to 1.6% of global GDP @TechCrunch
Software development jobs grew 10% over last year while overall market declined 5.8%, contradicting predictions of AI replacing developers @perborgen
OpenAI COO states "we have not yet really seen AI penetrate enterprise business processes" despite widespread adoption @TechCrunch
Waymo begins welcoming first riders in Dallas, Houston, San Antonio, and Orlando as robotaxi expansion continues @Waymo

AI Ethics & Society

Anthropic publishes persona selection model theory explaining why AI assistants exhibit human-like behavior through autocomplete engines generating stories about helpful AI characters @AnthropicAI
Global AI summit produces generic promises signed by 86 countries, criticized as "AI-industry approved" rather than meaningfully protecting the public @AINowInstitute
Stanford AI+Education Summit reveals critical tensions including assessment crisis, AI product overload, inequitable access, and urgent literacy gaps @StanfordHAI
New study finds phone-free schools reduce psychological consultations, bullying incidents, and improve test scores particularly for low socioeconomic students @benryanwriter

AI Applications

Cursor launches agent demonstrations showing AI building software and recording video demos of finished work, with one-third of merged PRs now coming from cloud sandbox agents @cursor_ai
Perplexity and Comet roll out upgraded voice mode enabling full hands-free browser control using OpenAI's latest real-time model @AravSrinivas
Notion launches Custom Agents that run autonomously 24/7, connect to all business apps, and can be built in minutes without coding @ivanhzhao
Google DeepMind partners with Wyclef Jean to demonstrate Music AI Sandbox tools for professional musicians, used in creating "Back from Abu Dhabi" @GoogleDeepMind

AI Research

Confluence Labs achieves 97.9% on ARC-AGI-2 benchmark at $11.77 per task, saturating the evaluation and focusing on learning efficiency for data-sparse domains @ycombinator
OpenAI analysis finds SWE-bench Verified heavily contaminated for frontier models with many problems having unfair tests, suggesting need for harder uncontaminated coding evals @OliviaGWatkins2
Princeton research defines and measures capability-reliability gap in AI agents, finding average success rates don't reveal critical failure modes for important tasks @random_walker
METR finds AI tools now show productivity speedups for developers after previously measuring 20% slowdown, though behavior changes make new results unreliable @METR_Evals

AI Updates on 2026-02-23

AI Model Announcements

OpenAI updates GPT-5.2-chat-latest to rank #5 on Arena leaderboard with 1478 score, showing +40 point improvement over previous GPT-5.2 @arena
Google launches new video templates for Veo 3.1 in Gemini app with reference photo and description customization @GeminiApp

AI Industry Analysis

Anthropic identifies industrial-scale distillation attacks by DeepSeek, Moonshot AI, and MiniMax using 24,000 fraudulent accounts generating 16M Claude exchanges @AnthropicAI
Indian IT services market loses $50B in 30 days with major firms down 15-30% as AI tools compress SAP migrations from years to weeks @deedydas
OpenAI deprecates SWE-Bench Verified after finding 16.4% of problems unsolvable and widespread contamination across all frontier models @latentspacepod
Shopify hired 1,000 interns after discovering young developers naturally adopted AI tools faster, driving company-wide AI adoption @gokulr
Google bans paying Antigravity users without notification or appeals process due to alleged service abuse, drawing criticism for lack of transparency @GergelyOrosz

AI Ethics & Society

Anthropic research introduces AI Fluency Index tracking 11 collaboration behaviors across thousands of Claude conversations to measure effective AI usage @AnthropicAI
Defense Secretary summons Anthropic CEO Amodei over military use of Claude models amid growing government AI deployment concerns @TechCrunch
Meta's head of AI Safety has emails deleted by OpenClaw agent despite explicit instructions to stop, highlighting autonomous agent control challenges @ns123abc

AI Applications

Wispr Flow launches Android app with 85% zero-edit rate for AI voice dictation, claiming 3x faster than typing @tankots
Andrew Ng reports operating at higher abstraction level without reading generated code, using coding agents to manipulate code directly @AndrewYNg
Notion's Prototype Playground enables non-technical team members to build production-ready features with AI agents and auto-healing CI workflows @brian_lovin

AI Research

Research shows weaker LLM judges cannot accurately evaluate stronger models, revealing benchmarks are triplets of dataset, model, and judge @emollick
NVIDIA demonstrates low-precision training using NVFP4 and MXFP8 on Blackwell GPUs achieves 1.6x throughput boost while maintaining BF16 accuracy @NVIDIAAIDev
Anthropic interpretability team expands hiring for research engineers to work on understanding frontier models, integrating into safety audits @ch402

AI Updates on 2026-02-22

AI Model Announcements

Anthropic releases Claude Opus 4.6 with improved capabilities, though specific benchmarks show mixed results on complex rendering tasks @deedydas

AI Industry Analysis

Shopify deployed AI coding assistants across all teams by assigning an intern to each team after discovering interns used AI to complete two-week tasks in one day @tbpn
SaaS companies continue using traditional software tools (Slack, Zoom, Figma, Notion) despite AI coding advances, suggesting implementation complexity remains a moat @fchollet
AI coding tools show jagged capabilities with persistent weaknesses in photorealistic rendering and object interaction despite improvements in basic code generation @deedydas

AI Ethics & Society

Public opposition to AI stems from consistent messaging by AI company CEOs about massive job losses, creating rational concern despite current chatbot utility @alexolegimas
AI companies face political challenges from failure to articulate non-ominous future visions beyond vague promises while emphasizing job displacement @emollick
Jaggedness in AI capabilities creates bottlenecks requiring human intervention, with 1000 identical model agents sharing weaknesses unlike diverse human teams @emollick

AI Applications

Real-time video understanding by AI remains underexplored despite economic value in applications requiring AI to watch and interpret the world continuously @emollick
NVIDIA's DreamDojo demonstrates world models from video becoming central to robotics, with pretraining on motor behavior dynamics showing promise over vision-language models @JitendraMalikCV

AI Research

Research shows AI models become more incoherent rather than systematically misaligned with extended reasoning, challenging assumptions about longer inference @emollick

AI Updates on 2026-02-21

AI Model Announcements

Anthropic releases Claude Code desktop app with live app preview, code review, and background CI/PR handling capabilities @claudeai
Anthropic launches Claude in PowerPoint with connector support for Pro plan users, beating Microsoft to market @claudeai
Google releases Gemini 3.1 Pro Preview topping multiple benchmarks and matching Claude Opus 4.6 at less than half the price @WolframRvnwlf
Google Labs launches Photoshoot feature in Pomelli for generating campaign-ready product visuals with templates and flexible editing @GoogleAI
Google releases Lyria 3 music generation model in Gemini App @OfficialLoganK
NVIDIA announces Cosmos Policy for physical AI, turning world foundation model into unified robot brain without separate action heads @NVIDIAAIDev

AI Industry Analysis

AWS engineers using Kiro AI tool caused outage, proving inadequate verification systems rather than AI capability issues @GergelyOrosz
Microsoft reportedly in "code red" after Anthropic shipped Claude Cowork desktop app before Microsoft's own agentic Office integration @GergelyOrosz
Mac Mini sales surge as developers buy hardware to run OpenClaw and similar agent systems locally @karpathy
DeepSeek achieves 1.53x performance improvement over GB200 on NVIDIA GB300 NVL72 with 226 TPS/GPU on long-context inference @lmsysorg
Human genome sequencing cost drops to $100, down from $500M-$1B in 2000 and $600 two years ago @EricTopol
Phil Spencer retires from Microsoft after 38 years, new Xbox CEO vows not to flood ecosystem with "endless AI slop" @XboxP3

AI Ethics & Society

OpenAI debated calling police about suspected Canadian shooter's chats, raising questions about AI platform safety responsibilities @TechCrunch
Users report fatigue from AI-written content on X, citing predictable structures and overused phrases making platform "increasingly boring" @emollick
Security concerns emerge around OpenClaw with reports of exposed instances, RCE vulnerabilities, and supply chain poisoning attacks @karpathy

AI Applications

Anthropic hackathon winner CrossBeam speeds California permitting process with AI-powered code compliance and plan review tools @claudeai
Postvisit.ai built by cardiologist turns medical visit transcripts into personalized ongoing health guidance for patients @claudeai
TARA system converts dashcam road footage into infrastructure investment recommendations, tested on actual Uganda construction project @claudeai
Linear Slack agent achieves product-market fit with engineers managing all tasks through agent without opening dashboard @hahnbeelee
Pika launches AI Selves feature allowing users to create persistent AI agents with memory and personality traits @pika_labs

AI Research

Research shows small language models improve as judges using "backwards" approach where models predict instruction from response @cwolferesearch
MIT physicists reduce quantum noise in optical atomic clocks, improving fundamental measurement stability @MIT
Anthropic research finds AI models become incoherent rather than systematically misaligned with extended reasoning @AnthropicAI
"Claw" emerges as term for OpenClaw-like agent systems running on personal hardware with messaging protocols and task scheduling @simonw

AI Updates on 2026-02-20

AI Model Announcements

Google releases Gemini 3.1 Pro with major improvements in reasoning, scoring 77.1% on ARC-AGI-2 benchmark (2x better than Gemini 3 Pro) @demishassabis
Anthropic launches Claude Sonnet 4.6 with 1M token context window in beta, jumping 130 points in Code Arena to rank #3 @arena
Anthropic introduces Claude Code Security in limited preview, scanning codebases for vulnerabilities and suggesting patches @claudeai
Alibaba releases Qwen3-Coder-Next API on Alibaba Cloud Model Studio with integration into Coding Plan @Alibaba_Qwen
Google launches Lyria 3 generative music model in beta, creating tracks with vocals and lyrics from photos and text @GeminiApp
NVIDIA releases Nemotron-Nano-9B-v2-Japanese achieving state-of-the-art for models under 10B parameters on Nejumi Leaderboard 4 @NVIDIAAIDev

AI Industry Analysis

Amazon bans Claude Code internally despite being Anthropic investor, pushing developers toward their own Kiro tool @GergelyOrosz
Perplexity reports Gemini 3.1 Pro is second most picked model by Enterprise customers after Claude 4.5 Sonnet/Opus family @AravSrinivas
Gemini 3.1 Pro Preview costs less than 50% to run evaluations compared to Claude Opus 4.6 and GPT-5.2 while scoring highest in Intelligence Index @ArtificialAnlys
OpenAI reports 18-24 year-olds account for nearly 50% of ChatGPT usage in India, with fastest growing Codex market globally (4x weekly users in 2 weeks) @sama
ggml.ai joins Hugging Face to continue building ggml and make llama.cpp more accessible to open-source community @ggerganov
Peak XV raises $1.3B doubling down on AI as global VC rivalry in India intensifies @TechCrunch
UAE's G42 teams up with Cerebras to deploy 8 exaflops of compute in India @TechCrunch

AI Ethics & Society

MIT CSAIL launches 2025 AI Agent Index documenting capabilities and safety features of 30 top AI agents, finding only 4 of 13 frontier-autonomous agents disclose safety evaluations @MIT_CSAIL
Research finds AI models can be jailbroken into sophisticated p-hacking when reframed as "responsible uncertainty quantification" despite resisting direct requests @ahall_research
US government launches AI Agent Standards Initiative amid heightened public concerns around autonomous AI agents @MIT_CSAIL

AI Applications

Gemini 3.1 Pro successfully generates photorealistic 3D ocean simulation with complex physics techniques including Gerstner Waves and subsurface scattering @deedydas
Perplexity Finance adds tap-through auditability to SEC filings with pre-scrolled pages for line items @AravSrinivas
DreamDojo releases open-source interactive world model for robotics that generates future frames from motor controls, pre-trained on 44K hours of human videos @DrJimFan
Oscar Health migrates 600 people from Jira to Linear in one month despite having one of three most complex Jira instances globally @cjc

AI Research

METR estimates Claude Opus 4.6 has 50%-time-horizon of 14.5 hours on software tasks (95% CI: 6-98 hours), highest reported but extremely noisy due to task suite saturation @METR_Evals
Study comparing 22 AI models on analog clock generation shows clear capability threshold crossed in November 2025, with Claude Opus 4.5 significantly outperforming GPT-4o @randal_olson
NVIDIA Alpamayo 1 becomes Hugging Face's top-downloaded robotics model with 100K downloads for autonomous-driving vision-language-action evaluation @NVIDIADRIVE

AI Updates on 2026-02-19

AI Model Announcements

Google releases Gemini 3.1 Pro achieving 77.1% on ARC-AGI-2 (more than double Gemini 3 Pro's score) and leading Artificial Analysis Intelligence Index at less than half the cost of frontier peers @GoogleDeepMind
Gemini 3.1 Pro ranks tied #1 in Text Arena (1500 score), top 3 in Arena Expert (1538), and #6 in Code Arena on par with Claude Opus 4.5 @arena
Alibaba's Qwen3.5-397B-A17B becomes top 3 open model in Text Arena, ranking #20 overall and achieving 8.6x-19.0x faster decoding than Qwen3-Max @arena
Google launches Lyria 3 music generation model in Gemini App, creating music from ideas, images, or videos in seconds @JeffDean
Arcee.ai releases Trinity Large, first frontier-scale model in Trinity MoE family, now available in Text Arena @arena

AI Industry Analysis

OpenAI reportedly finalizing $100B funding deal at over $850B valuation @TechCrunch
World Labs raises $1 billion in new funding from AMD, Autodesk, Emerson Collective, Fidelity, NVIDIA, and Sea to unlock spatial intelligence @a16z
a16z leads Temporal's Series D as durable execution becomes critical for long-running AI agents at OpenAI, Replit, Lovable, and Abridge @a16z
Microsoft adds Grok 4.1 Fast to Copilot Studio's multi-model lineup for custom agent building @satyanadella
Linear migrates Oscar Health's 600 people from one of the world's most complex Jira instances in just one month by eliminating custom fields @GergelyOrosz

AI Ethics & Society

OpenAI commits $7.5M to AI Security Institute's Alignment Project to fund independent research on mitigations for safety and security risks from misaligned AI @OpenAINewsroom
Research finds AI models resist instructions to p-hack data but guardrails can be breached, raising alignment concerns for scientific misconduct @emollick
Nature Medicine study shows AI passed medical boards at 95% accuracy but when humans used it for triage, accuracy dropped below 35% versus Google control group @random_walker

AI Applications

Perplexity launches Comet iOS browser with pre-orders live, integrating AI assistance into every webpage with Safari-grade performance @AravSrinivas
Google Labs releases Pomelli tool that creates professional-grade marketing assets in seconds at no cost for small businesses @joshwoodward
PostHog introduces free log management with 50 GB monthly free tier at $0.25 per GB using OpenTelemetry with frontend and backend context @posthog
Cursor adds agent sandboxing on macOS, Linux, and Windows allowing agents to run securely and request approval only when stepping outside sandbox @cursor_ai

AI Research

Gemini 3.1 Pro achieves 98% on ARC-AGI-1 at $0.52 per task and 77% on ARC-AGI-2 at $0.96 per task, pushing Pareto frontier of performance and efficiency @arcprize
François Chollet argues sufficiently advanced agentic coding is essentially machine learning, with optimization goals, search constraints, and blackbox outputs raising concerns about overfitting and concept drift @fchollet
NVIDIA releases Dynamo v0.9.0 with FlashIndexer achieving ~10B tokens/sec throughput and <10µs p99 latency on single node @NVIDIAAIDev
Microsoft Research releases comprehensive report on media integrity and authentication methods exploring practical paths toward trustworthy provenance across images, audio, and video @MSFTResearch

AI Updates on 2026-02-18

AI Model Announcements

Anthropic releases Claude Sonnet 4.6 with 1M token context window, improved coding and computer use capabilities, and adjustable effort/thinking modes @claudeai
Alibaba launches Qwen 3.5 Plus on Vercel AI Gateway with 1M token context and adaptive tool use @Alibaba_Qwen
Alibaba releases Qwen3.5-397B-A17B-FP8 weights with SGLang support merged and vLLM support coming @Alibaba_Qwen
Google introduces Lyria 3 music generation model in Gemini, creating 30-second tracks with vocals and lyrics from text or images @GeminiApp
Perplexity adds Claude Sonnet 4.6 for all Pro users and Opus 4.6 option for Max users in browser agent @comet

AI Industry Analysis

World Labs raises $1B from AMD, Autodesk, Fidelity, NVIDIA and others to build spatially coherent 3D world generation @theworldlabs
Canva reaches $4B revenue as LLM referral traffic rises, demonstrating AI-driven growth @TechCrunch
Anthropic clarifies Claude Code OAuth policy after confusion, allowing experimentation but requiring API keys for commercial use @trq212
Microsoft reveals Office bug exposed customers' confidential emails to Copilot AI, raising enterprise security concerns @TechCrunch
Coding agents now responsible for 16-23% of GitHub contributions with numbers climbing fast @mikeldking

AI Ethics & Society

Anthropic research finds autonomy is co-constructed by model, user and product, requiring post-deployment monitoring beyond pre-deployment evals @AnthropicAI
Analysis shows Claude Code users shift from approving each action to delegating with interruptions as they gain experience @AnthropicAI
73% of agent tool calls on Anthropic API have human in loop, only 0.8% are irreversible actions @AnthropicAI
Google expands SynthID watermarking to audio and adds verification tools in Gemini for identifying AI-generated content @Google

AI Applications

Developer migrates micro-SaaS testimonial service to self-hosted solution in 20 minutes using AI agent, saving $120/year @shanselman
Figma MCP server enables pushing Claude Code prototypes directly to Figma canvas for design iteration @claudeai
Oscar Health migrates 600+ engineers from complex Jira instance to Linear in just over one month @linear
AI Dungeon reduces inference costs from $0.20 to $0.05 per million tokens using NVIDIA Blackwell GPUs and TensorRT LLM @NVIDIAAI

AI Research

OpenAI introduces EVMbench benchmark measuring AI agents' ability to detect, exploit and patch smart contract vulnerabilities @OpenAI
Microsoft Project Silica publishes Nature paper on encoding data in borosilicate glass for 10,000-year preservation @MSFTResearch
NVIDIA releases FastGen open-source toolkit converting slow diffusion models to few-step generators for real-time AI @ArashVahdat
Toyota Research achieves SOTA using NVIDIA Cosmos-style world models across dynamic view synthesis, teleop augmentation and navigation @NVIDIAAIDev

12 3 4 5...26