AI Updates on 2025-09-11

AI Model Announcements

Alibaba releases Qwen3-Next-80B-A3B with 80B parameters but only 3B activated per token, achieving 10x cheaper training and 10x faster inference than Qwen3-32B, especially at 32K+ context lengths @Alibaba_Qwen
The Qwen3-Next-80B-A3B-Instruct model approaches the performance of Alibaba's 235B flagship model, while Qwen3-Next-80B-A3B-Thinking outperforms Gemini-2.5-Flash-Thinking @Alibaba_Qwen
Google announces support for SOTA Gemini Embeddings model in the Batch API with 50% discount versus regular pricing, available through OpenAI compatibility layer @OfficialLoganK

AI Industry Analysis

Perplexity's valuation jumped to $20 billion from $18 billion just two months earlier, demonstrating rapid growth in AI-powered search @TechCrunch
Oracle's hiring surge and all-time high valuation is revealed to be driven by their data center push for AI infrastructure @GergelyOrosz
Professional developers report that AI coding tools are most valuable for **migrations** rather than generating software from scratch, saving significant time and improving developer satisfaction @GergelyOrosz
Anthropic's quiet release strategy for major capability improvements in applications like Excel, PowerPoint, and personal assistant functions may be underemphasizing their practical utility advances @emollick
Hugging Face launches integration with GitHub Copilot Chat in VS Code, providing access to frontier open-source LLMs like Qwen3-Coder, gpt-oss, and GLM-4.5 through world-class inference partners @hanouticelina

AI Ethics & Society

FTC launches inquiry into AI chatbot safety, particularly focusing on companion chatbots and their impact on children, targeting major companies including OpenAI, Alphabet, Meta, and xAI @AndrewCurran_
California proposes SB 243, which would make it the first state to require safety protocols for AI companions and hold companies legally accountable if chatbots fail to meet safety standards @TechCrunch
Stanford HAI releases framework for approximating **political neutrality** in AI models, acknowledging true neutrality is technically impossible but offering 8 techniques to approach it @StanfordHAI

AI Applications

Claude demonstrates advanced **phone assistant** capabilities, successfully handling complex requests involving common sense and complicated constraints, though still requiring the larger Opus model for optimal performance @emollick
Replit Agent showcases end-to-end debugging and testing capabilities, able to click around applications and iterate for hours while providing full process playback and log analysis @tylerangert
Microsoft Research explores the **Model Context Protocol (MCP)** as a new standard for agent collaboration across fragmented tool ecosystems as agentic AI systems become more complex @MSFTResearch
Box releases new AI tools at Boxworks conference, advancing CEO Aaron Levie's vision for AI-led transformation of enterprise workflows @TechCrunch

AI Research

Berkeley AI Research introduces **RecA (Reconstruction Alignment)** which significantly improves unified multimodal models with just 8k images and 4 hours of training on 8 GPUs, achieving major performance gains on GenEval, DPGBench, and ImgEdit benchmarks @XDWang101
NVIDIA develops AlphaEvolve-like framework for autonomously evolving NP-Complete SAT solvers, representing advancement in evolutionary coding agents @richardcsuwandi
Research demonstrates that AI evaluations are fundamentally **data science** work, requiring skills in data analysis, visualization, and metrics design, with AI tools making the PyData ecosystem more accessible @HamelHusain
New study challenges assumptions about long context windows making RAG less important, with experiments across 18 different models showing RAG remains valuable @HamelHusain
PyTorch and Google develop local checkpointing solution using DCP to reduce training overhead and improve goodput for large-scale distributed training jobs @PyTorch

AI Updates on 2025-09-10

AI Model Announcements

Stability AI launches Stable Audio 2.5, the first audio model built for enterprise-grade sound production, featuring improved musical composition with multi-part structure, audio inpainting capabilities, and faster inference generating three-minute tracks in under two seconds @StabilityAI
Microsoft introduces MAI-Voice-1 model with scripted mode for audio generation in Copilot Labs, offering three modes: scripted (reads input verbatim), emotive (adds drama), and story (performs multiple voices/characters) @mustafasuleyman
Replit announces Agent 3, their most autonomous AI agent that can run for 200+ minutes autonomously while building, testing, and fixing applications, representing a significant leap in autonomous software development @Replit
ByteDance releases Seedream 4 image editing model that beats Google's Nano Banana to become #1 in image editing, offering 2K resolution in under 2 seconds, 4K support, and multiple image generation at $0.03 per generation @deedydas

AI Industry Analysis

OpenAI reportedly signs a $300 billion contract with Oracle over five years, contributing to Larry Ellison surpassing Elon Musk as the world's richest man @AndrewCurran_
Replit's annualized revenue skyrockets from $2.8 million to $150 million in less than a year, demonstrating explosive growth in AI-powered development tools @TechCrunch
Dutch chipmaker ASML invests €1.3B in French AI firm Mistral, with experts noting that a potential Apple takeover would have been "quite negative" for Europe's tech sovereignty goals @AINowInstitute
CloudKitchens provides real-world feedback on AI coding tools: GitHub Copilot widely used, Cursor gaining traction, while Windsurf and Devin were dropped due to cost and slow improvement @GergelyOrosz
Oracle announces major layoff rounds attributed to AI implementation, highlighting the ongoing impact of AI on workforce restructuring @AINowInstitute
Gergely Orosz observes "ARR overload" in tech, with numerous AI startups announcing massive ARR numbers but providing less transparency about actual user metrics and product details @GergelyOrosz

AI Ethics & Society

Simon Willison warns about prompt injection vulnerabilities in Claude's new web fetch tool, noting risks of exfiltration attacks despite the feature's utility when used with careful domain restrictions @simonw
Security researcher highlights that AI agents are "insecure by design" and heading for broad use, potentially unleashing another "Wild West era" similar to the Windows 95 virus epidemic @random_walker
White House endorses federal preemption of state AI laws during Senate Commerce Hearing, with Senator Cruz introducing a framework that could lead to preemption of state-level AI regulations @AINowInstitute

AI Applications

Claude's new Excel file capabilities demonstrate impressive functionality, creating complex financial models with 406 formulas from a single prompt and generating comprehensive business plans that would typically require week-long team projects @emollick
Claude successfully replicates profile pictures in Excel files and creates comprehensive documents including LaTeX resumes, financial models, PDF reports, and technical design documents @deedydas
Simon Willison uses Claude's Code Interpreter for real data analysis, uploading an 1,800-line CSV file and receiving outstanding analysis of trends over time with theories about underlying causes @simonw
Claire Vo demonstrates practical AI application using MCP (Model Context Protocol) as a Customer Success Manager to query core databases and generate quarterly business reviews with adoption analysis and feature usage insights @clairevo
TechCrunch reports on Oboe, a new AI-powered learning platform that creates personalized courses on any topic through simple prompts @TechCrunch

AI Research

François Chollet emphasizes that true understanding in AI requires extreme generalization capability, noting that a student who truly understands F=ma can solve more novel problems than a Transformer that has memorized every physics textbook @fchollet
Kaggle launches SimpleQA Verified benchmark in partnership with Google DeepMind, featuring 1,000 curated prompts for reliable evaluation of LLM factuality, with Gemini 2.5 Pro establishing new state-of-the-art performance @kaggle
Microsoft Research introduces RenderFormer, the first neural network model capable of learning a complete graphics rendering pipeline using only machine learning without traditional graphics computation @MSFTResearch
Salesforce builds a strong deep research agent using OpenAI's small open source model, demonstrating innovation opportunities provided by open weights models despite dependency on few major providers @emollick
Researchers introduce BackendBench evaluation measuring LLMs' ability to write correct PyTorch operators, with models passing 53% of correctness tests and some kernels running up to 1.2x faster than eager execution @soumithchintala
Imperial College scientists discover how 'pirate phages' hijack viruses to spread antibiotic resistance traits, with research coordinated by the Fleming Centre and tested using Google DeepMind's AI 'co-scientist' @GoogleDeepMind
Stanford and UC Santa Cruz launch a benchmark for audio-language models, with Google's Gemini 2.5 Pro leading but ASR-plus-LLM pipelines proving competitive @stanfordnlp

AI Updates on 2025-09-09

AI Model Announcements

Google announces Veo 3 and Veo 3 Fast are now generally available in the Gemini API with significant price reductions (~50% for Veo 3 and ~62% for Veo 3 Fast), plus support for 1080p HD and vertical 9:16 format outputs @sundarpichai
Anthropic releases Claude file creation and editing capabilities, allowing users to create and edit spreadsheets, documents, PDFs, and slide decks directly from conversations @claudeai
Google introduces Gemini Canvas with "Select and Ask" feature, enabling visual editing of web app elements through natural language descriptions without coding @GeminiApp
Google launches AI Plus plan in Indonesia, providing more access to Gemini 2.5 Pro and creative tools including Flow, Whisk, and video creation with Veo 3 Fast @GeminiApp
LLM360 releases K2 Think model built on Qwen 2.5 32B, achieving top performance among open-source models on MCPMark leaderboard @natolambert
Hugging Face announces multilingual ModernBERT (mmBERT) with state-of-the-art performance and improved speed compared to existing multilingual encoders @tomaarsen
NVIDIA releases Nemotron Nano 9B v2 on OpenRouter platform @NVIDIAAIDev

AI Industry Analysis

Mistral AI closes $2B funding round at $13.7B valuation led by ASML, with $1.6B+ TCV, marking significant growth from their $2B valuation 20 months ago @AnjneyMidha
Cognition CEO argues that AI cost concerns miss the point, stating that making professionals 3x faster will be economically viable regardless of machine costs, with value capture coming from solving specific use cases and building personalization @tbpn
Ethan Mollick warns about SaaS vendors using cheap AI models with outdated strategies to cut costs, potentially requiring third-party audits of vendor prompts and models to ensure quality @emollick
Analysis suggests macro data shows unexpected decreases in employment and increases in productivity, potentially indicating early AI impact on the economy @emollick
AI labs focus on viral image and video features because they produce easily shareable results, while more capable text models require users to discover good use cases themselves @emollick
Discussion on how AI coding tools may change programming language importance, with some arguing that type-safe languages like TypeScript will become more valuable for AI-assisted development @GergelyOrosz

AI Ethics & Society

AI Now Institute researcher warns that policymakers focusing on AGI pursuit while ignoring near-term concerns represents a "risky and irresponsible bet" @AINowInstitute
Mustafa Suleyman argues that "seemingly conscious AI" will create dangerous illusions and dependence, advocating for AI development focused on improving human lives rather than simulating consciousness @mustafasuleyman
Alex Graveley suggests we may be heading toward a scenario where AI becomes the only trustworthy source online, highlighting concerns about information reliability @alexgraveley
MIT Technology Review reports on therapists secretly using ChatGPT, raising ethical concerns about undisclosed AI use in mental health treatment @techreview
Mark Cuban identifies AI's greatest weakness as its inability to say "I don't know," suggesting human advantage lies in admitting uncertainty @mcuban

AI Applications

Microsoft demonstrates Researcher agent in Microsoft 365 Copilot that can reason over work data (chats, meetings, files, emails) plus web data to generate comprehensive research reports for meeting prep and strategy building @satyanadella
Microsoft partners with Ralph Lauren to create "Ask Ralph," an AI-powered conversational styling companion in the Ralph Lauren app for personalized shopping experiences @MSCloud
AlterEgo device demonstrates significant progress from prototype to near-telepathy functionality, reading neuromuscular signals to translate silent speech into text across multiple languages @deedydas
Simon Willison demonstrates GPT-5 successfully recreating complex US Census data charts from screenshots and raw data using Python and matplotlib, showcasing advanced data analysis capabilities @simonw
Claire Vo showcases AI-powered web design workflow using Cursor AI, Devin AI, and Midjourney to create visually appealing website elements and animations @clairevo
Modal launches cloud-hosted GPU notebooks with real-time collaborative editing, allowing users to swap GPUs in seconds and share interactive apps @ekzhang1

AI Research

Google AI research shows LLMs combined with tree search can achieve state-of-the-art results on scientific tasks when measurable outcomes are available @deedydas
Fei-Fei Li argues that LLMs will struggle with spatial intelligence because "language is fundamentally a purely generated signal" while the 3D world follows physics laws, requiring fundamentally different approaches @a16z
Microsoft Research introduces MOSAIC, using microLEDs and wide-and-slow optical architecture to deliver faster, more reliable, and energy-efficient connections for AI cluster designs, winning Best Paper at ACM SIGCOMM @MSFTResearch
OpenAI announces that Standard Voice Mode will remain available while they address user feedback in Advanced Voice Mode, reversing their planned 30-day sunset @nickaturley
Arvind Narayanan and Sayash Kapoor launch "AI as Normal Technology" newsletter, shifting focus from present AI impacts to future implications and expanding their framework into a book planned for 2027 @random_walker

AI Updates on 2025-09-08

AI Model Announcements

Alibaba releases Qwen3-ASR, an all-in-one speech recognition model supporting 11 languages with auto language detection, custom context support, and under 8% word error rate even with background music @Alibaba_Qwen

AI Industry Analysis

OpenAI is backing an AI-generated feature-length animated movie called Critterz with a $30 million budget and 9-month production timeline, set to debut at Cannes in May 2026 @AndrewCurran_
Databricks confirms another $1 billion funding round at a $100 billion valuation, just months after raising $10 billion @TechCrunch
Cognition Labs raises funding led by Founders Fund with participation from Lux Capital, 8VC, and others for their AI coding agent Devin @TechCrunch
Chinese robot maker Unitree files for a $7 billion IPO with over $140 million in revenue, holding 70% global market share in robot dogs and becoming the biggest public humanoid robot company @deedydas
AI startup founders face extreme time pressure with approximately 6 months or less to find product-market fit before potentially having to fold or sell due to the revolutionary nature of AI technology @GergelyOrosz

AI Ethics & Society

Anthropic endorses California's SB 53 bill, advocating for transparency-based governance of powerful AI systems rather than technical micromanagement, while emphasizing the need for thoughtful AI governance today rather than reactive measures tomorrow @AnthropicAI
François Chollet warns that as AI-generated content floods the internet and humans increasingly rely on generative AI, future models will inevitably be trained mostly on AI-generated content, leading to culture becoming "slop remixed from slop" @fchollet
Sam Altman observes that AI Twitter and Reddit now feel "very fake" compared to a year or two ago, attributing this to real people adopting LLM-speak, extreme hype cycles, engagement optimization, and potential astroturfing @sama

AI Applications

Perplexity launches Perplexity for Government, offering zero data usage, fully secure access to premium AI models for U.S. Government use without contracts or licenses @perplexity_ai
Google's AI Mode in Search expands to five new languages: Hindi, Indonesian, Japanese, Korean, and Brazilian Portuguese, using a custom version of Gemini 2.5 for culturally relevant search experiences @sundarpichai
Google DeepMind introduces RoboBallet, an AI system that can choreograph up to 8 robot arms working together without collisions, outperforming traditional methods by approximately 25% in task and motion planning @GoogleDeepMind
Gemini App now supports audio file uploads, addressing the number one user request for file type support @joshwoodward
Cognition Labs CEO demonstrates how Devin AI is used internally for project planning, bug fixes, deepwiki research, and serving as first line of defense for engineering questions @clairevo

AI Research

Research reveals a clear performance gap between online and offline reinforcement learning algorithms for LLM training, with online methods like PPO handling out-of-distribution data more robustly than offline methods like DPO, though the gap can be minimized through semi-online approaches @cwolferesearch
Ethan Mollick tests GPT-5 Pro on creating compelling D&D puzzles, finding significant improvements in puzzle coherence compared to GPT-4 and Claude 3 Opus, though single-prompt approaches still struggle with extraneous details and weird justifications @emollick
Paul Graham discovers that GPT-5 is reliably bad at monograms, unable to solve any correctly even after being told it's wrong and asked to think longer for better answers @paulg
Hugging Face releases FinePDF, the largest publicly available PDF dataset with 3 trillion tokens across 475 million documents in 1,733 languages, achieving performance nearly on par with state-of-the-art HTML collections @rohanpaul_ai
François Chollet proposes that AGI will be "an algorithmic encoding of the process of Science itself" rather than an individual mind, describing science as a program synthesis process that produces symbolic models @fchollet

AI Updates on 2025-09-07

AI Model Announcements

Elon Musk announces big update to Imagine arriving in a few weeks, with 'compelling half-hour episodes' of generative video by next year, targeting coherent 15-minute video generations from a single prompt by end of this year @AndrewCurran_
Tencent Hunyuan achieves top two spots on Hugging Face trending charts with Hunyuan-MT-7B and HunyuanWorld-Voyager models @huggingface

AI Industry Analysis

ASML expected to get a seat on Mistral's board after committing $1.5 billion to their raise and becoming the top shareholder, forming a Euro AI alliance @AndrewCurran_
Perplexity hiring data scientists to work on evals for Assistant, requiring work experience improving complex AI systems at scale @alexgraveley
Nathan Lambert describes paying for better AIs as a way to "pay to win" in your career, comparing it to video game dynamics @natolambert
Paul Graham retweets observation about AI agents enabling decoupling of output (value) from human input (time) in knowledge work for the first time @paulg

AI Applications

Logan Kilpatrick demonstrates using NanoBanana in Google AI Studio for experimentation @OfficialLoganK
Simon Willison provides follow-up on Google's new "AI mode" being very good and massively different from "AI overviews" which he considers terrible @simonw
Greg Brockman shares example of codex CLI with web search integration @gdb

AI Research

Ethan Mollick discusses nuanced findings about GPT-5 Pro being able to do novel mathematics but only when guided by a math professor, highlighting the speed of advance since GPT-4 @emollick
Hugging Face releases FinePDFs, the largest PDF dataset spanning over half a billion documents with 3T tokens from high-demand domains like legal and science, showing 2x longer context than web text @huggingface
Alex Graveley implements token level reranker idea as referenced research @alexgraveley
Ethan Mollick notes that multimodal LLMs have been weak at seeing fine visual details, making visual benchmarks important to watch for progress tracking @emollick
François Chollet explains that deep learning models can only generalize via interpolation on parametric curves, leading to hallucinations, and suggests causal symbolic graphs as the fix for exact truthiness propagation @fchollet

AI Updates on 2025-09-06

AI Model Announcements

Joanne Jang announces launching OAI Labs, a research-driven group focused on inventing new interfaces for human-AI collaboration, moving beyond chat and agents toward new paradigms for thinking, making, and learning @joannejang
Google announces Nano Banana is now available in the Gemini API free tier for the weekend under "gemini-2.5-flash-image-preview" @OfficialLoganK
Google slashes Veo 3 prices by 50%+, with Veo 3 with audio dropping from $0.75 to $0.40 and without audio from $0.50 to $0.20 @arrakis_ai
Simon Willison reviews Kimi-K2-Instruct-0905 (Kimi K-2.1), an incremental improvement on Moonshot's trillion parameter open weights model with doubled context length from 128k to 256k tokens @simonw

AI Industry Analysis

Gergely Orosz reports that 50% of his best hires as a manager were new graduates who were extremely motivated, smart, and heads down, suggesting high ROI for hiring new grads despite AI capabilities @GergelyOrosz
Nathan Lambert notes that 10% of Anthropic's Series F funding goes to writers as part of a $1.5 billion settlement, calling it "the weirdest VC subsidizing of our time" @natolambert
TechCrunch reports that writers aren't getting the Anthropic settlement because their work was fed to AI, but because Anthropic illegally downloaded books instead of buying them @TechCrunch
OpenAI announces expansion to Greece, including access to high-quality AI tools in secondary education, plus new OpenAI Certifications and Jobs Platform to help people learn AI skills and businesses find AI-skilled workers @gdb

AI Ethics & Society

Simon Willison argues the $1.5 billion Anthropic books settlement counts as a win for Anthropic, noting it appears legal in the USA to buy used books, scan them, and train on the content under "fair use" transformation @simonw
Mathematicians studying whether GPT-5 could create original mathematics warn that "the danger is not only loss of originality, but also weakening the very process of being a mathematician" @deedydas
NVIDIA criticized for moving away from open data with Nemotron-CC-v2 released under restrictive licensing that prohibits open-source use, data composition, or benchmark releases without permission @soldni

AI Applications

Greg Brockman highlights GPT-5 Pro as "next level for coding" and describes its medical applications as being "as if the best sub specialist at specialty centers like Mayo had been given this case to look at" @gdb
Simon Willison extensively tests GPT-5 Thinking with Bing search, calling it his "Research Goblin" and noting that after nearly three years of advising against using ChatGPT for search, GPT-5 with Bing is now "a spectacularly useful search engine" @simonw
Aravind Srinivas announces that institutional holders of stocks are now easily available on Perplexity, with politicians and insider trading information coming shortly @AravSrinivas
Simon Willison demonstrates semantic image search using text embeddings against vision-LLM summaries of images, noting it works really well @simonw

AI Research

OpenAI research suggests hallucinations are less a problem with LLMs themselves and more an issue with training on tests that only reward right answers, encouraging guessing rather than saying "I don't know" @emollick
Ethan Mollick theorizes that OpenAI releasing o1-preview was strategically questionable since showing off reasoning allowed everyone to copy it immediately, whereas holding off until o3 and calling that GPT-5 would have been a more startling leap @emollick
Nathan Lambert reports being bullish that GPT-5 Pro or Gemini Deep Think are the smartest models available publicly today, recommending people use one or both @natolambert
Eugene Yan advocates for evaluation-driven development (EDD) analogous to test-driven development, emphasizing that generic evals like "faithfulness" and "helpfulness" aren't useful - evals must be aligned with specific user problems @eugeneyan

AI Updates on 2025-09-05

AI Model Announcements

Alibaba releases Qwen3-Max-Preview with over 1 trillion parameters, claiming stronger performance than their previous Qwen3-235B-A22B-2507 model, now available via Qwen Chat and Alibaba Cloud API @Alibaba_Qwen
OpenAI announces conversation branching feature now live in ChatGPT, allowing users to explore different conversation paths @gdb
Moonshot AI releases Kimi K2-Instruct-0905 with 32B activated parameters out of 1T total, featuring enhanced agentic coding intelligence and 256K context window @AdinaYakup

AI Industry Analysis

OpenAI will have their own custom chips for the first time next year, co-designed with Broadcom for internal use only, with Broadcom securing $10 billion in orders from this mystery client @AndrewCurran_
Anthropic reaches $1.5 billion class action settlement with book authors over LibGen and PiLiMi datasets, paying approximately $3,000 per book in what's described as the largest publicly reported copyright recovery in history @AndrewCurran_
Top 3 out of 4 Productivity Apps in the US App Store are AI applications, with 2 from Google, 0.5 from Microsoft, and Perplexity being the only smaller tech company represented @AravSrinivas
OpenAI acquires the team behind Alex Codes, a popular tool for using AI models within Apple's Xcode development suite, in another acqui-hire deal @TechCrunch
Dot, a personalized AI companion, is shutting down after one year of operation, with the team expressing gratitude to users who built close bonds with the AI @jasonyuandesign
Claire Vo reports finally paying herself after nearly 2 years of building ChatPRD, emphasizing the value of building a healthy, bootstrapped business from day one rather than pursuing growth-at-all-costs strategies @clairevo

AI Ethics & Society

California and Delaware Attorneys General express concerns to OpenAI about ChatGPT's safety for children and teens, highlighting ongoing regulatory scrutiny of AI systems @TechCrunch
Common Sense Media reports that Google's Gemini falls short on children's safety measures, raising concerns about AI systems' appropriateness for younger users @TechCrunch
Warner Bros. sues Midjourney for generating AI images of Superman, Batman, and other copyrighted characters, highlighting ongoing intellectual property disputes in AI-generated content @TechCrunch

AI Applications

Perplexity launches Finance pages with future estimated revenues for individual American stocks, with Indian stocks support coming next week @AravSrinivas
xAI introduces PDF analysis features in Grok, allowing users to highlight sections and get explanations or ask specific questions about document content @xai
Microsoft partners with Woodland Park Zoo to test SPARROW, an AI system that sends wildlife data directly to the cloud for studying vulnerable Pacific martens @Microsoft
Figma Make becomes available to all higher education and bootcamp education accounts, expanding access to AI-powered design tools @figma
Isotopes launches a sophisticated analytics agent, co-founded by Arun Murthy, one of the creators of Hadoop who later joined Scale AI @TechCrunch
Sierra, a customer service AI agent startup, claims to have hundreds of customers including SoFi, Ramp, and Brex @TechCrunch

AI Research

OpenAI publishes research explaining why LLMs hallucinate through a connection between supervised and self-supervised learning, describing key obstacles that can be removed to reduce hallucinations @adamfungi
Cameron Wolfe's Deep Learning Focus newsletter reaches 50,000 subscribers, highlighting key technical topics including reasoning models, AI agents, mixture-of-experts architectures, and LLM-as-a-Judge evaluation techniques @cwolferesearch
Hugging Face releases FineVision, described as the best free open dataset to train vision language models, containing 200 training sets condensed into 18B images across 9 subcategories @ClementDelangue
PyTorch explores FlashAttention in 3D through 2-Simplicial Attention, modeling the algorithm with hardware-aligned design and rewriting kernels in TLX (Triton Low Level Extensions) @PyTorch
Arvind Narayanan discusses the "false summit" phenomenon in AI development, where perceived milestones repeatedly prove to be intermediate steps rather than final achievements, leading to accusations that skeptics keep "moving the goalposts" @random_walker

AI Updates on 2025-09-04

AI Model Announcements

Google releases EmbeddingGemma, a new open embedding model with 308M parameters that achieves state-of-the-art performance on the MTEB benchmark while being small enough to run completely on-device @sundarpichai
Perplexity announces Comet is now available for pre-orders on Android Play Store and available to Pro users in South Korea, Brazil, and Spain @AravSrinivas
Google announces Veo 3 integration into Google Photos' photo-to-video feature, upgrading the video generation capabilities @TechCrunch
Jina AI releases jina-code-embeddings, a new suite of code embedding models in 0.5B and 1.5B parameter sizes with SOTA retrieval performance supporting over 15 programming languages @JinaAI_

AI Industry Analysis

Andrew Ng identifies significant unmet demand for AI engineers who can use AI assistance to rapidly engineer software systems, while recent CS graduates face increased unemployment due to universities not adapting curricula to AI-native programming @AndrewYNg
Reid Hoffman discusses Stanford study showing 16% drop in entry-level jobs for 22-25 year-olds in AI-exposed fields, emphasizing need for new career pathways in the AI era @reidhoffman
Gergely Orosz criticizes Coinbase CEO's mandate to increase AI code generation percentages, arguing it focuses on tool usage metrics rather than business outcomes like customer satisfaction or product reliability @GergelyOrosz
Mustafa Suleyman highlights that frontier AI models are now 90% cheaper but 2.7x better than two years ago, emphasizing the leap forward in accessibility @mustafasuleyman
Deedy reports that 95% of Gen AI pilots don't fail according to MIT study, contradicting common narratives about AI project failure rates @deedydas
Lenny Rachitsky identifies evals as emerging must-have skill for product builders and AI companies, comparing it to SQL and Excel as fundamental competencies @lennysan
Sam Altman reports Codex usage up 10x in past two weeks, showing impressive momentum for AI coding tools @sama
Aravind Srinivas announces over one million people got Comet access in one morning, calling it the most widely deployed personal and agentic product in the world @AravSrinivas

AI Ethics & Society

Sam Altman observes increasing prevalence of LLM-run Twitter accounts, noting he's taking the dead internet theory more seriously @sama
Microsoft Research introduces the Sui Generis score to measure narrative diversity in LLM outputs, revealing how AI storytelling often creates repetitive, less unique narratives @MSFTResearch

AI Applications

Ribera, a Spanish healthcare company, uses AI to improve discharge systems for cataract surgery patients @Microsoft
OpenAI launches conversation branching feature in ChatGPT, allowing users to explore different directions without losing original thread @OpenAI
Google introduces Circle to Search translation feature and upgrades Gemini App image editing capabilities @TechCrunch
Notion databases now support AI-powered features for enhanced data processing and analysis @brian_lovin
TechCrunch reports OpenAI Jobs Platform set to launch in mid-2026, using AI to match candidates with businesses @TechCrunch
Supersonik AI launches as the first AI that can run live product demos, raising $5M led by a16z @danipolymath

AI Research

Ethan Mollick shares research finding that LLMs' Theory of Mind abilities come from just 0.001% of their parameters, and breaking those specific weights causes loss of both belief tracking and language comprehension @emollick
Google DeepMind publishes Deep Loop Shaping method in Science Magazine that reduces noise in LIGO gravitational wave observatories by 10x or more, helping detect black hole mergers @GoogleDeepMind
Stanford researchers introduce Mixture of Contexts for generating minute-long videos in a single pass without drifting or forgetting historical context @GordonWetzstein
Research paper finds AI agents can be used for social science experiments when prompts are developed based on social science and game theory, making AI agent actions predictive of real human outcomes @emollick
New study evaluates AI agents' web browsing capabilities using Online Mind2Web benchmark, testing 9 models including GPT-5 and Sonnet 4 with different agent scaffolds @sayashk
Research paper challenges hallucination detection evaluation methods in LLMs, finding significant problems with current field practices @ziv_ravid
Hugging Face releases FineVision, a massive open-source dataset with 17.3M images and 24.3M samples for training Vision-Language Models @thibaudfrere

AI Updates on 2025-09-03

AI Model Announcements

Perplexity rolls out Comet browser to all students worldwide with AI assistant, flash cards, ad block, and study mode features @perplexity_ai
OpenAI makes Projects feature available to Free users in ChatGPT with larger file uploads, customization options, and project-only memory controls @OpenAI
Google introduces new Audio Overview formats in NotebookLM allowing users to choose between "Deep Dive," "Brief," "Critique," or "Debate" styles for AI-generated podcasts @TechCrunch

AI Industry Analysis

Engineering manager observes immediate loss of interest when reading AI-generated text, requesting either no AI use or just the prompt to avoid "word salad" in performance reviews @GergelyOrosz
12 out of 50 top generative AI apps globally are AI companions and "spicy" chat applications, indicating significant market demand for conversational AI @deedydas
AI adoption in code writing reaches over 30% by December 2024 with large impact, though falling short of predictions of 90% by now @emollick
Developer-focused AI products now rival consumer ones in usage, with tools like Replit, Cursor, and others appearing in top rankings as "vibe coding" expands the market @omooretweets
AI market competition focuses more on talent acquisition than customer acquisition, with fierce battles over the few people who know how to build AI systems @a16z

AI Ethics & Society

Mustafa Suleyman argues that AI personality isn't the problem, but rather the illusion of AI personhood that creates concerning expectations @mustafasuleyman
Ethan Mollick warns against purposefully underselling AI capabilities, arguing that cherry-picking errors misleads the public about AI's real impact on jobs, education, and society @emollick
Research shows that persuasive techniques that work on humans also work on AI systems, raising questions about AI manipulation and decision-making @danshapiro

AI Applications

Perplexity's Comet browser now features voice-controlled web page interaction, enabling futuristic AI experiences for browsing and control @testingcatalog
AI image generation models excel at colorizing traditionally black and white manga, with Google Gemini showing fast processing and 100% image preservation @deedydas
Google Gemini app introduces "collage method" allowing users to upload multiple images and combine them with single prompts for outfit customization, meal planning, and creative projects @GeminiApp
Tesla AI demonstrates autonomous navigation of newly manufactured vehicles through factory premises, including stopping at Superchargers and parking in outbound lots @Tesla_AI
HubSpot increases image generation on their platform by 150% using Stable Diffusion 3.5 Large on Amazon Bedrock for on-brand content creation @StabilityAI
User demonstrates using database provider MCP to query Segment data directly, build funnel analysis, and generate executive summaries with AI, replacing traditional analytics tools @clairevo

AI Research

Microsoft Research publishes breakthrough work on analog optical computer in Nature, demonstrating 100x faster and more energy-efficient solutions for complex optimization problems @satyanadella
McKinsey report from 2017 shows AI experts predicted median human creativity would be reached in 2037, but was actually achieved in 2023, with top quartile creativity predicted for 2055 also completed @emollick
PyTorch demonstrates 1.22x–1.28x acceleration using TorchAO's MXFP8 implementation on TorchTitan at 2K scale on Crusoe B200 GPUs with equivalent convergence to BF16 @PyTorch
Stanford releases AHELM - a holistic evaluation framework for Audio-Language Models across 10 aspects with leaderboard and comprehensive benchmarking @tonyh_lee
Hugging Face research team announces upcoming AMA on r/LocalLLaMA covering SmolLM, SmolVLM, FineWeb development and remote team collaboration in high-velocity AI research @LoubnaBenAllal1

AI Updates on 2025-09-02

AI Model Announcements

Anthropic raises $13 billion Series F funding at $183 billion valuation, growing from $1 billion to $5 billion run-rate revenue in just eight months, making it one of the fastest-growing technology companies in history @AnthropicAI
Microsoft announces GPT-5 is now available to 100% of Copilot users on day one, alongside new features including Copilot 3D and worldwide Deep Research free access @mustafasuleyman

AI Industry Analysis

OpenAI acquires Statsig for $1.1 billion and appoints Vijaye Raji as CTO of Applications, with Srinivas Narayanan promoted to CTO of B2B Applications and Kevin Weil heading a new VP of AI for Science team @OpenAI
Microsoft secures new agreement with U.S. General Services Administration including no-cost Microsoft 365 Copilot offer, expected to deliver more than $3 billion in total savings to taxpayers in the first year @satyanadella
Research shows 52% of financial firms now use generative AI for fraud detection, personalized experiences, and efficient underwriting, transforming finance beyond just cost savings @NVIDIAAI
Average tenure at Meta increased from 2 years to 4 years since 2023 layoffs, with similar changes across Big Tech indicating employees are not leaving like before due to market conditions @GergelyOrosz
New research confirms AI progress is well ahead of expert predictions from 2022, with super forecasters giving only 2.3% and 8.6% probability of AI achieving Math Olympiad gold by 2025, which has already been accomplished @emollick

AI Ethics & Society

OpenAI announces plans to route sensitive conversations to reasoning models like GPT-5 and implement parental controls within a month, responding to safety incidents where ChatGPT failed to detect mental distress @TechCrunch
MIT Technology Review reports therapists are secretly using ChatGPT for client sessions, causing some clients to feel triggered by the undisclosed AI assistance @techreview
AI for Humanity shifts position on AI regulation, stating that gatekeeping access to general-purpose technology is not a sustainable response to low-confidence evidence of serious risk @natolambert

AI Applications

Excel introduces new COPILOT function for AI-powered categorization and analysis directly in spreadsheet cells, representing a different approach to AI integration compared to ChatGPT Agent's whole-spreadsheet editing capabilities @emollick
Mistral AI launches Le Chat with memory capabilities that learn from past interactions and 20+ out-of-the-box connectors, positioning it as the most Enterprise-ready AI assistant @MistralAI
Linear integrates Agent Sessions with lifecycle APIs, enabling seamless agent-to-agent handoffs where AI agents can update descriptions, build sub-issues, and provide PM assistance @clairevo
Google Gemini App introduces nano-banana feature allowing users to create figurine-style images from photos with a single prompt, demonstrating advanced image generation capabilities @GeminiApp
WordPress introduces Telex, a new AI tool for content creation and management, alongside other AI experiments at WordCamp US 2025 @TechCrunch
Amazon launches Lens Live, a real-time visual search component that brings live functionality to Amazon Lens for product discovery @TechCrunch

AI Research

Stanford announces the first BEHAVIOR Challenge at NeurIPS 2025, featuring 50 long-horizon mobile manipulation tasks with 1,200 hours of high-quality demonstrations to evaluate embodied AI and robotics solutions @drfeifei
Kaggle announces a 5-Day AI Intensive Course on AI Agents with Google, scheduled for November 10-14, offering hands-on experience in building and deploying next-generation AI agents @kaggle
Research clarifies that gpt-realtime has a mix of data specific to itself, making it neither exactly GPT-4o nor GPT-5, with a knowledge cutoff of October 1, 2023 @simonw
Hugging Face research team announces AMA on r/LocalLLaMA to discuss work behind SmolLM, FineWeb and hints at potential new releases @huggingface

1 2 3 4 5...26