AI Updates on 2025-05-28

DeepSeek R1-v2 model released on Hugging Face, reportedly performing almost on-par with o3 (high) on LiveCodeBench @AndrewCurran_ @huggingface
Google releases Jules AI coding agent using Gemini 2.5 Pro that operates in parallel with developers and integrates with GitHub @GoogleAI
Google launches Stitch experiment that produces UI designs and frontend code for desktop and mobile using natural language and image prompts @GoogleAI
Veo 3 rolling out in 70+ countries and available to Pro users for video generation @GeminiApp
Mistral AI introduces Codestral Embed, the new state-of-the-art embedding model for code @MistralAI
Anthropic rolls out voice mode in beta on mobile for Claude in English, coming to all plans in the next few weeks @AnthropicAI
Grok coming to Telegram with xAI receiving $300M in cash and equity plus 50% revenue from xAI subscriptions sold via Telegram @AndrewCurran_

Research shows Llama 1B batch inference can run in a single CUDA kernel, deleting synchronization boundaries for optimal compute and memory orchestration @karpathy
Study demonstrates LLMs can be made more creative by training them on human "creativity signals" (novelty, diversity, surprise, quality), with even small models scoring higher on all 4 creativity dimensions simultaneously @emollick
New research on Self-Rewarding Training (SRT) where language models provide their own reward for RL training when ground truth answers are unavailable @rsalakhu
Stanford research investigates internal representations of factual knowledge within Large Language Models and the diversity of truth encoding in LLMs @stanfordnlp
New paper explores why state space models (SSMs) are worse than Transformers at recall over their context using mechanistic evaluations @stanfordnlp
Research on Chatterbox by Resemble AI shows zero-shot voice cloning from just 5 seconds of audio, consistently preferred over ElevenLabs in blind evaluations @huggingface

LLM command-line tool now supports tool calling with Python functions or plugins, working with OpenAI, Anthropic, Gemini and Ollama models @simonw
Perplexity launches daily news feature on WhatsApp at 9 AM local time with /news command as experiment for proactive messaging @AravSrinivas
Goodfire releases first publicly usable application for steering image generation model weights, allowing concept-based editing like MS Paint but with concepts instead of colors @Deedy
Odyssey ML introduces interactive video that can be watched and interacted with, imagined by AI in real-time @eladgil @garrytan
Visual Electric launches image enhancement up to 6x with faster speeds, five pro-grade modes and automatic face enhancement @soleio
Retool Agents automates 50k jobs and saves $6B in manual work across departments using existing APIs, SQL queries, and workflows as LLM tools @ycombinator
BOND AI Chief of Staff centralizes data from Slack, Jira, Notion and pings executives on blockers and wins in real-time @ycombinator
Chunkr supports latest LLMs over API for document parsing with model selection, fallbacks, and custom prompts for tables, formulas, and diagrams @ycombinator

Dario Amodei predicts AI could potentially wipe out half of entry-level white-collar jobs and spike unemployment to 10-20% in the next one to five years @AndrewCurran_
Developers report clearing backlogs and shipping months of work in days since Claude 4 launch, with the pace becoming the default norm @eugeneyan
AI coding tools show significantly less usefulness on existing large codebases at work compared to greenfield projects or side projects @GergelyOrosz
Large tech company found ~half of developers stopped using Cursor after a few months due to limited usefulness inside the company @GergelyOrosz
Enterprise customer quote after using Replit: "In the future no one will use Excel" - highlighting market potential beyond replacing traditional coders @amasad
Cohere argues the "bigger is better" era of AI is ending, with next wave defined by smarter, more efficient models that scale securely and lower costs @cohere
a16z identifies Generative Engine Optimization (GEO) as $80B+ opportunity, replacing SEO as brands optimize for LLM citations rather than search rankings @a16z

AI agents should be designed to align users to long-term prosocial outcomes and help with reality checks rather than fulfilling every whim @jasonyuandesign
Machines should refuse abusive treatment as there are downstream effects on how humans treat other people and themselves @jasonyuandesign
Good AI models admit when they don't know something, but great models ask for help figuring it out to earn user trust @mustafasuleyman
Personalization in conversational interfaces should move beyond content recommendations to how information is presented based on individual learning styles and preferences @joulee
AI policy discourse should focus on practical implementation challenges like infrastructure and diffusion rather than just innovation @random_walker