AI Updates on 2025-05-25

Anthropic has released Claude 4 with both Opus and Sonnet variants, featuring improved capabilities and reduced reward hacking according to their system card @natolambert

Sean Heelan used an LLM CLI tool to help identify a remote zeroday vulnerability in the Linux kernel @simonw
The Claude 4 System Card (120 pages) provides extensive documentation on model capabilities and limitations, including sections on "opportunistic blackmail" @simonw
Anthropic's system prompts for Claude 4 Opus and Sonnet have minimal differences despite being separate models @simonw

Veo 3 demonstrates strong capabilities in creating fictional product reviews with YouTube-style presentations @emollick
Veo 3 can compose music based on genre, tone and lyrics descriptions @AndrewCurran_
Shopify developer used Claude 4 Opus with Claude Code to execute an 84-file refactor in their open source Roast framework @_catwu
Chiron is building an iPad app that understands math as it's written, using symbolic logic to track thinking in real-time for AI tutoring @ycombinator
Claude 4 features include "deep dive" functionality that classifies complex queries and makes multiple search tool calls @simonw
Claude Artifacts functionality is detailed in the hidden system prompt, including the full list of libraries it can load @simonw

Feature requests for Claude include 1M context window, memory, larger output token window, more file formats, more tool calls per request, and improved vision capabilities @deedydas
AI tools for coding are good at recreating what they've been trained on but won't create the next generation of frameworks, libraries, or technologies @GergelyOrosz
The software world may split between companies relying heavily on AI (potentially accumulating "AI tech debt") and those investing in best-in-class developers @GergelyOrosz
AI companies are paying higher base salaries for developers while barely using AI to write their own code, as they need innovative, best-in-class software @GergelyOrosz
The UX for long-running AI Agents will be one of the most interesting design questions in coming years, focusing on meta elements of managing their work @garrytan
Audio appears to be a significant part of OpenAI's consumer strategy, potentially for their new device @amasad
Infrastructure engineering teams can be most effectively distributed in modern startups due to knowable requirements and deliberate system changes @amasad

A database has documented 116 cases from 12 countries where lawyers have cited hallucinated legal cases generated by AI, with 20 instances occurring this month alone @simonw
The fact that advanced AI frequently makes mistakes or fabricates information remains unintuitive to most new users @simonw
AI will democratize access to skill, similar to how the internet democratized access to information @vkhosla
The future may be difficult to visualize because AI will significantly expand and alter our senses and perceptions @AndrewCurran_
Some nations may eventually subsidize AI model subscriptions for their citizens, with Middle Eastern nations potentially being first @AndrewCurran_