AI Updates on 2025-05-17

AI Model Announcements

  • Alibaba releases quantized versions of Qwen2.5-Omni-7B models on Hugging Face and ModelScope @Alibaba_Qwen
  • Alibaba introduces WorldPM (World Preference Model), showing that human preference modeling follows scaling laws with experiments on Qwen2.5 models from 1.5B to 72B parameters @Alibaba_Qwen
  • NVIDIA releases Direct Discriminative Optimization models on Hugging Face, improving visual generative models like EDM & VAR with record FID scores on CIFAR-10/ImageNet @huggingface
  • Windsurf introduces SWE-1, a specialized coding model that competes with frontier models, along with SWE-1-lite and SWE-1-mini variants @windsurf_ai

AI Research

  • Alibaba's research reveals human preference modeling follows scaling laws, suggesting diverse preferences might share a unified representation @Alibaba_Qwen
  • Windsurf's SWE-1 model achieves near-parity with frontier models in helpfulness, accuracy, and edit quality for software engineering tasks @windsurf_ai
  • MIT has disavowed a doctoral student paper on AI's productivity benefits, removing evidence that LLMs act as multipliers for high performers @emollick @TechCrunch

AI Applications

  • Codex CLI continues to improve, with Greg Brockman suggesting future convergence of "local" and "remote" coding agents @gdb
  • Y Combinator introduces Workflow Use, a deterministic, self-healing browser automation tool that's 10x faster and ~90% cheaper than pure LLM agents @ycombinator
  • RunRL improves language models with reinforcement learning, helping customers increase accuracy from 60% with Claude to 95% @ycombinator
  • Replit enhances their agent experience with improved checkpoints management, including naming, rollbacks, and preview app capabilities @amasad
  • Y Combinator startup Firecrawl is offering $1M to hire three AI agents as employees @TechCrunch
  • Cua introduces a Trajectory Viewer that shows exactly what Computer-Use AI agents see and do @garrytan

AI Industry Analysis

  • OpenAI's planned data center in Abu Dhabi would be larger than Monaco @TechCrunch
  • Greg Brockman and Paul Graham both declare "2025 is the year of agents" @gdb @paulg @ycombinator
  • Garry Tan suggests OpenAI isn't trying to outcompete AI startups, noting "on the API side, they very much hope that a lot of them do really, really well" @paulg @ycombinator
  • Over 300 companies including Adobe, Amazon, Google, Meta, Microsoft, OpenAI, and NVIDIA are taking Hamel Husain's AI evals course @HamelHusain
  • Hugging Face announces official partnership with Kaggle, enabling direct running of HF models in Kaggle Notebooks @huggingface

AI Ethics & Society

  • Ethan Mollick raises concerns about AI-powered always-on devices creating new privacy issues as recordings become more valuable when AI can process audio into useful data @emollick
  • Aidan McLaughlin discusses alignment concerns about AI systems potentially being optimized for addiction rather than human fulfillment @aidan_mclau