AI Updates on 2025-05-07

AI Model Announcements

  • Meta introduces Perception Language Model (PLM), an open and reproducible vision-language model for challenging visual tasks, with research paper, code, and dataset available @AIatMeta
  • Google releases updated Gemini 2.0 Image Generation model with better visual quality, more accurate text rendering, lower block rates, higher rate limits, and $0.039 per image generated @demishassabis @OfficialLoganK
  • NVIDIA open sources Open Code Reasoning models (32B, 14B, 7B versions) under Apache 2.0 license, beating o3 mini & o1 (low) on LiveCodeBench @huggingface

AI Research

  • Google and Institute of Science and Technology Austria report first-ever method using light microscopy to comprehensively map all neurons and their connections in mouse brain tissue @GoogleAI @fchollet
  • Stanford researchers release SWE-smith, a toolkit for generating software engineering training data that achieved 40.2% Pass@1 on SWE-bench Verified, making it the top open-source model for software engineering @stanfordnlp
  • MIT researchers develop new AI method modeled after neural oscillations in the brain to analyze long data sequences like climate trends and financial metrics @MIT_CSAIL
  • Researchers release SIFT-50M, a large-scale multilingual dataset for speech instruction fine-tuning covering 5 languages, with the resulting SIFT-LLM outperforming SALMONN & Qwen2-Audio on speech-following benchmarks @huggingface
  • MegaMath, the largest open-source math pre-training corpora collection, reaches 70k+ downloads @huggingface
  • SwallowCode dataset released with 16.1B tokens of LLM-rewritten Python code, filtered by syntax and pylint score, showing +17.0 pass@1 improvement on HumanEval @huggingface

AI Applications

  • Anthropic adds web search to their API, allowing developers to augment Claude's knowledge with up-to-date data, including citations and domain control features @AnthropicAI
  • Figma announces Figma Make, an AI-powered tool that turns designs into interactive prototypes, along with Figma Sites for web publishing with code and AI capabilities coming soon @figma
  • Stripe unveils payments foundation model that creates embeddings for transactions, improving fraud detection from 59% to 97% for card-testing attacks on large users @paulg
  • Coinbase launches x402, described as "HTTP for Money," built on stablecoins for Agentic Commerce, enabling AI agents to make payments without human intervention @garrytan
  • DeepLearning.AI releases new course on Building AI Voice Agents for Production in collaboration with LiveKit and RealAvatar, teaching how to build voice agents with low latency @AndrewYNg @DeepLearningAI
  • MIT researchers develop fiber computer that can be woven into clothing, allowing apparel to run apps and understand the wearer @MIT
  • Neuralink brain implant gets a boost from generative AI to improve functionality @techreview

AI Industry Analysis

  • PyTorch Foundation expands into an umbrella foundation with vLLM and DeepSpeed joining as the first hosted projects @PyTorch @soumithchintala
  • OpenAI reportedly discussing with FDA about using AI for drug evaluations @TechCrunch
  • OpenAI seeking to team up with governments to grow AI infrastructure @TechCrunch
  • CB Insights releases 2024 AI 100 list of promising early-stage startups, showing growing market for agents and infrastructure with over 20% of companies building or supporting agents @DeepLearningAI
  • Y Combinator publishes Requests for Startups focused on AI, seeking founders who treat AI agents as core operating systems for new companies and industries @ycombinator
  • Stanford HAI analysis of DeepSeek's rise challenges assumption that US leads in AI talent attraction and retention, as most DeepSeek researchers were educated in China @StanfordHAI
  • Google and Elementl Power sign agreement to develop three new project sites for advanced nuclear reactors, each generating at least 600 megawatts @AndrewCurran_

AI Ethics & Society

  • Research by MIT Media Lab and OpenAI finds that extensive use of AI chatbots correlates with increased feelings of loneliness @medialab
  • Anthropic Interpretability Team planning virtual Q&A about how they plan to make models safer, the role of the team, and future directions @ch402
  • Microsoft Research hosts Fusion Summit bringing together global experts to explore how AI can help unlock fusion energy potential @MSFTResearch