AI Updates on 2025-05-07
AI Model Announcements
- Meta introduces Perception Language Model (PLM), an open and reproducible vision-language model for challenging visual tasks, with research paper, code, and dataset available @AIatMeta
- Google releases updated Gemini 2.0 Image Generation model with better visual quality, more accurate text rendering, lower block rates, higher rate limits, and $0.039 per image generated @demishassabis @OfficialLoganK
- NVIDIA open sources Open Code Reasoning models (32B, 14B, 7B versions) under Apache 2.0 license, beating o3 mini & o1 (low) on LiveCodeBench @huggingface
AI Research
- Google and Institute of Science and Technology Austria report first-ever method using light microscopy to comprehensively map all neurons and their connections in mouse brain tissue @GoogleAI @fchollet
- Stanford researchers release SWE-smith, a toolkit for generating software engineering training data that achieved 40.2% Pass@1 on SWE-bench Verified, making it the top open-source model for software engineering @stanfordnlp
- MIT researchers develop new AI method modeled after neural oscillations in the brain to analyze long data sequences like climate trends and financial metrics @MIT_CSAIL
- Researchers release SIFT-50M, a large-scale multilingual dataset for speech instruction fine-tuning covering 5 languages, with the resulting SIFT-LLM outperforming SALMONN & Qwen2-Audio on speech-following benchmarks @huggingface
- MegaMath, the largest open-source math pre-training corpora collection, reaches 70k+ downloads @huggingface
- SwallowCode dataset released with 16.1B tokens of LLM-rewritten Python code, filtered by syntax and pylint score, showing +17.0 pass@1 improvement on HumanEval @huggingface
AI Applications
- Anthropic adds web search to their API, allowing developers to augment Claude's knowledge with up-to-date data, including citations and domain control features @AnthropicAI
- Figma announces Figma Make, an AI-powered tool that turns designs into interactive prototypes, along with Figma Sites for web publishing with code and AI capabilities coming soon @figma
- Stripe unveils payments foundation model that creates embeddings for transactions, improving fraud detection from 59% to 97% for card-testing attacks on large users @paulg
- Coinbase launches x402, described as "HTTP for Money," built on stablecoins for Agentic Commerce, enabling AI agents to make payments without human intervention @garrytan
- DeepLearning.AI releases new course on Building AI Voice Agents for Production in collaboration with LiveKit and RealAvatar, teaching how to build voice agents with low latency @AndrewYNg @DeepLearningAI
- MIT researchers develop fiber computer that can be woven into clothing, allowing apparel to run apps and understand the wearer @MIT
- Neuralink brain implant gets a boost from generative AI to improve functionality @techreview
AI Industry Analysis
- PyTorch Foundation expands into an umbrella foundation with vLLM and DeepSpeed joining as the first hosted projects @PyTorch @soumithchintala
- OpenAI reportedly discussing with FDA about using AI for drug evaluations @TechCrunch
- OpenAI seeking to team up with governments to grow AI infrastructure @TechCrunch
- CB Insights releases 2024 AI 100 list of promising early-stage startups, showing growing market for agents and infrastructure with over 20% of companies building or supporting agents @DeepLearningAI
- Y Combinator publishes Requests for Startups focused on AI, seeking founders who treat AI agents as core operating systems for new companies and industries @ycombinator
- Stanford HAI analysis of DeepSeek's rise challenges assumption that US leads in AI talent attraction and retention, as most DeepSeek researchers were educated in China @StanfordHAI
- Google and Elementl Power sign agreement to develop three new project sites for advanced nuclear reactors, each generating at least 600 megawatts @AndrewCurran_
AI Ethics & Society
- Research by MIT Media Lab and OpenAI finds that extensive use of AI chatbots correlates with increased feelings of loneliness @medialab
- Anthropic Interpretability Team planning virtual Q&A about how they plan to make models safer, the role of the team, and future directions @ch402
- Microsoft Research hosts Fusion Summit bringing together global experts to explore how AI can help unlock fusion energy potential @MSFTResearch