AI Updates on 2026-02-02

AI Model Announcements

  • xAI releases Grok Imagine 1.0, featuring 10-second video generation, 720p resolution, dramatically improved audio with emotional and expressive voices, and enhanced prompt following capabilities. The model tops Artificial Analysis benchmarks and has generated 1.245 billion videos in the last 30 days @xai
  • OpenAI launches Codex app for macOS, a command center for building with agents that enables parallel multitasking with worktrees, reusable skills, and scheduled automations. The app includes doubled rate limits across all tiers from Free to Enterprise @OpenAI
  • Google DeepMind adds Werewolf, Poker, and updated Chess results to Kaggle Game Arena, testing AI models on contextual communication, building consensus, and navigating ambiguity. Latest Gemini 3 models top the chess leaderboard @GoogleDeepMind
  • Cohere Command A Vision and Command A Reasoning now available through OCI Generative AI, enabling multimodal apps, agentic workflows, and reasoning-driven systems with enterprise security and EU region availability @OracleCloud

AI Industry Analysis

  • OpenAI's Codex team reports the tool now builds itself with team supervision, with the bottleneck shifting to how fast humans can help and supervise the outcome rather than development speed @thsottiaux
  • Linear adds more net new revenue in January 2026 alone than in their entire first three years combined, demonstrating how consistent acceleration enables compounding growth @cjc
  • Companies are renaming 2-pizza teams to 1-pizza teams as AI makes large teams unnecessary and slows things down, with teams getting smaller across most organizations @GergelyOrosz
  • University of Waterloo's co-op program produces standout new grads with far more real-world experience at good companies than most universities, making it a goto hiring source for CTOs and founders @GergelyOrosz
  • Ben Horowitz explains AI has eliminated the Mythical Man Month limitation in tech, as companies can now throw data and GPUs at problems to solve them, unlike traditional software development where team size was constrained @a16z
  • Goldman Sachs CEO notes the four largest companies contributed 1% to GDP growth with $400 billion of spending, with this potentially being the biggest M&A year in history @a16z
  • OpenAI partners with Snowflake to expand enterprise AI capabilities, signaling intensifying competition in the enterprise AI race @AndrewCurran_
  • Anthropic partners with The Allen Institute and Howard Hughes Medical Institute for research collaboration @AndrewCurran_

AI Ethics & Society

  • Coalition demands federal Grok ban over nonconsensual sexual content generation, raising concerns about AI-generated harmful content @TechCrunch
  • Ben Horowitz argues AI regulation should focus on applications rather than the technology itself, stating "Don't regulate math. Regulate the applications of that math" and warning that banning technology has hundred-year implications @a16z
  • Ethan Mollick demonstrates AI-generated videos have reached quality levels where distinguishing them from real content is extremely difficult, with examples of playing as characters in famous paintings and WWI battlecruiser simulations @emollick
  • Concerns emerge about AI-generated content on social media, with high-quality viral essays being entirely AI-written but presented as emotional truths, making it difficult to distinguish human from AI authorship @emollick
  • Marc Andreessen argues the world will be better off with more Einstein-level intelligence, stating existing AI models test around 130-140 IQ and will reach 160+ levels, comparing this to releasing limitations of human biology @a16z

AI Applications

  • Google's AI tools DeepVariant and DeepPolisher help researchers sequence genomes for endangered species, compressing what once took years into days. Genomes of 13 species are now freely available, with plans to scale to 150+ more species @sundarpichai
  • Carbon Robotics builds an AI model that detects and identifies plants for agricultural applications @TechCrunch
  • Linq raises $20M to enable AI assistants to live within messaging apps, expanding AI integration into communication platforms @TechCrunch
  • Claire Vo builds an infinite generative sci-fi story with 42 characters powered by Vercel AI gateway and workflows, demonstrating agent-to-agent communication and emergent narratives @clairevo
  • Reid Robinson demonstrates using MCPs to automate meeting prep, CRM updates, and customer feedback synthesis, showing practical PM workflows with Zapier's MCP server and Claude Projects @clairevo
  • PyTorch demonstrates unlocking advanced reasoning in Llama 8B through full fine-tuning on NVIDIA's DGX Spark AI-PC, using synthetic data and chain-of-thought prompts entirely offline with 128GB unified memory @PyTorch
  • Meta launches Oakley Meta Performance AI glasses with hands-free camera, Meta AI, and open-ear audio for athletic training applications @Meta

AI Research

  • Google DeepMind researchers use Gemini to systematically evaluate 700 open conjectures in the Erdős Problems database, addressing 13 problems marked as open with 5 novel autonomous solutions and identifying 8 existing solutions missed by previous literature @quocleix
  • Research demonstrates that even older GPT-4 could be prompted to generate more diverse and higher quality ideas than most people, with newer models performing better, challenging arguments that AI is poor at idea generation @emollick
  • Arvind Narayanan explains agentic coding works well because it's a type of neurosymbolic AI that fuses statistical LLMs with symbolic code execution, leveraging verifiable domains, compilers, shell tools, and recursive LLM-code interactions @random_walker
  • Phase 3 trial shows lung cancer patients treated with immunotherapy in the morning had better overall survival than those treated in the afternoon, demonstrating the immune system's circadian rhythm affects treatment outcomes @PatrickHeizer
  • Google DeepMind launches harder benchmarks for AI models through Kaggle Game Arena with werewolf, poker, and chess, providing objective measures of real-world skills like planning and decision making under uncertainty that auto-scale difficulty as models improve @demishassabis