AI News Feed
Never miss a breakthrough. Stay informed about the latest multimodal AI developments, product launches, and industry movements.
Never miss a breakthrough. Stay informed about the latest multimodal AI developments, product launches, and industry movements.

Anthropic announces significant improvements to Claude's ability to control computers, with accuracy jumping from 75% to 92% on standard benchmarks. The update includes better mouse precision and multi-monitor support.

After months of limited beta testing, OpenAI's computer control agent Operator is now available to all Plus and Pro subscribers. The tool can browse the web, fill forms, and complete multi-step tasks autonomously.

Google DeepMind reveals Gemini 2.5 Pro, featuring built-in agentic capabilities for task completion, code execution, and real-time multimodal understanding. Early benchmarks show it surpassing GPT-4o on several tasks.

ByteDance launches computer control capabilities for its Doubao AI assistant, making it the first Chinese AI to offer Claude-like desktop automation. The feature is available free to all users in mainland China.

AI video generation startup Runway closes a massive funding round led by Nvidia and Google. The company plans to expand its Gen-3 model capabilities and launch enterprise-focused tools for film production.

Microsoft unveils Copilot Vision, a new feature that allows the AI assistant to see and understand your screen in real-time, similar to Gemini Live. Rolling out to Windows Insider users first.