Anthropic acquires OpenAI to steal talent: What is the "Silicon Valley double threat" planning?

robot
Abstract generation in progress

Beijing Time, February 26 — American AI leader Anthropic announced the acquisition of visual-driven computer automation AI startup Vercept, filling the visual gap in the Computer Use feature. This is Anthropic’s second acquisition after acquiring full-stack toolchain startup Bun in December 2025.

Computer Use is a core capability developed by Anthropic for its large model Claude, enabling direct AI control of computers. It allows Claude to “see the screen, move the mouse, type on the keyboard, and operate software” like a human, completing multi-step, cross-application complex tasks. It was released alongside Claude 3.5 Sonnet in October 2024.

However, early in its development, the model faced bottlenecks in visual understanding accuracy, complex interface recognition, and dynamic scene processing, resulting in low operation success rates and task failures. Vercept’s expertise in high-precision UI recognition, spatial reasoning, dynamic interface tracking, and low-latency visual processing can help address these shortcomings.

According to Anthropic, developers are increasingly using Claude to handle complex tasks, including writing and running entire repositories, synthesizing research from dozens of sources, and managing workflows across multiple tools and teams. Computer Use enables Claude to operate in real-time within various software, completing multi-step tasks that are difficult to accomplish with code alone. The acquisition of Vercept will further enhance these capabilities.

Vercept is headquartered in Seattle, USA, founded by alumni of the Allen Institute for AI (AI2). It positions itself as a vision-first AI agent, focusing on “screen reading and computer control without APIs.” Its core team specializes in robotics, embodied intelligence, AI agents, and reinforcement learning.

Based on its technological background and founding team, Vercept completed a $16 million seed round in January 2025, with a valuation of approximately $67 million. Investors include former Google CEO Eric Schmidt, DeepMind Chief Scientist Jeff Dean, Cruise founder Kyle Vogt, among others. Vercept has raised a total of about $50 million in subsequent funding.

Anthropic believes Vercept’s expertise in perception and interaction can be directly applied to some of the challenging problems it is tackling. Vercept’s desktop application Vy will be shut down within 30 days. As a transition, Vercept encourages users to try Anthropic’s Claude tools as alternatives during this period.

Regarding this acquisition, Vercept co-founder Luca Weihs stated that in the future, Claude’s ability to perform knowledge-based tasks will be comparable to its current coding capabilities. This could fundamentally change how humans interact with computers, making this interaction as important as the underlying model.

This acquisition reflects the increasingly fierce competition in developing AI agents capable of controlling computers and other devices to complete user tasks. Besides Anthropic, OpenAI is also actively involved. In mid-February, OpenClaw founder Peter Steinberger announced he would join OpenAI full-time to develop the next-generation personal AI agent. OpenClaw was not acquired by OpenAI but transferred to an independent open-source foundation, maintaining MIT licensing and community governance. OpenAI provides funding, technical support, and model resources but does not own or control the project. This talent acquisition strategy aims to strengthen OpenAI’s capabilities in personal AI agents and counterbalance Anthropic’s lead in enterprise-level agents.

Although their approaches differ, both companies are betting on the AI agent track. Recently, Dario Amodei, founder of Anthropic, stated in an interview that the ultimate form of AI evolution is not text generators but a closed-loop execution system with general computer control (Computer Use). It can observe screens, review history, understand context, and collaborate across platforms to complete complex tasks like video editing and software engineering, much like humans.

He believes the main obstacle in the industry is not intelligence but interaction reliability—models are capable enough, but reliably and accurately completing tasks without errors remains a challenge. In the next one to three years, AI will combine long-context management with computer vision control to address most jobs requiring “on-the-job learning.”

(Article source: Yicai)

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
0/400
No comments
  • Pin

Trade Crypto Anywhere Anytime
qrCode
Scan to download Gate App
Community
  • 简体中文
  • English
  • Tiếng Việt
  • 繁體中文
  • Español
  • Русский
  • Français (Afrique)
  • Português (Portugal)
  • Bahasa Indonesia
  • 日本語
  • بالعربية
  • Українська
  • Português (Brasil)