Gate News reports that on March 17, NVIDIA officially announced the Vera CPU at GTC, the first processor designed specifically for Agent AI and reinforcement learning. As AI expands from generation and inference to autonomous agent actions, workflows such as planning tasks, tool invocation, code execution, and result verification are rapidly increasing the demands on CPUs. NVIDIA CEO Jensen Huang stated, “CPUs are no longer just supporting models; they are driving models.”
Vera features 88 NVIDIA-designed Olympus cores, each capable of executing two tasks simultaneously through Spatial Multithreading technology, making it suitable for multi-tenant AI factories with large-scale parallel workloads. The memory uses second-generation low-power subsystem LPDDR5X, with a bandwidth of 1.2 TB/s—double that of general-purpose CPUs—and half the power consumption.
Deployed cloud service providers include Alibaba, ByteDance, Cloudflare, CoreWeave, Lambda, Nebius, Oracle Cloud (OCI), Together.AI, Vultr, and others. Hardware partners such as Dell Technologies, HPE, Lenovo, and Supermicro have already begun adapting the Vera CPU. AI programming tool Cursor co-founder and CEO Michael Truell said they will use Vera to improve the throughput and responsiveness of programming agents. After testing Vera running Apache Kafka-compatible loads, the Redpanda streaming data platform observed latency reductions of up to 5.5 times. The Vera CPU has entered mass production and will be shipped through partners in the second half of this year.