Recently, AMD released a series of DeepSeek R1 inference benchmark results that have attracted significant industry attention. The tests reveal that AMD's high-end graphics card, the RX 7900 XTX, surpasses NVIDIA's flagship models, the RTX 4090 and RTX 4080 Super, when executing the DeepSeek model series. The RX 7900 XTX, leveraging the RDNA3 architecture, demonstrated a performance increase of about 13% over the RTX 4090 and a 34% improvement compared to the RTX 4080 Super in the DeepSeek R1 Distill Qwen 7B test. Additionally, AMD evaluated other models like Distill Llama 8B and Distill Qwen 14B, where it consistently outperformed, although the RTX 4090 maintained a slight advantage with the Distill Qwen 32B model.
In response to AMD's data, NVIDIA promptly launched a counterattack. Via its official blog, NVIDIA asserted that the newly introduced RTX 5090 significantly outclassed the RX 7900 XTX in comparable tests, achieving up to 2.2 times the latter's performance. NVIDIA's rebuttal stressed that after accounting for varying test conditions, optimized configurations, and driver versions, its products maintained an unparalleled edge in crucial tasks. This exchange of statements originating from the benchmark tests underscores the immense value of hardware and software co-optimization in contemporary AI inference tasks, while also revealing the potential limitations posed by manufacturers' proprietary test conditions.
Image credit: Nvidia
Beyond the performance battle, the market landscape is subtly transforming. The AMD RX 7900 XTX, with its price advantage, is crafting a competitive niche for cost-effectiveness, altering the traditional rationale behind high-end graphics card selections. Statistics reveal that its pricing significantly undercuts NVIDIA's comparable products, appealing considerably to tech professionals and small to medium-sized AI research teams prioritizing return on investment. Concurrently, the DeepSeek open-source model, launched by a Chinese startup, is redefining the global AI infrastructure value assessment with its low training costs and high inference efficiency. The model's commercial potential in domains such as mathematical derivation and code generation, along with its seamless deployment on consumer-grade hardware, has objectively shifted the conventional AI arithmetic stacking development model.
Industry experts note that competition among hardware providers is evolving beyond sheer arithmetic power to encompass architectural innovation and ecological synergy. AMD's RDNA3 architecture enhances parallel processing efficiency through improved computational unit layouts, whereas NVIDIA continues fortifying the CUDA ecosystem, establishing a robust software-level moat. This differentiated competitive strategy results in performance variances across different testing scenarios, elucidating why NVIDIA's products with enhanced video memory bandwidth regained advantage in the 32B large model test.
The DeepSeek model's rise is causing a ripple effect worthy of exploration. The open-source program diminishes dependency on hardware computing power via algorithmic innovation, enabling medium-sized enterprises to conduct AI application research within constrained budgets. This technological trend not only challenges the traditional GPU cluster procurement model but also encourages the industry to revisit the "computing power monopoly" business logic's sustainability. Some Silicon Valley technology firms are beginning to adjust their hardware procurement tactics, exploring more affordable hybrid computing solutions in specific scenarios.
From an industrial evolution perspective, the current performance dispute is essentially an inevitable outcome of AI computing paradigm shifts. For end-users, an increasingly diverse product range not only signals potential cost optimizations but also indicates that the AI hardware market is poised for accelerated technological evolution. As the open-source model ecosystem continues to mature and heterogeneous computing architecture evolves, it is expected to drive the entire industry's progression toward greater efficiency and inclusivity.