• Send Us A Tip
  • Calling all Tech Writers
  • Advertise
Tuesday, July 8, 2025
  • Login
TechStory
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to
No Result
View All Result
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to
No Result
View All Result
TechStory
No Result
View All Result
Home Future Tech AI

Nvidia Labels China’s DeepSeek R1 Model ‘An Excellent AI Advancement’ in AI Race

by Reshab Agarwal
January 28, 2025
in AI, News
Reading Time: 3 mins read
0
Tech Giants Compete: Tencent, ByteDance Outpace Meta, Google in NVIDIA AI GPU Purchases
TwitterWhatsappLinkedin

Nvidia has acknowledged the breakthrough of DeepSeek’s R1 model, describing it as “an excellent AI advancement,” even as the Chinese startup’s rise led to a sharp 17% drop in Nvidia’s stock price on Monday. Nvidia calls China’s DeepSeek R1 model ‘an excellent AI advancement’ while acknowledging its reliance on compliant Nvidia GPUs.

You might also like

Elon Musk Forms ‘America Party’ After Bitter Rift With Trump

Linkrunner Raises ₹5 Cr to Revolutionize App Attribution in India’s Booming Mobile Market

BigBasket Strengthens Leadership Bench with Deepika Khattar Bhan’s Appointment

A spokesperson for Nvidia told CNBC that DeepSeek represents a notable achievement in AI and highlights the potential of “Test Time Scaling.” According to Nvidia, the model demonstrates how innovative techniques can leverage widely available resources while adhering to export control regulations.

“DeepSeek’s work showcases how new AI models can emerge using techniques like Test Time Scaling, which rely on compliant models and compute infrastructure,” the spokesperson said.

DeepSeek’s R1 Model: Disrupting AI Development Costs

Nvidia calls China’s DeepSeek R1 model ‘an excellent AI advancement’ for showcasing the potential of Test Time Scaling in AI development. DeepSeek, a Hangzhou-based AI startup, released its R1 model last week. This open-source reasoning model has reportedly surpassed the performance levels of leading models from U.S.-based companies, including OpenAI. Impressively, DeepSeek claims its R1 model was trained at a cost of less than $6 million—a fraction of the billions spent by tech giants in Silicon Valley.

The R1 model’s hybrid architecture combines reinforcement learning with chain-of-thought reasoning. It comes in two versions: DeepSeek-R1 and DeepSeek-R1-Zero, the latter being capable of unsupervised fine-tuning for even greater reasoning abilities.

Implications for Nvidia and U.S. Tech Giants

Despite the competition posed by DeepSeek, Nvidia noted that its GPUs are integral to DeepSeek’s operations. The startup reportedly utilized approximately 2,000 Nvidia H800 chips, designed to comply with U.S. export regulations introduced in 2022.

“Inference tasks require significant numbers of Nvidia GPUs and high-performance networking,” the spokesperson added, emphasizing the importance of GPUs in supporting AI advancements like DeepSeek.

This development has raised concerns among analysts over the efficiency of U.S. tech companies’ massive investments in AI infrastructure. Microsoft plans to spend $80 billion on AI infrastructure in 2025, while Meta’s projected capital expenditures for AI are estimated between $60 billion and $65 billion for the same year.

Analysts Question High Costs of AI Development

Experts believe that if training costs for models like R1 remain significantly lower, companies relying on AI services could benefit from cost reductions in the short term. However, the long-term impact on hyperscale AI revenues and investments may be more profound.

Bank of America analyst Justin Post noted that lower model training costs could lead to savings for sectors like advertising and consumer applications. However, this would also reduce the scale of revenues for AI infrastructure providers.

Shifting Focus to Test Time Scaling

In a groundbreaking shift, Nvidia calls China’s DeepSeek R1 model ‘an excellent AI advancement’ for its hybrid architecture and reasoning abilities. Nvidia CEO Jensen Huang, OpenAI CEO Sam Altman, and Microsoft CEO Satya Nadella have recently highlighted a new phase of AI development driven by “Test Time Scaling.”

This concept builds on the 2020 scaling law proposed by OpenAI researchers, which emphasized that increasing computation and data led to better AI models. Test Time Scaling, however, focuses on optimizing the use of compute resources during inference to improve model accuracy and reasoning.

DeepSeek’s R1 model exemplifies this approach, leveraging additional computational power during inference to achieve better outputs. The model’s efficiency and lower training costs have sparked questions about the future direction of AI investments.

Amid its growing recognition, DeepSeek faced a large-scale cyberattack, forcing the startup to restrict user registrations to mainland China phone numbers. The company did not disclose the duration of these restrictions or further details about the attack.

 

Tweet54SendShare15
Previous Post

Trump Names India and China as High-Tariff Nations, Vows to Impose Tariffs on Countries That Harm U.S. Interests

Next Post

Trump Calls AI Development ‘Positive,’ Stresses Need for U.S. Dominance

Reshab Agarwal

Reshab is a tech-enthusiast who likes to write about all things crypto. He is a Bitcoin bull and believes in a decentralized future of finance. Follow him on Twitter for more!

Recommended For You

Elon Musk Forms ‘America Party’ After Bitter Rift With Trump

by Thomas Babychan
July 8, 2025
0
Elon Musk Forms ‘America Party’ After Bitter Rift With Trump

Elon Musk, the billionaire known for his ventures in technology, space exploration, and electric vehicles, has entered the political stage with the launch of a new political party...

Read more

Linkrunner Raises ₹5 Cr to Revolutionize App Attribution in India’s Booming Mobile Market

by Ishaan Negi
July 8, 2025
0
Linkrunner Raises ₹5 Cr to Revolutionize App Attribution in India’s Booming Mobile Market

In a significant milestone for India’s growing app economy, Linkrunner, the country’s first AI-powered mobile measurement partner (MMP), has raised ₹5 crore in a pre-seed funding round. The...

Read more

BigBasket Strengthens Leadership Bench with Deepika Khattar Bhan’s Appointment

by Ishaan Negi
July 8, 2025
0
Tata owned Bigbasket’s revenue goes over ₹10,000 Cr, loss reduces to ₹1,415

In a strategic leadership reshuffle aimed at sharpening its edge in the fiercely competitive quick commerce market, Tata-backed BigBasket is set to welcome Deepika Khattar Bhan to its...

Read more
Next Post
Donald Trump's Re-Election

Trump Calls AI Development ‘Positive,’ Stresses Need for U.S. Dominance

Please login to join discussion

Techstory

Tech and Business News from around the world. Follow along for latest in the world of Tech, AI, Crypto, EVs, Business Personalities and more.
reach us at [email protected]

Advertise With Us

Reach out at - [email protected]

BROWSE BY TAG

#Crypto #howto 2024 acquisition AI amazon Apple bitcoin Business China cryptocurrency e-commerce electric vehicles Elon Musk Ethereum facebook flipkart funding Gaming Google India Instagram Investment ios iPhone IPO Market Markets Meta Microsoft News NFT samsung Social Media SpaceX startup startups tech technology Tesla TikTok trend trending twitter US

© 2024 Techstory.in

No Result
View All Result
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to

© 2024 Techstory.in

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?