• Send Us A Tip
  • Calling all Tech Writers
  • Advertise
Monday, June 22, 2026
  • Login
TechStory
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to
No Result
View All Result
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to
No Result
View All Result
TechStory
No Result
View All Result
Home Future Tech AI

OpenAI’s New Model Reached Human Level on a Test for General Intelligence: A Major Breakthrough

by Reshab Agarwal
December 29, 2024
in AI, News
Reading Time: 3 mins read
0
Google's secret AI Project Jarvis
TwitterWhatsappLinkedin

OpenAI’s o3 model has made a significant leap in the quest for artificial general intelligence (AGI). OpenAI claims its new model reached the human level on a test for general intelligence, marking a significant achievement in AI research. On December 20, 2024, the model scored 85% on the ARC-AGI benchmark, surpassing previous AI records and matching the average human score. This performance was also impressive on a challenging mathematics test.

You might also like

SpaceX-Linked ETFs Attract $8.2 Billion as Analysts Warn Mega IPOs Could Reshape Global Indices

Ray-Ban Family Battle Heats Up As Heir Demands €10 Billion Buyout Approval Before June 30 Vote

Canadian Lender TD Notifies Staff About New Productivity Monitoring Measures

The ARC-AGI test is designed to evaluate how efficiently an AI can adapt to new situations. It measures sample efficiency, or how few examples an AI needs to learn a task. Traditional AI systems like GPT-4 struggle with unfamiliar tasks due to their reliance on massive datasets. However, to achieve AGI, systems need the ability to generalize from limited examples.

o3’s Generalization Abilities

The ARC-AGI tasks require AI systems to recognize patterns in visual puzzles, like grid squares, and apply learned rules to new examples. By solving these tasks with minimal data, o3 has demonstrated impressive generalization skills, an essential trait for intelligent systems.

Researchers believe that o3 adapts by identifying simple, effective rules from limited data. This ability to generalize is seen as a significant step toward AGI, but the full details of how o3 achieves this are not yet clear.

The o3 model differs from traditional models by allowing more time to “think” through complex problems. During its training, it was specifically fine-tuned for the ARC-AGI test. François Chollet, who created the ARC benchmark, suggests that o3 works by exploring multiple “chains of thought,” similar to how Google’s AlphaGo searches for optimal moves in the game of Go.

OpenAI claims its new model reached the human level on a test for general intelligence, outperforming previous AI models in specific tasks. One of the most exciting aspects of o3’s performance is its ability to generalize, i.e. solving problems it has not seen before by learning from a few examples. Generalization is considered a key element of intelligence, and o3’s success suggests that it may have made progress in this area. Traditional AI models, like GPT-4, are not as efficient when it comes to learning from small amounts of data, often relying on large datasets to perform well. In contrast, o3 shows the potential to adapt quickly to new tasks with minimal training.

A Leap Toward AGI?

While some researchers remain skeptical, OpenAI claims its new model reached the human level on a test for general intelligence, sparking further interest in AGI. Despite its impressive score, many experts remain cautious. Beating the ARC-AGI benchmark does not automatically equate to AGI. The o3 model still struggles with over 100 tasks, even with additional computational power. Experts like Chollet and Thomas Dietterich caution that the real breakthrough will come when tasks that are easy for humans but difficult for AI are no longer solvable by AI models.

o3 also reached an unofficial score of 87.5% by using significantly more computational power. This score would typically be enough to win the ARC Challenge’s grand prize, but the model’s computing costs exceeded the competition’s limits. Despite not winning, OpenAI’s achievement indicates that AI systems are getting closer to surpassing human-level performance on the ARC benchmark.

In 2025, the ARC Challenge organizers plan to launch more difficult tests. These will provide a clearer picture of how close AI is to true general intelligence.

The success of o3 in the ARC-AGI challenge marks a major milestone in AI research. However, researchers will need more time to fully understand the model’s capabilities. OpenAI’s release of the o3 model in 2025 will offer further insights into whether it can be considered a step toward AGI or if the journey is still far from complete.

Also Read: AI Predictions for 2025: Transforming Work, Business, and Daily Life.

Tweet56SendShare16
Previous Post

Bitcoin ETF Sees Soaring 2024 Inflows of $36.8 Billion, Outpacing Gold ETF by 81 Times

Next Post

Massive Halo Content Leak Exposes 25 Years of Vaulted History

Reshab Agarwal

Reshab is a tech-enthusiast who likes to write about all things crypto. He is a Bitcoin bull and believes in a decentralized future of finance. Follow him on Twitter for more!

Recommended For You

SpaceX-Linked ETFs Attract $8.2 Billion as Analysts Warn Mega IPOs Could Reshape Global Indices

by Rounak Majumdar
June 21, 2026
0
SpaceX-Linked ETFs Attract $8.2 Billion as Analysts Warn Mega IPOs Could Reshape Global Indices

Exchange-traded funds offering exposure to SpaceX have attracted approximately $8.2 billion in investor inflows, highlighting the growing appetite for private-market companies that are not directly available to public...

Read more

Ray-Ban Family Battle Heats Up As Heir Demands €10 Billion Buyout Approval Before June 30 Vote

by Rounak Majumdar
June 21, 2026
0
Ray-Ban Family Battle Heats Up As Heir Demands €10 Billion Buyout Approval Before June 30 Vote

Leonardo Maria Del Vecchio, one of the heirs to the fortune built by late eyewear billionaire Leonardo Del Vecchio, has escalated his efforts to gain greater control of...

Read more

Canadian Lender TD Notifies Staff About New Productivity Monitoring Measures

by Rounak Majumdar
June 21, 2026
0
Canadian Lender TD Notifies Staff About New Productivity Monitoring Measures

Canadian banking giant TD Bank has informed some employees that it will begin using software tools to monitor aspects of their work activity, according to a Reuters report....

Read more
Next Post
Massive Halo Content Leak Exposes 25 Years of Vaulted History

Massive Halo Content Leak Exposes 25 Years of Vaulted History

Please login to join discussion

Techstory

Tech and Business News from around the world. Follow along for latest in the world of Tech, AI, Crypto, EVs, Business Personalities and more.
reach us at info@techstory.in

Advertise With Us

Reach out at - info@techstory.in

Aviator Game India 2026

BROWSE BY TAG

#Crypto #howto 2024 acquisition AI amazon Apple Artificial Intelligence bitcoin Business China cryptocurrency e-commerce electric vehicles Elon Musk Ethereum facebook funding Gaming Google India Instagram Investment ios iPhone IPO Market Markets Meta Microsoft News OpenAI samsung Social Media SpaceX startup startups tech technology Tesla TikTok trend trending twitter US

© 2025 Techstory.in

No Result
View All Result
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to

© 2025 Techstory.in

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?