• Send Us A Tip
  • Calling all Tech Writers
  • Advertise
Sunday, June 21, 2026
  • Login
TechStory
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to
No Result
View All Result
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to
No Result
View All Result
TechStory
No Result
View All Result
Home Tech

OpenAI’s New AI Models Face Troubling Increase in Hallucinations

by Sneha Singh
April 20, 2025
in Tech
Reading Time: 3 mins read
0
OpenAI's New AI Models Face Troubling Increase in Hallucinations
TwitterWhatsappLinkedin

Tech giant OpenAI has hit an unexpected roadblock with its latest artificial intelligence models. The company’s new reasoning models, o3 and o4-mini, are showing a concerning spike in hallucination rates – essentially making up information that isn’t true – compared to their predecessors.

You might also like

Mitsubishi Hints at a Future Worthy of the Lancer Evolution Legacy

How to Increase Gas Mileage: Small Driving Changes That Save Big at the Pump

Paradigms of Luminance and Chemistry The Definitive OLED vs Mini LED Display Audit

This development has stunned both the company’s own engineers and industry watchers alike, as it reverses years of steady improvement in AI reliability. While each previous generation of OpenAI’s large language models had been getting gradually better at avoiding hallucinations, these new models are suddenly performing worse.

A Step Backward in AI Reliability

According to OpenAI’s internal testing, the new o3 model hallucinated in 33% of cases on the company’s PersonQA benchmark. That’s roughly double the rate of previous models like o1 (16%) and o3-mini (14.8%). Even more troubling, the o4-mini model performed worse still, hallucinating in nearly half of all test cases – a staggering 48%.

This setback has raised serious concerns throughout the AI research community. When AI systems confidently present false information as fact, it undermines user trust and limits how these technologies can be safely used in important applications.

OpenAI Takes Aim at 'Hallucinations' as More Businesses Integrate AI
Credits: PYMNTS.com

“What we’re seeing is unusual for a company that has built its reputation on steady, measurable progress in AI safety,” said tech analyst Sarah Chen. “These hallucination rates could potentially undermine years of work building public trust in AI systems.”

Mystery Behind the Decline

Perhaps most concerning is that OpenAI itself doesn’t fully understand why this is happening. In its technical documentation, the company openly admits that “more research is needed” to figure out why scaling up these reasoning models is leading to more frequent hallucinations.

Neil Chowdhury, a researcher at nonprofit AI lab Transluce and former OpenAI employee, suggests the reinforcement learning methods used in developing these models might be amplifying problems that older techniques managed to avoid. His team found that o3 sometimes makes up not just facts, but even fabricates actions it claims to have taken – like pretending to run code on hardware that doesn’t exist.

“It’s as if the models are becoming more confident but not necessarily more accurate,” Chowdhury explained. “They’re generating more claims overall, which means both more correct answers and more incorrect ones.”

Despite these issues, the new models do excel in certain areas. The o3 model achieved an impressive 69.1% score on the SWE-bench coding benchmark, with o4-mini close behind at 68.1%. These are significant improvements in coding and mathematical capabilities.

However, the practical problems are already evident. Kian Katanforoosh, Stanford adjunct professor and CEO of startup Work, noted that while o3 performs exceptionally well for coding tasks compared to competitors, it frequently generates broken website links – URLs that simply don’t exist.

“For businesses relying on these models, such hallucinations can be more than just annoying – they can actively harm productivity and decision-making,” Katanforoosh said. “Imagine building a product roadmap based on AI research that includes references to non-existent studies or tools.”

Industry Impact and Future Challenges

This spike in hallucination rates comes at a crucial moment for OpenAI, which faces intense competition from rivals like Google, Meta, xAI, Anthropic, and DeepSeek. The company had been counting on these new reasoning models to set a new industry standard, but the unexplained rise in hallucinations could damage user trust.

AI ethics researcher Maya Johnson points out the fundamental challenge: “While some creative ‘hallucination’ can be useful for brainstorming or generating novel ideas, these rates are simply too high for enterprise or scientific applications where accuracy is non-negotiable.”

OpenAI has acknowledged the seriousness of the issue and is dedicating resources to understanding and addressing the root causes. The company has also called on the broader AI research community to help investigate this phenomenon.

As the race for more capable AI continues, this development serves as a sobering reminder that as models grow more sophisticated in some ways, they may simultaneously struggle with basic reliability problems. For now, users of these advanced models may need to exercise extra caution and verification when working with their outputs.

Tags: AnrthopicChatGPTDeepSeekGoogleMetaOpenAI
Tweet65SendShare18
Previous Post

How to get Blue Vida in MLB The Show 25?

Next Post

AI Declares Trump’s Reported Physical Results “Virtually Impossible”

Sneha Singh

Sneha is a skilled writer with a passion for uncovering the latest stories and breaking news. She has written for a variety of publications, covering topics ranging from politics and business to entertainment and sports.

Recommended For You

Mitsubishi Hints at a Future Worthy of the Lancer Evolution Legacy

by Samir Gautam
June 21, 2026
0
Mitsubishi Hints at a Future Worthy of the Lancer Evolution Legacy

Mitsubishi Motors has reignited hopes among performance-car fans after its new president said the company wants to become capable of building another great car in the mould of...

Read more

How to Increase Gas Mileage: Small Driving Changes That Save Big at the Pump

by Samir Gautam
June 21, 2026
0
Fuel prices may rise and fall, but one thing stays constant: drivers want to make every litre go further. The good news is that improving gas mileage does not always require buying a new hybrid or changing cars altogether. A few disciplined habits behind the wheel, along with basic maintenance, can make a noticeable difference over time. For most drivers, the biggest gains come from reducing waste. That means less aggressive acceleration, fewer unnecessary trips, correctly inflated tyres and a car that is mechanically healthy. Smooth Driving Uses Less Fuel The quickest way to burn more fuel is to drive as if every traffic light is a starting grid. Hard acceleration, sharp braking and sudden changes in speed force the engine to work harder and consume more petrol. A smoother approach works better. Accelerate gradually, maintain a steady speed where possible and look ahead to anticipate traffic. If a red light is visible in the distance, easing off the accelerator early is usually more efficient than rushing forward and braking hard at the last moment. Speed also matters. As speeds rise, aerodynamic drag increases and the engine needs more energy to keep the vehicle moving. On highways, staying within a sensible cruising range rather than constantly pushing at high speeds can help reduce fuel consumption. Check Tyre Pressure Regularly Tyres are easy to ignore until something goes wrong, but they play a major role in fuel economy. Under-inflated tyres create more rolling resistance, which means the engine has to use more fuel just to move the car forward. Drivers should check tyre pressure at least once a month, preferably when the tyres are cold. The correct pressure is usually listed on the driver-side door frame or in the owner’s manual. It is important not to use the maximum pressure printed on the tyre sidewall as a target. That figure is not necessarily the recommended setting for the vehicle. The US Environmental Protection Agency notes that under-inflation reduces fuel economy, increases tyre wear and adds to emissions. Stop Carrying Extra Weight A car is not a storage room. Heavy items in the boot may seem harmless, but extra weight makes the engine work harder, especially in city traffic where the vehicle is constantly stopping and starting. Clear out unnecessary tools, boxes, sports gear and other items that have been sitting in the car for weeks. Roof racks and cargo boxes can also hurt mileage by increasing aerodynamic drag. If they are not being used, remove them. This is especially relevant for drivers who spend most of their time on highways, where wind resistance becomes a bigger factor. Keep Up With Maintenance A well-maintained vehicle is usually a more fuel-efficient vehicle. Delayed oil changes, worn spark plugs, clogged air filters, dragging brakes and poor wheel alignment can all affect how efficiently a car runs. Following the manufacturer’s service schedule is the safest route. Use the recommended engine oil grade and get warning lights checked instead of ignoring them. A sudden drop in mileage can be an early sign that something needs attention. The EPA advises motorists to follow their vehicle maintenance schedule and use the recommended motor oil to support better fuel efficiency and safer operation. Combine Trips and Avoid Long Idling Short trips can be surprisingly fuel-hungry because the engine has not had enough time to reach its most efficient operating temperature. Combining errands into one planned route can reduce cold starts, unnecessary kilometres and fuel use. Idling is another quiet fuel drain. If you are waiting for an extended period, switching off the engine can be more sensible than leaving it running. Modern cars do not need long warm-up periods before driving. Start, settle for a few seconds and drive gently. The Bottom Line Better gas mileage is less about one miracle trick and more about consistent habits. Drive smoothly, maintain the right tyre pressure, remove excess weight and service the car on time. These small changes may not feel dramatic on a single trip, but over months of commuting, school runs and highway drives, they can add up to real savings.

Fuel prices may rise and fall, but one thing stays constant: drivers want to make every litre go further. The good news is that improving gas mileage does...

Read more

Paradigms of Luminance and Chemistry The Definitive OLED vs Mini LED Display Audit

by Anochie Esther
June 21, 2026
0
OLED vs Mini LED

The global display and consumer electronics sectors are locked in a historic technological civil war. For years, the gold standard of premium visual performance was dictated by a...

Read more
Next Post
AI Declares Trump's Reported Physical Results "Virtually Impossible"

AI Declares Trump's Reported Physical Results "Virtually Impossible"

Please login to join discussion

Techstory

Tech and Business News from around the world. Follow along for latest in the world of Tech, AI, Crypto, EVs, Business Personalities and more.
reach us at info@techstory.in

Advertise With Us

Reach out at - info@techstory.in

Aviator Game India 2026

BROWSE BY TAG

#Crypto #howto 2024 acquisition AI amazon Apple Artificial Intelligence bitcoin Business China cryptocurrency e-commerce electric vehicles Elon Musk Ethereum facebook funding Gaming Google India Instagram Investment ios iPhone IPO Market Markets Meta Microsoft News OpenAI samsung Social Media SpaceX startup startups tech technology Tesla TikTok trend trending twitter US

© 2025 Techstory.in

No Result
View All Result
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to

© 2025 Techstory.in

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?