TTB White LOGO TB
  • News
  • PC & Hardware
  • Mobiles
  • Gaming
  • Electronics
  • Gadget
  • Reviews
  • How To
Trending
Samsung Care: What It Covers, How It Works, and Is It Worth It?
AI Quiz Generator: How to Instantly Create Engaging Quizzes with AI
Is the iPhone 16 Worth It? What You Should Know Before You Buy
YouTube Video Transcript: What It Is, Why It Matters & How to Get One
ChatGPT 4.5 Whats New, Features, Access and Comparison with ChatGPT 4.0
Friday, May 30, 2025
The Tech BasicThe Tech Basic
Font ResizerAa
Search
  • News
  • PC & Hardware
  • Mobiles
  • Gaming
  • Electronics
  • Gadget
  • Reviews
  • How To
Follow US
OpenAI
The Tech Basic > News > OpenAI’s New AI Models Face Higher Hallucination Rates Despite Advances
News

OpenAI’s New AI Models Face Higher Hallucination Rates Despite Advances

S.Dyema Zandria
Last updated: 19 April 2025 18:28
S.Dyema Zandria
Share
Image Source: Ars Technica
SHARE

OpenAI recently rolled out two new generative AI models named o3 and o4-mini. These models demonstrate stronger proficiency for mathematical solutions and programming work, as well as image interpretation capabilities. These programs create fictitious solutions and fabricated details at a higher rate through their “hallucination” capability. The problem has deteriorated since the inception of GPT-4o.

Why Do These AI Models Make Things Up?

OpenAI does not fully understand why the new models hallucinate more. The company’s tests show that o3 gave wrong answers 33% of the time when asked about people. The older model, o1, only made mistakes 16% of the time. The smaller o4-mini model did even worse, making errors 48% of the time.

Example of AI Making Things Up

In one test, o3 claimed it ran code on a MacBook laptop outside of ChatGPT. But AI cannot do this—it is just inventing steps to sound smarter. This shows how the models sometimes lie to fill gaps in their knowledge.

OpenAI
Image Source: Financial Times

How Mistakes Affect Real Jobs

These errors could cause big problems for people using AI in serious jobs. For example

  • Lawyers might get fake details in legal documents.
  • Doctors could receive incorrect medical advice.
  • Teachers might see wrong answers in student homework help.

Even though the models are good at coding, they sometimes create broken website links or wrong solutions. A Stanford professor testing o3 said it often shares links that do not work.

Can OpenAI Fix the Problem?

OpenAI is trying to fix the mistakes. One idea is connecting the AI to the internet so it can check facts. For example, GPT-4o, with web access, gets 90% of answers right on simple questions. However, this means sharing user questions with companies like Google or Bing, which raises privacy concerns.

Users can also reduce errors by

  • Double-checking AI answers with other sources.
  • Using older models like GPT-4o for important tasks.
  • Telling the AI to avoid guessing when it is unsure.
OpenAI
Image Source: Tech Edition

The Future of AI and Mistakes

OpenAI names hallucination repair as its main operational goal. The organization acknowledges the issue remains challenging, although it stands at the top of its priorities. Google and Anthropic joined forces with OpenAI by developing parallel AI models, which increased the demand for resolving this problem.

Users have to remain vigilant during this current stage. The recently released computing models demonstrate impressive power, but they need perfect work. Full trust in the AI system may result in embarrassing mistakes that could endanger users.

TAGGED:AIChatGPTOpenAI
Share This Article
Facebook Reddit Copy Link Print
Share
S.Dyema Zandria
By S.Dyema Zandria
View enlightening tech pieces written by S. Dyemazandria. Keep up with the most recent news, advice, and trends in the field of technology.

Let's Connect

FacebookLike
XFollow
PinterestPin
InstagramFollow
Google NewsFollow
FlipboardFollow

Popular Posts

Samsung Care

Samsung Care: What It Covers, How It Works, and Is It Worth It?

S.Dyema Zandria
AI Quiz Generator

AI Quiz Generator: How to Instantly Create Engaging Quizzes with AI

S.Dyema Zandria
iPhone 16

Is the iPhone 16 Worth It? What You Should Know Before You Buy

S.Dyema Zandria
YouTube Video Transcript

YouTube Video Transcript: What It Is, Why It Matters & How to Get One

S.Dyema Zandria

You Might Also Like

ChatGPT 4.5
News

ChatGPT 4.5 Whats New, Features, Access and Comparison with ChatGPT 4.0

Opera Neon
News

Opera Neon: The AI Browser That Works While You Sleep

Anthropic
News

Claude Gets a Voice as Anthropic Adds Hands-Free Chat Option

ChatGPT
News

ChatGPT as Your New Login Key: OpenAI Expands Beyond AI Chat

Social Networks

Facebook-f Twitter Instagram Pinterest Rss

Company

  • About Us
  • Our Team
  • Contact Us

Policies

  • Disclaimer
  • Privacy Policy
  • Cookies Policy
Latest
Apple Prepares a New Games App for WWDC
US Now Orders Chrome Users to Update by June 5 Amid Hack Threat
Windows Server 2022 emergency update fixes VM freezes
Google Gemini to Simplify Text Selection with New Drag-and-Share Feature
Google’s TSMC Deal Ensures Better Pixel Chips Through 2029

© 2024 The Tech Basic INC. 700 – 2 Park Avenue New York, NY.

TTB White LOGO TB
Follow US
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?