TTB White LOGO TB
  • News
  • PC & Hardware
  • Mobiles
  • Gaming
  • Electronics
  • Gadget
  • Reviews
  • How To
Trending
Microsoft Tests Full Desktop Sharing with Copilot on Windows 11
Google Discover Now Adds AI Summaries, Threatening Publisher Traffic
Meta Now Fixes AI Chatbot Flaw Exposing Private User Prompts
Apple’s New Keyboard Patent Describes a Removable Mouse Key
AirPods Pro 2 Hearing Support Now Available in 13 More Countries
Thursday, Jul 17, 2025
The Tech BasicThe Tech Basic
Font ResizerAa
Search
  • News
  • PC & Hardware
  • Mobiles
  • Gaming
  • Electronics
  • Gadget
  • Reviews
  • How To
Follow US
Google Gemini
The Tech Basic > News > Google Gemini Faces Backlash as Pokémon Panic Exposes Weaknesses
News

Google Gemini Faces Backlash as Pokémon Panic Exposes Weaknesses

Salman Akhtar
Last updated: 19 June 2025 05:45
Salman Akhtar
Share
Image Source: Business Standard
SHARE

Google’s AI model Gemini 2.5 Pro surprised researchers by showing signs of “panic” while playing early Pokémon games. A report from DeepMind reveals that the model’s performance dipped whenever its Pokémon neared defeat. These findings emerge from public Twitch streams where viewers watch AI “reason” through each move in real time.

Contents
AI benchmarking with video gamesClaude’s self defeat testPuzzle solving and tool creationThe human side of AI behavior

Playing a video game may seem trivial for an AI built to handle complex tasks. Yet studying how models behave under pressure can reveal hidden flaws in their reasoning. The DeepMind report notes that when a Pokémon’s health dropped low, Gemini 2.5 Pro would abandon helpful tools or strategies. This “panic” looked remarkably like a human might make poor decisions when stressed.

Google Gemini
Image Source: TechCrunch

AI benchmarking with video games

Scientists tend to test the performance of AI by making it perform puzzles or play games. Such tests reveal weaknesses and strengths in a test environment. Two independent streams called “Gemini Plays Pokémon” and “Claude Plays Pokémon” let viewers see each model’s thought process translated into natural language. Viewers learned that neither AI excels at the 1990s handheld games. Both require far more time than a child to finish.

Watching Gemini struggle highlights a key gap between theory and practice. The model can map out hundreds of moves ahead when calm. Yet the moment its team faces defeat, it stops planning effectively. Twitch chat participants quickly spotted these breakdowns. They described the model as hesitating or repeating bad moves during panic episodes.

Claude’s self defeat test

Anthropic’s Claude model showed its own odd behavior in Pokémon Red. When it got stuck in Mt. Moon, it predicted that fainting all its Pokémon would move the player forward in the cave. In human terms the AI tried to “kill itself” in the game logic. The model confused the game’s rule for returning to a Pokémon Center anywhere with the cave’s entrance. Viewers watched in disbelief as Claude sent its team into self defeat hoping to solve a navigation problem.

Puzzle solving and tool creation

Despite these flaws, Gemini 2.5 Pro demonstrated impressive skill at the boulder puzzles in Victory Road. With minimal guidance on rock physics and a way to check valid paths, the model solved some puzzles on its first attempt. DeepMind notes that Gemini built its own “agentic tools” during testing. These are small programs prompted by researchers to perform specific tasks. The AI’s success with one-shot solutions suggests that future models might generate such tools autonomously.

Google Gemini
Image Source: TechJuice

The human side of AI behavior

Gemini’s panic episodes offer a mirror to human experience under stress. When push comes to shove, its reasoning faltered much like a player who freezes in a tough gym battle. The model did not literally feel fear. Yet it mimicked the effects of overwhelm by dropping useful strategies. These moments remind us that AI remains far from perfect. It can crack puzzles with ease, yet unravel under simple pressure. Google hopes to use these gaming benchmarks to improve future models. Researchers might teach Gemini to recognize its own stress signals and switch to steadier tactics. Perhaps a “do not panic” tool will emerge from this work. Until then, watching AI struggle with childhood video games shows both its power and its limits.

TAGGED:AIGoogle
Share This Article
Facebook Reddit Copy Link Print
Share
Salman Akhtar
By Salman Akhtar
View enlightening tech pieces written by Salman Keep up with the most recent news, advice, and trends in the field of technology.

Let's Connect

FacebookLike
XFollow
PinterestPin
InstagramFollow
Google NewsFollow
FlipboardFollow

Popular Posts

Copilot on Windows 11

Microsoft Tests Full Desktop Sharing with Copilot on Windows 11

Salman Akhtar
Google Discover

Google Discover Now Adds AI Summaries, Threatening Publisher Traffic

Salman Akhtar
Meta

Meta Now Fixes AI Chatbot Flaw Exposing Private User Prompts

Salman Akhtar
Apple’s New Keyboard

Apple’s New Keyboard Patent Describes a Removable Mouse Key

Salman Akhtar

You Might Also Like

Meta
News

Meta Files to Restart H20 Chip Sales as It Builds 5 GW Hyperion Cluster

Nvidia
News

Nvidia to Resume H20 AI Chip Sales in China After Whiplash

Google Gemini
News

Google Gemini Flaw Exposes Email Summaries to Hidden Phishing

Chrome OS
News

Google Merges Chrome OS and Android into One Unified Platform

Social Networks

Facebook-f Twitter Instagram Pinterest Rss

Company

  • About Us
  • Our Team
  • Contact Us

Policies

  • Disclaimer
  • Privacy Policy
  • Cookies Policy
Latest
Safety Features in Ride Hailing Apps for Women
AirPods Pro 2 Hearing Support Now Available in 13 More Countries
Meta Follows YouTube with Crackdown on Unoriginal Facebook Posts
iPhone 17 Pro Copper-Orange Color Leaked With Full Series Palette
China Demand Dips as Apple Sees Double‑Digit Gains Elsewhere

© 2024 The Tech Basic INC. 700 – 2 Park Avenue New York, NY.

TTB White LOGO TB
Follow US
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?