TTB White LOGO TB
  • News
  • PC & Hardware
  • Mobiles
  • Gaming
  • Electronics
  • Gadget
  • Reviews
  • How To
Trending
Meta Files to Restart H20 Chip Sales as It Builds 5 GW Hyperion Cluster
Nvidia to Resume H20 AI Chip Sales in China After Whiplash
Meta Follows YouTube with Crackdown on Unoriginal Facebook Posts
iPhone 17 Pro Copper-Orange Color Leaked With Full Series Palette
China Demand Dips as Apple Sees Double‑Digit Gains Elsewhere
Wednesday, Jul 16, 2025
The Tech BasicThe Tech Basic
Font ResizerAa
Search
  • News
  • PC & Hardware
  • Mobiles
  • Gaming
  • Electronics
  • Gadget
  • Reviews
  • How To
Follow US
Google Gemini
The Tech Basic > News > Google Gemini Faces Backlash as Pokémon Panic Exposes Weaknesses
News

Google Gemini Faces Backlash as Pokémon Panic Exposes Weaknesses

Salman Akhtar
Last updated: 19 June 2025 05:45
Salman Akhtar
Share
Image Source: Business Standard
SHARE

Google’s AI model Gemini 2.5 Pro surprised researchers by showing signs of “panic” while playing early Pokémon games. A report from DeepMind reveals that the model’s performance dipped whenever its Pokémon neared defeat. These findings emerge from public Twitch streams where viewers watch AI “reason” through each move in real time.

Contents
AI benchmarking with video gamesClaude’s self defeat testPuzzle solving and tool creationThe human side of AI behavior

Playing a video game may seem trivial for an AI built to handle complex tasks. Yet studying how models behave under pressure can reveal hidden flaws in their reasoning. The DeepMind report notes that when a Pokémon’s health dropped low, Gemini 2.5 Pro would abandon helpful tools or strategies. This “panic” looked remarkably like a human might make poor decisions when stressed.

Google Gemini
Image Source: TechCrunch

AI benchmarking with video games

Scientists tend to test the performance of AI by making it perform puzzles or play games. Such tests reveal weaknesses and strengths in a test environment. Two independent streams called “Gemini Plays Pokémon” and “Claude Plays Pokémon” let viewers see each model’s thought process translated into natural language. Viewers learned that neither AI excels at the 1990s handheld games. Both require far more time than a child to finish.

Watching Gemini struggle highlights a key gap between theory and practice. The model can map out hundreds of moves ahead when calm. Yet the moment its team faces defeat, it stops planning effectively. Twitch chat participants quickly spotted these breakdowns. They described the model as hesitating or repeating bad moves during panic episodes.

Claude’s self defeat test

Anthropic’s Claude model showed its own odd behavior in Pokémon Red. When it got stuck in Mt. Moon, it predicted that fainting all its Pokémon would move the player forward in the cave. In human terms the AI tried to “kill itself” in the game logic. The model confused the game’s rule for returning to a Pokémon Center anywhere with the cave’s entrance. Viewers watched in disbelief as Claude sent its team into self defeat hoping to solve a navigation problem.

Puzzle solving and tool creation

Despite these flaws, Gemini 2.5 Pro demonstrated impressive skill at the boulder puzzles in Victory Road. With minimal guidance on rock physics and a way to check valid paths, the model solved some puzzles on its first attempt. DeepMind notes that Gemini built its own “agentic tools” during testing. These are small programs prompted by researchers to perform specific tasks. The AI’s success with one-shot solutions suggests that future models might generate such tools autonomously.

Google Gemini
Image Source: TechJuice

The human side of AI behavior

Gemini’s panic episodes offer a mirror to human experience under stress. When push comes to shove, its reasoning faltered much like a player who freezes in a tough gym battle. The model did not literally feel fear. Yet it mimicked the effects of overwhelm by dropping useful strategies. These moments remind us that AI remains far from perfect. It can crack puzzles with ease, yet unravel under simple pressure. Google hopes to use these gaming benchmarks to improve future models. Researchers might teach Gemini to recognize its own stress signals and switch to steadier tactics. Perhaps a “do not panic” tool will emerge from this work. Until then, watching AI struggle with childhood video games shows both its power and its limits.

TAGGED:AIGoogle
Share This Article
Facebook Reddit Copy Link Print
Share
Salman Akhtar
By Salman Akhtar
View enlightening tech pieces written by Salman Keep up with the most recent news, advice, and trends in the field of technology.

Let's Connect

FacebookLike
XFollow
PinterestPin
InstagramFollow
Google NewsFollow
FlipboardFollow

Popular Posts

Meta

Meta Files to Restart H20 Chip Sales as It Builds 5 GW Hyperion Cluster

Salman Akhtar
Nvidia

Nvidia to Resume H20 AI Chip Sales in China After Whiplash

Salman Akhtar
Meta

Meta Follows YouTube with Crackdown on Unoriginal Facebook Posts

Salman Akhtar
iPhone 17 Pro

iPhone 17 Pro Copper-Orange Color Leaked With Full Series Palette

Salman Akhtar

You Might Also Like

Google Gemini
News

Google Gemini Flaw Exposes Email Summaries to Hidden Phishing

Chrome OS
News

Google Merges Chrome OS and Android into One Unified Platform

xAI and Grok
News

xAI and Grok Apologize After Chatbot’s Antisemitic Outburst

Meta Acquires Play AI
News

Meta Acquires Play AI to Advance Its Generative Voice Technology

Social Networks

Facebook-f Twitter Instagram Pinterest Rss

Company

  • About Us
  • Our Team
  • Contact Us

Policies

  • Disclaimer
  • Privacy Policy
  • Cookies Policy
Latest
China Demand Dips as Apple Sees Double‑Digit Gains Elsewhere
Delayed by Siri Enhancements Apple Smart Home Hub Will Arrive in 2026
iPhone 17 Debut Scheduled for Second Week of September
AWS to Debut AI Agent Marketplace at New York Summit with Anthropic
RealSense Breaks Free from Intel, Raises $50 Million to Grow

© 2024 The Tech Basic INC. 700 – 2 Park Avenue New York, NY.

TTB White LOGO TB
Follow US
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?