TTB White LOGO TB
  • News
  • PC & Hardware
  • Mobiles
  • Gaming
  • Electronics
  • Gadget
  • Reviews
  • How To
Trending
PS6 Expected Release Date, Features, and Upcoming Games
iOS 18 and Apple Passwords: How to Import All Your Saved Passwords
How to Block Someone on TikTok – Complete Guide 2025
Character AI: Review – Is it Safe for Teens and Kids?
How Apple Health Turns Your Phone into a Personal Doctor
Sunday, Jun 1, 2025
The Tech BasicThe Tech Basic
Font ResizerAa
Search
  • News
  • PC & Hardware
  • Mobiles
  • Gaming
  • Electronics
  • Gadget
  • Reviews
  • How To
Follow US
Bluesky
The Tech Basic > News > One Million Bluesky Posts Scraped and Uploaded for AI Use Now
News

One Million Bluesky Posts Scraped and Uploaded for AI Use Now

Evelyn Blake
Last updated: 27 November 2024 20:29
Evelyn Blake
Share
Image Source: USA Today
SHARE

The rising decentralized social media platform, Bluesky, is scrambling with its first big AI data scrape. One million public posts were eventually crawled and uploaded to help train generative AI, despite its creators’ assurances that the platform would not participate in training generative AI. Hosted on Hugging Face, this dataset was created by machine learning librarian Daniel van Strien.

Furthermore, Bluesky included text content, metadata, information about media attachments, and even decentralized identifiers (DIDs) for Bluesky users. It was described explicitly to be used for natural language processing, social media trend analysis, and content moderation experiments.

Bluesky

Bluesky’s Open API Raises Questions

The public posts are available through Bluesky’s open firehose API and Authenticated Transfer (AT) Protocol. Real-time updates such as posts, likes, follows, and many more are streamed to you through the Firehose API. This openness provides a great opportunity for new ideas, but it also presents moral issues regarding how your user data is collected and utilized.

Bluesky users did not agree to having their posts used for machine learning. Nevertheless, the platform does not forbid actions of this kind. This issue brings forward the issue of a critical gap in user data protection and the risk of decentralized platforms.

Public Outcry and Data Removal

Once Bluesky made the dataset public, it stirred up controversy with Bluesky users and privacy advocates who criticized that the dataset lacked transparency and consent. Hugging Face has removed the dataset in response.

Daniel Van Strien apologized for the ethical lapse in a public statement. In a statement, he said that he was motivated to support platform tool development, but his approach broke principles of transparency and consent in data collection. But he was sorry for this mistake.

Bluesky’s Response

The controversy was met by a Bluesky representative who focused on the public and decentralized nature of the platform. They compared their situation to websites and their robots.txt files, which don’t always work to prevent web crawling.

There was a need to provide mechanisms for users to have control over what data was used by unaffiliated third parties. Safeguards are being discussed around starting the code such that the developers will ensure they respect the users’ preferences.

Implications for Bluesky Users

This is a dark warning for Bluesky’s expanding user base. Users simply moved to the platform to avoid competitor X’s shocking AI training policies. The controversy shows the fragile condition of balance between openness and privacy in decentralized networks.

Bluesky’s decentralized structure has the benefit of transparency and collaboration while being less forgiving than centralized systems when it comes to user privacy risks. The way the company is trying to address these issues will surely affect the long-term growth and trust of the company from the users.

Bluesky

End Note

As good as the removal of this dataset is, it confirms the need for ethical rules on all decentralized platforms. To protect its ethics in social networking, Bluesky must give user consent and transparency precedence.

It is a reminder of the challenges faced by platforms striving to reconcile innovation with user rights. Bluesky’s path forward is to respond to these gaps by bringing them up proactively and building trust with its users.

Share This Article
Facebook Reddit Copy Link Print
Share
Evelyn Blake
By Evelyn Blake
Follow:
Evelyn Blake is an investor in technology and journalist who has been in the nascent space since 2014. Her love and passion for technological innovations made her delve deeper into the world of technology evolution. As a journalist, Evelyn has been covering latest trends and emerging gadgetries. She is a philanthropist and human rights activist.

Let's Connect

FacebookLike
XFollow
PinterestPin
InstagramFollow
Google NewsFollow
FlipboardFollow

Popular Posts

PS6 Expected Release Date

PS6 Expected Release Date, Features, and Upcoming Games

S.Dyema Zandria
iOS 18 and Apple Passwords

iOS 18 and Apple Passwords: How to Import All Your Saved Passwords

S.Dyema Zandria
Block Someone on TikTok

How to Block Someone on TikTok – Complete Guide 2025

S.Dyema Zandria
Character AI

Character AI: Review – Is it Safe for Teens and Kids?

S.Dyema Zandria

You Might Also Like

New Gemini Feature
News

No More Reading Long Emails? Google’s New Gemini Feature

Grammarly
News

Grammarly’s $1 Billion Boost to Build the Future of AI Productivity

Instagram Edits
News

How Instagram Edits Empowers Creators with Advanced AI Video Editing Tools

Google Veo 3
BlogNews

Google Veo 3: The Next Frontier in AI-Generated Video Content

Social Networks

Facebook-f Twitter Instagram Pinterest Rss

Company

  • About Us
  • Our Team
  • Contact Us

Policies

  • Disclaimer
  • Privacy Policy
  • Cookies Policy
Latest
ChatGPT 4.5 Whats New, Features, Access and Comparison with ChatGPT 4.0
Apple Prepares a New Games App for WWDC
Opera Neon: The AI Browser That Works While You Sleep
US Now Orders Chrome Users to Update by June 5 Amid Hack Threat
Claude Gets a Voice as Anthropic Adds Hands-Free Chat Option

© 2024 The Tech Basic INC. 700 – 2 Park Avenue New York, NY.

TTB White LOGO TB
Follow US
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?