pull down to refresh

The open web is closing down for unwanted automated traffic
A growing number of websites are taking steps to ban AI bot traffic so that their work isn't used as training data and their servers aren't overwhelmed by non-human users. However, some companies are ignoring the bans and scraping anyway.
Online traffic analysis conducted by BuiltWith, a web metrics biz, indicates that the number of publishers trying to prevent AI bots from scraping content for use in model training has surged since July.
About 5.6 million websites presently have added OpenAI's GPTBot to the disallow list in their robots.txt file, up from about 3.3 million at the start of July 2025. That's an increase of almost 70 percent.
I feel this to be an excellent measure to stop the abundance of bad AI generated content.
reply
but if the decent content gets locked up, LLM quality is gonna tank! 🤠
reply
from my point of view, AI should access only what you give access and have rights over.
reply