106
submitted 2 months ago by CynicusRex@lemmy.ml to c/privacy@lemmy.ml
you are viewing a single comment's thread
view the rest of the comments
[-] CynicusRex@lemmy.ml 11 points 2 months ago

#TL;DR:

User-agent: GPTBot
Disallow: /
User-agent: ChatGPT-User
Disallow: /
User-agent: Google-Extended
Disallow: /
User-agent: PerplexityBot
Disallow: /
User-agent: Amazonbot
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: Omgilibot
Disallow: /
User-Agent: FacebookBot
Disallow: /
User-Agent: Applebot
Disallow: /
User-agent: anthropic-ai
Disallow: /
User-agent: Bytespider
Disallow: /
User-agent: Claude-Web
Disallow: /
User-agent: Diffbot
Disallow: /
User-agent: ImagesiftBot
Disallow: /
User-agent: Omgilibot
Disallow: /
User-agent: Omgili
Disallow: /
User-agent: YouBot
Disallow: /
[-] mox@lemmy.sdf.org 6 points 2 months ago

Of course, nothing stops a bot from picking a user agent field that exactly matches a web browser.

[-] JackbyDev@programming.dev 4 points 2 months ago

Nothing stops a bot from choosing to not read robots.txt

[-] mox@lemmy.sdf.org 2 points 2 months ago* (last edited 2 months ago)

Indeed, as has already been said repeatedly in other comments.

this post was submitted on 18 Aug 2024
106 points (100.0% liked)

Privacy

31800 readers
128 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Related communities

Chat rooms

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago
MODERATORS