unboiled.info
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Cynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 1 year ago

How to block AI Crawler Bots using robots.txt file

www.cyberciti.biz

external-link
message-square
46
link
fedilink
31
external-link

How to block AI Crawler Bots using robots.txt file

www.cyberciti.biz

Cynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 1 year ago
message-square
46
link
fedilink
Just a moment...
www.cyberciti.biz
external-link
  • Da Bald Eagul@feddit.nl
    link
    fedilink
    arrow-up
    8
    ·
    1 year ago

    That is what they meant, yes. The title promises a block, completely preventing crawlers from accessing the site. That is not what is delivered.

    • JackbyDev@programming.dev
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      1 year ago

      Is it a lie or a simplification for beginners?

      • thanks_shakey_snake@lemmy.ca
        link
        fedilink
        arrow-up
        8
        ·
        1 year ago

        Lie. Or at best, dangerously wrong. Like saying “Crosswalks make cars incapable of harming pedestrians who stay within them.”

        • JackbyDev@programming.dev
          link
          fedilink
          English
          arrow-up
          0
          arrow-down
          2
          ·
          1 year ago

          It’s better than saying something like “there’s no point in robots.txt because bots can disobey is” though.

          • thanks_shakey_snake@lemmy.ca
            link
            fedilink
            arrow-up
            2
            ·
            1 year ago

            Maybe? But it’s not like that’s the only alternative thing to say, lol

          • ReversalHatchery@beehaw.org
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            1
            ·
            edit-2
            1 year ago

            Is it, though?

            I mean, robots.txt is the Do Not Track of the opposite side of the connection.

      • Eager Eagle@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        the word disallow is right there

Privacy@lemmy.ml

privacy@lemmy.ml

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !privacy@lemmy.ml

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

  • Posting a link to a website containing tracking isn’t great, if contents of the website are behind a paywall maybe copy them into the post
  • Don’t promote proprietary software
  • Try to keep things on topic
  • If you have a question, please try searching for previous discussions, maybe it has already been answered
  • Reposts are fine, but should have at least a couple of weeks in between so that the post can reach a new audience
  • Be nice :)

Related communities

  • Lemmy.ml libre_culture
  • Lemmy.ml privatelife
  • Lemmy.ml DeGoogle
  • Lemmy.ca privacy

much thanks to @gary_host_laptop for the logo design :)

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 1 user / week
  • 948 users / month
  • 6.77K users / 6 months
  • 2 local subscribers
  • 40.5K subscribers
  • 3.55K Posts
  • 72K Comments
  • Modlog
  • mods:
  • k_o_t@lemmy.ml
  • tmpod@lemmy.pt
  • Yayannick@lemmy.ml
  • ranok@sopuli.xyz
  • UI: unknown version
  • BE: 0.19.12
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org