Hi, I’m building a personal website and I don’t want it to be used to train AI. In my robots.txt file I blocked:

  • ChatGPT-User
  • GPTBot
  • Google-Extended
  • FacebookBot

What bots should I also add? Are there any other ways to block AI bots?

IMPORTANT: I don’t want to block search engine crawlers, only bots that are used to train AI.

    • chevy9294OP
      link
      fedilink
      arrow-up
      4
      ·
      11 months ago

      Nice idea, but a lot of random text that user doean’t see would slow down the website.

      • Pantherina@feddit.de
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        11 months ago

        I dont think thats really a big problem. Like simply make every key word useless, somehow automate the process.

        There should be a tool for this damn, there is at least one Unicode character that doesnt even display a blank in a damn Terminal.

        Like… modern web crap doesnt even load without Javascript or animations. So dont bother a bit more HTML