Cynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 4 months agoHow to block AI Crawler Bots using robots.txt filewww.cyberciti.bizexternal-linkmessage-square60fedilinkarrow-up1111arrow-down132
arrow-up179arrow-down1external-linkHow to block AI Crawler Bots using robots.txt filewww.cyberciti.bizCynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 4 months agomessage-square60fedilink
minus-squareasudox@lemmy.worldlinkfedilinkarrow-up6arrow-down1·4 months agoNot sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?
minus-squareɐɥO@lemmy.ohaa.xyzlinkfedilinkarrow-up14·4 months agocause many crawlers seem to explicitly crawl “forbidden” sites
minus-squareCrashumbc@lemmy.worldlinkfedilinkEnglisharrow-up2·4 months agoGoogle and script kiddies copying code…
Not sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?
cause many crawlers seem to explicitly crawl “forbidden” sites
Google and script kiddies copying code…