91
I was wrong about robots.txt
(evgeniipendragon.com)
This is a most excellent place for technology news and articles.
Kinda, but also not really. Any major tech player that has billions to lose will make a show of respecting robots.txt when presenting that information to third parties, lest they be exposed by basic journalism.
However, they also have separate networks in R&D that sweep the net all the time and do not care about such restrictions. It's theatre.
And they're still happy to punish people that have the gall to publicly decline their crawlers. Basically they can eat their cake and have it too.