Monthly Archives: January 2006

Log crawling

Once in a while, I wander through the web server logs, to see who’s a bot and who’s not, and whether the bots are well-behaved. I’m not all that strict—I don’t even have a robots.txt file (yet)—but I don’t like bots that suck all the pages down extremely quickly or display other anti-social tendencies. One of the bots I have banned is from an outfit called NameProtect, since any mentioning of trademarks anybody on this site will be doing will be entirely within the bounds of fair use. I’ve left the bot from TurnItIn alone, since I don’t have any particular objection to plagiarists who are using our stuff getting caught (if my understanding of how TurnItIn works is flawed, please let me know). Every so often, though, I’ve noticed a bot that’s pretending not to be a bot. Frequently, these are spam address harvesters, but I’ve noticed occasionally that the IP range from one of the spoofers is owned by these assholes. Today, I finally looked in to who they are and whom they work for, and I’m sorry I didn’t ban them a long time ago.

New address obfuscation

I noticed that Outlook Express was doing the wrong thing with the LF character at the beginning of my address, so I’ve changed it to a space. If the spam bots are smart enough to remove the space, I’ll have to do something else. Since they weren’t smart enough to remove the line feed, I’m trying to remain optimistic.
Update: spam bots apparently are smart enough to take out the space. I’m trying a vertical tab character now. Outlook does the right thing; I’ll check OE as soon as I remember. Odds are good I’ll have to change the alias, though, since I have gotten a spam attempt. I’ll keep the mailto links up to date, and try to remember to keep any other in-line mentions clean, too.