Some links below may be affiliate links. BMOW may get paid if you buy something or take an action after clicking one of these.
As an Amazon Associate BMOW earns from qualifying purchases.
|
|
|
Spamalot |
|
Submitted by Rickshaw :: Thu Mar 01, 2007 11:54 pm |
I spent some time de-spamming Runworks today. If you visited in the last couple of days, you probably noticed the appearance of a whole bunch of posts about what Lindsey Lohan is doing with Catherine Zeta Jones, or something else not safe for work viewing. Thanks, but no thanks. I decided I needed to tighten up security around here to help keep spam out, so I opened up my toolbox and got to work.
Since about a year ago, new users registering at Runworks have been required to solve a captcha (an image containing distorted/hard-to-recognize text) and reply to a confirmation email. I figured that would prevent anybody from creating an automated tool to create new user accounts for posting spam. I was wrong.
Solving the captcha in an automated way proved to be easier than I would have thought. Do a Google search for "defeat captcha", and you'll see that the captchas for many popular web sites can be solved in an automated manner by using image filters and pattern recognition programs. It's not something a casual user could automate in 5 minutes, but for someone knowledgable enough and with a big enough potential reward, it's doable.
Part of my problem was that I'm using phpBB, a very popular bulletin board program, so the captchas on Runworks looked identical to all the other phpBB sites in the world. That makes spending the time to defeat the phpBB captcha a worthwhile goal for a spammer, where defeating a captcha that was unique to Runworks wouldn't be. So I removed the phpBB captcha, and with a little experimentation, I created my own text-obfuscating algorithm to use in its place. I'm not sure if it's any more robust than the phpBB-provided one, but since it's unique to this site alone, it's doubtful anyone will be interested enough to try to defeat it.
Can you read this? Neither can I.
The confirmation email proved to be easy to automate as well. Free email sites like hotmail and gmail can be checked for and handled specially, but now spammers have discovered a way to generate randomly-named temporary domains for receiving email. I had lots of registrations from users with email like asterbast@fkdj39jdkh291.info. Spammers (or an automated tool they wrote) would have to actually receive and reply to email at this address in order to complete the registration process. I believe this is possible because of a technicality in the rules governing domain registration. You can "test" a domain name for 24 or 48 hours before you have to pay for it, so spammers register hundreds or thousands of random domain names, use them for a day or two, and then dump them. There's not much I can do about that. Hopefully the new captcha will be enough.
For good measure, I also went through and disabled about 300 relatively recently created user accounts, so already-existing accounts couldn't be used to post new spam. I think these were all bogus accounts, but if I accidentally nuked a real user's account in the process, please let me know.
The most interesting moment in this whole process came when the spammer happened to log in and start posting more spams while I was in the middle of making the security changes. I was able to determine the spammer's IP address and trace it back to an ISP in Hong Kong. Then I permanently banned that IP.
Hopefully this will keep things quiet for a while, but I doubt it's the last we'll see of spam. The problem with solutions like captchas is that they only determine who's a computer and who's a human (and imperfectly at that), not who's got good intentions and who's got bad ones. A spammer could easily employ people in a sweatshop somewhere to solve captchas for target sites, and then turn over the actual posting of the spams to an automated program. You could even imagine a black-hat web site hosting illegal software or porn that required users to solve a captcha (pulled from the spammer's target site list) before the next warez file could be downloaded.
The internet is a bit of a mess. It's amazing that it actually works as well as it does.
Now back to running...
|
|
|
|
All times are GMT - 8 Hours
|
Page 1 of 1
|
|