Goofy Quote Engine Spam....

spambot.jpg
 
It looks like a basic spam bot, other than the post field being so long it looks like an attempt at a buffer overflow on a wordpress form.

I keep thinking I should deny all foreign ips on insurance domains on the server just for security.
 
I keep thinking I should deny all foreign ips on insurance domains on the server just for security.

I did something like that for a while. Everytime I would get a bot from China or any of the Russian states I would ban the IP on the server side. Seems every time I would ban an IP they would hit my site from a new one in China or one of the Russian Federation states. It was like trying to kill roaches.

Talked with Malcolm (Pangaea) and he said searches from Yandex, Baidu and others would help your rank.

Can't say if it is good or bad, but in the last few days I have the following bots that have visited my site.

sefooz
baiduspider
bingbot
googlebot
scoutjet
tweetmebot
lemurwebcrawler
yandexbot

Don't know if that is good or not. Seems I am getting quite a few spam comments that make it to the holding tank, waiting on approval.
 
Seems I am getting quite a few spam comments that make it to the holding tank, waiting on approval.

Yea....those are fun......every morning going through them striping out the links and approving........the way I see it is traffic and new content is always good.......
 
striping out the links and approving........the way I see it is traffic and new content is always good.......

I just mark as spam and sometimes ban that IP.

Be interesting to read Brook's comments on this.
 
Frequency of activity is one of the things that is supposed to be looked at in the google algorithm.

You can even trick a page into thinking duplicated content is the original copy by updating it more frequently than the original, or having comments post to it. It can even be done with a fake comment system, posting to replicated spun content.

They do catch those sites, just saying that the update frequency thing is pretty well known.

There is a way to run a plugin called spam free wordpress and just allow spam through onto the blog but strip the links out of it. I'm not sure if the content is of value, but google does like seeing the updates happening.

Best way is probably to manually review the content and strip part of it out then post it if its spam, or rewrite it to create a fake frequency of update that appears somewhat real.

I watched a guys case study video last week where he took completely duplicated content, then updated it several times based on the google spider hitting his sitemap, each time it hit he updated slightly, and after the 3rd hit he outranked the original post he had copied for the same term, and the original post had dropped off the index.

It's a flaw in the algorithm, probably part of the reason they're using social metrics so much now, but social is just as flawed and easy to cheat on.
 
allow spam through onto the blog but strip the links out of it

Are you suggesting I approve the spam and the plugin automatically strips out the links? Does it also strip out the email addy?

I have been reporting the obvious spam and trashing it.
 
Are you suggesting I approve the spam and the plugin automatically strips out the links? Does it also strip out the email addy?

I have been reporting the obvious spam and trashing it.

I hate to sound like a politician here, but I don't really know the right answer. I don't think there is a right and wrong here.

The best way I can answer it is to say, one of the metrics google does look at is update frequency. A comment, even if it is purely spam, is an update. The email address isn't a link, it wouldn't matter. Past that there are hundreds of valid approaches to how to address the spam.

I personally looked at an approach of not approving any comments, but holding them in the unapproved status, then having a script that checked my database for unapproved comments, approved them IF they didn't have any html links in them, and if they did have links or urls completely replace the comment with a random pre-written comment that randomized itself, then approve. I still haven't written the code to do it, but it wouldn't be terribly hard.

I'd like to make a wordpress plugin that did exactly that, but just like anything else, it would really need a round of testing to prove if it did anything.

As for now, you could strip the links and approve with a plugin, or hold and check manually, or throw them all in the trash, but like I said before, 2-3 rewrites/updates and a few backlinks can make plagurized content become original in the eyes of google.
 
Back
Top