Google mining the web for blog spam comments

It is always a pleasure to come back from holidays and read comments on your blog. However, not all comments are worth spending time. An example of undesirable comment can be found here. After a first read, it already sounds like a strange comment. Expression such as Hey buddy! make it feel that it is spam. If you go deeper in the text, you can see that there is no personal information about my blog or the topics I cover. This is typical of spam comments.

However, deleting a normal comment may be annoying, especially for the guy who posted it. If you want to avoid the extremity of word verification process or comment moderation, a simple solution is to use Google. Just copy/paste the first line of text and Google will mine the web for other similar comments.

In the case described above, simply put the first sentence (using quotes to get the exact match) and Google will link you to this site. You can easily see that the first comment is exactly the same and can therefore be safely considered as spam.


Recommended Reading

Comments Icon4 comments found on “Google mining the web for blog spam comments

  1. Dear Sandro Saitta
    Hi, Are you fine?
    I am PhD Student in Ankara University, Turkey. I want to apply Data Mining(Decision Tree)in water reservoirs control and operation. I read your blog and i hope that, i can used from your experiences in this topic.

  2. Hello,

    If you ask specific questions on the blog, there is a chance that one of the reader (or me) may be able to answer you.

    Kind regards.

  3. Hi, Mr Sandro
    I new started in Data Mining.
    I want any practical and simple examples about clustering and decision tree.

Comments are closed.