Google mining the web for blog spam comments

August 2, 2007 by
Filed under: comments, Google, moderation, spam, word verification 

It is always a pleasure to come back from holidays and read comments on your blog. However, not all comments are worth spending time. An example of undesirable comment can be found here. After a first read, it already sounds like a strange comment. Expression such as Hey buddy! make it feel that it is spam. If you go deeper in the text, you can see that there is no personal information about my blog or the topics I cover. This is typical of spam comments.

However, deleting a normal comment may be annoying, especially for the guy who posted it. If you want to avoid the extremity of word verification process or comment moderation, a simple solution is to use Google. Just copy/paste the first line of text and Google will mine the web for other similar comments.

In the case described above, simply put the first sentence (using quotes to get the exact match) and Google will link you to this site. You can easily see that the first comment is exactly the same and can therefore be safely considered as spam.

Share

Comments

4 Comments on Google mining the web for blog spam comments

  1. Mohammad Taghi on Sun, 5th Aug 2007 10:11 am
  2. Dear Sandro Saitta
    Hi, Are you fine?
    I am PhD Student in Ankara University, Turkey. I want to apply Data Mining(Decision Tree)in water reservoirs control and operation. I read your blog and i hope that, i can used from your experiences in this topic.
    Thanks

  3. Sandro Saitta on Tue, 7th Aug 2007 8:32 am
  4. Hello,

    If you ask specific questions on the blog, there is a chance that one of the reader (or me) may be able to answer you.

    Kind regards.

  5. Mohammad Taghi on Tue, 14th Aug 2007 8:48 am
  6. Hi, Mr Sandro
    I new started in Data Mining.
    I want any practical and simple examples about clustering and decision tree.

  7. Sandro Saitta on Tue, 14th Aug 2007 10:14 am
  8. Hi,

    In my personal opinion, Introduction to Data Mining (Tan et al, 2006) is the best introduction to data mining (especially if you’re interested in clustering)

Tell me what you're thinking...





  • Swiss Association for Analytics

  • Most Popular Posts

  • T-shirts, Mugs & Mousepads


    All benefits given to a charity association
  • Data Mining Search Engine

    Supported by AnalyticBridge

  • Archives

  • Reading Recommandations