Handling missing values
As you may know, one of the most important, or at least time consuming, part of the whole data mining process is the data preprocessing. One common task that has to be done concerns the missing values. In most databases, they are noted NaN (Not a Number) or simply ?. Before normalizing
Google mining the web for blog spam comments
August 2, 2007 by Sandro Saitta · 4 Comments
Filed under: Google, comments, moderation, spam, word verification
Filed under: Google, comments, moderation, spam, word verification
It is always a pleasure to come back from holidays and read comments on your blog. However, not all comments are worth spending time. An example of undesirable comment can be found here. After a first read, it already sounds like a strange comment. Expression such as Hey buddy! make it feel… Continue reading... | 4 Comments














