Do you feel lonely? Do you want more colleagues? Especially ones that understand expressions such as validation error, overfitting and SVM? Then you need to find data miners. Here are a few advices on how to find them:
I would like to welcom Cristian among the few data mining bloggers. His blog, Text and Data Mining by practical means, will be of interest to several people in the field. Here are a few topics that have already appeared on Cristian's blog.
- Issues with K-means
- Examples of SVM using libsvm
- Search engines

The blog started in June this year. We will certainly find…
Continue reading... | 2 Comments
Usually data miners don't cheat. The reason is simple: you cannot cheat with the future. In reality, it's a bit more complicated. A data miner may be cheating without knowing it. Here are a few examples.
First, one may cheat by learning the training set by heart. If you cheat (in any way) on your training set, it will certainly be visible on the test set (overfitting).
Another way of cheating is…
Continue reading... | 5 Comments