Machine learning according to Mitchell
January 30, 2007 by Sandro Saitta · Leave a Comment
Filed under: machine learning, mitchell, research questions
Filed under: machine learning, mitchell, research questions
I recently read a white paper about machine learning written by Tom Mitchell. The article can be found here. This recent paper (July 2006) deals about the discipline of machine learning, its state and position in computer science. Mitchell also writes about current research areas and possible directions for future work. He has a… Continue reading...
Around the world
I'm really excited about the abundance and quality of other blogs related to data mining. These blogs are often good sources of ideas for my everyday work in data mining. Here are recent posts around the world:- Abbott's Analytic has an interesting post entitled Do and Do Not, about issues related to the collective knowledge in data mining.
- As written by Kevin Hillstrom, if "you work for a company that
Data mining is trendy
The expression data mining is more and more "trendy" these days. As an example a recent post on Engadget about explosive data mining robots. After reading the news, it happens that it has nothing to do with data mining but rather with data gathering. More astonishing is the first comment. According to non practitioners, credit card is the only successful data mining application around…Stealing data mining books
Google is great in so many aspects. You can look for nearly anything and find relevant information in less than a second (I should advertise for Google :-) The dark side of such a powerful tool? It is often referencing illegal content. You were all aware of the possibility of downloading mp3, divx and so on. But did you know about data mining books? If like me, you cannot believe… Continue reading...What Google can’t mine
January 18, 2007 by Sandro Saitta · 1 Comment
Filed under: deep web, hidden web, information, search engine
Filed under: deep web, hidden web, information, search engine
While I was reading a book about search and information, I found a particular chapter about the hidden web very interesting. Basically, the hidden web is the part of the Internet that is accessible to people but not to bots (such as Google bots). In other words, these pages exist, but they are… Continue reading... | 1 Comment
Data mining book recommendation
January 17, 2007 by Sandro Saitta · 3 Comments
Filed under: beginner, book, data mining field, literature
Filed under: beginner, book, data mining field, literature
As in several fields, lots of books exist in the data mining literature. People new to any domain usually appreciate suggestions from someone in the field. With my present experience in data mining books, I would suggest Introduction to Data Mining (Tan et al., 2005). It is a readable, although not comprehensive, book about… Continue reading... | 3 Comments
Unique cluster
Estimating the correct, or most reliable number of clusters, namely cluster validity, is of importance in clustering. For more details about clustering and cluster validity, you can read these three posts: part1, part2 and part3.In the recent literature, a lot of work has been done on clustering and on developing indices that… Continue reading...














