Favorite data mining quote
February 28, 2007 by Sandro Saitta · 2 Comments
Filed under: Moore's law, data storage, fayyad, quote
Filed under: Moore's law, data storage, fayyad, quote
Below is my favorite data mining related quote:"The capacity of digital data storage worldwide has doubled every nine months for at least a decade, at twice the rate predicted by Moore's Law for the growth of computing power during the same period." (Fayyad et al., 2002)It can be mentioned to justify the overall success… Continue reading... | 2 Comments
Small book review: Statistical Pattern Recognition
In the present post I would like to write a few words about a book I recently read. The book written by Andrew Webb, Statistical Pattern Recognition (2nd ed.) is certainly the most comprehensive book related to machine learning. Although the title does not contain the "data mining" expression, this book covers all theoretical aspects behind data mining. I have not been able to find any machine learning topic… Continue reading... | 1 CommentAround the world
It is time again to have a look around the world. If you feel bad in the plane, I have some pills for you. Here are the destinations:- Thanks to an anonymous blogger, you can yet have more details about data mining and terrorism.
- Will has a new poll about your use of Matlab toolboxes here.
- Dean is opening an interesting discussion on his blog about the complexity of data
Data mining for terrorism: is it working? (cont’d)
To continue on the post of yesterday, I want to write a few words about Jonas and Harper paper. In this paper, the authors argue that data mining is definitely not useful against terrorism. The first sentence which avoid all confusion about their position is: "Though data mining has many valuable uses, it is not well suited to the terrorist discovery problem". According to the authors, data mining… Continue reading... | 4 CommentsData mining for terrorism: is it working?
Some people think that data mining can be used for terrorism and other not. A team of researchers at Queen's University think that the first case is the most likely to be true. An interesting article about their work can be found here. Although not technical, this article explains that the lab uses data mining for terrorism among others. The Prof. of the lab even states that "sophisticatedNew data mining blog
February 9, 2007 by Sandro Saitta · 6 Comments
Filed under: crime, data mining blog, predictive analytics
Today I would like to introduce you to a new data mining blog. Crime Analysis and Data Mining is a weblog maintained by Shyam Varan Nath. It started in 2007 and deals with data mining used for crime analysis. In his first posts, he writes about biometric as well as news related to crime and security. This blog opens a door on possible applications of data mining that might… Continue reading... | 6 Comments
Filed under: crime, data mining blog, predictive analytics
Feature selection
February 7, 2007 by Sandro Saitta · 4 Comments
Filed under: dimensionality reduction, feature extraction, feature selection, variable, visualization
One of the most interesting and well written paper I have read regarding data mining is certainly "An Introduction to Variable and Feature Selection" (Guyon and Elisseeff, 2003). It is freely available on the Journal of Machine Learning Research website. After reading this paper, you should have a good view of what feature selection really is about. Although not popularized, the paper is written in a very readable way.Feature… Continue reading... | 4 Comments
Filed under: dimensionality reduction, feature extraction, feature selection, variable, visualization














