Data, Information, Knowledge and Wisdom
October 24, 2007 by Sandro Saitta · Leave a Comment
Filed under: DIKW hierarchy, data, information, knowledge, wisdom
The aim of data mining is to draw understandable knowledge from raw data. Behind these notions of data and knowledge, a more complex hierarchy exists. This hierarchy originates independently from knowledge management, design and information science (1). In knowledge management, the Data Information Knowledge Wisdom (DIKW) hierarchy or pyramid has been initiated by Cleveland in 1982, Zeleny (2) in 1987 and Ackoff in 1989 separately.
Filed under: DIKW hierarchy, data, information, knowledge, wisdom
WIRED point of view on AI
October 19, 2007 by Sandro Saitta · 1 Comment
Filed under: WIRED, artificial intelligence, vulgarization
Filed under: WIRED, artificial intelligence, vulgarization
In its October 2007 issue, WIRED has special section named "Geekipedia". In this supplement, WIRED summarizes 149 people, facts or concepts that they think are important. Among the list, one can find "Artificial Intelligence". The description is quite negative and focus on different aims that AI hasn't been able to achieve. I agree with… Continue reading... | 1 Comment
One year of Data Mining Research
October 15, 2007 by Sandro Saitta · 1 Comment
Filed under: data mining research, visitor sources, visitor trends
I have started blogging in June 2006. However, I consider that the real start of Data Mining Research (DMR) was in October 2006. It is at this date that I have started posting on a regular basis. Since October 2006, DMR has also been subject to several changes. First, I think that posts have gone from a passive view of data mining related news and applications to more active opinions… Continue reading... | 1 Comment
Filed under: data mining research, visitor sources, visitor trends
EPFL forum: leitmotif “business intelligence”
October 12, 2007 by Sandro Saitta · Leave a Comment
Filed under: EPFL forum, cognos, companies, data mining application
Filed under: EPFL forum, cognos, companies, data mining application
I recently went to the company forum of my campus at EPFL. I have a post about a similar event last year. This year, several companies were present including Microsoft and Google. Google was giving a presentation but no interview. If you want to apply for a job at Google, you have to… Continue reading...
Data Mining et al.
Today, I would like to introduce you to a new blog (started in September this year) written by Georg Russ: Data Mining et al. Up to now, most of his posts focus on data mining applications. Among others, he writes about data mining for sports science as well as agriculture data. I hope we will soon get some details about his future experiments and challenges he encounters. So, keep… Continue reading...Stratification for data mining
October 4, 2007 by Sandro Saitta · 5 Comments
Filed under: class distribution, cross-validation, stratification
One common issue in data mining is the size of the data set. It is often limited. When this is the case, the test of the model is an issue. Usually, 2/3 of the data are used for training and validation and 1/3 for final testing. By chance, the training or the test set may not be representative of the overall data set. Consider for example a data set of… Continue reading... | 5 Comments
Filed under: class distribution, cross-validation, stratification
Black sheep poster
October 1, 2007 by Sandro Saitta · 4 Comments
Filed under: black sheep, data interpretation, udc poster
Filed under: black sheep, data interpretation, udc poster
Have you heard of the black sheep issue in Switzerland? If not, then go to BBC News website to have a quick overview. The biggest political party in Switzerland, UDC, has put some controversial ads showing white sheep (Swiss citizens) and black sheep (foreigners who commit crimes). I think you can imagine the confusion… Continue reading... | 4 Comments














