Math Stats and Data Mining

November 29, 2007 by Sandro Saitta · Leave a Comment
Filed under: maths, new blog, statistics 
I recently found the new data mining blog named "Math Stats and Data Mining" written by Rachel Graham. It is a very nice blog with a particular focus on statistics and making sense of data. I really like the way posts are written: readable and entertaining with a personal viewpoint. Certain posts are particularly interesting, such as the one on the Pythagorean Theorem or the one entitled "Why is Statistics… Continue reading...
  • Share/Bookmark

Poll on DIKW hierarchy

November 27, 2007 by Sandro Saitta · Leave a Comment
Filed under: DIKW hierarchy, poll 
If you work in data mining, you are every day confronted to terms such as data, information and knowledge. As explained in a previous post on Data Mining Research, there exists a hierarchy on these terms. It is usually represented as shown in the following picture.My question, regarding this terminology is: What do
  • Share/Bookmark

The two cultures according to Breiman

November 23, 2007 by Sandro Saitta · 1 Comment
Filed under: data miners, statisticians, statistics 
In a recent post on Data Mining Research, Will mentioned a paper entitled Statistical Modeling: The Two Cultures. This paper, written by Leo Breiman (the father of decision trees) and published in 2001 in Statistical Science is intended to both statisticians and data miners. As indicated in the title, Breiman compares two different cultures: the statistical culture assuming data models and the data mining culture using algorithmic models.The
  • Share/Bookmark

Small book review: Web Dragons

November 16, 2007 by Sandro Saitta · Leave a Comment
Filed under: data mining books, search engine, web dragons 
Data mining is a field which is closely related to information extraction and search engines. Web Dragons: Inside the Myths of Search Engine Technology explains everything you want to know about search engines (the so called "web dragons") and how they work. Before reading the book, you perhaps wonder why Witten and co-authors called… Continue reading...
  • Share/Bookmark

RSS Feed of Data Mining Research

November 13, 2007 by Sandro Saitta · Leave a Comment
Filed under: Feedburner, RSS feed 
Some readers reported that the RSS feed of Data Mining Research is sometimes giving feedburner error reports. This may happen if you use the old RSS feed from blogger:http://dataminingresearch.blogspot.com/atom.xml (old feed)This feed is no more valid. For those who are still using it, please update to the following one:http://feeds.feedburner.com/dataminingblog (new feed)Thanks to Shane for noticing the problem.[End of post]
  • Share/Bookmark

Data mining and statistics

I have recently found an interesting paper about the connection between data mining and statistics. It is written by Diego Kuonen, who is now working at Statoo Consulting in Switzerland. The basic question that leads his paper is whether data mining is statistical déjà vu.After explaining what is statistics and why it is needed, he explains data mining using several definitions. He points out an interesting fact by… Continue reading...
  • Share/Bookmark

Data mining interview

November 1, 2007 by Sandro Saitta · Leave a Comment
Filed under: Interview, data miner, practitioner 
Will Dwinnell is a data mining practitioner with a long experience as well as a blogger on Data Mining in MATLAB and Abbott Analytics. He kindly accepted to answer the questions of Data Mining Research (DMR) about his every day work. DMR: Who are you and what is your job?Will Dwinnell (WD): I am Will Dwinnell and I build predictive mathematical models. At the moment, I work… Continue reading...
  • Share/Bookmark

  • Data Mining Search Engine

  • Reading Recommandations

  • T-shirts, Mugs & Mousepads

  • Archives

  • Pages

  • Disclaimer

    The opinions discussed on Data Mining Research are my own and do not reflect the position of my current employer, FinScore. The views and opinions expressed by visitors to this blog are theirs and do not necessarily reflect mine.
  • Meta