In certain situations, the data miner has to perform sampling on the dataset before applying any algorithm. The main reason being too many data to mine. In such a case, a possible technique is random sampling. If classes are uniformly distributed, one may use random sampling before supervised learning.
But what about association rule mining? If you use random sampling before an association rule algorithm, you may end up finding no…
Continue reading... | 7 Comments

I was at PAW Gov in Washington D.C. on September 12th and 13th and it was just great! Let's start with the people. It was a pleasure for me to meet so many data mining experts. That was one great aspect of this PAW conference: experts are very accessible compared to other events. I had the opportunity to meet great…
Continue reading... | 2 Comments
BAQMaR, a network of analytic people, is organizing its annual event on December 8th in Ghent, Belgium. I have been invited to give a talk during the data mining session. I will present the work I did when I was consultant for FinScore. The talk is entitled "Personalized online advertising using data mining". If you are interested, feel free to
register for the BAQMaR conference. For more details, look…
Continue reading...