Why not to use Wikipedia as a reference
I recently had in interesting discussion with my director about references. The goal of references is twofold. The writer, can refer some texts positively. It is the case if he uses an existing algorithm, for example. A reference can also be used negatively. This happens when the writer want to highlight lack in the literature. He can thus justify the originality of his work. As already mentioned in an… Continue reading... | 11 CommentsData mining jobs in Europe
KDnuggets is certainly the best place to find a data mining job in US. Several opportunities are often updated. In addition, external links are given to find data mining jobs using other websites. But what about jobs in Europe? To my knowledge, there is no website specialized on data mining jobs in Europe. Of course, websites such as DataShaping can be used, since they contain such offers. However… Continue reading... | 5 Comments10 years of data mining
In a recent paper in the Data Mining and Knowledge Discovery Journal, Gregory Piatetsky-Shapiro has an interesting paper about recent tendency in data mining. He wrote a survey based on the point of view of KDnuggets, a must-known company sharing news and more for the data mining community. Below is the abstract of his paper:I survey the transformation of the data mining and knowledge discovery fieldData Mining Research updated
As you have probably noticed, DMR has been updated recently. On the top of the blog, you can find the three most read topics. Between parentheses, you can find the number of times the corresponding articles have been accessed. On the sidebar (right), you can find recent news related to data mining. I hope you find this blog convenient to read. Feel free to comment or suggest any improvement.Kind regards.Sandro… Continue reading...MLDM 2007: A brief overview
Here is the last post about the MLDM 2007 conference in Leipzig. As mentioned in an earlier post, several different topics were covered in this meeting. In my opinion, there were no trendy topics such as SVM, ANN, GA, etc. that flood other methods. Below, you can find a list of interesting papers.- "Kernel MDL to Determine the Number of Clusters" by Kyrgyzov et al. where they combine Minimum Description
MLDM 2007: Anil K. Jain’s presentation on clustering
August 16, 2007 by Sandro Saitta · 1 Comment
Filed under: Jain, MLDM, clustering, clustering algorithms, k-means
As written in the previous post, Anil K. Jain was the invited speaker of MLDM 2007. He gave an interesting presentation about clustering, focusing on the user's dilemma. He started with a comprehensive introduction on clustering and then showed some of the future work he is involved in: semi-supervised clustering and clustering with co-association. Below is the abstract of his presentation:Data clustering is a long standing research problem
Filed under: Jain, MLDM, clustering, clustering algorithms, k-means
MLDM 2007: Clustering in Leipzig
August 9, 2007 by Sandro Saitta · 2 Comments
Filed under: MLDM, clustering, conference, data mining application
Filed under: MLDM, clustering, conference, data mining application
I recently came back from the Machine Learning and Data Mining (MLDM) conference in Leipzig, Germany. This was an interesting meeting with various subjects. Out of the usual subjects such as classification (SVM, etc.), feature selection and clustering, a lot of papers were dedicated to applications of data mining.Examples of application domains are:- Intrusion














