Top Five Articles in Data Mining

April 19, 2011 by
Filed under: Uncategorized 

During the last years, I’ve read several data mining articles. Here is a list of my top five articles in data mining. For each article, I put the title, the authors and part of the abstract. Feel free to suggest your favorite ones.

An Introduction to Variable and Feature Selection

Isabelle Guyon and André Elisseeff

Variable and feature selection have become the focus of much research in areas of application for which datasets with tens or hundreds of thousands of variables are available. These areas include text processing of internet documents, gene expression array analysis, and combinatorial chemistry. The objective of variable selection is three-fold: improving the prediction performance of the predictors, providing faster and more cost-effective predictors, and providing a better understanding of the underlying process that generated the data.

Data Clustering: A Review

A.K. Jain, M.N. Murty and P.J. Flynn

Clustering is the unsupervised classification of patterns (observations, data items, or feature vectors) into groups (clusters). The clustering problem has been addressed in many contexts and by researchers in many disciplines; this reflects its broad appeal and usefulness as one of the steps in exploratory data analysis. However, clustering is a difficult problem combinatorially, and differences in assumptions and contexts in different communities has made the transfer of useful generic concepts and methodologies slow to occur. This paper presents an overview of pattern clustering methods from a statistical pattern recognition perspective, with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners.

From Data Mining to Knowledge Discovery in Databases

Usama Fayyad, Gregory Piatetsky-Shapiro and Padhraic Smyth

Data mining and knowledge discovery in databases have been attracting a significant amount of research, industry, and media attention of late. What is all the excitement about? This article provides an overview of this emerging field, clarifying how data mining and knowledge discovery in databases are related both to each other and to related fields, such as machine learning, statistics, and databases.

Nine Laws of Data Mining

Tom Khabaza

In its current form, data mining as a field of practise came into existence in the 1990s, aided by the emergence of data mining algorithms packaged within workbenches so as to be suitable for business analysts.  Perhaps because of its origins in practice rather than in theory, relatively little attention has been paid to understanding the nature of the data mining process.  The development of the CRISP-DM methodology in the late 1990s was a substantial step towards a standardised description of the process that had already been found successful and was (and is) followed by most practising data miners.

Statistical Modeling: The Two Cultures

Leo Breiman

There are two cultures in the use of statistical modeling to reach conclusions from data. One assumes that the data are generated by a given stochastic data model. The other uses algorithmic models and treats the data mechanism as unknown. The statistical community has been committed to the almost exclusive use of data models. This commitment
has led to irrelevant theory, questionable conclusions, and has kept statisticians from working on a large range of interesting current problems. Algorithmic modeling, both in theory and practice, has developed rapidly in fields outside statistics.

In its current form, data mining as a field of practise came into existence in the 1990s, aided by the emergence of data mining algorithms packaged within workbenches so as to be suitable for business analysts.  Perhaps because of its origins in practice rather than in theory, relatively little attention has been paid to understanding the nature of the data mining process.  The development of the CRISP-DM methodology in the late 1990s was a substantial step towards a standardised description of the process that had already been found successful and was (and is) followed by most practising data miners.
Share

Comments

52 Comments on Top Five Articles in Data Mining

  1. regina sharon on Fri, 30th Dec 2016 5:56 pm
  2. My husband of six years left me for another girl because I accuse him of seeing another girl and since then i have been trying to get him but he refuse to come back to me, he was not responding to my call or emails and he even unfriend me on face-book and he told me that he is done with me. i was searching on the internet for help and i saw a testimony of how a spell caster help them to get their ex back so i decided to give it a try and i contacted dr emua and i explained my problems to him and he cast a love spell for me and guarantee me of 2 days that my ex will come back to me and to my greatest surprise the third day a great miracle fell on me and my husband came back to me and he beg me for forgiveness, dr emua you are just the best, i will continue to publish his name because he is my Savior, we are now one big happy and united family. If you need his help you can Email him @ dremuahelphome@outlook.com or dremuahelphome@gmail.com for easy and fast communication you can also call or add him on whats-app with this number +2347063628174

  3. Emily Thomas on Sat, 7th Jan 2017 12:44 pm
  4. EDWARD JONES FINANCE IS THE BEST PLACE TO GET A LOAN {Jonesloanfinance@yahoo.com},
    God bless you sir, I will not stop telling the world about your kindness in my life, I am a single mum with kids to look after. My name is Emily Thomas and I am from Convention Center Drive, Miami Beach, FL . A couple of weeks ago My friend visited me and along our discussion she told me about EDWARD JONES FINANCE, that they can help me out of my financial situation, I never believed cause I have spend so much money on different loan lenders who did nothing other than running away with my money. I have been in a financial mess for the pass 7 months now, She advised I give it a try so I mailed him and explain all about my financial situation to him, he therefore took me through the loan process and gave me a loan of $390,000.00 at a very low interest rate of 2% and today I am a proud business owner and can now take good care of my kids, If you must contact any firm to get any amount of loan you need with a low interest rate of 2% and better repayment schedule, please contact EDWARD JONES FINANCE email:- {Jonesloanfinance@yahoo.com} OR Text +1(307) 241-3712 or go to there page on http://jonesloanfinance.bravesites.com

Tell me what you're thinking...





  • Swiss Association for Analytics

  • Most Popular Posts

  • T-shirts, Mugs & Mousepads


    All benefits given to a charity association
  • Data Mining Search Engine

    Supported by AnalyticBridge

  • Archives

  • Reading Recommandations