Data mining “one click” applications

I recently read an interesting article from Analytics Magazine entitled “Managing fundamental tradeoffs” by Mu Zhu. This article is interesting since it explains what prevent easy automation of data mining tools. According to the author:

Algorithms used to uncover such a relationship – for example, neural networks and support vector machines – must be sufficiently flexible. This is because only flexible algorithms can be adapted to the vastly different situations that we encounter in practice.

He illustrates his ideas using K-nearest neighbors algorithm. He explains that any flexible algorithm have “knobs” that must be tuned:

Blind applications of predictive algorithms without carefully turning these “knobs” are sure to produce bad or even disastrous results.

He gives another example of “knobs” with decision trees:

The size of the decision tree is an important “knob” and, like the KNN, it is necessary to control this parameter carefully in order for the decision tree to be effective.

Finally, a very important quote according to me:

Predictive analytics and data mining are about finding information from data. They are search operations. As with all search operations, there are always two questions: where do we search, and how do we search? The algorithms are concerned with how to search, but we must tell them where to search, that is, we must feed the algorithms with data.

One of the main conclusion of the author is that “one click” applications cannot solve all problems. What’s your opinion about that? If this is correct, then the next question would be in which situation can we use “one-click applications? Feel free to give your mind about this issue.

Read the full article.


Recommended Reading

Comments Icon5 comments found on “Data mining “one click” applications

  1. i am working as an associate professor,and presently i hav registered in the rayalaseema university ,karnool,AP for Ph.D. .i want information about ‘what are the problems are available in the data mining &its applications.

  2. I think that one-click data mining cannot be done without using domain knowledge. Many decisions required for data mining (or any other real-world activity) require deep knowledge which machines do not have … yet
    But watch out for IBM Watson, Wolfram Alpha, Google AI – general AI is getting closer

  3. @Gregory: Yes, I agree. But imagine that a data mining tool is developed for a given application field (by using the available domain knowledge). Then, what about the “one click” issue? Would it be possible to build this application so that it is automated?

Comments are closed.