Stock Picking using Data Mining: Parameter Tuning

It is known that in data mining projects, one can spend 80% of the time for data preprocessing and the remaining 20% for the data mining task itself. However, when data mining is integrated in an overall system (such as a stock picking system), an important task is to tune the parameters of the overall system.

For example, in the above mentioned system, there are several parameters to fix in order to obtain satisfying results. Here is a list of these parameters:

  • Number of stocks to analyze (depends on the computational resources)
  • Number of stocks to select as the best ones (fixed number or with a threshold on the validation accuracy and the minimum number of trades)
  • Short or long term prediction (predict increase/decrease of given stocks in X days)
  • Confusion matrix for the classifier (how to penalize the errors of the classifier)
  • Size of the shifting window (i.e. size of the training/validation set)

These parameters will vary according to each project. For example, you can have a look at the parameters mentioned in a post by Themos Kalafatis. Feel free to comment and give examples of parameters that you have to tune.

Share

Comments

4 Comments on Stock Picking using Data Mining: Parameter Tuning

  1. Themos Kalafatis on Tue, 16th Dec 2008 4:45 pm
  2. Sandro,

    Nice Post…the usage of confusion matrix (and thus a cost-sensitive classifier) on such a predictive application is a must so it is good that you have pointed it out as one of “must do” steps.

  3. Sandro Saitta on Thu, 18th Dec 2008 4:39 pm
  4. Thanks for the comment. However, finding the best confusion matrix is not a straightforward task…

  5. Suresh Babu on Fri, 11th Dec 2009 2:56 am
  6. I want more information on Stock market prediction and data mining tools used to predict crisis in stock market.

  7. Sandro Saitta on Sun, 13th Dec 2009 6:00 pm
  8. @Suresh: you should try the book “Data Mining in Finance” that you can find on Amazon.

Tell me what you're thinking...





  • Swiss Association for Analytics

  • Most Popular Posts

  • T-shirts, Mugs & Mousepads


    All benefits given to a charity association
  • Data Mining Search Engine

    Supported by AnalyticBridge

  • Archives

  • Reading Recommandations