Data Mining Interview: Rob Hyndman

October 21, 2011 by Sandro Saitta
Filed under: Uncategorized 

If you're new here, you may want to subscribe to my RSS feed. Thanks for visiting!

I recently discovered Cross Validated, a Q&A platform for statisticians and data miners. With Seth Rogers, Community Developer of Cross Validated, we have conducted an interview of Rob Hyndman, Professor of Statistics. Rob proposed forming this particular community. Thanks to both Seth and Rob for your collaborations.

cv

Data Mining Research: Could you introduce yourself and explain your relationship to Analytics?

Rob Hyndman: I am Professor of Statistics at Monash University, Australia, and Editor-in-Chief of the International Journal of Forecasting. I’m probably best known for my work in statistical forecasting — I am author of the forecast package for R, and I’ve written a couple of books on forecasting.

DMR: What is Cross Validated and how does it help in your work?

RH: Cross Validated is a website where people can ask and answer questions about statistical topics. The site is part of the Stack Exchange network of sites, all of which are free community-driven expert Q&A sites around particular topics. We [community members] interpret statistics quite broadly- the site is intended for statisticians, data miners, and anyone else doing data analysis.

When I proposed the site in April 2010 [in Stack Exchange’s new site hatchery called Area 51] I had in mind it being useful for the thousands of researchers using statistical methods, but who may not have enough statistical training to be confident that they are using the best methods and implementing them appropriately. Having spent a couple of decades as a statistical consultant, I thought it would be nice to have a good site I could recommend to people, rather than trying to answer every question myself. It’s turned into something much bigger than that, and is now a wonderful resource for everyone doing statistics, even those who have years of experience.

Having proposed the site, I was one of the first moderators when it launched in July 2010. After about six months, I decided to take a back seat and some of the most reputable users formed a new moderation panel. They are doing a great job and I’m delighted to see the site being so active and obviously meeting the needs of so many people.

One of the nice things about CV, and other Stack Exchange sites, is that the good answers get voted up and it is easy to see what the community regards as the best answer. You can also see which of the people answering has established a reputation for providing helpful advice, based on their reputation scores. It is also extremely easy to find answers to past questions. This sets it apart from email lists and forums where you have to search through badly formatted archives. Everything is tagged and searchable, and (as of 5 October 2011) there are more than 5300 questions and more than 10000 answers that provide a repository of useful knowledge which is freely available.

DMR: Do you recommend Cross Validated to your students and colleagues?

RH: I recommend CV all the time. I’ve promoted it on my blog and I often refer people to CV when they send me questions by email. So it has helped in filtering out some of the questions that otherwise would land on my desk. I will now usually suggest that if someone has a specific question, they ask on CV first. Very often the answers are faster and better than what I would have provided. There is a wonderful community of people on CV (more than 5000 of them!) that are very helpful and willing to share their expertise.

DMR: Can you remember a particular problem or question that came up during a project that CV helped you solve?

RH: I tend to answer more questions than I ask, but occasionally I have asked a question, and I’ve always learned something from the answers. I learned a lot about causation when I asked “Under what conditions does correlation imply causation?” including several references that I was unfamiliar with. I’ve often found answers to my R questions are already available on the site in abundance.

Feel free to participate to Cross Validated.

Note: this interview will also be published on Amstatnews in December.

No TweetBacks yet. (Be the first to Tweet this post)
  • Share/Bookmark

Comments

2 Comments on Data Mining Interview: Rob Hyndman

  1. Lucian on Sun, 6th Nov 2011 12:47 pm
  2. SE does a great job. It’s area is a little broader than the one of metaoptimize.com; they two complement each other.

  3. Amy G on Mon, 29th Apr 2013 7:42 am
  4. Hello,

    My name is Amy and I am a representative of OnlineMathDegrees.org. My team and I have just published a list on our site titled 100 Savvy Sites on Statistics and Quantitative Analysis. To view our article follow the link: http://onlinemathdegrees.org/statistics/.

    Here at OnlineMathDegrees.org we are dedicated to spreading the word so to speak about numbers. So many people are afraid of charts, formulas, algorithms, multiplication and anything related to math. This is why we are here to represent those of us who adventure head first into the world of numbers. Our list is here to provide numbers junkies with great statistic sites for them to delve into.

    We are very excited about our article and we want to spread it beyond our readership. To that end, it would be greatly appreciated if you could share our article with your readership, as you know there is power in numbers. Please let me know what you think, thanks.

    Regards,
    Amy

Tell me what you're thinking...





  • Swiss Association for Analytics

  • T-shirts, Mugs & Mousepads


    All benefits given to a charity association
  • Data Mining Search Engine

    Supported by AnalyticBridge

  • Archives

  • Reading Recommandations