Five Reasons Why “Data Miner” is the Best Job in the World
Yes. I do believe that “Data Miner” is the best job on Earth. I will give you five reasons why I think so. Of course, I’m a data miner. I’m thus not objective, but who cares? Here is why everybody should be jealous about data miners:
1. We can predict the future
Yes, we can. Of course, this is not 100% sure, but still we predict the future. As long as we have enough data with some patterns inside (the famous gold nuggets), we can predict the future. What product will this customer buy next? What will be the pollen concentration in the air tomorrow? How will the stock market evolve in the following week? Yes, to some extend, and with some errors, we can predict the future. Isn’t that cool?
2. We can change our job without changing it
Sounds strange? Let me explain this. Data mining is a field where we apply machine learning and statistical techniques to some concrete problems in a certain field. Every new project may cover a different field. This gives you the opportunity to discover and learn new domains without changing your job. While working as a data miner, I have – in six years – discovered fields such as meteorology, civil engineering, finance and telco. Isn’t this great?
3. We can easily impress friends and family
This is not the case in every job. Being a data miner, it is. Of course, we could use simple terms such as “data, analysis and results” to explain what we do. However, we prefer to use terms such as “random set of points, support vector machines and score probabilities” instead. It is much more impressive, so why not use them? Yes, we are like this. Aren’t we awesome?
4. We can spy on anybody
Who has never dreamed of being a spy like James Bond? Being a data miner, we are kind of spies, at least according to the non-technical news. We steal data and we mine them to discover very personal information. We break the privacy of every human being on Earth by spying on them. Yes, we do that in our everyday job, and to be honest we love that. Of course, we could stop, but it is too much fun to play the spy. Isn’t it?
5. We never fail
Sounds impossible? It isn’t: we are data miners. Which means that if we find patterns in your data, we are good. If we don’t – it’s not our fault – it means there are some data quality issues (missing values, not enough data, etc.). But it’s not our fault. And if you insist, we can still find something for you. You bet? Give us any data (even random!) and we will find some patterns using clustering, for example. This is like finding patterns in clouds. If you look long enough, you will always see an image appear. Isn’t it beautiful?
If you think of other reasons, feel free to comment on this post and help me explain why “data miner” is the best job on Earth
Comments
33 Comments on Five Reasons Why “Data Miner” is the Best Job in the World
-
Frank Podlaha on
Sun, 13th Dec 2009 7:05 pm
-
Sandro Saitta on
Sun, 13th Dec 2009 7:35 pm
-
Michael Tweet on
Sun, 13th Dec 2009 11:14 pm
-
Sandro Saitta on
Mon, 14th Dec 2009 12:19 pm
-
Tommy on
Mon, 14th Dec 2009 7:41 pm
-
Dominic Pouzin on
Tue, 15th Dec 2009 9:16 am
-
Sandro Saitta on
Wed, 16th Dec 2009 2:57 pm
-
Dathan on
Thu, 17th Dec 2009 6:07 am
-
Sandro Saitta on
Thu, 17th Dec 2009 10:56 am
-
steffen on
Sun, 17th Jan 2010 12:18 pm
-
Sandro Saitta on
Mon, 18th Jan 2010 8:50 am
-
ernest m on
Tue, 2nd Feb 2010 8:19 pm
-
Sandro Saitta on
Wed, 3rd Feb 2010 12:15 pm
-
kalaivani on
Fri, 5th Feb 2010 12:26 pm
-
Sandro Saitta on
Sat, 6th Feb 2010 2:10 pm
-
dhirendra on
Sun, 7th Mar 2010 11:52 am
-
Sandro Saitta on
Mon, 8th Mar 2010 1:31 pm
-
Jona on
Tue, 9th Mar 2010 6:34 am
-
John Elder on
Sun, 14th Mar 2010 11:59 pm
-
Sandro Saitta on
Thu, 18th Mar 2010 8:56 pm
-
Ray on
Mon, 12th Apr 2010 2:53 am
-
Sandro Saitta on
Tue, 27th Apr 2010 10:27 pm
-
archana on
Wed, 21st Jul 2010 6:31 am
-
ashok on
Fri, 3rd Sep 2010 10:49 am
-
seo marketing on
Mon, 1st Nov 2010 6:25 pm
-
Andy Palmer on
Sat, 27th Nov 2010 8:43 am
-
The data mining effect | Data Mining Research - www.dataminingblog.com on
Tue, 21st Jun 2011 8:23 pm
-
Andy on
Sat, 24th Mar 2012 9:23 pm
-
Sandro Saitta on
Tue, 27th Mar 2012 6:39 am
-
jb on
Wed, 16th Jan 2013 3:42 pm
-
Data Mining And BI-applications. Their Use. | Wisdom Health Prosperity on
Sat, 6th Apr 2013 3:51 am
-
Oakley Canada Sunglasses on
Fri, 10th May 2013 7:58 am
-
CPR on
Sat, 18th May 2013 11:50 am
Sweet!
~a fellow data miner!
@Frank: I’m happy you liked it
NOBODY can predict the future. In the real world, everybody should be wary of geeks with formulas and models. Saying that you can predict the future is simply ridiculous.
“We can impress family and friends”, why do you have to nerdify data mining? no wonder why data mining hasn’t gone mainstream. And no, the fact that you may have many important clients doesn’t mean that is being adopted heavily!
@Michael: Of course we can predict the future… we are data miners!
Well, I thought my smiley at the end of the post was clear enough, but it seems not to everybody…
#5 is the best. I see it constantly in my work. Even when I can’t find what I’m looking for, I find something (ie. data quality problems that need fixing).
Unfortunately, #3 doesn’t work for me. because I’m required to spend my time defining the terms I’m using before they’ll be impressed. Even then most of the time they end up confused.
About impressing people / predicting the future, unfortunately there seems to be an emerging trend consisting of blaming quants and their formulas for the economic situation.
For example see Wired’s “Recipe for Disaster: The Formula That Killed Wall Street”: The article claims that David X. Li’s Gaussian copula function (among others) is one of the root causes behind the crisis.
Article: http://www.wired.com/techbiz/it/magazine/17-03/wp_quant?currentPage=all
At least, “Data Miner” is not a dirty word yet (unlike “Banker”)
@Tommy: I’m sorry for you man, #3 is maybe the best one
@Dominic: I’m quite confident that in a few decades, the term “data miner” will be as dirty as “banker”
Thanks for the link!
Okay, I’m jealous!
@Dathan: I know
Priceless
I should visit your blog more often (or using rss). I may miss something.
All the best for 2010.
Steffen
@steffen: Thanks for your kind words. I think the easiest is RSS or e-mail subscription.
That explain why everybody wants to be a data miner. I had a boss who would go ” have you tried the K nearest neighbor of the Bayesian Probabilities..” he had no clue of what data mining is but he couldn’t help spewing out all sorts of gibberish stuff.
@Ernest: “That explain why everybody wants to be a data miner” -> Definitely yes
Hi sandro
i was searching a blog like yours for a long time.
Finally i gotta gud one..
your posts are very good and informative.
since am doing my masters degree,i took a course
in data mining as well as data warehousing.
since am interested in data mining
I want to do atleast 2 month internship in data mining area.
Am beginner in this arena,i donot know much about
it but have a craze and very eager to delve into
any research related to mining.. so could u guide me?
@Kalaivani: Thanks for your comment. Data mining can be applied to a very wide range of domains so it is quite hard to propose areas of research. I think the best would be to buy a data mining book with examples of applications and see what you like best. Hope this helps.
hey Sandro Saitta,well while searching about data mining i gone through ur 5 pts.i impressed by these.i want to get some white papers regarding these.can u help me from where should i start for reserch.
@dhirendra: The easiest would be to put your question on KDnuggets forums.
I say #2 is just it. i am creating a clearcut path for myself, trying to get the real deal with data mining and I have done research in 2 distinct areas so far, but I must say seeking out the domain knowledge can be challenging, but can soon make you an “almost expert” in something you never actually thought of in the first place. I’m so loving data mining for that.
Ha! Great list.
You’ll like my “Top 10 Data Mining Mistakes”. It’s got real-world ways we analysts overdo or over-interpret data. (”Seeing things in clouds”, like your #5 is there, too.)
It’s chapter 20 in Nisbet, Elder, Miner: “Handbook of Statistical Analysis & Data Mining Applications”. (Or, ask SAS, one of the vendors, along with SPSS and STATISTICA, that includes free, limited-time mining software, as they are printing that chapter and giving it away free.)
-John Elder
@John: Thanks a lot for your comment. I have already started to read your data mining book, which is excellent. I will soon post a review about it on Data Mining Research.
Excellent post. I want to become data miner. What do you recommend if in my region there are not courses about data mining? Dou you know about a roadmap to study? I have a computer science degree with some certifications in warehousing.
Thanks
@Ray: I would suggest the tutorial at http://www.thearling.com/
yes its realy intresting wht u wrote abut datamining.m continueing my mtech so i wana to a thesis in datamining..could u guide me the posible way to do so.
thank you
can take is as thesis
Great post!!! I love it and sharing with social networks. Thanks
This is truly a great read for me!! Once again, thanks for the quality articles you post on your blog!!
[...] data mining can have an effect. And I’m not thinking about the effect you feel when you do the most exciting job in the world. In this post, I want to discuss the effect of applying data mining in an iterative way (call it [...]
Nice piece of reading ![]()
Right now I’m not sure which career to pursue but I definitely considering data mining as an option. I have to choose between a MS in Data Mining at a mediocre university and MS or PhD in Statistics at a relatively solid one – how do you think which is better for job opportunities in the future? Thanks in advance.
@Andy: thanks for your comment. I would suggest the better university. If you do a master in stats, the field of data mining is open to you. If you choose a phd in stats, then you can maybe focus on data mining research. This way, you will be ready for data mining in industry. What is sure is that with a background in stats, data mining is not an issue. Hope this helps.
Great!
~ IT student, hoping to become a data-miner.
[...] engines, social networks, blogs and forums – all this will help you make a decision on many issues.A corporate database of the enterprise usually contains a set of tables that store records of some f…the patterns, trends or relationship between some data. This kind of information is commonly used in [...]
Oakley Game Sun glasses, Oakley Productive Sun glasses, Oakley Lifestyle Sun glasses, Manufacturer One on one, No cost Transport.
Exactly correct !!!!!
Tell me what you're thinking...

















