Data mining people: Heikki Mannila

November 30, 2006 by Sandro Saitta · Leave a Comment
Filed under: Mannila, people 
Here is a new post about data mining people. Today, Heikki Mannila is introduced. He has Ph.D. in computer science from the University of Helsinki. He worked for companies such as Microsoft and Nokia. He also was a research director in Helsinki Institute for Information Technology. He is currently an academy professor.Heikki Mannila is well… Continue reading...
  • Share/Bookmark

Data mining explained

November 29, 2006 by Sandro Saitta · Leave a Comment
Filed under: Uncategorized 
On the blog of Devipriya, there is a very interesting and complete introduction to data mining named "Who is mining your data?". This clearly written introduction is mainly intended to people who wants to know what motivates data mining and what are the possible applications. Only the minimum technical terms are used so that any reader can understand what data mining is about…
  • Share/Bookmark

Juice Analytics’ Blog

November 27, 2006 by Sandro Saitta · Leave a Comment
Filed under: blog, juice analytics 
It's always a pleasure for me to find interesting blogs about data mining and to present them here. Juice Analytics is company that... well, let's them define what they do with their own words: "Juice Analytics helps small and mid-market companies develop deep prospect and customer understanding through visualization and analytics of existing
  • Share/Bookmark

Now boarding!

November 24, 2006 by Sandro Saitta · 2 Comments
Filed under: Google, blog, mahalanobis 
Here is some food for the week-end:
  • Will is explaining a good alternative to the standard Euclidean distance by introducing the Mahalanobis distance on his blog
  • Andy is writing about the fact that Google seems to start integrating blog post in its results (pointed by Matthew)
By the way, I would like to thank Joël Arnold for the nice drawing he made for me (picture on the right)…
  • Share/Bookmark

Cluster validity: Existing indices

November 23, 2006 by Sandro Saitta · 9 Comments
Filed under: clustering 
The third - and final - post on cluster validity is about existing validity indices. As written in (1), the two fundamentals issues in cluster validity are 1) the number of clusters present in the data and 2) how good is the clustering itself.Several indices have been proposed in the literature. The main idea with these indices is to plot them with regard to the number of clusters and then… Continue reading... | 9 Comments
  • Share/Bookmark

Cluster validity: Clustering algorithms

November 22, 2006 by Sandro Saitta · 2 Comments
Filed under: clustering algorithms, k-means 
Now that the clustering ideas have been introduced, let's look at existing clustering strategies. Several clustering techniques can be found in the literature. They can be divided in four main categories (1): partitional clustering (K-means, etc.), hierarchical clustering (BIRCH, etc.), density-based clustering (DBSCAN, etc.) and grid-based clustering (STING, etc.). In the literature, clustering can be found under different expression such as unsupervised learning, numerical taxonomy and partition (2). One… Continue reading... | 2 Comments
  • Share/Bookmark

Cluster validity: Introduction to clustering

November 21, 2006 by Sandro Saitta · 2 Comments
Filed under: clustering, unsupervised learning, validity index 
In the near future, I will use this blog to write about recent research I'm involved in. I start today (and the following days) by an introduction on the topic I'm interested in: cluster validity.Clustering is certainly the best known example of unsupervised learning. The goal of clustering is to group data points that are similar according to a given similarity metric (by default Euclidean distance is used). As Jain… Continue reading... | 2 Comments
  • Share/Bookmark

Next Page »

  • Data Mining Search Engine

  • Reading Recommandations

  • T-shirts, Mugs & Mousepads

  • Archives

  • Pages

  • Disclaimer

    The opinions discussed on Data Mining Research are my own and do not reflect the position of my current employer, FinScore. The views and opinions expressed by visitors to this blog are theirs and do not necessarily reflect mine.
  • Meta