<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Finding Interests of Visitors through Data Mining</title>
	<atom:link href="http://www.dataminingblog.com/find-interests-of-visitors-through-data-mining/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dataminingblog.com/find-interests-of-visitors-through-data-mining/</link>
	<description>Data mining crossroads - research, applications, news, list of blogs and customized search engine about data mining.</description>
	<lastBuildDate>Tue, 07 Feb 2012 08:14:55 +0100</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.5</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Sandro Saitta</title>
		<link>http://www.dataminingblog.com/find-interests-of-visitors-through-data-mining/comment-page-1/#comment-41815</link>
		<dc:creator>Sandro Saitta</dc:creator>
		<pubDate>Fri, 30 Oct 2009 10:21:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataminingblog.com/?p=778#comment-41815</guid>
		<description>@Steffen: Thanks for your comment. Your explanations really correspond to what we are doing (I&#039;m happy that you got it). 

Just a few points. First, there is only one website where we gather the user behavior. And yes, we predict interests for people that are clients AND that we can identify (either by logging or other means). This identification allows us to make the link between online data (web profile) and offline data (CRM).

For us, the hardest part is to aggregate the web raw data from the web logs (several Gb per day) in a web profile for each user.

When confronted to unidentified visitors, we find interest groups after they have visited 1 or 2 pages on the websites using the human-defined rules. But of course, there may be other means of doing.

We have the following levels: URL -&gt; Categories -&gt; Interest Groups. Even if categories may have hierarchies, we don&#039;t explicitely use this concept in our solution for the moment.

I&#039;m planning to write more about these topics in the following weeks :-)</description>
		<content:encoded><![CDATA[<p>@Steffen: Thanks for your comment. Your explanations really correspond to what we are doing (I&#8217;m happy that you got it). </p>
<p>Just a few points. First, there is only one website where we gather the user behavior. And yes, we predict interests for people that are clients AND that we can identify (either by logging or other means). This identification allows us to make the link between online data (web profile) and offline data (CRM).</p>
<p>For us, the hardest part is to aggregate the web raw data from the web logs (several Gb per day) in a web profile for each user.</p>
<p>When confronted to unidentified visitors, we find interest groups after they have visited 1 or 2 pages on the websites using the human-defined rules. But of course, there may be other means of doing.</p>
<p>We have the following levels: URL -> Categories -> Interest Groups. Even if categories may have hierarchies, we don&#8217;t explicitely use this concept in our solution for the moment.</p>
<p>I&#8217;m planning to write more about these topics in the following weeks <img src='http://www.dataminingblog.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steffen</title>
		<link>http://www.dataminingblog.com/find-interests-of-visitors-through-data-mining/comment-page-1/#comment-41118</link>
		<dc:creator>Steffen</dc:creator>
		<pubDate>Wed, 28 Oct 2009 17:30:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataminingblog.com/?p=778#comment-41118</guid>
		<description>Hello Sandro

Interesting project. If I understand you correct, then you build a flat static model based on features independent of the behavior on the website . The target variable then is deduced dependent on the sites visited, so that you can predict the interest of other visitors as long as they a) are logged in and b) are already clients. As you said, they do not even have to visit the site to get scored.

So far, this strategy sounds good to me :)

I guess the hardest part is to calculate the label. Merging visited sites, time spent on page and maybe the multiple categories of a site into one crisp value is, well, hard.

Even harder is to identify how interested a visitor, who is not logged in, for certain categories. Now we are leaving the area of flat models, now it is getting interesting ...Any plans in this direction ? 

Another point is, that the categories may have a hierarchy (do they ?). I recently experienced that building models for hierarchical models is really non trivial. Any thoughts on this issue ?

kind regards,

Steffen

PS: More posts like that. I like them technical :D</description>
		<content:encoded><![CDATA[<p>Hello Sandro</p>
<p>Interesting project. If I understand you correct, then you build a flat static model based on features independent of the behavior on the website . The target variable then is deduced dependent on the sites visited, so that you can predict the interest of other visitors as long as they a) are logged in and b) are already clients. As you said, they do not even have to visit the site to get scored.</p>
<p>So far, this strategy sounds good to me <img src='http://www.dataminingblog.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>I guess the hardest part is to calculate the label. Merging visited sites, time spent on page and maybe the multiple categories of a site into one crisp value is, well, hard.</p>
<p>Even harder is to identify how interested a visitor, who is not logged in, for certain categories. Now we are leaving the area of flat models, now it is getting interesting &#8230;Any plans in this direction ? </p>
<p>Another point is, that the categories may have a hierarchy (do they ?). I recently experienced that building models for hierarchical models is really non trivial. Any thoughts on this issue ?</p>
<p>kind regards,</p>
<p>Steffen</p>
<p>PS: More posts like that. I like them technical <img src='http://www.dataminingblog.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
</channel>
</rss>

