<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Less Talk, More Do &#187; Digg Statistical Data</title>
	<atom:link href="http://www.chrisfinke.com/category/social-media/digg/digg-statistical-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.chrisfinke.com</link>
	<description>Christopher Finke is a software engineer at Mahalo. He is available for birthday parties and bar mitzvahs.</description>
	<lastBuildDate>Thu, 18 Jun 2009 21:41:21 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Tracking the top Diggers</title>
		<link>http://www.chrisfinke.com/2008/05/23/tracking-the-top-diggers/</link>
		<comments>http://www.chrisfinke.com/2008/05/23/tracking-the-top-diggers/#comments</comments>
		<pubDate>Sat, 24 May 2008 04:02:54 +0000</pubDate>
		<dc:creator>Christopher Finke</dc:creator>
				<category><![CDATA[Digg]]></category>
		<category><![CDATA[Digg Statistical Data]]></category>
		<category><![CDATA[Python]]></category>

		<guid isPermaLink="false">http://www.chrisfinke.com/?p=744</guid>
		<description><![CDATA[For the past 15 months, I've been maintaining a list of the top 100 (and top 1000) Digg users.  As Digg has become less and less relevant to my interests (and the interests of the greater tech community), I've decided to stop updating these lists.  However, I have chosen to provide the tools [...]]]></description>
			<content:encoded><![CDATA[<p>For the past 15 months, I've been maintaining a list of the <a href="http://www.chrisfinke.com/digg/topusers.html">top 100</a> (and <a href="http://www.chrisfinke.com/digg/top1000users.html">top 1000</a>) Digg users.  As <a href="http://digg.com/">Digg</a> has become less and less relevant to my interests (and the interests of the greater tech community), I've decided to stop updating these lists.  However, I have chosen to provide the tools I have used in case someone else has an interest in tracking Digg statistics.</p>
<blockquote><p>Caveat: This whole thing was thrown together without any regard for coding or design conventions. Also, I can't guarantee that you won't be banned from Digg for using any of this information - I was, a few times.  Proceed with caution.</p></blockquote>
<p>First, the <b>data structures</b>.  The Top Diggers lists are supported by two database tables: `digg` and `digg_users`.  `digg` holds the set of front-page stories and their submitters, while `digg_users` stores aggregated information about each user in the system.</p>
<pre>CREATE TABLE `digg` (
  `user` varchar(128) NOT NULL default '',
  `submission` varchar(255) NOT NULL default '',
           `date` datetime NOT NULL default '0000-00-00 00:00:00',
  PRIMARY KEY  (`submission`),
  KEY `user` (`user`)
);

CREATE TABLE `digg_users` (
  `user` varchar(128) NOT NULL default '',
  `frontpage` int(11) NOT NULL default '0',
  `dugg` int(11) NOT NULL default '0',
  `submitted` int(11) NOT NULL default '0',
  `profileViews` int(11) NOT NULL default '0',
  `frontpagestatic` int(11) NOT NULL default '0',
  `frontpagetotal` int(11) NOT NULL default '0',
  `submittedstatic` int(11) NOT NULL default '0',
  `image` varchar(128) NOT NULL default '',
  UNIQUE KEY `username` (`user`)
);
</pre>
<p>Next, <b>some data to get you started</b>.  <a href="http://www.chrisfinke.com/files/2008/05/digg_data.sql.gz">Here is a SQL dump of the two tables described above</a>, including information on about 13000 Digg users (for `digg_users`), as well as the last 4 stories on the Digg homepage at the time of the last update (for `digg`).  Import this data into your database in preparation for the next step.</p>
<p>The next step: <b>data retrieval</b>.  The main work of updating the top 100 list is done by a Python script:</p>
<pre>
import digg

# Grab the last X pages of popular stories
digg.update_news(100)

# Update the top X profiles
digg.update_profiles(110)
</pre>
<p>Of course, this code makes no sense without the <a href="http://www.chrisfinke.com/files/2008/05/digg.py"><code>digg.*</code> methods, downloadable here</a>.  This script also requires the excellent <a href="http://www.crummy.com/software/BeautifulSoup/">BeautifulSoup Python HTML parser.</a>  You will have to modify digg.py to change the database connection parameters.</p>
<p>For those who don't care to read through the code, it achieves two main objectives: Find out who submitted any frontpage stories since the last update (it stops when it hits a story already in the `digg` table), and using that information, determine the new top 100 users and update their profile information.</p>
<p>The last step is <b>data presentation</b>.  The information in the database tables needs to be transformed into a readable HTML file.  I'll leave this step as an exercise for the reader, but to get you started, this SQL query will get you the data you want in an easy-to-read format:</p>
<pre>
SELECT
   user `Username`,
   frontpagetotal `Frontpage Stories`,
   submitted `Stories Submitted`,
   dugg `Stories Dugg`,
   profileViews `Profile Views`
FROM digg_users
WHERE
   frontpagetotal <= submitted
ORDER BY
   `Frontpage Stories` DESC,
   `Stories Submitted` ASC,
   `Stories Dugg` DESC
LIMIT 100
</pre>
<p>So to sum up, if you want to manage your own "Top 100 Diggers" list, take the following steps:</p>
<p>1. Import the dump of digg data linked above.<br />2. Set up and run your scripts<br />3. Create a readable version of the data.</p>
<p>Have fun, and beware the Digg ban-hammer.</p>
 <img src="http://www.chrisfinke.com/wp-content/plugins/feed-statistics.php?view=1&post_id=744" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.chrisfinke.com/2008/05/23/tracking-the-top-diggers/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Kevin Rose removed from Top Diggers Lists</title>
		<link>http://www.chrisfinke.com/2007/09/26/kevin-rose-removed-from-top-diggers-lists/</link>
		<comments>http://www.chrisfinke.com/2007/09/26/kevin-rose-removed-from-top-diggers-lists/#comments</comments>
		<pubDate>Wed, 26 Sep 2007 21:27:34 +0000</pubDate>
		<dc:creator>Christopher Finke</dc:creator>
				<category><![CDATA[Digg]]></category>
		<category><![CDATA[Digg Statistical Data]]></category>
		<category><![CDATA[Kevin Rose]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://www.chrisfinke.com/2007/09/26/kevin-rose-removed-from-top-diggers-lists/</guid>
		<description><![CDATA[Some of you may have noticed that Kevin Rose (Digg user kevinrose) no longer appears in both the Digg's Top 100 and Top 1,000 Users lists.  It was recently brought to my attention that Kevin's profile page shows that he has submitted 295 stories to Digg, but somehow, 298 of his submissions have reached [...]]]></description>
			<content:encoded><![CDATA[<p>Some of you may have noticed that Kevin Rose (Digg user <a href="http://digg.com/users/kevinrose/">kevinrose</a>) no longer appears in both the Digg's <a href="http://www.chrisfinke.com/digg/topusers.html">Top 100</a> and <a href="http://www.chrisfinke.com/digg/top1000users.html">Top 1,000</a> Users lists.  It was recently brought to my attention that Kevin's profile page shows that he has submitted 295 stories to Digg, but somehow, 298 of his submissions have reached the front page.</p>
<p><img src='http://www.chrisfinke.com/files/2007/09/kevin-rose-is-more-powerful-than-math.png' alt='Kevin Rose is more powerful than math' /></p>
<div class="digg-this" style="float: right;"><script>digg_url = "http://digg.com/tech_news/Kevin_Rose_Removed_From_Top_Diggers_Lists_For_Gaming_the_System";</script><script src="http://digg.com/api/diggthis.js"></script></div>
<p>Gaming the system like this will not be tolerated, and any Digg users caught breaking the laws of mathematics will immediately be banned from both Top Users lists.</p>
<p><br style="clear: both;" /></p>
 <img src="http://www.chrisfinke.com/wp-content/plugins/feed-statistics.php?view=1&post_id=487" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.chrisfinke.com/2007/09/26/kevin-rose-removed-from-top-diggers-lists/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Top 1000 Diggers as of 2007/09/26</title>
		<link>http://www.chrisfinke.com/2007/09/26/top-1000-diggers-as-of-20070926/</link>
		<comments>http://www.chrisfinke.com/2007/09/26/top-1000-diggers-as-of-20070926/#comments</comments>
		<pubDate>Wed, 26 Sep 2007 21:09:56 +0000</pubDate>
		<dc:creator>Christopher Finke</dc:creator>
				<category><![CDATA[Digg]]></category>
		<category><![CDATA[Digg Statistical Data]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://www.chrisfinke.com/2007/09/26/top-1000-diggers-as-of-20070926/</guid>
		<description><![CDATA[This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.
Top 1000 Diggers as of 2007/09/26
If you'd like to be notified whenever I release a new dataset, you can subscribe to the Digg Statisical Data RSS [...]]]></description>
			<content:encoded><![CDATA[<p>This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.</p>
<p><a href='http://www.chrisfinke.com/files/2007/09/digg_top_1000_20070926.csv' title='Top 1000 Diggers as of 2007/09/26'>Top 1000 Diggers as of 2007/09/26</a></p>
<p class="key-point">If you'd like to be notified whenever I release a new dataset, you can subscribe to <a href="http://www.chrisfinke.com/category/digg-statistical-data/feed/">the <em>Digg Statisical Data</em> RSS feed</a>, which will include only the dataset posts, or <a href="http://www.chrisfinke.com/feed/">my main RSS feed</a>, which is updated with all of my blog posts.</p>
 <img src="http://www.chrisfinke.com/wp-content/plugins/feed-statistics.php?view=1&post_id=486" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.chrisfinke.com/2007/09/26/top-1000-diggers-as-of-20070926/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Top 1000 Diggers as of 2007/09/12</title>
		<link>http://www.chrisfinke.com/2007/09/12/top-1000-diggers-as-of-20070912/</link>
		<comments>http://www.chrisfinke.com/2007/09/12/top-1000-diggers-as-of-20070912/#comments</comments>
		<pubDate>Wed, 12 Sep 2007 20:00:17 +0000</pubDate>
		<dc:creator>Christopher Finke</dc:creator>
				<category><![CDATA[Digg]]></category>
		<category><![CDATA[Digg Statistical Data]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://www.chrisfinke.com/2007/09/12/top-1000-diggers-as-of-20070912/</guid>
		<description><![CDATA[This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.
Top 1000 Diggers as of 2007/09/12
If you'd like to be notified whenever I release a new dataset, you can subscribe to the Digg Statisical Data RSS [...]]]></description>
			<content:encoded><![CDATA[<p>This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.</p>
<p><a href='http://www.chrisfinke.com/files/2007/09/digg_top_1000_20070912.csv' title='Top 1000 Diggers as of 2007/09/12'>Top 1000 Diggers as of 2007/09/12</a></p>
<p class="key-point">If you'd like to be notified whenever I release a new dataset, you can subscribe to <a href="http://www.chrisfinke.com/category/digg-statistical-data/feed/">the <em>Digg Statisical Data</em> RSS feed</a>, which will include only the dataset posts, or <a href="http://www.chrisfinke.com/feed/">my main RSS feed</a>, which is updated with all of my blog posts.</p>
 <img src="http://www.chrisfinke.com/wp-content/plugins/feed-statistics.php?view=1&post_id=477" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.chrisfinke.com/2007/09/12/top-1000-diggers-as-of-20070912/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Top 1000 Diggers as of 2007/09/05</title>
		<link>http://www.chrisfinke.com/2007/09/05/top-1000-diggers-as-of-20070905/</link>
		<comments>http://www.chrisfinke.com/2007/09/05/top-1000-diggers-as-of-20070905/#comments</comments>
		<pubDate>Wed, 05 Sep 2007 21:04:44 +0000</pubDate>
		<dc:creator>Christopher Finke</dc:creator>
				<category><![CDATA[Digg]]></category>
		<category><![CDATA[Digg Statistical Data]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://www.chrisfinke.com/2007/09/05/top-1000-diggers-as-of-20070905/</guid>
		<description><![CDATA[This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.
Top 1000 Diggers as of 2007/09/05
If you'd like to be notified whenever I release a new dataset, you can subscribe to the Digg Statisical Data RSS [...]]]></description>
			<content:encoded><![CDATA[<p>This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.</p>
<p><a href='http://www.chrisfinke.com/files/2007/09/digg_top_1000_20070905.csv' title='Top 1000 Diggers as of 2007/09/05'>Top 1000 Diggers as of 2007/09/05</a></p>
<p class="key-point">If you'd like to be notified whenever I release a new dataset, you can subscribe to <a href="http://www.chrisfinke.com/category/social-media/digg/digg-statistical-data/feed/">the <em>Digg Statisical Data</em> RSS feed</a>, which will include only the dataset posts, or <a href="http://www.chrisfinke.com/feed/">my main RSS feed</a>, which is updated with all of my blog posts.</p>
 <img src="http://www.chrisfinke.com/wp-content/plugins/feed-statistics.php?view=1&post_id=457" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.chrisfinke.com/2007/09/05/top-1000-diggers-as-of-20070905/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Top 1000 Diggers as of 2007/08/08</title>
		<link>http://www.chrisfinke.com/2007/08/08/top-1000-diggers-as-of-20070808/</link>
		<comments>http://www.chrisfinke.com/2007/08/08/top-1000-diggers-as-of-20070808/#comments</comments>
		<pubDate>Wed, 08 Aug 2007 11:20:05 +0000</pubDate>
		<dc:creator>Christopher Finke</dc:creator>
				<category><![CDATA[Digg]]></category>
		<category><![CDATA[Digg Statistical Data]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://www.chrisfinke.com/2007/08/08/top-1000-diggers-as-of-20070808/</guid>
		<description><![CDATA[This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.
Top 1000 Diggers as of 2007/08/08
If you'd like to be notified whenever I release a new dataset, you can subscribe to the Digg Statisical Data RSS [...]]]></description>
			<content:encoded><![CDATA[<p>This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.</p>
<p><a href='http://www.chrisfinke.com/files/2007/08/digg_top_1000_20070808.csv' title='Top 1000 Diggers as of 2007/08/08'>Top 1000 Diggers as of 2007/08/08</a></p>
<p class="key-point">If you'd like to be notified whenever I release a new dataset, you can subscribe to <a href="http://www.chrisfinke.com/category/social-media/digg/digg-statistical-data/feed/">the <em>Digg Statisical Data</em> RSS feed</a>, which will include only the dataset posts, or <a href="http://www.chrisfinke.com/feed/">my main RSS feed</a>, which is updated with all of my blog posts.</p>
 <img src="http://www.chrisfinke.com/wp-content/plugins/feed-statistics.php?view=1&post_id=438" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.chrisfinke.com/2007/08/08/top-1000-diggers-as-of-20070808/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Top 1000 Diggers as of 2007/08/01</title>
		<link>http://www.chrisfinke.com/2007/08/01/top-1000-diggers-as-of-20070801/</link>
		<comments>http://www.chrisfinke.com/2007/08/01/top-1000-diggers-as-of-20070801/#comments</comments>
		<pubDate>Wed, 01 Aug 2007 15:55:44 +0000</pubDate>
		<dc:creator>Christopher Finke</dc:creator>
				<category><![CDATA[Digg]]></category>
		<category><![CDATA[Digg Statistical Data]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://www.chrisfinke.com/2007/08/01/top-1000-diggers-as-of-20070801/</guid>
		<description><![CDATA[This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.
Top 1000 Diggers as of 2007/08/01
If you'd like to be notified whenever I release a new dataset, you can subscribe to the Digg Statisical Data RSS [...]]]></description>
			<content:encoded><![CDATA[<p>This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.</p>
<p><a href='http://www.chrisfinke.com/files/2007/08/digg_top_1000_20070801.csv' title='Top 1000 Diggers as of 2007/08/01'>Top 1000 Diggers as of 2007/08/01</a></p>
<p class="key-point">If you'd like to be notified whenever I release a new dataset, you can subscribe to <a href="http://www.chrisfinke.com/category/social-media/digg/digg-statistical-data/feed/">the <em>Digg Statisical Data</em> RSS feed</a>, which will include only the dataset posts, or <a href="http://www.chrisfinke.com/feed/">my main RSS feed</a>, which is updated with all of my blog posts.</p>
 <img src="http://www.chrisfinke.com/wp-content/plugins/feed-statistics.php?view=1&post_id=431" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.chrisfinke.com/2007/08/01/top-1000-diggers-as-of-20070801/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Top 1,000 Diggers: 2007/07/18</title>
		<link>http://www.chrisfinke.com/2007/07/18/top-1000-diggers-20070718/</link>
		<comments>http://www.chrisfinke.com/2007/07/18/top-1000-diggers-20070718/#comments</comments>
		<pubDate>Wed, 18 Jul 2007 17:27:29 +0000</pubDate>
		<dc:creator>Christopher Finke</dc:creator>
				<category><![CDATA[Digg]]></category>
		<category><![CDATA[Digg Statistical Data]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://www.chrisfinke.com/2007/07/18/top-1000-diggers-20070718/</guid>
		<description><![CDATA[This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.
Top 1000 Diggers as of 2007/07/18
If you'd like to be notified whenever I release a new dataset, you can subscribe to the Digg Statisical Data RSS [...]]]></description>
			<content:encoded><![CDATA[<p>This CSV files contains the username, number of frontpage stories, number of submitted stories, number of stories dugg, and number of profile views for the top 1,000 users on Digg.</p>
<p><a href='http://www.chrisfinke.com/files/2007/07/digg_top_1000_20070718.csv' title='Top 1000 Diggers as of 2007/07/18'>Top 1000 Diggers as of 2007/07/18</a></p>
<p class="key-point">If you'd like to be notified whenever I release a new dataset, you can subscribe to <a href="http://www.chrisfinke.com/category/social-media/digg/digg-statistical-data/feed/">the <em>Digg Statisical Data</em> RSS feed</a>, which will include only the dataset posts, or <a href="http://www.chrisfinke.com/feed/">my main RSS feed</a>, which is updated with all of my blog posts.</p>
 <img src="http://www.chrisfinke.com/wp-content/plugins/feed-statistics.php?view=1&post_id=421" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.chrisfinke.com/2007/07/18/top-1000-diggers-20070718/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
