<?xml version="1.0" encoding="UTF-8" ?>
<rss version="2.0">
	<channel>
		<title>gdp's Comments</title>
		<language>en-us</language>
		<link>https://www.intensedebate.com/users/41961</link>
		<description>Comments by Pete Warden</description>
<item>
<title>PeteSearch : Five short links</title>
<link>http://petewarden.typepad.com/searchbrowser/2013/05/five--1.html#IDComment647376876</link>
<description>That is an interesting idea! From my limited exposure to research code, it would take a long time to turn most of it into something other people could run, but maybe if it was a requirement that would encourage more re-usable code? </description>
<pubDate>Tue, 21 May 2013 17:27:46 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2013/05/five--1.html#IDComment647376876</guid>
</item><item>
<title>PeteSearch : Security by silo</title>
<link>http://petewarden.typepad.com/searchbrowser/2013/01/security-by-silo.html#IDComment551038811</link>
<description>Good point, I did jump over that a bit! Most people imported their address book from their webmail when they signed up to Facebook, and while a bit different from auto-analysing emails, has been a lot less controversial. </description>
<pubDate>Tue, 22 Jan 2013 18:29:18 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2013/01/security-by-silo.html#IDComment551038811</guid>
</item><item>
<title>PeteSearch : Five short links</title>
<link>http://petewarden.typepad.com/searchbrowser/2012/10/five-short-links.html#IDComment467808115</link>
<description>Sweet, looks like they&amp;#039;ve fixed that over the last few months! It required headless x-windows and QT the last time I used it. </description>
<pubDate>Fri, 19 Oct 2012 17:55:59 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2012/10/five-short-links.html#IDComment467808115</guid>
</item><item>
<title>PeteSearch : How I ended up using S3 as my database</title>
<link>http://petewarden.typepad.com/searchbrowser/2010/10/how-i-ended-up-using-s3-as-my-database.html#IDComment438758225</link>
<description>The documents are given a unique, extremely-hard-to-guess ID, and that&amp;#039;s referenced as part of the URL for the main page. This makes it impossible to browse for documents unless somebody has shared the URL with you. My application doesn&amp;#039;t need any searching capabilities luckily! </description>
<pubDate>Tue, 11 Sep 2012 18:27:30 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2010/10/how-i-ended-up-using-s3-as-my-database.html#IDComment438758225</guid>
</item><item>
<title>PeteSearch : How I got sued by Facebook</title>
<link>http://petewarden.typepad.com/searchbrowser/2010/04/how-i-got-sued-by-facebook.html#IDComment331672220</link>
<description>I almost feel bad for all the lawyers who obviously struggle with this new-fangled technology. Almost. </description>
<pubDate>Wed, 4 Apr 2012 18:10:15 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2010/04/how-i-got-sued-by-facebook.html#IDComment331672220</guid>
</item><item>
<title>PeteSearch : Lessons from a Cassandra disaster</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/12/lessons-from-a-cassandra-disaster.html#IDComment234349196</link>
<description>Thanks as always Joaquin! You and the Datastax team are a big part of why I chose Cassandra, you&amp;#039;re the backbone of the community. </description>
<pubDate>Wed, 7 Dec 2011 21:54:08 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/12/lessons-from-a-cassandra-disaster.html#IDComment234349196</guid>
</item><item>
<title>PeteSearch : Five short links</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/11/five-short-links.html#IDComment222986026</link>
<description>Thanks Matthieu, that is fascinating stuff, especially the Mount Everest coverage issue with SRTM3. It looks like ASTER would be essential for any kind of open-source Google Earth, I&amp;#039;ll include it in my next roundup. </description>
<pubDate>Wed, 16 Nov 2011 18:38:48 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/11/five-short-links.html#IDComment222986026</guid>
</item><item>
<title>PeteSearch : Why we need an open-source geocoding alternative to Google</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/10/what-can-you-use-for-geocoding-instead-of-google-maps.html#IDComment213521308</link>
<description>It was failing to locate addresses like &amp;quot;Benn&amp;auml;sv&amp;auml;gen 5, 68600 Jakobstad, Finland&amp;quot; that Google could handle, and that data was available for in OpenStreeMap. </description>
<pubDate>Fri, 28 Oct 2011 18:17:40 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/10/what-can-you-use-for-geocoding-instead-of-google-maps.html#IDComment213521308</guid>
</item><item>
<title>PeteSearch : Why we need an open-source geocoding alternative to Google</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/10/what-can-you-use-for-geocoding-instead-of-google-maps.html#IDComment213520416</link>
<description>There are multiple providers out there, but almost all of them have terms and conditions like Google that prohibit you from general geocoding, if you&amp;#039;re not going to be displaying the results on one of their maps. </description>
<pubDate>Fri, 28 Oct 2011 18:15:24 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/10/what-can-you-use-for-geocoding-instead-of-google-maps.html#IDComment213520416</guid>
</item><item>
<title>PeteSearch : Free bulk geocoding for US addresses</title>
<link>http://petewarden.typepad.com/searchbrowser/2010/07/free-bulk-geocoding-for-us-addresses.html#IDComment187798293</link>
<description>It should be up and running again now, sorry about that. </description>
<pubDate>Mon, 29 Aug 2011 18:51:31 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2010/07/free-bulk-geocoding-for-us-addresses.html#IDComment187798293</guid>
</item><item>
<title>PeteSearch : Five short links</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/08/five-short-links-1.html#IDComment186203467</link>
<description>Thanks, I&amp;#039;ll check it out and add it to my next links post. </description>
<pubDate>Wed, 24 Aug 2011 17:51:16 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/08/five-short-links-1.html#IDComment186203467</guid>
</item><item>
<title>PeteSearch : Green Tea Kit Kats</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/06/green-tea-kit-kats.html#IDComment185555339</link>
<description>I&amp;#039;ll be looking in Japantown here in SF next time I&amp;#039;m up that end, it was quite something. I checked out your blog by the way, loved the writing, and the puppy rescuing! </description>
<pubDate>Mon, 22 Aug 2011 18:23:48 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/06/green-tea-kit-kats.html#IDComment185555339</guid>
</item><item>
<title>PeteSearch : Using Hadoop with external API calls</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/05/using-hadoop-with-external-api-calls.html#IDComment175275164</link>
<description>I don&amp;#039;t have any public examples I can point you to unfortunately. One way to approach it is to pass the access keys as part of the input data, or to have a central proxy server that figures out how to allocate user keys to maximize your use of the rate limits. </description>
<pubDate>Tue, 19 Jul 2011 19:53:24 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/05/using-hadoop-with-external-api-calls.html#IDComment175275164</guid>
</item><item>
<title>PeteSearch : My San Francisco food highlights</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/07/my-san-francisco-food-highlights.html#IDComment175273789</link>
<description>I will have to check that out next time I&amp;#039;m in Berkley for the theatre, thanks for the tip! </description>
<pubDate>Tue, 19 Jul 2011 19:48:37 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/07/my-san-francisco-food-highlights.html#IDComment175273789</guid>
</item><item>
<title>PeteSearch : http://petewarden.typepad.com/searchbrowser/2011/06/am-i-wrong-about-queues-being-satans-little-hel</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/06/am-i-wrong-about-queues-being-satans-little-helpers.html#IDComment175273407</link>
<description>That is very interesting, thanks. I&amp;#039;ll include that in my next roundup post. </description>
<pubDate>Tue, 19 Jul 2011 19:47:14 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/06/am-i-wrong-about-queues-being-satans-little-helpers.html#IDComment175273407</guid>
</item><item>
<title>PeteSearch : Five belated links</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/07/five-belated-links.html#IDComment174856719</link>
<description>Thanks Javier, I was unaware there was actually a proposed standard behind the hash-bang convention. That does make it a lot more palatable. </description>
<pubDate>Mon, 18 Jul 2011 17:27:05 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/07/five-belated-links.html#IDComment174856719</guid>
</item><item>
<title>PeteSearch : &lt;a href=&quot;http://petewarden.typepad.com/searchbrowser/search_tips/&quot;&gt;Search Tips</title>
<link>http://petewarden.typepad.com/searchbrowser/2008/07/try-out-opencal.html#IDComment168441904</link>
<description>Sorry, I haven&amp;#039;t looked at that for a while so I&amp;#039;m not sure what&amp;#039;s going on. I think that the OpenCalais site might have some better demos now. </description>
<pubDate>Thu, 30 Jun 2011 20:53:37 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2008/07/try-out-opencal.html#IDComment168441904</guid>
</item><item>
<title>PeteSearch : http://petewarden.typepad.com/searchbrowser/2011/06/my-introduction-to-mapreduce-video-is-now-avail</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/06/my-introduction-to-mapreduce-video-is-now-available.html#IDComment166245422</link>
<description>I go through one example in the video, where I analyze public Facebook information, and I&amp;#039;ve used it extensively in my work on Twitter and public web crawls.  I chose Python because it&amp;#039;s the most common teaching language, and a lot of people know it. The great thing about the Hadoop streaming model for MapReduce is that you can use any language that can read from stdin and write to stdout, so in fact I&amp;#039;ve used all sorts of different languages in production, including PHP! </description>
<pubDate>Fri, 24 Jun 2011 19:08:54 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/06/my-introduction-to-mapreduce-video-is-now-available.html#IDComment166245422</guid>
</item><item>
<title>PeteSearch : Shutting down Wordlin.gs</title>
<link>http://petewarden.typepad.com/searchbrowser/2011/06/shutting-down-wordlings.html#IDComment162685384</link>
<description>That&amp;#039;s a good point, I really didn&amp;#039;t experiment enough with different placement to figure out what would work. Part of the reason I decided to shut it down was when I realized I didn&amp;#039;t have time to do that experimentation, with the other projects I&amp;#039;m running. I have had a sponsorship offer come through though, so the site may yet still live! </description>
<pubDate>Tue, 14 Jun 2011 22:11:26 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2011/06/shutting-down-wordlings.html#IDComment162685384</guid>
</item><item>
<title>PeteSearch : The missing tool for data scientists?</title>
<link>http://petewarden.typepad.com/searchbrowser/2010/09/the-missing-tool-for-data-scientists.html#IDComment155791565</link>
<description>Thanks, that does look pretty fascinating. I&amp;#039;ll be including it in an upcoming five short links. </description>
<pubDate>Tue, 24 May 2011 23:31:57 +0000</pubDate>
<guid>http://petewarden.typepad.com/searchbrowser/2010/09/the-missing-tool-for-data-scientists.html#IDComment155791565</guid>
</item>	</channel>
</rss>