<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Educating Silicon &#187; Thoughts</title>
	<atom:link href="http://www.educatingsilicon.com/category/thoughts/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.educatingsilicon.com</link>
	<description>Listening for the pitter-patter of tiny metal feet.</description>
	<lastBuildDate>Wed, 10 Mar 2010 13:07:47 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Dinosaurs and Tail Risk</title>
		<link>http://www.educatingsilicon.com/2009/04/08/dinosaurs-and-tail-risk/</link>
		<comments>http://www.educatingsilicon.com/2009/04/08/dinosaurs-and-tail-risk/#comments</comments>
		<pubDate>Wed, 08 Apr 2009 10:32:27 +0000</pubDate>
		<dc:creator>mark</dc:creator>
				<category><![CDATA[Random Interesting]]></category>
		<category><![CDATA[Thoughts]]></category>

		<guid isPermaLink="false">http://www.educatingsilicon.com/2009/04/08/dinosaurs-and-tail-risk/</guid>
		<description><![CDATA[Writing in this morning&#8217;s FT, Nassim Nicholas Taleb proposes Ten principles for a Black Swan-proof world:
1. What is fragile should break early while it is still small. Nothing should ever become too big to fail. Evolution in economic life helps those with the maximum amount of hidden risks – and hence the most fragile – [...]]]></description>
			<content:encoded><![CDATA[<p>Writing in this morning&#8217;s FT, Nassim Nicholas Taleb proposes <a href="http://www.ft.com/cms/s/0/5d5aa24e-23a4-11de-996a-00144feabdc0.html">Ten principles for a Black Swan-proof world</a>:</p>
<blockquote><p>1. <em>What is fragile should break early while it is still small</em>. Nothing should ever become too big to fail. Evolution in economic life helps those with the maximum amount of hidden risks – and hence the most fragile – become the biggest.<br />
&#8230;<br />
Then we will see an economic life closer to our biological environment: smaller companies, richer ecology, no leverage.</p></blockquote>
<p>A sensible plan, but unfortunately Mr. Taleb&#8217;s faith in biology is misplaced.</p>
<blockquote><p><a href="http://www.newscientist.com/article/mg20127001.400-how-did-the-largest-dinosaurs-get-so-big.html">Why the Dinosaurs got so Large</a><br />
<em><br />
</em>19th-century palaeontologist Edward Drinker Cope noticed that animal lineages tend to get bigger over evolutionary time, starting out small and leaving ever bigger descendants. This process came to be known as Cope&#8217;s rule.</p>
<p>Getting bigger has evolutionary advantages, explains David Hone, an<br />
expert on Cope&#8217;s rule at the Institute of Vertebrate Paleontology and<br />
Paleoanthropology in Beijing, China. &#8220;You are harder to predate and it<br />
is easier for you to fight off competitors for food or for mates.&#8221; But<br />
eventually it catches up with you. &#8220;We also know that big animals are<br />
generally more vulnerable to extinction,&#8221; he says. Larger animals eat<br />
more and breed more slowly than smaller ones, so their problems are<br />
greater when times are tough and food is scarce. &#8220;Many of the very<br />
large mammals, such as Paraceratherium, had a short tenure in the<br />
fossil record, while smaller species often tend to be more<br />
persistent,&#8221; says mammal palaeobiologist Christine Janis of Brown<br />
University in Providence, Rhode Island. So on one hand natural<br />
selection encourages animals to grow larger, but on the other it<br />
eventually punishes them for doing so. This equilibrium between<br />
opposing forces has prevented most land animals from exceeding about 10 tonnes.</p></blockquote>
<p>Dinosaurs had skewed incentives and took on too much tail risk! If even evolution falls into this trap, God help the bank regulators&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.educatingsilicon.com/2009/04/08/dinosaurs-and-tail-risk/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Computer Vision in the Elastic Compute Cloud</title>
		<link>http://www.educatingsilicon.com/2008/11/11/computer-vision-in-the-elastic-compute-cloud/</link>
		<comments>http://www.educatingsilicon.com/2008/11/11/computer-vision-in-the-elastic-compute-cloud/#comments</comments>
		<pubDate>Tue, 11 Nov 2008 19:48:06 +0000</pubDate>
		<dc:creator>mark</dc:creator>
				<category><![CDATA[Misc]]></category>
		<category><![CDATA[Thoughts]]></category>

		<guid isPermaLink="false">http://www.educatingsilicon.com/2008/11/11/computer-vision-in-the-elastic-compute-cloud/</guid>
		<description><![CDATA[In a datacenter somewhere on the other side of the planet, a rack-mounted computer is busy hunting for patterns in photographs of Oxford.  It is doing this for 10 cents an hour, with more RAM and more horsepower than I can muster on my local machine. This delightful arrangement is made possible by Amazon&#8217;s Elastic [...]]]></description>
			<content:encoded><![CDATA[<p>In a datacenter somewhere on the other side of the planet, a rack-mounted computer is busy hunting for patterns in photographs of Oxford.  It is doing this for 10 cents an hour, with more RAM and more horsepower than I can muster on my local machine. This delightful arrangement is made possible by <a href="http://aws.amazon.com/ec2/">Amazon&#8217;s Elastic Compute Cloud</a>.</p>
<p>For the decreasing number of people who haven&#8217;t heard of EC2, it&#8217;s a pretty simple idea. Via a simple command line interface you can &#8220;create&#8221; a server running in Amazon&#8217;s datacenter. You pick a hardware configuration and OS image, send the request and voilà &#8211; about 30 seconds later you get back a response with the IP address of the machine, to which you now have root access and sole use.  You can customize the software environment to your heart&#8217;s content and then save the disk image for future use. Of course, now that you can create one instance you can create twenty. Cluster computing on tap.</p>
<p>This is an absolutely fantastic resource for research. I&#8217;ve been using it for about six months now, and have very little bad to say about it. Computer vision has an endless appetite for computation. Most groups, including our own, have their own computing cluster but demand for CPU cycles typically spikes around paper deadlines, so having the ability to instantly double or triple the size of your cluster is very nice indeed.</p>
<p>Amazon also have some <a href="http://docs.amazonwebservices.com/AWSEC2/2008-02-01/DeveloperGuide/index.html?instance-types.html">hi-spec machines</a> available. I recently ran into trouble where I needed about 10GB of RAM for a large learning job. Our cluster is 32-bit, so 4GB RAM is the limit. What might have been a serious headache was solved with a few hours and $10 on Amazon EC2.</p>
<p>The one limitation I&#8217;ve found is that disk access on EC2 is a shared resource, so bandwidth to disk tends to be about 10MB/s, as opposed to say 70MB/sec on a local SATA hard drive. Disk bandwidth tends to be a major factor in running time for very big out-of-core learning jobs. Happily, Amazon very recently released a new service called Elastic Block Store which offers dedicated disks, though the pricing is a little hard to figure out.</p>
<p>I should mention that for UK academics there is a free service called <a href="http://www.grid-support.ac.uk/">National Grid</a>, though personally I&#8217;d rather work with Amazon.</p>
<p>Frankly, the possibilities opened up by EC2 just blow my mind. Every coder in a garage now potentially has access to Google-level computation. For tech startups this is a dream. <a href="http://www.techcrunch.com/2008/04/21/who-are-the-biggest-users-of-amazon-web-services-its-not-startups/">More</a> <a href="http://open.nytimes.com/2007/11/01/self-service-prorated-super-computing-fun/">traditional</a> <a href="http://anand.typepad.com/datawocky/2008/04/a-herald-of-rev.html">companies</a> are <a href="http://www.wired.com/techbiz/it/magazine/16-05/mf_amazon">playing</a> too. People have been talking about this idea for a long time, but it&#8217;s finally here, and it rocks!</p>
<p><font color="#999999"><em>Update:</em> Amazon are <a href="http://aws.typepad.com/aws/2008/12/paging-researchers-analysts-and-developers.html">keen to help</a> their scientific users. Great!</font></p>
]]></content:encoded>
			<wfw:commentRss>http://www.educatingsilicon.com/2008/11/11/computer-vision-in-the-elastic-compute-cloud/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Big Data to the Rescue?</title>
		<link>http://www.educatingsilicon.com/2008/09/03/big-data-to-the-rescue/</link>
		<comments>http://www.educatingsilicon.com/2008/09/03/big-data-to-the-rescue/#comments</comments>
		<pubDate>Wed, 03 Sep 2008 20:52:18 +0000</pubDate>
		<dc:creator>mark</dc:creator>
				<category><![CDATA[Thoughts]]></category>

		<guid isPermaLink="false">http://www.educatingsilicon.com/2008/09/03/big-data-to-the-rescue/</guid>
		<description><![CDATA[Peter Norvig of Google likes to say that for machine learning, you should &#8220;worry about the data before you worry about the algorithm&#8221;.
Rather than argue about whether this algorithm is better than that algorithm, all you have to do is get ten times more training data. And now all of a sudden, the worst algorithm [...]]]></description>
			<content:encoded><![CDATA[<p>Peter Norvig of Google likes to say that for machine learning, you should &#8220;worry about the data before you worry about the algorithm&#8221;.</p>
<blockquote><p><em>Rather than argue about whether this algorithm is better than that algorithm, all you have to do is get ten times more training data. And now all of a sudden, the worst algorithm &#8230; is performing better than the best algorithm on less training data.</em></p></blockquote>
<p>It&#8217;s a rallying cry <a href="http://anand.typepad.com/datawocky/2008/03/more-data-usual.html">taken up</a> <a href="http://glinden.blogspot.com/2006/10/advantages-of-big-data-and-big.html">by many</a>, and there&#8217;s a lot of truth to it.  Peter&#8217;s <a href="http://www.youtube.com/watch?v=nU8DcBF-qo4">talk</a> here has some nice examples (beginning at 4:30). The maxim about more data holds over several orders of magnitude. For some examples of the power of big-data-simple-algorithm for computer vision, <a href="http://graphics.cs.cmu.edu/projects/im2gps/">check out</a> <a href="http://graphics.cs.cmu.edu/projects/scene-completion/">the work</a> of Alyosha Efros&#8217; group at CMU.  This is all pretty convincing evidence that scale helps. The data tide lifts all boats.</p>
<p>What I find more interesting, though, is the fact that we already seem to have reached the limits of where data scale alone can take us. For example, as discussed in the talk, Google&#8217;s statistical machine translation system incorporates a language model consisting of length 7 <a href="http://en.wikipedia.org/wiki/N-gram">N-grams</a> trained from a 10^12 word dataset. This is an astonishingly large amount of data. To put that in perspective, a human will hear less than 10^9 words in an entire lifetime. It&#8217;s pretty clear that there must be huge gains to be made on the algorithmic side of the equation, and indeed some graphs in the talk show that, for machine translation at least, the performance gain from adding more data has already started to level off. The news from the frontiers of the <a href="http://en.wikipedia.org/wiki/Netflix_Prize">Netflix Prize</a> is the same &#8211; the top teams report that the Netflix dataset is so big that adding more data from sources like IMDB <a href="http://pragmatictheory.blogspot.com/2008/08/you-want-truth-you-cant-handle-truth.html">makes no difference at all</a>! (Though this is more an indictment of ontologies than big data.)</p>
<p>So, the future, like the past, will be about the algorithms. The sudden explosion of available data has given us a significant bump in performance, but has already begun to reach its limits. There&#8217;s still lots of easy progress to be made as the ability to handle massive data spreads beyond mega-players like Google to more average research groups, but fundamentally we know where the limits of the approach lie. The hard problems won&#8217;t be solved just by lots of data and nearest neighbour search. For researchers this is great news &#8211; still lots of fun to be had!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.educatingsilicon.com/2008/09/03/big-data-to-the-rescue/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Spot the Sensor</title>
		<link>http://www.educatingsilicon.com/2007/11/03/spot-the-sensor/</link>
		<comments>http://www.educatingsilicon.com/2007/11/03/spot-the-sensor/#comments</comments>
		<pubDate>Sun, 04 Nov 2007 02:21:49 +0000</pubDate>
		<dc:creator>mark</dc:creator>
				<category><![CDATA[Robot News]]></category>
		<category><![CDATA[Thoughts]]></category>

		<guid isPermaLink="false">http://www.educatingsilicon.com/2007/11/03/spot-the-sensor/</guid>
		<description><![CDATA[Watching all the videos and photography coming out of the Urban Challenge, the thing I pay most attention to are the racks of sensors sitting on top of the robots.  The sensor choices the teams have made are quite different to the previous 2005 Grand Challenge. There has been a big move to more [...]]]></description>
			<content:encoded><![CDATA[<p>Watching all the videos and photography coming out of the Urban Challenge, the thing I pay most attention to are the racks of sensors sitting on top of the robots.  The sensor choices the teams have made are quite different to the previous 2005 Grand Challenge. There has been a big move to more sophisticated laser systems, and some notable absences in the use of cameras. Here&#8217;s a short sensor-spotter&#8217;s guide:</p>
<p><strong>Lasers</strong></p>
<p>For almost all the competitors, the primary sensor is the <a href="http://web.mit.edu/kvogt/www/lidar.html">time-of-flight lidar</a>, often called simply a laser scanner. These are very popular in robotics, because they provide accurate distances to obstacles with higher robustness and less complexity than alternatives such as stereo vision. Some models to look out for:</p>
<blockquote><p><strong>SICK</strong></p>
<p><img src="http://www.educatingsilicon.com/wp-content/uploads/2007/11/sicklms200.jpg" alt="SICK Lidar" /></p>
<p>Used by 26 of the 36 semi-finalists, these blue boxes are ubiquitous in robotics research labs around the world because they&#8217;re the cheapest decent lidar available and are more than accurate enough for most applications. Typically they are operated with a maximum range of 25m, with distances accurate to a few centimetres . They&#8217;re 2D scanners, so they only see a slice through the world. This is normally fine for dealing with obstacles like walls or trees which extend vertically from the ground, but can <a href="http://blog.wired.com/defense/2007/10/safety-last-for.html">land you in trouble</a> for overhanging obstacles that aren&#8217;t at the same height as the laser. In the previous Challenge, these were the primary laser sensors for many teams. This time around they seem to be mostly relegated to providing some  extra sensing in blind-spots.<br />
SICK scanners have a list price of around $6,000, but there is a low-price deal for Grand Challenge entries. Indeed, the SICK corporation has had so much business and publicity from the Grand Challenge, that this year they decided to enter a <a href="http://www.team-lux.com/">team of their own</a>.</p>
<p><strong>Velodyne</strong></p>
<p><img src="http://www.educatingsilicon.com/wp-content/uploads/2007/11/velodyne-hdl-64e-small.jpg" alt="Velodyne Lidar" /></p>
<p>New kid on the block for the Urban Challenge, the Velodyne scanner is conspicuously popular this year. It&#8217;s used by 12 of the 36 semi-finalists, including most of the top teams. With a list price of $75,000, the Velodyne is quite a bit more pricey than the common SICK. However, instead of just containing a single laser, the Velodyne has a fan of 64 lasers, giving a genuine 3D picture of the surroundings.</p>
<p>There&#8217;s an interesting story behind the Velodyne sensor. Up until two years ago Velodyne was a company that made subwoofers. It&#8217;s founders decided to enter the 2005 Grand Challenge as a hobby project. Back then, the SICK scanner was about the best available, but it didn&#8217;t provide enough data, so many teams were loading up their vehicles with racks of SICKs. <a href="http://www.darpa.mil/grandchallenge05/TechPapers/TeamDAD.pdf">Team DAD</a> instead produced a custom laser scanner that was a big improvement on what was available at the time. Their website illustrates the change <a href="http://www.velodyne.com/lidar/vision/default.aspx">quite nicely</a>. For the Urban Challenge, they decided to concentrate on selling their new scanner to other teams instead of entering themselves. I&#8217;m sure this is exactly the kind of ecosystem of technology companies DARPA dreams about creating with these challenges.</p>
<p>I understand that the Velodyne data is a bit nosier than a typical SICK because of cross-talk between the lasers, but it&#8217;s obviously more than good enough to do the job. These sensors produce an absolute flood of data &#8211; more than a million points a second &#8211; and dealing with that is driving a lot of teams&#8217; computing requirements.</p>
<p>Teams who couldn&#8217;t afford the hefty price tag of this sensor have improvised Velodyne-like scanners by putting SICKs on turntables or pan-tilt units, but the SICK wasn&#8217;t designed for applications like this, so the data is quite sparse and it&#8217;s tricky to synchronize the laser data with the pan-tilt position.</p>
<p><strong>Riegl</strong></p>
<p><a href="http://www.educatingsilicon.com/wp-content/uploads/2007/11/riegl-lms-q120.jpg" title="Riegl Lidar"><img src="http://www.educatingsilicon.com/wp-content/uploads/2007/11/riegl-lms-q120.jpg" alt="Riegl Lidar" /></a></p>
<p>Some of the more well-funded competitors are using these high-end lidar systems from Riegl. These are 2D scanners similar to the SICK, but have longer range and more sophisticated processing to deal with confusing multiple returns. However, they will set you back a hefty $28,000.</p>
<p><strong>Ibeo</strong><br />
Ibeo is a subsidiary of SICK that makes sensors for the automotive market. They produce several models of laser scanner, such as the flying-saucer like attachments <a href="http://zodiac.ibr.cs.tu-bs.de/joomla/images/stories/img_6652.jpg">seen here</a> on the front of team CarOLO. I&#8217;m not too familiar with these sensors, but I believe they are rotating laser fans &#8211; something like a scaled-down Velodyne.</p></blockquote>
<p><strong>Vision</strong></p>
<p>Vision is less prevalent this year than I was expecting. As far as I can gather, none of the teams have gone in for a computer-vision based approach to recognising other cars. I suppose with a good laser sensor it&#8217;s mostly unnecessary, plus you have the advantage of being immune to illumination problems which can foil vision techniques.  Many teams have cameras for detecting lane markings, but that appears to be the extent of it.<br />
Some teams, such as  Stanford&#8217;s Junior, are all-laser systems with no cameras at all. Given that vision was the core of the secret sauce that helped Stanford win the 2005 Grand Challenge, and their <a href="http://www.educatingsilicon.com/wp-content/uploads/2007/11/stanford_junior_press_photo.jpg">early press photos</a> prominently showed a Ladybug2 camera, I was pretty surprised by this. The reason is revealed in this <a href="http://www.tgdaily.com/content/view/34644/113/">interview with Mike Montemerlo</a> where he shows plenty of results using the Ladybug2, but explains that they had to abandon the sensor after their lead vision programmer left for a job with Google (who have <a href="http://gizmodo.com/gadgets/eye-on-you/google-streetview-camera-car-fleet-set-to-invade-america-279222.php">some interest</a> in Ladybugs).  The final version of Junior uses laser reflectance information to find the road markings, and judging by the results so far, seems to be getting on just fine without vision.</p>
<p>Cameras come in all shapes and sizes, but a few to look out for:</p>
<blockquote><p><strong>PointGrey Bumblebee</strong></p>
<p><a href="http://www.educatingsilicon.com/wp-content/uploads/2007/11/bumblebee.jpg" title="Bumblebee Stereo Camera"><img src="http://www.educatingsilicon.com/wp-content/uploads/2007/11/bumblebee.jpg" alt="Bumblebee Stereo Camera" /></a></p>
<p><a href="http://www.ptgrey.com/">Point Grey</a> are a popular supplier of stereo vision systems, and you can see these cameras attached to a number of vehicles. Princeton&#8217;s Team Prowler has a system based entirely around these stereo cameras &#8211; a choice they made for budget reasons.</p>
<p><strong>Ladybug2</strong></p>
<p><a href="http://www.educatingsilicon.com/wp-content/uploads/2007/11/ladybug2.jpg" title="Point Grey Ladybug2"><img src="http://www.educatingsilicon.com/wp-content/uploads/2007/11/ladybug2.jpg" alt="Point Grey Ladybug2" /></a></p>
<p>This is a spherical vision system composed of 6 tightly packed cameras, also produced by Point Grey. After Stanford abandoned their vision system, I don&#8217;t think any entries are using this camera &#8211; but there&#8217;s one sitting on my desk as I type this, so I&#8217;m including the picture anyway.</p></blockquote>
<p><strong>Radar</strong></p>
<p>Though not very visible, several cars are sporting radar units. The MIT vehicle has 16! Radar is good for long range, out to hundreds of meters, but  it&#8217;s noisy and has poor resolution. However, when you&#8217;re travelling fast and just want to know if there&#8217;s a major obstacle up ahead, it does the job. It&#8217;s already used in several commercial automotive safety systems.</p>
<p><strong>GPS</strong></p>
<p>GPS is obviously a core sensor for every entrant. Most vehicles have several redundant GPS units on their roofs, popular suppliers being companies like <a href="http://www.trimble.com/">Trimble</a> who sell rugged, high-accuracy units developed for applications in precision agriculture.</p>
<p><strong>IMU</strong></p>
<p>Though not visible on the outside, many of the entrants have inertial measurement units tucked inside. These little packages of gyroscopes and accelerometers help the vehicles keep track of position during GPS outages. High-end IMUs can be amazingly precise, but have a price tag to match.</p>
<p>This post has become something of a beast. If you still can&#8217;t get enough of sensors, there are some interesting videos <a href="http://www.tgdaily.com/content/view/34686/113/">here</a> and <a href="http://www.tgdaily.com/content/view/34685/113/">here</a> where Virgina Tech and Ben Franklin discuss their sensor suites.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.educatingsilicon.com/2007/11/03/spot-the-sensor/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.245 seconds -->
