<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Big Data Toolkit</title>
	<atom:link href="http://bigdatatoolkit.org/feed/" rel="self" type="application/rss+xml" />
	<link>http://bigdatatoolkit.org</link>
	<description>Just another CASA Blogs site</description>
	<lastBuildDate>Sat, 27 Apr 2013 11:15:31 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>The iPad Video Wall</title>
		<link>http://bigdatatoolkit.org/2013/04/18/the-ipad-video-wall/</link>
		<comments>http://bigdatatoolkit.org/2013/04/18/the-ipad-video-wall/#comments</comments>
		<pubDate>Thu, 18 Apr 2013 12:51:41 +0000</pubDate>
		<dc:creator>Steven Gray</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[Exhibitions]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[interactive]]></category>
		<category><![CDATA[iPad]]></category>
		<category><![CDATA[iPad Video Wall]]></category>
		<category><![CDATA[real-time data]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[video wall]]></category>
		<category><![CDATA[visualisation]]></category>
		<category><![CDATA[Wall]]></category>

		<guid isPermaLink="false">http://bigdata.blogweb.casa.ucl.ac.uk/?p=518</guid>
		<description><![CDATA[I am happy to report that the iPad Video wall has grown up from a prototype to a fully fledged finished project. If you have been following the blog then you would have saw the prototype video of the wall&#8217;s proof of concept and watched a single movie playing over all 8 iPads. Well I&#8217;ve [...]]]></description>
				<content:encoded><![CDATA[<p>I am happy to report that the iPad Video wall has grown up from a prototype to a fully fledged finished project.  If you have been following the blog then you would have saw the <a href="https://vimeo.com/50516464" title="iPad Wall Video Test">prototype video</a> of the <a href="http://bigdatatoolkit.org/2012/10/01/ipad-video-wall/" title="iPad Video Wall">wall&#8217;s proof of concept</a> and <a href="https://vimeo.com/50516464" title="iPad Wall Video Test">watched a single movie playing over all 8 iPads</a>.  Well I&#8217;ve been hard at work in the workshop with laser cutters, hammers, routers, and vices and we have a finished product which is ready to go.  </p>
<p><span id="more-518"></span></p>
<p><a href="http://bigdatatoolkit.org/files/2013/04/IMG_0104.jpg"><img src="http://bigdatatoolkit.org/files/2013/04/IMG_0104-1024x682.jpg" alt="iPad Wall" width="560" class="aligncenter size-large wp-image-520" /></a></p>
<p>The iPad Wall consists of 12 iPads in a 4&#215;3 configuration housed in a custom designed wooden frame which is only 23mm thick.  All the iPads are powered from a single power source again housed inside the frame which charges the iPads when needed.  The wall, in its current configuration, displays data from <a href="http://www.citydashboard.org">City-Dashboard</a> and allows users to interact with the data.  By touching some of the iPad&#8217;s, the view flips to a historic graph of data over the last 24 hours. We can show individual videos on any of the iPads and we usually show the live stream of BBC News 24 on iPad number 11 which allows us to keep up with all the latest news happening around the world.  </p>
<p><a href="http://bigdatatoolkit.org/files/2013/04/IMG_20130117_143439.jpg"><img src="http://bigdatatoolkit.org/files/2013/04/IMG_20130117_143439-768x1024.jpg" alt="iPad Wall Side on" width="560" class="aligncenter size-large wp-image-523" /></a> </p>
<p>When we have visitors in the office we can grab our mobile phones and open up the iPad Video Wall mobile application and play one of the six visualization videos we have created for the wall.  These videos play over all 12 iPads creating a virtual video wall that gives the illusion that you are looking through a window.  The effect is fantastic when you view it in person as you can see in the video below.</p>
<p style="text-align: center;"><iframe width="560" height="315" src="http://www.youtube.com/embed/BRCxNtx5_i0?vq=hd720" frameborder="0" allowfullscreen></iframe></p>
<p>I envision that a wall like this can be used in decision making, giving an insight into what is happening in real-time.  Unlike traditional walls, which are heavy bulky, and fixed to walls, the iPad Video Wall is very portable and can be moved around the office where-ever it is needed. It also opens up new interaction designs for the traditional video wall.  By having 12 touch screens we can come up with new methods to interact with real-time data.  The wall was funded through a generous grant by <a href="http://www.ucl.ac.uk/enterprise" title="UCL Enterprise">UCL Enterprise</a> and forms an output to the <a href="http://geotalisman.org" title="Talisman Blog">Talisman project</a></p>
<p>In the following days I&#8217;ll be posting up a few blog entries on how I built the wall and how the software works so that you can get a real behind the scenes look at the current state of the wall. </p>
<p>I&#8217;m <a href="http://twitter.com/frogo" title="Me on Twitter">@frogo</a> on Twitter. If you get a chance then you should <a href="http://twitter.com/frogo" title="Follow Me">follow me</a>.</p>
<g:plusone href="http://bigdatatoolkit.org/2013/04/18/the-ipad-video-wall/"></g:plusone><div id="tweetbutton518" class="tw_button" style="float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Fbigdatatoolkit.org%2F2013%2F04%2F18%2Fthe-ipad-video-wall%2F&amp;text=The%20Interactive%20iPad%20Video%20Wall%3A%20&amp;related=frogo:Follow%20me&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fbigdatatoolkit.org%2F2013%2F04%2F18%2Fthe-ipad-video-wall%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://bigdata.blogweb.casa.ucl.ac.uk/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://bigdatatoolkit.org/2013/04/18/the-ipad-video-wall/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Telefonica (02) plans to explore Big Data</title>
		<link>http://bigdatatoolkit.org/2012/10/11/exploring-big-data/</link>
		<comments>http://bigdatatoolkit.org/2012/10/11/exploring-big-data/#comments</comments>
		<pubDate>Thu, 11 Oct 2012 11:12:01 +0000</pubDate>
		<dc:creator>Steven Gray</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[data processing]]></category>
		<category><![CDATA[publications]]></category>
		<category><![CDATA[research]]></category>

		<guid isPermaLink="false">http://bigdata.blogweb.casa.ucl.ac.uk/?p=512</guid>
		<description><![CDATA[Telecommunication companies are sitting on a gold mine. With the prevalence of mobile devices in our every day life, for example reports from Google IO are that 400 million Android devices have been activated at a rate of 1,000,000 activations per day, the data that we generate as a collective group is phenomenal. Phone companies [...]]]></description>
				<content:encoded><![CDATA[<p>Telecommunication companies are sitting on a gold mine.  With the prevalence of mobile devices in our every day life, for example reports from <a href="http://www.engadget.com/2012/06/27/google-400-million-android-devices-one-million-activations-a-d/" target="_blank">Google IO</a> are that 400 million Android devices have been activated at a rate of 1,000,000 activations per day, the data that we generate as a collective group is phenomenal.  Phone companies are collecting all sorts of data from location to browsing history to who you phone and message.</p>
<p>Telefonia plans to set up a group called &#8220;Telefonica Digital Insights&#8221; to explore this data and link the data together to create anonymous profiles based on demographics and location.  Using the data they will be able to sell fine grain locational data (up to hundredths of meters) to retailers, for example, of certain demographics groups frequenting the area. This data combined with footfall data that retailers already have would allow them to see where demographics go when they are outside. The one controversial decision is that there will be no opt-out of this service.  If your&#8217;e one of the 23 million UK subscribers then you&#8217;re already collecting the data for them. But they are not the only company to use this sort of data to understand the demographics using their services.</p>
<p>Verizon, a US telecommunications provider, are also selling anonymised personal data. One astute <a href="https://alpha.app.net/bryanjclark/post/843872" target="_blank">user</a> noticed a clause in the terms and conditions which allows then to sell the data derived and collected through normal use of service but unlike Telefonica, customers have 30 days to opt-out of the service. </p>
<p>It remains unclear what the data will show and how these companies will start to make value from the analysis of data but it is clear that our normal day to day activities can be linked and we can be used as sensors.  The original story can be read on <a href="http://www.bbc.co.uk/news/technology-19882647" target="_blank">BBC News</a>.</p>
<g:plusone href="http://bigdatatoolkit.org/2012/10/11/exploring-big-data/"></g:plusone><div id="tweetbutton512" class="tw_button" style="float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F10%2F11%2Fexploring-big-data%2F&amp;text=Telefonica%20%2802%29%20plans%20to%20explore%20Big%20Data&amp;related=&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F10%2F11%2Fexploring-big-data%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://bigdata.blogweb.casa.ucl.ac.uk/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://bigdatatoolkit.org/2012/10/11/exploring-big-data/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Big Data Problems have been around longer than you think</title>
		<link>http://bigdatatoolkit.org/2012/10/03/big-data-problems/</link>
		<comments>http://bigdatatoolkit.org/2012/10/03/big-data-problems/#comments</comments>
		<pubDate>Wed, 03 Oct 2012 11:43:51 +0000</pubDate>
		<dc:creator>Steven Gray</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data processing]]></category>
		<category><![CDATA[problems]]></category>
		<category><![CDATA[shortest path]]></category>

		<guid isPermaLink="false">http://bigdata.blogweb.casa.ucl.ac.uk/?p=499</guid>
		<description><![CDATA[The Strata Conference is in town and one presentation that caught my eye was titled The Great Railway Caper: Big Data in 1955. John Graham-Cumming from CloudFlare gives a great overview on why some Big Data problems have been around since the early days of computing when computer filled entire rooms. Back in 1955 the [...]]]></description>
				<content:encoded><![CDATA[<p>The <a href="http://strataconf.com/strataeu/public/content/about" target="_blank">Strata Conference</a> is in town and one presentation that caught my eye was titled <a href="http://strataconf.com/strataeu/public/schedule/detail/26214" target="_blank">The Great Railway Caper: Big Data in 1955</a>.  <a href="http://jgc.org/" target="_blank">John Graham-Cumming</a> from <a href="http://www.cloudflare.com/" target="_blank">CloudFlare</a> gives a great overview on why some Big Data problems have been around since the early days of computing when computer filled entire rooms. </p>
<p align="center"><a href="http://bigdatatoolkit.org/files/2012/10/1951_leo_large.jpg"><img src="http://bigdatatoolkit.org/files/2012/10/1951_leo_large-300x181.jpg" alt="" title="1951_leo_large" width="300" height="181" class="aligncenter size-medium wp-image-506" /></a></p>
<p>Back in 1955 the Government tasked a team of people, including Roger Coleman, to calculate the distances between nodes on the British Rail system using the first computer created for a commercial company called <a href="http://en.wikipedia.org/wiki/LEO_(computer)" target="_blank">LEO</a> owned by the <a href="http://en.wikipedia.org/wiki/J._Lyons_and_Co." target="_blank">Lyons company</a>.  They had to compute distances over 12 million nodes which represent the connection between the 5,550 stations in the United Kingdom using a computer that had the equivalent of a 500Hz CPU and  2 Kilobytes of RAM.   To give you an idea of the size of the RAM on this mainframe, 2K is about the equivalent to storing 2048 characters in memory and that&#8217;s not including the space you would need to store the program while running. The problems don&#8217;t stop there.  This before graph theory and the shortest path algorithms exsisted and pre-dates some of the standard, accepted algorithmic solutions by 5 years</p>
<p><iframe width="560" height="315" src="http://www.youtube.com/embed/pcBJfkE5UwU" frameborder="0" allowfullscreen></iframe></p>
<p>John bridges some parallels between today&#8217;s big data challenges and the challenges these pioneers had to deal with from renting out time from the Lyons company (renting out time on Amazon&#8217;s EC2), challenges with storing output (reams of punchcards vs. hard drive space) and memory contraints (2K was the limit due to there being not enough mercury in the earth to build larger medium) </p>
<p>The talk is a very entertaining and provides insight into what appears to be a very old problem in big data processing.  A very interesting watch indeed. </p>
<g:plusone href="http://bigdatatoolkit.org/2012/10/03/big-data-problems/"></g:plusone><div id="tweetbutton499" class="tw_button" style="float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F10%2F03%2Fbig-data-problems%2F&amp;text=Big%20Data%20Problems%20have%20been%20around%20longer%20than%20you%20think&amp;related=&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F10%2F03%2Fbig-data-problems%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://bigdata.blogweb.casa.ucl.ac.uk/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://bigdatatoolkit.org/2012/10/03/big-data-problems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBM&#8217;s Mira will simulate the universe</title>
		<link>http://bigdatatoolkit.org/2012/10/02/ibms-mira-will-simulate-the-universe/</link>
		<comments>http://bigdatatoolkit.org/2012/10/02/ibms-mira-will-simulate-the-universe/#comments</comments>
		<pubDate>Tue, 02 Oct 2012 08:35:21 +0000</pubDate>
		<dc:creator>Steven Gray</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[supercomputers]]></category>
		<category><![CDATA[universe]]></category>

		<guid isPermaLink="false">http://bigdatatoolkit.org/?p=494</guid>
		<description><![CDATA[IBM have been on the forefront of Big Data research for the last few years. Last year Watson beat 2 human competitors in a game of Jeopardy by making links between data and reasoning what the correct answer would be in fractions of a second. Now the hardware scientists are at it again.  Mira, the [...]]]></description>
				<content:encoded><![CDATA[<p>IBM have been on the forefront of Big Data research for the last few years. Last year <a href="http://www.youtube.com/watch?v=DywO4zksfXw" title="Watson Video" target="_blank">Watson </a>beat 2 human competitors in a game of Jeopardy by making links between data and reasoning what the correct answer would be in fractions of a second. Now the hardware scientists are at it again.  </p>
<p><a href="http://bigdatatoolkit.org/files/2012/10/Mira.jpg"><img src="http://bigdatatoolkit.org/files/2012/10/Mira-300x198.jpg" alt="" title="Mira" width="300" height="198" class="aligncenter size-medium wp-image-496" /></a></p>
<p><a href="http://i.top500.org/system/177718" target="_blank">Mira</a>, the<a href="http://en.wikipedia.org/wiki/IBM_Mira" target="_blank"> 3rd largest super computer</a> in the world, is gearing up to simulate the first 13 billion years of our universe from the very start of the big bang. Physicists from the <a href="http://www.anl.gov/" target="_blank">Argonne National Laboratory</a> will use the cluster to simulate the trillions of interactions between particles, hopefully creating giant masses of planets, stars and galaxies. The simulation will hopefully give scientists an understanding of how these elemental interactions create a time lapse of expanding bodies in the universe.</p>
<p>To give a sense of the power of IBM&#8217;s Mira, the whole task is scheduled to complete in just 2 weeks.  With the advances in hardware, distributed computing and CPU design, engineers can cram more cores on a single chip and if Moore&#8217;s Law holds then clusters by the end of the decade will be 1000 times more powerful than Mira.</p>
<p>If you want to read more about the experiment then the <a href="http://www.theatlantic.com/technology/archive/2012/09/meet-mira-the-supercomputer-that-makes-universes/262639/" target="_blank">full article</a> is over at the <a href="http://www.theatlantic.com/technology/archive/2012/09/meet-mira-the-supercomputer-that-makes-universes/262639/" target="_blank">Alantic</a>.</p>
<g:plusone href="http://bigdatatoolkit.org/2012/10/02/ibms-mira-will-simulate-the-universe/"></g:plusone><div id="tweetbutton494" class="tw_button" style="float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F10%2F02%2Fibms-mira-will-simulate-the-universe%2F&amp;text=IBM%26%238217%3Bs%20Mira%20will%20simulate%20the%20universe&amp;related=&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F10%2F02%2Fibms-mira-will-simulate-the-universe%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://bigdata.blogweb.casa.ucl.ac.uk/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://bigdatatoolkit.org/2012/10/02/ibms-mira-will-simulate-the-universe/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>iPad Video Wall</title>
		<link>http://bigdatatoolkit.org/2012/10/01/ipad-video-wall/</link>
		<comments>http://bigdatatoolkit.org/2012/10/01/ipad-video-wall/#comments</comments>
		<pubDate>Mon, 01 Oct 2012 08:21:37 +0000</pubDate>
		<dc:creator>Steven Gray</dc:creator>
				<category><![CDATA[Development]]></category>

		<guid isPermaLink="false">http://bigdatatoolkit.org/?p=472</guid>
		<description><![CDATA[It seems like my favourite device of the moment is the iPad.  First I built the QRator app which has been quite popular and well received by the UCL Grant Museum.  We even won an award for the system. After a discussion with a few of my colleagues about new exhibition pieces for upcoming events [...]]]></description>
				<content:encoded><![CDATA[<p>It seems like my favourite device of the moment is the iPad.  First I built the <a href="http://www.qrator.org/about-the-project/ipads/" title="QRator Website" target="_blank">QRator</a> app which has been quite popular and well received by the <a href="http://www.ucl.ac.uk/museums/zoology" title="UCL Grant Museum" target="_blank">UCL Grant Museum</a>.  We even won an <a href="http://www.museumsandheritage.com/awards/award-winners-2012/innovations" title="QRator Award" target="_blank">award for the system</a>. </p>
<p>After a discussion with a few of my colleagues about new exhibition pieces for upcoming events we thought it would great to have a video wall to showcase some of our visualisations created at CASA.  Using 8 iPads and a custom application to connect them together via a centralised server, we now have a basic video wall.</p>
<p style="text-align: center;"><iframe src="http://player.vimeo.com/video/50516464" width="500" height="281" frameborder="0" webkitAllowFullScreen mozallowfullscreen allowFullScreen></iframe> </p>
<p>The iPads connect to a server (my iMac) which in turn notifies them which video segment they have to play, then once connected the server will tell each device to start buffering the video.  After a few seconds all iPads are told to play simultaneously and the result is what you see in the video above, a silly movie of myself. </p>
<p>The next steps for this project is to add some interactive screens so users can interact with the wall and select content that should be displayed on the wall. Also  I&#8217;m also looking into different ways to charge multiple iPads while retaining the ability to sync them all with 1 cable. I&#8217;m really happy with the results, especially being a proof of concept, so once I&#8217;ve ironed out the few synchronisation bugs and added more functionality wall then maybe we will build a full size wall in the near future.</p>
<g:plusone href="http://bigdatatoolkit.org/2012/10/01/ipad-video-wall/"></g:plusone><div id="tweetbutton472" class="tw_button" style="float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F10%2F01%2Fipad-video-wall%2F&amp;text=iPad%20Video%20Wall&amp;related=&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F10%2F01%2Fipad-video-wall%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://bigdata.blogweb.casa.ucl.ac.uk/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://bigdatatoolkit.org/2012/10/01/ipad-video-wall/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Wolfram Alpha&#8217;s Personal Analytics</title>
		<link>http://bigdatatoolkit.org/2012/09/03/wolfram-alphas-personal-analytics/</link>
		<comments>http://bigdatatoolkit.org/2012/09/03/wolfram-alphas-personal-analytics/#comments</comments>
		<pubDate>Mon, 03 Sep 2012 10:15:07 +0000</pubDate>
		<dc:creator>Steven Gray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://bigdata.blogweb.casa.ucl.ac.uk/?p=416</guid>
		<description><![CDATA[Wolfram Alpha has just launched their new take on social media analysis, building personalised reports for Facebook users. The computational engine builds various metrics and visualisation based on usage over a period of time, number of friends, geographical distribution of friends and even a network graph showing connections between friends. If you head to the [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.wolframalpha.com/">Wolfram Alpha</a> has just launched their new take on social media analysis, building personalised reports for Facebook users.  The computational engine builds various metrics and visualisation based on usage over a period of time, number of friends, geographical distribution of friends and even a network graph showing connections between friends.   If you head to the <a href="http://www.wolframalpha.com/input/?i=facebook+report">Wolfram Alpha</a> website, type in &#8220;facebook report&#8221; and then link your facebook profile to the site, Wolfram Alpha will collect stats via Facebook&#8217;s Open Graph platform and then starts aggregating the data.  The real value of a service like this is giving users overview of the actual value of the data hidden inside your Facebook profile. </p>
<p><a href="http://bigdatatoolkit.org/files/2012/09/gender-stats.png"><img src="http://bigdatatoolkit.org/files/2012/09/gender-stats-300x212.png" alt="" title="gender stats" width="300" height="212" class="aligncenter size-medium wp-image-423" /></a>There were some little gems of information I found in my data.  For example, I have a near 50/50 split of male to female friends and around 36% of them are married.  My network graph shows all the distinct groups of my friends including UCL friends, Glasgow University friends and my disconnect groups are my family.  The biggest insight into my data was my own personal use of facebook mainly involves posting lots of photographs and not that many textual update posts.</p>
<p><img src="http://bigdatatoolkit.org/files/2012/09/network-graph-300x221.png" alt="" title="network-graph" width="300" height="221" class="aligncenter size-medium wp-image-421" /></p>
<p>It&#8217;s really great to see that Wolfram Alpha is joining data like your birthday and historical weather reports or moon phases to give even more insight into your personal data.  I think that the most impressive feature is that the speed that the analytics are created.  Some of these quires are computationally expensive yet Wolfram Alpha seems to correlate the data extremely quickly and have the interactive graphs ready to view in less than a minute.  </p>
<p><img src="http://bigdatatoolkit.org/files/2012/09/geostats-300x289.png" alt="" title="geostats" width="300" height="289" class="aligncenter size-medium wp-image-424" /></p>
<p>If you&#8217;re curious about your own facebook analytics then <a href="http://www.wolframalpha.com/input/?i=facebook+report">head over to the site</a> and give it a try.</p>
<g:plusone href="http://bigdatatoolkit.org/2012/09/03/wolfram-alphas-personal-analytics/"></g:plusone><div id="tweetbutton416" class="tw_button" style="float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F09%2F03%2Fwolfram-alphas-personal-analytics%2F&amp;text=Wolfram%20Alpha%26%238217%3Bs%20Personal%20Analytics&amp;related=&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F09%2F03%2Fwolfram-alphas-personal-analytics%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://bigdata.blogweb.casa.ucl.ac.uk/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://bigdatatoolkit.org/2012/09/03/wolfram-alphas-personal-analytics/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Olympic Twitter Collectors</title>
		<link>http://bigdatatoolkit.org/2012/08/10/olympic-twitter-collectors/</link>
		<comments>http://bigdatatoolkit.org/2012/08/10/olympic-twitter-collectors/#comments</comments>
		<pubDate>Fri, 10 Aug 2012 10:30:24 +0000</pubDate>
		<dc:creator>Steven Gray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://bigdata.blogweb.casa.ucl.ac.uk/?p=394</guid>
		<description><![CDATA[As the athletes have been training for the London 2012 Olympic Games so has been our Twitter Collectors. You may have saw the maps we created from data collected by the very first iteration of the Big Data Toolkit&#8217;s Twitter collector which produced some great visualisations. Over the past few weeks I re-wrote some of [...]]]></description>
				<content:encoded><![CDATA[<p>As the athletes have been training for the London 2012 Olympic Games so has been our Twitter Collectors.   You may have saw the maps we created from data collected by the very first iteration of the Big Data Toolkit&#8217;s Twitter collector which produced some great visualisations.  Over the past few weeks I re-wrote some of the major components of the system to allow multiple machines to connect together and form a swarm of collectors.  This allows the swarm to collect data from different locations over the same period of time.  We thought that with a certain international event happening on our doorstep what better way to test the system out.  At CASA we are lucky enough to have our own private cloud resources that allow us to spawn machines when we need extra infrastructure.  So for the Olympic collectors we have 22 machines each collecting Tweets from each of the Olympic venues all over London and we have managed to collect over 1.4 million tweets from the last 14 days of the Olympics (Each has been sent from the vicinity of each venue hence why the individual numbers are low).    </p>
<p><a href="http://bigdatatoolkit.org/files/2012/08/Olympics-Collection-2.png"><img src="http://bigdatatoolkit.org/files/2012/08/Olympics-Collection-2.png" alt="" title="Olympics Collection 2" width="570" height="496" class="aligncenter size-full wp-image-408" /></a></p>
<p>One central server manages the swarm and asks each collector every 5 seconds to send back statistics giving us a live view of each server just incase a machine stalls. This is a big step towards a completed system allowing users to initiate collectors on services such as Amazon&#8217;s EC2 to takle large scale data capture and still get responsive, live statistics of what each individual machine is collecting.</p>
<p>We have already incorporated this data back into City DashBoard which you can read more about at <a href="http://oliverobrien.co.uk/2012/08/olympic-venue-tweets-on-citydashboard/" title="CityDB">Oliver O&#8217;Brien&#8217;s blog</a></p>
<p><strong>Update:</strong>  I&#8217;ve now enabled the relevant web sockets ports on our servers so you can see the stats from the live collectors <a href="http://bigdata.casa.ucl.ac.uk/olympics">here</a></p>
<g:plusone href="http://bigdatatoolkit.org/2012/08/10/olympic-twitter-collectors/"></g:plusone><div id="tweetbutton394" class="tw_button" style="float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F08%2F10%2Folympic-twitter-collectors%2F&amp;text=Olympic%20Twitter%20Collectors&amp;related=&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F08%2F10%2Folympic-twitter-collectors%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://bigdata.blogweb.casa.ucl.ac.uk/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://bigdatatoolkit.org/2012/08/10/olympic-twitter-collectors/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>How are the big boys using big data?</title>
		<link>http://bigdatatoolkit.org/2012/06/07/how-are-the-big-boys-using-big-data/</link>
		<comments>http://bigdatatoolkit.org/2012/06/07/how-are-the-big-boys-using-big-data/#comments</comments>
		<pubDate>Thu, 07 Jun 2012 09:43:38 +0000</pubDate>
		<dc:creator>Steven Gray</dc:creator>
				<category><![CDATA[Informative]]></category>

		<guid isPermaLink="false">http://bigdata.blogweb.casa.ucl.ac.uk/?p=377</guid>
		<description><![CDATA[Last night Google held a press event unveiling the recent additions to the Google Map/Earth arsenal, automatic 3D mapping from aerial imagery and a mobile StreetView capture backpack. While we&#8217;ve seen this kind of 3D mapping from companies like C3, which Apple acquired a few years back (expect to see that in iOS 6 next [...]]]></description>
				<content:encoded><![CDATA[<p>Last night Google held a press event unveiling the recent additions to the Google Map/Earth arsenal, automatic 3D mapping from aerial imagery and a mobile StreetView capture backpack.  While we&#8217;ve seen this kind of 3D mapping from companies like <a href="http://9to5mac.com/2011/10/29/apple-acquired-mind-blowing-3d-mapping-company-c3-technologies-looking-to-take-ios-maps-to-the-next-level/" title="C3 Tech" target="_blank">C3</a>, which Apple acquired a few years back (expect to see that in iOS 6 next week), this wasn&#8217;t the highlight of the event for me. </p>
<p><img src="http://bigdatatoolkit.org/files/2012/06/google-maps-steetview.jpeg" alt="" title="google-maps-steetview" width="560" height="418" class="aligncenter size-full wp-image-378" /></p>
<p>During the &#8220;<a href="https://www.youtube.com/watch?feature=player_embedded&#038;v=HMBJ2Hu0NLw" target="_blank">Next Dimension of Maps</a>&#8221; event Google announced that the StreetView cars have driven over 5 million unique miles of roads and collected 20 petabytes of data on their travels.  Let&#8217;s stop and think about how much data that is. 20 petabytes is equivalent to 20,480 terabytes (TB), if you buy a standard PC today it usually comes with a 1TB hard drive. It&#8217;s like you have sat in 1 seat of Pittodrie Stadium in Aberdeen and Google has bought out the rest of the stadium.  Although putting size aside it wasn&#8217;t really clear to me what they were doing with all that data until I read a<a href="http://www.holovaty.com/writing/streetview/" target="_blank"> recent article</a> that made me really start to think. </p>
<div id="attachment_379" class="wp-caption aligncenter" style="width: 570px"><a href="http://bigdatatoolkit.org/files/2012/06/pit-stadium.jpg"><img src="http://bigdatatoolkit.org/files/2012/06/pit-stadium.jpg" alt="" title="pit-stadium" width="560" height="377" class="size-full wp-image-379" /></a><p class="wp-caption-text">Pretty lonely up there in Pittodrie Stadium</p></div>
<p>As Google drives around the world taking images of the streets they collect Lidar data, a high detail 360-degree laser scan of the surrounding area. This gives an accurate point cloud of buildings, signs, surfaces etc. around the car. You may have seen the rectangle box in StreetView showing you the front of buildings.  So why are they collecting this data?  Google has never said. </p>
<div id="attachment_382" class="wp-caption aligncenter" style="width: 590px"><a href="http://bigdatatoolkit.org/files/2012/06/Self-driving-car.jpg"><img src="http://bigdatatoolkit.org/files/2012/06/Self-driving-car.jpg" alt="" title="Self-driving car" width="580" height="349" class="size-full wp-image-382" /></a><p class="wp-caption-text">Google&#039;s self-driving car which I saw in April 2012</p></div>
<p>Well another little project that hit the headlines was Google&#8217;s automatous, self-driving cars.  These cars also use Lidar to detect the surrounding traffic, people crossing the street etc., to give the car&#8217;s software an accurate image of the cars current location. This is where it gets interesting.  Have Google used the StreetView data they collected to train the self-driving cars software?</p>
<div id="attachment_380" class="wp-caption aligncenter" style="width: 590px"><a href="http://bigdatatoolkit.org/files/2012/06/lidardata.png"><img src="http://bigdatatoolkit.org/files/2012/06/lidardata.png" alt="" title="lidardata" width="580" height="324" class="size-full wp-image-380" /></a><p class="wp-caption-text">What the self-driving can see.</p></div>
<p>It makes perfect sense!  Drive around all the roads in the world and gather the data to train the AI in the car to all the possibilities of junctions, road types, crossings, signs, international differences, if you can see it on the road &#8211; chances are a StreetView car has saw it as well.  The big question is what came first?  Was it Google&#8217;s intension to use StreetView data to train a car? Or did some sharp engineer say, “I know what to do with this data&#8221; in his 20% time? We will never know the answer but we can all learn a lesson from Google.</p>
<p>When you plan a project to collect data then collect as much data as possible. Don&#8217;t throw away data that you collect because you think it’s not valuable now. Chances are that somewhere down the line the data you collect may be useful to someone somewhere.</p>
<g:plusone href="http://bigdatatoolkit.org/2012/06/07/how-are-the-big-boys-using-big-data/"></g:plusone><div id="tweetbutton377" class="tw_button" style="float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F06%2F07%2Fhow-are-the-big-boys-using-big-data%2F&amp;text=How%20are%20the%20big%20boys%20using%20big%20data%3F&amp;related=&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F06%2F07%2Fhow-are-the-big-boys-using-big-data%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://bigdata.blogweb.casa.ucl.ac.uk/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://bigdatatoolkit.org/2012/06/07/how-are-the-big-boys-using-big-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Visualising Social Media: Facebook Checkins</title>
		<link>http://bigdatatoolkit.org/2012/06/06/visualising-social-media/</link>
		<comments>http://bigdatatoolkit.org/2012/06/06/visualising-social-media/#comments</comments>
		<pubDate>Wed, 06 Jun 2012 09:40:03 +0000</pubDate>
		<dc:creator>Steven Gray</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://bigdatatoolkit.org/?p=365</guid>
		<description><![CDATA[You may or may not realise but most social media companies have geo-location at the heart of there services. Facebook and foursquare have checkins, Twitter allows users to geotag their tweets and Google Latitude and Apple&#8217;s Find my friend, with permission, all track your location and send it back for analysis. Have you ever wondered [...]]]></description>
				<content:encoded><![CDATA[<p>You may or may not realise but most social media companies have geo-location at the heart of there services. Facebook and foursquare have checkins, Twitter allows users to geotag their tweets and Google Latitude and Apple&#8217;s Find my friend, with permission, all track your location and send it back for analysis. Have you ever wondered what happens to all that data or considered what the bigger picture looks like? </p>
<p><a href="https://www.facebook.com/notes/facebook-data-team/visualizing-activity-on-facebook/10150884743158859" target="_blank"><img src="http://bigdatatoolkit.org/files/2012/06/582255_10150856803877036_100061655_n.jpeg" alt="" title="582255_10150856803877036_100061655_n" width="560" height="350" class="aligncenter size-full wp-image-369" /></a></p>
<p>Well the researchers at Facebook&#8217;s Data Team have released s<a href="https://www.facebook.com/notes/facebook-data-team/visualizing-activity-on-facebook/10150884743158859" target="_blank">ome fantastic visualisations of global checkins</a> collected by their 900 million users. By joining this locational data with other information such as age, political affiliation or even distances between checkins, the researchers can start to understand our travel patterns or even the distrubtion of certain demographic groups. </p>
<p>At CASA we have looked at the <a href="http://bigdatatoolkit.org/2011/05/19/london-twitter-map/" title="London Tweets">geo-located tweets of London from Twitter data</a> and also James Cheshire, over at <a href="http://spatialanalysis.co.uk/" title="Spatial Analysis" target="_blank">spatialanalysis.co.uk</a>, and I have looked at the <a href="http://spatialanalysis.co.uk/2012/04/twitter-languages-london/" title="Language of Twitter" target="_blank">language distribution of London based on Twitter data</a>. This certainly is a hot topic at the moment and proves just what can be done with this type of crowd sourced data.</p>
<p>You can read more about the process of making these image and even have a look at higher quality versions of the maps over at the <a href="https://www.facebook.com/notes/facebook-data-team/visualizing-activity-on-facebook/10150884743158859" title="Facebook Data Team" target="_blank">Facebook&#8217;s Data Team blog</a>.</p>
<g:plusone href="http://bigdatatoolkit.org/2012/06/06/visualising-social-media/"></g:plusone><div id="tweetbutton365" class="tw_button" style="float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F06%2F06%2Fvisualising-social-media%2F&amp;text=Visualising%20Social%20Media%3A%20Facebook%20Checkins&amp;related=&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F06%2F06%2Fvisualising-social-media%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://bigdata.blogweb.casa.ucl.ac.uk/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://bigdatatoolkit.org/2012/06/06/visualising-social-media/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to build a London Data Table</title>
		<link>http://bigdatatoolkit.org/2012/04/26/london-data-table/</link>
		<comments>http://bigdatatoolkit.org/2012/04/26/london-data-table/#comments</comments>
		<pubDate>Thu, 26 Apr 2012 21:00:33 +0000</pubDate>
		<dc:creator>Steven Gray</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Featured]]></category>

		<guid isPermaLink="false">http://bigdata.blogweb.casa.ucl.ac.uk/?p=333</guid>
		<description><![CDATA[We recently held a one day conference here at CASA called Smart Cities. For the conference we built various exhibition pieces and my contribution to the conference was the London Data Table, a projection table the shape of Greater London. The table had various visualisation projected onto the surface; from live aircraft positions, live traffic [...]]]></description>
				<content:encoded><![CDATA[<p><img src="http://bigdatatoolkit.org/files/2012/04/table_final.jpg" alt="Table Final" /></p>
<p>We recently held a one day conference here at CASA called Smart Cities.  For the conference we built various exhibition pieces and my contribution to the conference was the London Data Table, a projection table the shape of Greater London. The table had various visualisation projected onto the surface; from live aircraft positions, live traffic and bike hire usage to movies of public transport over 24 hours.  We got some great feedback from the attendees and I thought I would share my documentation of the whole process of build the table, from start to finish. </p>
<p><strong>Step 1: The Planning</strong><br />
<img src="http://bigdatatoolkit.org/files/2012/04/LondonSmall.jpg" alt="London Small Calculations" /><br />
The biggest issue in this project was size.  How big do we make the table? Can we find a projector with a short enough throw to project to the table?  How were we going to mount the projector.   I downloaded the outline of London from OS OpenData (<a href="http://os.openstreetmap.org/data/">http://os.openstreetmap.org/data/</a>) and created a pdf of the London outline, this would serve as our master vector throughout the process. I&#8217;d be lying if I said we knew all the answers the questions at the start of the project but with a few calculations and a lot of paper we mocked up the outline on paper (to scale) and stuck it to the wall of the office.  </p>
<p><img src="http://bigdatatoolkit.org/files/2012/04/LondonWall-1024x768.jpg" alt="London on the Wall" /></p>
<p>Planning the project this way allowed us to see that a normal projector would be impractical (due to the distance required to project an image of that size) and that we would have to buy a short throw projector, not ideal. Time was against us so we ordered the projector, took the leap and started to build the table. </p>
<p><strong>Step 2: Building the Table</strong><br />
I wanted this table to become a piece of furniture!  There was no point putting in all this effort to build a table which would only last the length of the conference and then started to sag, as MDF eventually does. So after some advice from the staff in the Bartlett workshop I decided to order a 10ft x 5ft (3m x 1.5m) piece of birch plywood. Birch plywood has excellent strength with a beautiful finish but is some what more expensive that MDF. </p>
<p>Here at the Bartlett @ UCL we have an excellent workshop that has a wealth of tools including laser cutters, a full wood and metal workshop, 3D prototyping and a CNC milling machine.  This would cut the outline from our sheet.  Armed with an Adobe Illustrator file of Greater London, I exported to DXF (a CAD standard file format) and loaded it into the milling machine.  Watch the video to see what happened next.</p>
<p><iframe src="http://player.vimeo.com/video/41109644?title=0&amp;byline=0&amp;portrait=0" width="550" height="309" frameborder="0" webkitAllowFullScreen mozallowfullscreen allowFullScreen></iframe></p>
<p>Within 10 minutes we had the outline of London cut out.  A quick chisel to remove the frame and we were ready to go into the preparation stage.</p>
<p><img src="http://bigdatatoolkit.org/files/2012/04/cutout-1024x768.jpg" alt="London Cut out" /></p>
<p><strong>Step 3: Preparation </strong><br />
Out of the whole project this was the most boring part.  Sanding the table down for preparation then painting a layer, then sanding again.  Repeat this 4 times until I got an even finish over the whole surface and the table top was complete.  I used a standard white emulation, the same paint you would use for walls with 10% water added to ease application.  After a whole day of painting and drying and this was what the table looked like from above.</p>
<p><img src="http://bigdatatoolkit.org/files/2012/04/tablefromabove-1024x768.jpg" alt="" title="tablefromabove" class="aligncenter" /></p>
<p><strong>Step 4: Table Legs and Projector </strong><br />
I had the idea straight from the start that I was going to use a set of legs from Ikea to quickly make a table.  A trip to Ikea later we had a set of T-Legs from a <a href="http://www.ikea.com/gb/en/catalog/products/S19852113/" target="_blank">Galant</a> desk.  Screwed them on and we have an instant table. </p>
<p><strong>Step 5: Align the projector </strong><br />
This was the most time consuming part of the project, aligning the projected image to the table.  We mounted the projector on a basketball stand, which was a great idea from <a href="http://mackerron.com/home/">George MacKerron</a>, which was custom mount screwed into the projector mounts made of the same birch as the table top and filled the base with water for stability.  We created another vector to align the river Thames to the table and adjusted the height of the stand to fit.  The results were fantastic!</p>
<p><img src="http://bigdatatoolkit.org/files/2012/04/londonprojected-1024x768.jpg" alt="London Projected onto the Table" /></p>
<p>So there you go, that&#8217;s how you build a table the shape of London and create an exciting new exhibit to show off the vast number of open data visualisations we create at CASA. In the next few posts, I&#8217;ll go into the in&#8217;s and out of the software and how I created the base software and the Aircraft Visualisation that we displayed on the table. </p>
<p><img src="http://bigdatatoolkit.org/files/2012/04/final-1024x768.jpg" alt="" title="final" class="aligncenter" /></p>
<g:plusone href="http://bigdatatoolkit.org/2012/04/26/london-data-table/"></g:plusone><div id="tweetbutton333" class="tw_button" style="float:left;margin-right:10px;"><a href="http://twitter.com/share?url=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F04%2F26%2Flondon-data-table%2F&amp;text=How%20to%20build%20a%20London%20Data%20Table&amp;related=&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fbigdatatoolkit.org%2F2012%2F04%2F26%2Flondon-data-table%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://bigdata.blogweb.casa.ucl.ac.uk/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://bigdatatoolkit.org/2012/04/26/london-data-table/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
