<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/rss-style.xsl" type="text/xsl"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	    xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	     xmlns:dc="http://purl.org/dc/elements/1.1/"
	   xmlns:atom="http://www.w3.org/2005/Atom"
	     xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	  xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>
<channel>
	<title>Why &quot;Raw Data Now&quot; Could Fail&#8230; &#8211; Terence Eden’s Blog</title>
	<atom:link href="https://shkspr.mobi/blog/2009/06/why-raw-data-now-could-fail/feed/" rel="self" type="application/rss+xml" />
	<link>https://shkspr.mobi/blog</link>
	<description>Regular nonsense about tech and its effects 🙃</description>
	<lastBuildDate>Wed, 04 Oct 2023 18:46:14 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://shkspr.mobi/blog/wp-content/uploads/2023/07/cropped-avatar-32x32.jpeg</url>
	<title>Why &quot;Raw Data Now&quot; Could Fail&#8230; &#8211; Terence Eden’s Blog</title>
	<link>https://shkspr.mobi/blog</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title><![CDATA[Why "Raw Data Now" Could Fail...]]></title>
		<link>https://shkspr.mobi/blog/2009/06/why-raw-data-now-could-fail/</link>
					<comments>https://shkspr.mobi/blog/2009/06/why-raw-data-now-could-fail/#comments</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Sun, 14 Jun 2009 22:29:10 +0000</pubDate>
				<category><![CDATA[/etc/]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[democracy]]></category>
		<category><![CDATA[government]]></category>
		<guid isPermaLink="false">http://shkspr.mobi/blog/?p=163</guid>

					<description><![CDATA[Tim Berners-Lee has made a call for governments to open up their data. Indeed, Tim&#039;s been appointed by the UK government to do just that.  His central thesis is that we, the taxpayers, have paid for government research and data - we should be able to access it. Easy, free and unfettered access to raw, unadulterated data will allow us to do wonderful things.  Take a look at his recent TED Talk,…]]></description>
										<content:encoded><![CDATA[<p>Tim Berners-Lee has made a call for governments to open up their data. Indeed, <a href="http://news.bbc.co.uk/1/hi/technology/8096273.stm">Tim's been appointed by the UK government to do just that</a>.</p>

<p>His central thesis is that we, the taxpayers, have paid for government research and data - we should be able to access it. <a href="http://www.w3.org/2009/Talks/0204-ted-tbl/">Easy, free and unfettered access to raw, unadulterated data will allow us to do wonderful things</a>.</p>

<p>Take a look at his recent TED Talk, it's inspiring stuff.</p>

<div style="max-width:854px"><div style="position:relative;height:0;padding-bottom:56.25%"><iframe src="https://embed.ted.com/talks/tim_berners_lee_the_next_web" width="854" height="480" style="position:absolute;left:0;top:0;width:100%;height:100%" frameborder="0" scrolling="no" allowfullscreen=""></iframe></div></div>

<p>I think there's a fatal flaw in his plan.</p>

<p>Data, in its raw form is hard to come by. Data in databases is, in my experience, a rarity.</p>

<p>Data is usually held in Excel workbooks or Word documents or, more likely, random emails.</p>

<p>Let's take, as a perfect example, the Post Office.</p>

<p>Tom Taylor wanted to know <a href="http://www.whatdotheyknow.com/request/location_of_every_post_box_that">the location of every postbox in the UK</a>.</p>

<p>This is the sort of information which could be very useful to all sorts of projects. A widget to tell you where the nearest postbox was which hadn't missed the last pickup. If you were looking for a new place to live, knowing where the postboxes were would be helpful. Perhaps there is a public health implication that none of us are aware of yet.</p>

<p>It's the sort of small, bespoke manipulation of information which having free access of data makes possible.</p>

<p>So Tom makes a Freedom of Information request to the Post Office.</p>

<p>You can read the whole story yourself, but in synopsis, <a href="http://www.whatdotheyknow.com/request/errors_in_list_of_postbox_locati">the Post office doesn't hold these data</a>!</p>

<p>Local Post offices may do - but it's probably on scraps of paper, old print outs, a list in an obscure data format on an old PC that's never backed up.</p>

<p>So, some bright spark creates a database in Microsoft Access. Not only is Access a barely credible alternative to a database but (and here's the punchline) the database they've created doesn't record post codes properly!</p>

<p>Now, this isn't a crappily designed products built by EDS or Captia to an ever changing specification - this is a in house design. Probably specced and built by someone with a day's training in Access.  People who only think they know what they're doing are <em>dangerous</em>.</p>

<p>Because it's only designed for internal use - and light use at that - the data and its structure are of extremely low quality.</p>

<p>At best, huge tranches of data are held in barely functional, imperfect databases. The rest as flat files on individual computers in multiple inconsistent revisions.</p>

<p>Now, it's been several years since I've worked for a local government, but I can't believe too much has changed since then. Especially given what I see in day-to-day business. For some of the companies that I do business with, the very idea of having a database is akin to science fiction. <em>Everyone knows</em> that best practice is to keep data centralised location in a well maintained database. But everyone knows that's <em>it's easier</em>, in the short term, to keep the data in a spreadsheet on your desktop.</p>

<p>So, the challenge is 3-fold.
1. Convince people that placing information is a good thing to do.
2. Designing databases which are both correct &amp; useful.
3. Freeing the data from their hellish-Excel bondage.</p>

<p>It won't be easy. However, the end result will be worthwhile.</p>

<p>But it's up to all of us - whether we're in public or private service - to make sure the data we're creating is rational, well formed and accessible.</p>

<p><a href="http://www.ted.com/index.php/talks/tim_berners_lee_on_the_next_web">Raw Data Now</a>.</p>

<p><em>Edit: 29/06/2009</em>
Tim has posted about <a href="http://www.w3.org/DesignIssues/GovData.html">Putting Government Data Online</a>.</p>

<blockquote>The chances are quite high that the data your department/agency runs off will be largely in relational databases, often with a large amount in spreadsheets.</blockquote>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=163&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2009/06/why-raw-data-now-could-fail/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
	</channel>
</rss>
