<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/rss-style.xsl" type="text/xsl"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	    xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	     xmlns:dc="http://purl.org/dc/elements/1.1/"
	   xmlns:atom="http://www.w3.org/2005/Atom"
	     xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	  xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>
<channel>
	<title>murdoch &#8211; Terence Eden’s Blog</title>
	<atom:link href="https://shkspr.mobi/blog/tag/murdoch/feed/" rel="self" type="application/rss+xml" />
	<link>https://shkspr.mobi/blog</link>
	<description>Regular nonsense about tech and its effects 🙃</description>
	<lastBuildDate>Fri, 25 Apr 2025 07:58:43 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://shkspr.mobi/blog/wp-content/uploads/2023/07/cropped-avatar-32x32.jpeg</url>
	<title>murdoch &#8211; Terence Eden’s Blog</title>
	<link>https://shkspr.mobi/blog</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title><![CDATA[Leveson - Death By A Thousand (Paper) Cuts]]></title>
		<link>https://shkspr.mobi/blog/2012/04/leveson-death-by-a-thousand-paper-cuts/</link>
					<comments>https://shkspr.mobi/blog/2012/04/leveson-death-by-a-thousand-paper-cuts/#respond</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Wed, 25 Apr 2012 11:06:38 +0000</pubDate>
				<category><![CDATA[politics]]></category>
		<category><![CDATA[usability]]></category>
		<category><![CDATA[leveson]]></category>
		<category><![CDATA[murdoch]]></category>
		<category><![CDATA[ocr]]></category>
		<category><![CDATA[paper]]></category>
		<guid isPermaLink="false">http://shkspr.mobi/blog/?p=5619</guid>

					<description><![CDATA[I&#039;ve been listening to the Leveson inquiry. A large part of the exchanges seem to go like this:  Jay: Turning to page 51. Witness: Which bundle? Jay: 1606. Witness: 1660? Leveson: No, the page after. Jay: Paragraph 7. Witness: I don&#039;t have a paragraph 7. Jay: Ah, I have an earlier print out. Leveson: You&#039;ll find it in tab 15. Witness: Is this Volume 2?   And so on, ad nauseum.  Surely there&#039;s no…]]></description>
										<content:encoded><![CDATA[<p>I've been listening to the Leveson inquiry. A large part of the exchanges seem to go like this:</p>

<blockquote><p>Jay: Turning to page 51.
</p><p>Witness: Which bundle?
</p><p>Jay: 1606.
</p><p>Witness: 1660?
</p><p>Leveson: No, the page after.
</p><p>Jay: Paragraph 7.
</p><p>Witness: I don't have a paragraph 7.
</p><p>Jay: Ah, I have an earlier print out.
</p><p>Leveson: You'll find it in tab 15.
</p><p>Witness: Is this Volume 2?
</p></blockquote>

<p>And so on, <i lang="la">ad nauseum</i>.</p>

<p>Surely there's no reason to have so much paper wastefully printed and then discarded?  Why not a single reference electronic document which can be supplied to each participant? Allowing them to increase the font size, annotate, cross reference, and search?</p>

<h2 id="search"><a href="https://shkspr.mobi/blog/2012/04/leveson-death-by-a-thousand-paper-cuts/#search">Search</a></h2>

<p>Ah, search.  Searching text is something computers are really good at.  Within a fraction of a second, even a modest computer can extract every sentence which contains the word "Clegg" from hundreds of thousands of pages.  Brilliant! Makes life really easy. Until humans come along and bugger about with it.</p>

<p>Let's take a look at the "smoking gun" <a href="https://web.archive.org/web/20120428223514/http://www.levesoninquiry.org.uk/evidence/?day=2012-04-24">emails which have been submitted from News International to Leveson</a>. Specifically <a href="https://web.archive.org/web/20120428084720/http://www.levesoninquiry.org.uk/wp-content/uploads/2012/04/Exhibit-KRM-18.pdf">KRM18</a>.</p>

<p>I have no idea how these emails were supplied to Leveson. I <strong>hope</strong> that they were submitted electronically - with all headers intact. What's supplied to the pubic, however, is this:</p>

<p><img src="https://shkspr.mobi/blog/wp-content/uploads/2012/04/Leveson-Email-Printed.jpg" alt="Leveson Email Printed" title="Leveson Email Printed" width="623" height="378" class="aligncenter size-full wp-image-5621">
The emails have been...</p>

<ul>
    <li>Printed out.</li>
    <li>Redacted with marker pen.</li>
    <li>Scanned in as a PDF.</li>
    <li>Then subject to an uncorrected OCR process.</li>
</ul>

<p>Computers are <em>really</em> bad at recognising text. OCR (Optical Character Recognition) is a very error-prone process.  Take a look at how the computer has translated the above document.</p>

<img src="https://shkspr.mobi/blog/wp-content/uploads/2012/04/Leveson-Email-Printed-OCR.jpg" alt="Leveson Email Printed OCR" title="Leveson Email Printed OCR" width="626" height="382" class="aligncenter size-full wp-image-5620">

<p>It's <em>partly</em> there. But enough of the characters are mangled, and words distorted that searching through the text is near impossible.</p>

<p>I get that PDF is a reasonably popular file format for sharing documents. It preserves the document structure faithfully - but at the expense of readability, fluidity, and usefulness.  But distributing <em>images</em> is the least useful way of distributing information to people who want to use it.</p>

<p>It's simply bad civic responsibility to do this.  These emails, if they are important enough to be made public, should be made public in their original form. I understand that some redactions should be made - but that's about the limit.</p>

<p>How on Earth is anyone supposed to make sense of this extract?
<img src="https://shkspr.mobi/blog/wp-content/uploads/2012/04/OCR.jpg" alt="OCR" title="OCR" width="603" height="229" class="aligncenter size-full wp-image-5625"></p>

<p>We need to shake off the tyranny of printed paper. It is wasteful, non-useful, and - in this context - damaging to justice.</p>

<p>I leave you with an entirely random extract from the emails...
<img src="https://shkspr.mobi/blog/wp-content/uploads/2012/04/Please-Consider-The-Environment-Before-Printing-This-Email.jpg" alt="Please Consider The Environment Before Printing This Email" title="Please Consider The Environment Before Printing This Email" width="602" height="574" class="aligncenter size-full wp-image-5623"></p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=5619&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2012/04/leveson-death-by-a-thousand-paper-cuts/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
