<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/rss-style.xsl" type="text/xsl"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	    xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	     xmlns:dc="http://purl.org/dc/elements/1.1/"
	   xmlns:atom="http://www.w3.org/2005/Atom"
	     xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	  xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>
<channel>
	<title>text &#8211; Terence Eden’s Blog</title>
	<atom:link href="https://shkspr.mobi/blog/tag/text/feed/" rel="self" type="application/rss+xml" />
	<link>https://shkspr.mobi/blog</link>
	<description>Regular nonsense about tech and its effects 🙃</description>
	<lastBuildDate>Thu, 29 Jan 2026 09:19:14 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://shkspr.mobi/blog/wp-content/uploads/2023/07/cropped-avatar-32x32.jpeg</url>
	<title>text &#8211; Terence Eden’s Blog</title>
	<link>https://shkspr.mobi/blog</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title><![CDATA[A small collection of text-only websites]]></title>
		<link>https://shkspr.mobi/blog/2025/12/a-small-collection-of-text-only-websites/</link>
					<comments>https://shkspr.mobi/blog/2025/12/a-small-collection-of-text-only-websites/#comments</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Tue, 30 Dec 2025 12:34:20 +0000</pubDate>
				<category><![CDATA[/etc/]]></category>
		<category><![CDATA[blogging]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[text]]></category>
		<category><![CDATA[unicode]]></category>
		<category><![CDATA[utf-8]]></category>
		<guid isPermaLink="false">https://shkspr.mobi/blog/?p=64224</guid>

					<description><![CDATA[A couple of years ago, I started serving my blog posts as plain text.  Add .txt to the end of any URl and get a deliciously lo-fi, UTF-8, mono[chrome&#124;space] alternative.  Here&#039;s this post in plain text - https://shkspr.mobi/blog/2025/12/a-small-collection-of-text-only-websites.txt  Obviously a webpage without links is like a fish without a bicycle, but the joy of the web is that there are no…]]></description>
										<content:encoded><![CDATA[<p>A couple of years ago, I started <a href="https://shkspr.mobi/blog/2024/05/link-relalternate-typetext-plain/">serving my blog posts as plain text</a>.  Add <code>.txt</code> to the end of any URl and get a deliciously lo-fi, UTF-8, mono[chrome|space] alternative.</p>

<p>Here's this post in plain text - <a href="https://shkspr.mobi/blog/2025/12/a-small-collection-of-text-only-websites.txt">https://shkspr.mobi/blog/2025/12/a-small-collection-of-text-only-websites.txt</a></p>

<p>Obviously a webpage without links is like a fish without a bicycle, but the joy of the web is that there are no gatekeepers. People can try new concepts and, if enough people join in, it becomes normal.  I'm not saying the plain-text is the <em>best</em> web experience. But it is <em>an</em> experience. Perfect if you like your browsing fast, simple, and readable. There are no cookie banners, pop-ups, permission prompts, autoplaying videos, or garish colour schemes.</p>

<p>I'm certainly not the first person to do this, so I thought it might be fun to gather a list of websites which you browse in text-only mode.  If you know of any more - including your own site - please drop a comment in the box!</p>

<ul>
<li><a href="https://shkspr.mobi/blog/2024/05/link-relalternate-typetext-plain/">Terence Eden's blog</a> - add <code>.txt</code> to any URl.</li>
<li><a href="https://daringfireball.net/2025/10/apple_uk_lawsuit_app_store_commissions.text">Daring Fireball</a> - add <code>.text</code> to any URl.</li>
<li><a href="https://flower.codes/2025/10/23/onion-mirror.txt">Zach Flowers</a> - replace <code>.html</code> with <code>.txt</code>.</li>
<li><a href="https://fabien.benetou.fr/Content/SwappingPartsOfTheRestrictionStack?action=source">Fabien Benetou's PIM</a> - add <code>?action=source</code> to any URl.</li>
<li><a href="https://m0yng.uk/2025/03/Tracking-the-benefits-of-Solar-and-Battery.txt">M0YNG</a> - add <code>.txt</code> to any URl.</li>
<li><a href="https://gwern.net/speedrunning.md">Gwern</a> - add <code>.md</code> to any URl or send an HTTP Accept for Markdown.</li>
<li><a href="https://textplain.blog/">Dan Q's textplain.blog</a> - the <em>entire</em> blog is plain text!</li>
<li><a href="https://nooshu.com/feed/feed.txt">Matt Hobbs</a> - there is a <em>feed</em> of plaintext which allows you to read recent posts.</li>
<li><a href="https://www.bananas-playground.net/projekt/portagefilelist/index.txt">Bananas Playground</a> - add <code>index.txt</code> to any post. Also works with <code>index.md</code>.</li>
<li><a href="https://www.jorsys.org/index.md">Jorvik Systems</a> - change <code>.html</code> to <code>.md</code> for Markdown.</li>
<li><a href="https://blog.omgmog.net/post/moving-to-github-actions-and-adding-txt-posts.txt">Max Glenister's blog</a> - add <code>.txt</code> to any post's URl.</li>
<li><a href="https://notes.philippdubach.com/0003.txt">Philipp Dubach's notes</a> - add <code>.txt</code> to any post's URl.</li>
<li><a href="https://derickrethans.nl/php-500.txt">Derick Rethans' blog</a> - add <code>.txt</code> to any post's URl.</li>
<li><a href="https://4c6e.xyz/TEXT-MANIFEST.txt">Ricardson's blog</a> - add <code>.txt</code> to any post's URl.</li>
<li><a href="https://elle.sh/blog">elle's blog</a> - text mode only available via <code>curl elle.sh/blog</code>.</li>
<li><a href="https://www.benji.dog/notes/1761683274.txt">Benji.dog</a> - add <code>.txt</code> to any note's URl.</li>
</ul>

<p>If you'd like to add a site, please get in touch. The rules are simple - content which has the MIME type of <code>text/plain</code>. No HTML, no multimedia, no RTF, no XML, no ANSI colour escape sequences.</p>

<p>Emoji are fine though; emoji are cool.</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=64224&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2025/12/a-small-collection-of-text-only-websites/feed/</wfw:commentRss>
			<slash:comments>22</slash:comments>
		
		
			</item>
		<item>
		<title><![CDATA[Is it faster to read or to listen?]]></title>
		<link>https://shkspr.mobi/blog/2022/01/is-it-faster-to-read-or-to-listen/</link>
					<comments>https://shkspr.mobi/blog/2022/01/is-it-faster-to-read-or-to-listen/#comments</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Sat, 29 Jan 2022 12:34:39 +0000</pubDate>
				<category><![CDATA[/etc/]]></category>
		<category><![CDATA[education]]></category>
		<category><![CDATA[MSc]]></category>
		<category><![CDATA[reading]]></category>
		<category><![CDATA[text]]></category>
		<category><![CDATA[tts]]></category>
		<category><![CDATA[voice]]></category>
		<guid isPermaLink="false">https://shkspr.mobi/blog/?p=41715</guid>

					<description><![CDATA[Fourteen years ago, I blogged about the future of voice. In the post, I asked these two questions - which I&#039;d nicked from someone else:   Are you faster at speaking or typing? Are you faster at reading or listening?   Lots of us now use Siri, Alexa, Bixby, and the like because it is quicker to speak than type. For long-form wordsmithing - it&#039;s still probably easier to type-and-edit than it is to…]]></description>
										<content:encoded><![CDATA[<p>Fourteen years ago, I blogged about <a href="https://shkspr.mobi/blog/2008/05/the-future-of-voice/">the future of voice</a>. In the post, I asked these two questions - which I'd nicked from someone else:</p>

<ol>
<li>Are you faster at speaking or typing?</li>
<li>Are you faster at reading or listening?</li>
</ol>

<p>Lots of us now use Siri, Alexa, Bixby, and the like because it is quicker to speak than type. For long-form wordsmithing - it's still probably easier to type-and-edit than it is to speak-then-edit. And the way humans speak is markedly different from how they write.</p>

<p>But the bottleneck has always been that <em>listening</em> to speech is slower than <em>reading</em> text.</p>

<p>The <a href="https://www.bps.org.uk/research-digest/most-comprehensive-review-date-finds-average-persons-reading-speed-slower">average reading speed is around 238 words per minute</a>. Obviously there are a lot of caveats around the age of the reader, the difficulty of the material, whether one is reading for leisure or work. But it will do as a comparator.</p>

<p>The <a href="https://www.sciencedirect.com/science/article/pii/S0885230819300518">average speaking speed is around 150 words per minute</a>. Again, that depends on the <a href="https://www.sciencedirect.com/science/article/pii/S0892199706000889">age of the speaker</a>, urgency of their talk, <a href="https://www.sciencedirect.com/science/article/pii/S0095447019300543">familiarity with the language</a>, and so on.</p>

<p>Therefore it is faster to read academic papers rather than to listen to academic lectures. Case closed!</p>

<p>Except…</p>

<p>There's a fascinating new paper out - <q><cite itemprop="headline"><a href="https://onlinelibrary.wiley.com/doi/abs/10.1002/acp.3899">Learning in double time: The effect of lecture video speed on immediate and delayed comprehension</a></cite></q>.</p>

<p>Here's the quote I found most interesting - with emphasis added:</p>

<blockquote><p>Collectively, the present experiments indicate that increased video speed (up to 2x) does not negatively impact learning outcomes and watching at faster speeds can be a more efficient use of study time. 

</p><p>Thus, as long as to-be-remembered information can be effectively perceived and encoded, <strong>learning outcomes may not be affected by playback speed</strong>. 

</p><p>However, previous work has indicated that speech comprehension begins to decline at around 275 words per minute (Foulke &amp; Sticht,&nbsp;<span><a href="https://onlinelibrary.wiley.com/doi/full/10.1002/acp.3899#acp3899-bib-0019" id="#acp3899-bib-0019R" class="bibLink tab-link" data-tab="pane-pcw-references">1969</a></span>; see also Goldhaber,&nbsp;<span><a href="https://onlinelibrary.wiley.com/doi/full/10.1002/acp.3899#acp3899-bib-0021" id="#acp3899-bib-0021R" class="bibLink tab-link" data-tab="pane-pcw-references">1970</a></span>; Pastore &amp; Ritzhaupt,&nbsp;<span><a href="https://onlinelibrary.wiley.com/doi/full/10.1002/acp.3899#acp3899-bib-0042" id="#acp3899-bib-0042R" class="bibLink tab-link" data-tab="pane-pcw-references">2015</a></span>; Vemuri et al.,&nbsp;<span><a href="https://onlinelibrary.wiley.com/doi/full/10.1002/acp.3899#acp3899-bib-0055" id="#acp3899-bib-0055R" class="bibLink tab-link" data-tab="pane-pcw-references">2004</a></span>) and the videos in the current study exceeded this threshold when played at 2x speed. 

</p><p>Although the elevated speech rates at 2x speed may initially be less comprehensible to students, researchers have been able to train participants to <strong>understand speech at rates up to 475 WPM</strong> (Orr et al.,&nbsp;<span><a href="https://onlinelibrary.wiley.com/doi/full/10.1002/acp.3899#acp3899-bib-0038" id="#acp3899-bib-0038R" class="bibLink tab-link" data-tab="pane-pcw-references">1965</a></span>). 

</p><p>Therefore, with practice, higher rates of speech may not be completely incomprehensible and since <strong>85% of students reported watching lecture videos at quicker than normal speeds</strong> (see Figure&nbsp;<a href="https://onlinelibrary.wiley.com/doi/full/10.1002/acp.3899#acp3899-fig-0003">3a</a>), they may be better able to process the material as a result of experience.</p></blockquote>

<p>I guess this shouldn't come as a surprise to me. I tend to watch my MSc lectures at 1.75x with subtitles - and have been doing the same with podcasts and tutorial videos for years. Looks like I am in the majority.</p>

<p>If the average person speaks at ~150 Words Per Minute, increasing playback speed to 1.5x gives a listening rate of ~225 WPM. That's about the same as reading speed.</p>

<p>Going to 475 WPM means listening at 3x normal speed.</p>

<p>My mate Léonie Watson is blind and has written extensively about <a href="https://tink.uk/notes-on-synthetic-speech/">the use of text-to-speech technology</a>.  Because she listens to a synthetic voice, with predictable and consistent pronunciation, she's able to listen at about <strong>520 WPM</strong>! That's 3.5x faster than the speech of a  biological human.</p>

<p>I'm not suggesting that you can speed-listen your way through any complicated topic and retain perfect understanding of subject and nuance. But it is becoming clear that <em>synchronous</em> teaching has limitations when it comes to efficiently teaching people. There's no substitute for being able to stop an expert mid-lecture and saying "sorry Prof, I don't get that - could you please help me understand?"  But the reality is, most people never stick their hand up in class. So listening to lectures on playback - at double speed - is simply a better "user experience" for the student.</p>

<p>Learning, of course, isn't just listening to people drone on in front of a blackboard. The student still needs to do the exercises, <a href="https://shkspr.mobi/blog/2022/01/an-algorithm-to-write-an-assignment/">write their essays</a>, consolidate their knowledge, reflect on what they've learned, and so on.</p>

<p>But the ability to "speed" your way through a (well edited and professionally recorded) lecture is something to be welcomed. It gives students more time to spend on their studies with, apparently, no ill effects.</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=41715&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2022/01/is-it-faster-to-read-or-to-listen/feed/</wfw:commentRss>
			<slash:comments>4</slash:comments>
		
		
			</item>
		<item>
		<title><![CDATA[Text Adventures Via Text Message]]></title>
		<link>https://shkspr.mobi/blog/2013/06/text-adventures-via-text-message/</link>
					<comments>https://shkspr.mobi/blog/2013/06/text-adventures-via-text-message/#comments</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Fri, 14 Jun 2013 10:55:20 +0000</pubDate>
				<category><![CDATA[mobile]]></category>
		<category><![CDATA[games]]></category>
		<category><![CDATA[hack]]></category>
		<category><![CDATA[hacks]]></category>
		<category><![CDATA[interactive fiction]]></category>
		<category><![CDATA[sms]]></category>
		<category><![CDATA[text]]></category>
		<category><![CDATA[twilio]]></category>
		<category><![CDATA[zork]]></category>
		<guid isPermaLink="false">http://shkspr.mobi/blog/?p=8386</guid>

					<description><![CDATA[Remember text adventures, eh?  They were pretty nifty!  &#34;You are in a maze of twisty little passages, all alike&#34; &#62;Go East &#34;You have been eaten by a Grue. A dwarf starts singing about gold&#34;  Smashing! Just like the pictures are better on the radio, so the graphics are immeasurably superior when they&#039;re in your head. Don&#039;t get me wrong, I love the 5.1 surround sound snarl of a rabid beast rendered…]]></description>
										<content:encoded><![CDATA[<p>Remember text adventures, eh?  They were pretty nifty!</p>

<pre>"You are in a maze of twisty little passages, all alike"
&gt;Go East
"You have been eaten by a Grue. A dwarf starts singing about gold"</pre>

<p>Smashing! Just like the pictures are better on the radio, so the graphics are immeasurably superior when they're in your head. Don't get me wrong, I love the 5.1 surround sound snarl of a rabid beast rendered in 1080p - but nothing is <em>quite</em> as good as using your imagination.</p>

<p>Text Adventures - or, more properly, <em>Interactive Fiction</em> - is enjoying a mini-renaissance at the moment.  There are emulators for smart phones, computers, and consoles.</p>

<p>But for SMS? None that I could find.  Think about it; SMS is close to the perfect medium for Interactive Fiction.  Your commands are concise enough to fit into a single message, you don't have to worry about the speed of your response, everything is text based.</p>

<p>So, I present to you, Zork via SMS (with a little help from <a href="http://twilio.com/">Twilio</a>).</p>

<img src="https://shkspr.mobi/blog/wp-content/uploads/2013/06/Twilio-Zork-SMS-fs8.png" alt="Twilio Zork SMS" width="600" height="1033" class="aligncenter size-full wp-image-8391">

<p>Brilliant!</p>

<h2 id="howto"><a href="https://shkspr.mobi/blog/2013/06/text-adventures-via-text-message/#howto">HOWTO</a></h2>

<p>Although superficially a simple project, there are a few gotchas along the way.  This documentation is mostly for me - but may be of use to you :-)</p>

<p>Let's start at the end.... Using <a href="https://www.twilio.com/">Twilio</a> you can send an SMS to a phone number, that is then POSTed to a webserver of your choice.</p>

<p>Twilio will send an SMS back if it receives a response such as</p>

<pre><code class="language-xml">&lt;?xml version="1.0" encoding="UTF-8"?&gt;
&lt;response&gt;
  &lt;sms&gt;You are in an open field west of a big white house with a boarded
front door.
There is a small mailbox here.
&lt;/sms&gt;
&lt;/response&gt;
</code></pre>

<p>Easy enough.  (NB the free account prepends your messages with a trial message. Once you pay, that goes away.)</p>

<p>So, how do we get a text adventure like Zork running server side?  There are plenty of client side emulators out there - but I only found one which was purely server side.</p>

<p><a href="https://web.archive.org/web/20141212015043/http://devel.thcnet.net/thcnetdev/index.php?section=article,29588">THCnet</a> have a PHP interpreter for Zork.  It's slightly cludgy - but it works.  I'm checking to see if I can distribute it on GitHub.</p>

<p>The game library "dtextc.dat" must be installed in "/usr/games/lib/".  The source code is available if you want to recompile it and place it somewhere more accessible.</p>

<p>A MySQL database needs to be created to keep track of users and sessions.</p>

<p>A temporary directory (which must be readable and writeable) will also be needed to store the state of play for each session.  I've used "tmp/404/" but you can alter config.php to point wherever is convenient.  You will also need to adjust index.php and functions.php with the new location.</p>

<p>Finally, the executable "zork" must be runable.</p>

<p>That's pretty much it.</p>

<p>When a user (or, in this case, Twilio) first makes a request to the URL, we check to see if they already have a session cookie.
If they don't, create one, and start the game by returning the opening text.</p>

<p>For every subsequent connection, we check the state of the game based on their cookie, then apply the body of their SMS to the game engine.  Whatever Zork spits out, we return as an SMS.</p>

<p>By default, Twilio kills unused sessions after 4 hours, and all cookies will be lost.</p>

<h2 id="where-next"><a href="https://shkspr.mobi/blog/2013/06/text-adventures-via-text-message/#where-next">Where Next?</a></h2>

<p>It strikes me that the various forms of text adventures are excellent as a sort of mini-IVR.  A nice little finite-state-machine which can quite happily run for years.  Services like <a href="https://web.archive.org/web/20130603122624/https://www.frontlinesms.com/">Frontline SMS</a> are great for information sharing, but they're really not designed for this sort of work.</p>

<p>Could we use interactive fiction in places where SMS is cheap but other forms of entertainment are expensive?  Could IF be used to augment existing media events?  SMS is lowest common denominator - even iPhones can use it.</p>

<p>But, for now, all I have to worry about is someone texting me "Go west'); DROP TABLE ..." ;-)</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=8386&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2013/06/text-adventures-via-text-message/feed/</wfw:commentRss>
			<slash:comments>3</slash:comments>
		
		
			</item>
		<item>
		<title><![CDATA[Usability of mixing LTR and RTL text?]]></title>
		<link>https://shkspr.mobi/blog/2013/01/usability-of-mixing-ltr-and-rtl-text/</link>
					<comments>https://shkspr.mobi/blog/2013/01/usability-of-mixing-ltr-and-rtl-text/#respond</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Sat, 19 Jan 2013 10:18:44 +0000</pubDate>
				<category><![CDATA[usability]]></category>
		<category><![CDATA[bidirectional]]></category>
		<category><![CDATA[ltr]]></category>
		<category><![CDATA[rtl]]></category>
		<category><![CDATA[text]]></category>
		<category><![CDATA[unicode]]></category>
		<guid isPermaLink="false">http://shkspr.mobi/blog/?p=7417</guid>

					<description><![CDATA[Annoyingly, FourSquare has started be be a source of spam for me.  I get friend request from people who only like certain brands of stores, from recruitment consultants trying to work out who I&#039;m visiting, and from cultists who are desperate for me to visit Scientology centres.  I also get friend requests from people I&#039;ve never met, including from Ahmed al-Najjar (احمد النجار).  I&#039;ve never met Ahm…]]></description>
										<content:encoded><![CDATA[<p>Annoyingly, FourSquare has started be be a source of spam for me.  I get friend request from people who only like certain brands of stores, from recruitment consultants trying to work out who I'm visiting, and from cultists who are desperate for me to visit <a href="http://www.xenu.net/">Scientology centres</a>.</p>

<p>I also get friend requests from people I've never met, including from Ahmed al-Najjar (احمد النجار).  I've never met Ahmed and I've no wish to taint him as a spammer - I'm sure he's just misclicked in his friend request - but because Arabic is written right-to-left (RTL) it shows up some interesting design and usability imitations of FourSquare.</p>

<p>On the Android app, there's a curious problem.
<img src="https://shkspr.mobi/blog/wp-content/uploads/2013/01/LTR-RTL.png" alt="LTR RTL" width="600" height="565" class="aligncenter size-full wp-image-7419"></p>

<p>In layman's terms, I'm expecting to see "[name] [action]".  What the computer is seeing as its instruction is "Place [name] at the start - then place [action]" - in this case, the [name] is placed and, because it is RTL, the [action] is placed after it - which means on space to the left.</p>

<p>Incidentally, their website doesn't suffer from this problem:
<img src="https://shkspr.mobi/blog/wp-content/uploads/2013/01/FourSquare-RTL-Web.png" alt="FourSquare RTL Web" width="551" height="408" class="aligncenter size-full wp-image-7418"></p>

<p>In an average design environment, we may only encounter one text layout.  In the west, that's usually Left To Right (LTR).  If we're feeling exotic, we might internationalise all our assets and strings so that they support a fully RTL experience. (Of course, <a href="http://www.nngroup.com/articles/international-usability-details-differ/">internationalisation and localisation is slightly more complex than that</a>.)</p>

<p>So, the experience becomes:
"أحمد النجار يريد أن يكون صديقك."</p>

<p>That's fine, but what happens when we want to mix the scripts? Should we say
"أحمد النجار wants to be your friend."</p>

<p>Somehow that feels wrong.  It's the lexical equivalent of writing "nedE ecnereT wants to be your friend". Yuck!</p>

<p>Do we accept that - whether one can or cannot read the script - a block of foreign text is read as a separate unit?</p>

<p><a href="https://en.wikipedia.org/wiki/Bi-directional_text">Bidirectional text support</a> is present in most computers.  Even if it does lead to some complex problems.</p>

<p><a href="http://xkcd.com/1137/"><img src="https://imgs.xkcd.com/comics/rtl.png" width="238" height="363" class="aligncenter"></a> Combining the text into one layout is a <a href="http://www.unicode.org/reports/tr9/">solved problem</a> - but it doesn't address the usability of mixing and matching.  What is it that <strong>the user</strong> expects to happen?  I think we can agree that their must be consistency between all UIs - in this case Web and Mobile.</p>

<p>As I see it, there are five approaches</p>

<ol>
    <li>Keep original UI ordering - place the name where the user <em>expects</em> it to be.</li>
    <li>Take the ordering from the first element of the sentence.</li>
    <li>Whenever the text direction changes, start a new line.</li>
    <li>Translate the foreign text into the user's native language.</li>
    <li>Rotate or reverse the foreign text.</li>
</ol>

<p>The common consensus is the first option.  Take the overall direction of the text and place the reversed text in place.</p>

<p>However, of the above items, I think that starting a new line is the most satisfactory.</p>

<pre>أحمد
wants to be your friend.</pre>

<p>But, of course, that leads to</p>

<pre>أحمد
is now friends with
تيرينس
</pre>

<p>Both of which could lead to vertical layout problems, as well as introducing alignment confusion.</p>

<pre style="direction:rtl;">أحمد
<p style="direction:ltr;">is now friends with</p>
تيرينس</pre>

<p>Gah!</p>

<p>But then, adjacency also provides problems. The <a href="http:s//www.w3.org/TR/i18n-html-tech-bidi/">W3C maintain some excellent guidelines for mixing bidirectional text</a>.  In this adjacency example, can you quickly determine which item should be checked to select "English"?</p>

<div id="displaybox">
     <p style="direction:rtl;">لغتي: <input type="checkbox"> العربية <input type="checkbox">   English‏ <input type="checkbox"> Français <input type="checkbox"> فارسی <input type="checkbox"> اردو</p>
</div>

<p>( Incidentally, number 5 isn't as strange as it sounds. In languages which are written Top-To-Bottom sometimes foreign characters are inserted in place but rotated!)
<a title="By Article by Shizhao (《zh:河北文安今發生地震 北京有震感》, zh.wikinews.org) [CC-BY-2.5 (http://creativecommons.org/licenses/by/2.5)], via Wikimedia Commons" href="http://commons.wikimedia.org/wiki/File%3AHebei_Wen'an_jin_fasheng_dizhen_-_Beijing_you_zhengan.png"><img width="256" alt="Hebei Wen'an jin fasheng dizhen - Beijing you zhengan" src="//upload.wikimedia.org/wikipedia/commons/e/e4/Hebei_Wen%27an_jin_fasheng_dizhen_-_Beijing_you_zhengan.png"></a></p>

<h2 id="a-conclusion-of-sorts"><a href="https://shkspr.mobi/blog/2013/01/usability-of-mixing-ltr-and-rtl-text/#a-conclusion-of-sorts">A Conclusion - of sorts</a></h2>

<p>I think it's obvious that in this example, the FourSquare mobile app is wrong from my point of view.  As the user, I expect the starting element to be at the left.  From Ahmed's point-of-view, he as a user expects the starting element to be on the right.</p>

<p>Yet, for both of us, it's an unsatisfactory and inconsistent experience.</p>

<p><a href="http://www.iamcal.com/understanding-bidirectional-text/">Bidirectional support in Unicode</a> solves the layout problem.  Yet I'm not sure if it solves the <em>semantic</em> problem.  Indeed, I wonder if that can be solved at all.</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=7417&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2013/01/usability-of-mixing-ltr-and-rtl-text/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title><![CDATA[Crowdsourcing Leveson]]></title>
		<link>https://shkspr.mobi/blog/2012/05/crowdsourcing-leveson/</link>
					<comments>https://shkspr.mobi/blog/2012/05/crowdsourcing-leveson/#comments</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Fri, 11 May 2012 11:40:48 +0000</pubDate>
				<category><![CDATA[politics]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[justice]]></category>
		<category><![CDATA[leveson]]></category>
		<category><![CDATA[ocr]]></category>
		<category><![CDATA[text]]></category>
		<guid isPermaLink="false">http://shkspr.mobi/blog/?p=5702</guid>

					<description><![CDATA[I&#039;ve already blogged about the Leveson Inquiry&#039;s disturbing habit of releasing evidence as scanned in PDFs.  I had a suggestion from digital journalist Kevin Anderson  Terence Eden is on Mastodon@edentGah! The #leveson witness statements are photocopied &#38; scanned in levesoninquiry.org.uk/evidence/?witn…Disastrous for open justice - shkspr.mobi/blog/index.php…❤️ 0💬 0🔁 110:12 - Fri 11 May 2012Mr And…]]></description>
										<content:encoded><![CDATA[<p>I've already blogged about the <a href="https://shkspr.mobi/blog/2012/04/leveson-death-by-a-thousand-paper-cuts/">Leveson Inquiry's disturbing habit of releasing evidence as scanned in PDFs</a>.</p>

<p>I had a <a href="https://twitter.com/kevglobal/status/200898240965644289">suggestion from digital journalist Kevin Anderson</a></p>

<blockquote class="social-embed" id="social-embed-200898240965644289" lang="en" itemscope="" itemtype="https://schema.org/SocialMediaPosting"><blockquote class="social-embed" id="social-embed-200891119947620352" lang="en" itemscope="" itemtype="https://schema.org/SocialMediaPosting"><header class="social-embed-header" itemprop="author" itemscope="" itemtype="https://schema.org/Person"><a href="https://twitter.com/edent" class="social-embed-user" itemprop="url"><img class="social-embed-avatar social-embed-avatar-circle" src="data:image/webp;base64,UklGRkgBAABXRUJQVlA4IDwBAACQCACdASowADAAPrVQn0ynJCKiJyto4BaJaQAIIsx4Au9dhDqVA1i1RoRTO7nbdyy03nM5FhvV62goUj37tuxqpfpPeTBZvrJ78w0qAAD+/hVyFHvYXIrMCjny0z7wqsB9/QE08xls/AQdXJFX0adG9lISsm6kV96J5FINBFXzHwfzMCr4N6r3z5/Aa/wfEoVGX3H976she3jyS8RqJv7Jw7bOxoTSPlu4gNbfXYZ9TnbdQ0MNnMObyaRQLIu556jIj03zfJrVgqRM8GPwRoWb1M9AfzFe6Mtg13uEIqrTHmiuBpH+bTVB5EEQ3uby0C//XOAPJOFv4QV8RZDPQd517Khyba8Jlr97j2kIBJD9K3mbOHSHiQDasj6Y3forATbIg4QZHxWnCeqqMkVYfUAivuL0L/68mMnagAAA" alt="" itemprop="image"><div class="social-embed-user-names"><p class="social-embed-user-names-name" itemprop="name">Terence Eden is on Mastodon</p>@edent</div></a><img class="social-embed-logo" alt="Twitter" src="data:image/svg+xml,%3Csvg%20xmlns%3D%22http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%22%0Aaria-label%3D%22Twitter%22%20role%3D%22img%22%0AviewBox%3D%220%200%20512%20512%22%3E%3Cpath%0Ad%3D%22m0%200H512V512H0%22%0Afill%3D%22%23fff%22%2F%3E%3Cpath%20fill%3D%22%231d9bf0%22%20d%3D%22m458%20140q-23%2010-45%2012%2025-15%2034-43-24%2014-50%2019a79%2079%200%2000-135%2072q-101-7-163-83a80%2080%200%200024%20106q-17%200-36-10s-3%2062%2064%2079q-19%205-36%201s15%2053%2074%2055q-50%2040-117%2033a224%20224%200%2000346-200q23-16%2040-41%22%2F%3E%3C%2Fsvg%3E"></header><section class="social-embed-text" itemprop="articleBody">Gah! The <a href="https://twitter.com/hashtag/leveson">#leveson</a> witness statements are photocopied &amp; scanned in <a href="http://www.levesoninquiry.org.uk/evidence/?witness=rebekah-brooks">levesoninquiry.org.uk/evidence/?witn…</a><br>Disastrous for open justice - <a href="http://shkspr.mobi/blog/index.php/2012/04/leveson-death-by-a-thousand-paper-cuts/">shkspr.mobi/blog/index.php…</a></section><hr class="social-embed-hr"><footer class="social-embed-footer"><a href="https://twitter.com/edent/status/200891119947620352"><span aria-label="0 likes" class="social-embed-meta">❤️ 0</span><span aria-label="0 replies" class="social-embed-meta">💬 0</span><span aria-label="1 reposts" class="social-embed-meta">🔁 1</span><time datetime="2012-05-11T10:12:30.000Z" itemprop="datePublished">10:12 - Fri 11 May 2012</time></a></footer></blockquote><header class="social-embed-header" itemprop="author" itemscope="" itemtype="https://schema.org/Person"><a href="https://twitter.com/kevglobal" class="social-embed-user" itemprop="url"><img class="social-embed-avatar social-embed-avatar-circle" src="data:image/webp;base64,UklGRugBAABXRUJQVlA4INwBAACwCQCdASowADAAPrVMoUynI6MiKrVaqOAWiWMAxxVPBAOTvWLa7wnDEHBsXvYybYIxOWgYdbBhqKBiKpLTCAwFzyTRJbW6ZYON7SSj1kIg/TjfH8NUGAD++5bMCvq0eeJ+m+ScbtVm1B4ju65wKNSkkytpJ9C0bISjCCd5zkIU7eOQv9mh97+F91jXhvB49YWzWoqQ/33RY13WuC2r9jTEY+UgqJSIRCGqsJibjV8tpqXkPvFvW9NLWn/+ObXnaQQNpiHvlgwKEs6qx6fhZPWbx0b9/S4pRk9CY/z+qvB/mjy/BrHyfLzqhkj14YjX18vAfIkv4N96d03RKB0kIqSoJ+S2RMvgFJr5IxyBYZEX3OUvfkakll8bau190J8MkLug2HEmhHVnAFV68Kg4mH4s3Trq75z+r51bJ9HG0CWGre/kHnEyELddmZzFB8xtD4WCR3tx0F000a5497aymDaAXSGKSiBwvhl/8QEB+9IBUf3OC8+Umh6ptAsDQMek3WfXPfnhJ0RwaTbJEOeoK2X4br776WozZ/RfH4yJlpKs3dGKrrHR79H+9wenHD1+47btxPL9vvL6zJn26d8xsASbIlk7vpk7gPedt2/xHKP6NGKKRkoOqhVw0+wAAA==" alt="" itemprop="image"><div class="social-embed-user-names"><p class="social-embed-user-names-name" itemprop="name">Mr Anderson</p>@kevglobal</div></a><img class="social-embed-logo" alt="Twitter" src="data:image/svg+xml,%3Csvg%20xmlns%3D%22http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%22%0Aaria-label%3D%22Twitter%22%20role%3D%22img%22%0AviewBox%3D%220%200%20512%20512%22%3E%3Cpath%0Ad%3D%22m0%200H512V512H0%22%0Afill%3D%22%23fff%22%2F%3E%3Cpath%20fill%3D%22%231d9bf0%22%20d%3D%22m458%20140q-23%2010-45%2012%2025-15%2034-43-24%2014-50%2019a79%2079%200%2000-135%2072q-101-7-163-83a80%2080%200%200024%20106q-17%200-36-10s-3%2062%2064%2079q-19%205-36%201s15%2053%2074%2055q-50%2040-117%2033a224%20224%200%2000346-200q23-16%2040-41%22%2F%3E%3C%2Fsvg%3E"></header><section class="social-embed-text" itemprop="articleBody"><a href="https://twitter.com/edent">@edent</a> Put the Leveson docs up on Google Docs. I'd be curious how their OCR could handle them. Then click 'make public'</section><hr class="social-embed-hr"><footer class="social-embed-footer"><a href="https://twitter.com/kevglobal/status/200898240965644289"><span aria-label="0 likes" class="social-embed-meta">❤️ 0</span><span aria-label="1 replies" class="social-embed-meta">💬 1</span><span aria-label="0 reposts" class="social-embed-meta">🔁 0</span><time datetime="2012-05-11T10:40:47.000Z" itemprop="datePublished">10:40 - Fri 11 May 2012</time></a></footer></blockquote>

<p>Google Docs has an annoying 2MB limit for uploaded PDFs.  However, I've taken the first half of <a href="https://web.archive.org/web/20120511101152/http://www.levesoninquiry.org.uk/evidence/?witness=rebekah-brooks">Rebekah Brooks' witness statement</a> and run them through the OCR process.</p>

<p>This is how <a href="https://docs.google.com/document/d/1eTss2IfnCHAZQQVIfEGLpvHcriyzshpVodq4JrRopro/edit">Google recognises the text in the document</a></p>

<blockquote><p>Leveson Inquiry into the culture, practices and ethics of the press</p><br>

<p>1 I dlT| necessarily inhibited to some extent about what I can say in reiation to some of the issues that the Inquiry has raised with me.
My background</p><br>

<p>3. ijoined News International in 1989. I began my career on the News of the Worlcfs coiour supplement, Sunday magazine, whiie simultaneousiy attending ajournalism course at the London College of Printing.</p><br>
<p>4. Since then i have been either a joumeiist or an executive on both The News of the World and The Sun. For afrnc-st a decade Iwas a nationai newspaper editor. In May 2000 I became the editor of The News of the Worid and in January 2003 I became the editor of The Sun.</p><br>

<p>5. In September 2009, I was appointed Chief Executive of News lnternationaf. My responsibilities embraced ail the newspapers and digital products of the 1.... -. -</p></blockquote>

<p>That's based on this text:</p>

<p><a href="https://docs.google.com/document/d/1eTss2IfnCHAZQQVIfEGLpvHcriyzshpVodq4JrRopro/edit"><img src="https://shkspr.mobi/blog/wp-content/uploads/2012/05/Brooks-Witness-Statement.jpg" alt="Brooks Witness Statement" title="Brooks Witness Statement" width="422" height="384" class="aligncenter size-full wp-image-5703"></a></p>

<h2 id="why-is-this-important"><a href="https://shkspr.mobi/blog/2012/05/crowdsourcing-leveson/#why-is-this-important">Why Is This Important</a></h2>

<p>The journalist <a href="https://twitter.com/newsbrooke">Heather Brooke</a> has been ranting for some time about <a href="https://www.heatherbrooke.org/the-silent-state">the closed nature of the British Courts</a>. It's close to impossible to get verbatim or accurate information about course cases.  This means as citizens, journalists, or archivists, we can't accurately search documents.  We need access to the original digital documents.</p>

<p>Poor OCR is also a huge problem.  As above, OCR gives us a misleading impression that documents are searchable.</p>

<p>Should we wish to search, say <a href="https://web.archive.org/web/20120521174431/http://www.levesoninquiry.org.uk/evidence/?day=2012-04-24">KRM-18</a>, to see whether the MP Tom Watson is mentioned; a search for "Watson" turns up zero results.  Yet he is mentioned.</p>

<p>The page shows:
<img src="https://shkspr.mobi/blog/wp-content/uploads/2012/05/Evidence-mentioning-Watson.jpg" alt="Evidence mentioning Watson" title="Evidence mentioning Watson" width="599" height="58" class="aligncenter size-full wp-image-5705">
But the scanned text reads:</p>

<blockquote><p>Had ~ debrief with 5f[ ~nd his team tm~.igl~t ttt 77~ betbre he [o~ t.o his constituency:
</p><p>l-]~e is veo’ h.,qlopY~ith d~ ~va~, today" wellt mid ~s~cci~iiJ~’ ~,it[i tae ~bsoiutely’idiotie. del)&amp;t~s led by Wtttson.urtd
</p><p>Prescott.</p></blockquote>

<p>So, it's totally impossible to rapidly search through these documents. It would be necessary to laboriously read each document manually.</p>

<h2 id="how-to-accomplish-this"><a href="https://shkspr.mobi/blog/2012/05/crowdsourcing-leveson/#how-to-accomplish-this">How To Accomplish This</a></h2>

<p>There are two ways to get this done - in the case of the Leveson Inquiry.</p>

<ol>
    <li>Petition the Inquiry to release the original documents.</li>
    <li>Crowdsource the OCR.  Taking the Google OCR as a starting point and "Wikifying" it to let anyone correct the text.  A bit like <a href="http://www.pgdp.net/c/">Distributed Proofreaders</a></li>
</ol>

<p>I will, of course, send an email to the Leveson Inquiry - but would people be interested in being part of a crowdsourcing effort to opening up these documents?</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=5702&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2012/05/crowdsourcing-leveson/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
			</item>
	</channel>
</rss>
