<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/rss-style.xsl" type="text/xsl"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	    xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	     xmlns:dc="http://purl.org/dc/elements/1.1/"
	   xmlns:atom="http://www.w3.org/2005/Atom"
	     xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	  xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>
<channel>
	<title>publishing &#8211; Terence Eden’s Blog</title>
	<atom:link href="https://shkspr.mobi/blog/tag/publishing/feed/" rel="self" type="application/rss+xml" />
	<link>https://shkspr.mobi/blog</link>
	<description>Regular nonsense about tech and its effects 🙃</description>
	<lastBuildDate>Sat, 23 Aug 2025 22:48:11 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://shkspr.mobi/blog/wp-content/uploads/2023/07/cropped-avatar-32x32.jpeg</url>
	<title>publishing &#8211; Terence Eden’s Blog</title>
	<link>https://shkspr.mobi/blog</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title><![CDATA[How do you raise a software bug with a book publisher?]]></title>
		<link>https://shkspr.mobi/blog/2020/11/how-do-you-raise-a-software-bug-with-a-book-publisher/</link>
					<comments>https://shkspr.mobi/blog/2020/11/how-do-you-raise-a-software-bug-with-a-book-publisher/#respond</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Mon, 23 Nov 2020 12:26:12 +0000</pubDate>
				<category><![CDATA[/etc/]]></category>
		<category><![CDATA[bug]]></category>
		<category><![CDATA[ebook]]></category>
		<category><![CDATA[epub]]></category>
		<category><![CDATA[NaBloPoMo]]></category>
		<category><![CDATA[publishing]]></category>
		<category><![CDATA[standards]]></category>
		<guid isPermaLink="false">https://shkspr.mobi/blog/?p=37325</guid>

					<description><![CDATA[Recently, I bought an eBook which has a bug. I&#039;d like to explain what the bug is, why it is a problem, and how I&#039;m trying to get it corrected.  Amazon sells eBooks in KF8 format. That is an ePub with some proprietary extras. ePub is a standard based off HTML5. You can read the ePub 3 specification but, basically, it is a .zip of HTML files.  If you unzip an eBook, you can read the source code…]]></description>
										<content:encoded><![CDATA[<p>Recently, I bought an eBook which has a bug. I'd like to explain what the bug is, why it is a problem, and how I'm trying to get it corrected.</p>

<p>Amazon sells eBooks in <a href="https://wiki.mobileread.com/wiki/KF8">KF8 format</a>. That is an ePub with some proprietary extras. ePub is a standard based off HTML5. You can <a href="https://www.w3.org/publishing/epub3/epub-overview.html">read the ePub 3 specification</a> but, basically, it is a .zip of HTML files.  If you unzip an eBook, you can read the source code behind it.</p>

<p>When trying to read a Kindle book on a non-Kindle device, I noticed a bug. Some words were not displaying.  I took a look at the underlying source code, and found this:</p>

<img src="https://shkspr.mobi/blog/wp-content/uploads/2020/11/Screenshot-from-2020-11-19-09-20-27.png" alt="HTML code - a span wraps the first letter of a word." width="931" height="65" class="aligncenter size-full wp-image-37328">

<img src="https://shkspr.mobi/blog/wp-content/uploads/2020/11/Screenshot-from-2020-11-19-09-19-56.png" alt="HTML code - a span wraps the first word of a sentence.." width="680" height="65" class="aligncenter size-full wp-image-37329">

<p>Sometimes words and letters were wrapped with a <code>pagebreak</code> span like this:</p>

<pre><code class="language-html">&lt;span id="pg5" epub:type="pagebreak"&gt;‘But&lt;/span&gt; of course!’
</code></pre>

<p>When I tried to read the book using <a href="https://koreader.rocks/">KOReader</a> the word "‘But" didn't appear. Why?  Let's take a look at the <a href="https://idpf.github.io/epub-vocabs/structure/#pagination">ePub3 specification concerning page breaks</a>:</p>

<blockquote> <strong>pagebreak</strong>
<ul><li>  A separator denoting the position before which a break occurs between two contiguous pages in a statically paginated version of the content.
</li><li>  HTML usage context: phrasing and flow content, where the value of the carrying elements title attribute takes precedence over element content for the purposes of representing the pagebreak value</li></ul></blockquote>

<p>Here's the problem - eBooks can have page numbers. Despite "<a href="https://shkspr.mobi/blog/2011/02/quoting-page-numbers-from-ebooks-considered-harmful/">Page Numbers in eBooks Considered Harmful</a>" lots of publishers still use them. I guess it is <em>kind of</em> useful if you want to refer to something on a printed page - but eReaders allow you to change font size and line spacing, so the concept of a page is somewhat nebulous.</p>

<p>The way the spec is written, means that you can write something like:</p>

<pre><code class="language-html">&lt;span epub:type="pagebreak" id="page_123_a" title="123"&gt;123&lt;/span&gt;
</code></pre>

<p>You use the <code>id</code> for internal linking and the <code>title</code> attribute for the value.</p>

<p>Because of this, most eReaders do not display the physical page number inside the span. It has no semantic content for the reader, and breaks flow. If they did display it, you might end up reading text like this:</p>

<blockquote><p>Lorem ipsum dolor sit amet, consectetur adipiscing elit. Aenean vel</p>

<p>9</p>

<p>risus at metus molestie tincidunt. Donec aliquet aliquam lorem, ...</p></blockquote>

<p>So KOReader <em>deliberately</em> ignores any text which is wrapped with <code>epub:type="pagebreak"</code>.</p>

<p>If you look at <a href="https://github.com/IDPF/epub3-samples/search?l=HTML&amp;q=pagebreak">the example ePubs provided by the International Digital Publishing Forum</a>, you'll see that the majority of their spans are self-closing.</p>

<pre><code class="language-html">&lt;span epub:type="pagebreak" id="p123-a" title="123"/&gt;
</code></pre>

<p>Very occasionally, you see something which has just a page number in it:</p>

<p><a href="https://github.com/IDPF/epub3-samples/blob/68ba020fe5c99625d45dd50a94c4b202c5f220e1/30/childrens-literature/EPUB/s04.xhtml#L11"><code>&lt;span epub:type="pagebreak" title="169" id="Page_169"&gt;169&lt;/span&gt;</code></a></p>

<p>I checked with <a href="https://github.com/koreader/koreader/issues/6894">KOReader</a> and they confirmed that they were following the spec. I agree with them. There's no reason to wrap readable content in a metadata span like that.</p>

<p>I've checked several other books from different publishers. None of them abuse <code>pagebreaks</code> this way.  I think Penguin Random House are doing it wrong and I would like to correct them.</p>

<h2 id="reporting-it"><a href="https://shkspr.mobi/blog/2020/11/how-do-you-raise-a-software-bug-with-a-book-publisher/#reporting-it">Reporting it</a></h2>

<p>I've previously <a href="https://shkspr.mobi/blog/2011/03/how-do-you-upgrade-an-ebook/">reported buggy ebooks to vendors</a>. But because the Kindle app doesn't exhibit this problem, I thought it was futile going via Amazon. So I thought I'd try going directly to the publisher.</p>

<p>Sadly, <a href="https://github.com/puk">Penguin UK's GitHub repo is dead</a>. Their dedicated digital publishing team <a href="https://mobile.twitter.com/PRHDigital">haven't tweeted in 2 years</a>.</p>

<p>Their <a href="https://www.penguin.co.uk/company/about-us/notices/faqs.html">contact page</a> has a suggestion for what to do if there is an error in an eBook:</p>

<blockquote><p>It is best if you return the book to the original bookshop from which it was purchased; they should be happy to exchange it for a perfect copy. If you have any difficulty with this then please return the cover and title page of the book to us.</p></blockquote>

<p>Hmmm....</p>

<p>I dropped them an email, and got back this very reasonable reply:</p>

<blockquote><p>Thank you for reaching out and bringing this to our attention. The distinction you’ve made is already a part of our specification; this was an oversight which we’re looking into as a result of your input—it bypassed checks both because it validates and also passes visual checks on all the major platforms we screen for. This isn’t reflective of our entire library and should be limited to specific titles which we’re currently investigating.</p></blockquote>

<p>That's fair enough. The rendering quirk is specification compliant - but hard to spot because of the Kindle monoculture.</p>

<h2 id="change-the-spec-change-the-world"><a href="https://shkspr.mobi/blog/2020/11/how-do-you-raise-a-software-bug-with-a-book-publisher/#change-the-spec-change-the-world">Change the spec, change the world</a></h2>

<p>I've made <a href="https://github.com/w3c/epub-specs/issues/1437">a suggestion on GitHub that the spec should be clarified</a>. I don't think it's particularly obvious that content in a <code>pagebreak</code> may not be displayed.</p>

<p>Most resources agree that the content of a <code>pagebreak</code> should either be blank, or be the page number.</p>

<blockquote><p>If you include the page numbers as text content within a <code>span</code> or <code>div</code>, the pages will be more easily accessible to both sighted users and users using assistive technologies. This method has been employed in previous DAISY standards. The potential downside, however, is that mainstream user agents will not provide equivalent functionality to turn off unwanted content, forcing users to hear and view the page numbers.</p>

<p><a href="https://kb.daisy.org/publishing/docs/navigation/pagelist.html#faq">Digital Accessible Information System (DAISY)</a></p></blockquote>

<h2 id="whose-fault-is-it-anyway"><a href="https://shkspr.mobi/blog/2020/11/how-do-you-raise-a-software-bug-with-a-book-publisher/#whose-fault-is-it-anyway">Whose fault is it anyway?</a></h2>

<p>This is a tricky one. I think Penguin have undoubtedly made a mistake with the way they publish ePubs. But, so far, KOReader is the only rendering engine I've found which suppresses the content of <code>pagebreak</code>s by default.</p>

<p>Generally speaking, a user wouldn't want to display page numbers on an eBook. Software could have a user defined toggle to switch them on or off. Luckily, KOReader has a variety of style-sheets for rendering eBooks - so I picked one which displayed <code>pagebreak</code> content.</p>

<p>Software is hard.</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=37325&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2020/11/how-do-you-raise-a-software-bug-with-a-book-publisher/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title><![CDATA[If you don't sell it, I can't buy it]]></title>
		<link>https://shkspr.mobi/blog/2020/03/if-you-dont-sell-it-i-cant-buy-it/</link>
					<comments>https://shkspr.mobi/blog/2020/03/if-you-dont-sell-it-i-cant-buy-it/#comments</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Tue, 03 Mar 2020 18:05:28 +0000</pubDate>
				<category><![CDATA[/etc/]]></category>
		<category><![CDATA[academia]]></category>
		<category><![CDATA[ebook]]></category>
		<category><![CDATA[piracy]]></category>
		<category><![CDATA[publishing]]></category>
		<guid isPermaLink="false">https://shkspr.mobi/blog/?p=34228</guid>

					<description><![CDATA[I don&#039;t understand the world of academic publishing. Incredibly niche books, some barely longer than a novella, are sold for ridiculously high prices. Or, worse than that, they&#039;re not sold at all.  Let me explain.  A friend of mine recommended an obscure book, published a couple of years ago. The blurb made it look right up my alley, so I tried to buy it.  I only read eBooks, but I couldn&#039;t find…]]></description>
										<content:encoded><![CDATA[<p>I don't understand the world of academic publishing. Incredibly niche books, some barely longer than a novella, are sold for ridiculously high prices. Or, worse than that, they're not sold at all.  Let me explain.</p>

<p>A friend of mine recommended an obscure book, published a couple of years ago. The blurb made it look right up my alley, so I tried to buy it.  <a href="https://shkspr.mobi/blog/2012/03/burning-all-my-books/">I only read eBooks</a>, but I couldn't find it anywhere for sale in the UK.  Amazon had a listing for a paper copy, but it was permanently out of stock.</p>

<p>In desperation, I went straight to the publisher. They <em>would</em> ship a dead-tree book across the Atlantic to me. With shipping costing even more than the book.</p>

<p>I found someone on Twitter who was posting highlighted screenshots of the book, so I Tweeted him and the publisher to find out where to get a copy. He graciously offered to send me a PDF.</p>

<p>At which point, the publisher waded in - furious at the situation.</p>

<blockquote><p>How dare you give this away for free? We're a small publisher and we have to sell books to stay in business!</p></blockquote>

<p>Fair enough. So I asked them where I could buy an eBook version. Their response was disappointing.</p>

<blockquote><p>It's only available on Proquest and Ebsco. Sorry!</p></blockquote>

<p>Both of those are academic databases, available on subscription only, with no public pricing available.</p>

<p>So, let's be clear. You don't want people giving it away, but you won't actually sell me a copy?</p>

<p>What do you want me to do? <em>Not</em> read your book?</p>

<p>I took the emailed PDF. I'm not particularly proud of that. The book is good, and I'm enjoying learning from it. I just wish I could have paid for it.</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=34228&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2020/03/if-you-dont-sell-it-i-cant-buy-it/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
			</item>
		<item>
		<title><![CDATA[Traditional Book Publishers Don't Own The Future]]></title>
		<link>https://shkspr.mobi/blog/2013/04/traditional-book-publishers-dont-own-the-future/</link>
					<comments>https://shkspr.mobi/blog/2013/04/traditional-book-publishers-dont-own-the-future/#comments</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Thu, 18 Apr 2013 11:00:14 +0000</pubDate>
				<category><![CDATA[/etc/]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[drm]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[publishing]]></category>
		<guid isPermaLink="false">http://shkspr.mobi/blog/?p=8086</guid>

					<description><![CDATA[One of the &#34;perks&#34; of running a moderately popular blog is that sometimes people send you stuff.  I&#039;ve been sent wine, phones, apps, and all sorts to review - usually by PR people who realise that engagement with bloggers is a little different that their regular way of doing business.  Yesterday, I received this delightful email from the publishers Simon and Schuster.  NEW book by the &#34;Prophet of …]]></description>
										<content:encoded><![CDATA[<p>One of the "perks" of running a moderately popular blog is that sometimes people send you stuff.  I've been sent wine, phones, apps, and all sorts to review - usually by PR people who realise that engagement with bloggers is a little different that their regular way of doing business.</p>

<p>Yesterday, I received this <em>delightful</em> email from the publishers Simon and Schuster.</p>

<blockquote><p>NEW book by the "Prophet of Silicon Valley", Jaron Lanier, available for your review!</p>

<p>Dear Mr. Vogels,</p>

<p>I'm writing today to let you know of a new book from Simon &amp; Schuster that I think your readers will be interested in — WHO OWNS THE FUTURE? By Jaron Lanier (available 5/7/13).</p>

<p>Jaron Lanier is the bestselling author of You Are Not a Gadget, the father of virtual reality, and one of the most influential thinkers of our time.</p>

<p>[Snip a load of marketing fluff]</p>

<p>I'd like to send you a copy of the book to review.</p></blockquote>

<p>Well, let's ignore the fact that I am not "Mr Vogels".  PR is a hard job and getting people's names right isn't a fundamental part of relationship building.</p>

<p>So, as the book sounded interesting, I asked to be emailed a copy.</p>

<blockquote><p>I'm happy to send you a hardcover copy. If you might provide a mailing address, I'll put one in the mail today.</p></blockquote>

<p>As regular readers know, last year <a href="https://shkspr.mobi/blog/2012/03/burning-all-my-books/">I burned all my paper books and switched exclusively to ebooks</a>.  I related that fact, and mentioned that it would be much quicker than international shipping - surely a priority if you want advanced reviews.</p>

<blockquote><p>Unfortunately, I'm not able to send ebooks as review copies at this time. Many thanks for your interest.</p></blockquote>

<p>So, the "Prophet of Silicon Valley" wants his words stamped on to bits of dead tree, then air shipped at great expense across the Atlantic - as opposed to emailing me a few MB?</p>

<p>How <strong>very</strong> futuristic.</p>

<h2 id="dont-copy-that-floppy"><a href="https://shkspr.mobi/blog/2013/04/traditional-book-publishers-dont-own-the-future/#dont-copy-that-floppy">Don't Copy That Floppy!</a></h2>

<p>I don't know if this is Simon and Schuster police on ebooks - or Jaron Lanier's. Given <a href="http://copyrightandtechnology.com/2010/02/03/jaron-lanier-is-not-a-gadget/">Jaron's stance on DRM</a> and his <a href="http://www.publishersweekly.com/pw/by-topic/digital/content-and-e-books/article/56042-indie-booksellers-sue-amazon-big-six-over-e-book-drm.html">publisher's ongoing DRM law-suit</a> it could be either.</p>

<p>What I do know is this:</p>

<ul>
    <li>Regional release dates are antithetical to the Internet. The UK edition of "Who Owns The Future" went on sale in the UK on 7th March - it goes on sale on May 7 in the US/Canada and May 22 in Australia/NZ</li>
    <li>1 minute of Googling found an eBook copy, without DRM, nestling on a Russian server. Along with the usual polarity of Magnet links.</li>
</ul>

<p>It just goes to show, even the most influential futurist, and his publishers, are still struggling with the reality of the Internet and the ongoing eBook revolution.</p>

<p>The book certainly sound very interesting.  To quote from the preface:</p>

<blockquote><p>The very action of reading makes you the hero of the story I am telling. Maybe you bought, or stole, a physical copy, paid to read this on your tablet, or pirated a digital copy off a share site. Whatever the prequel, here you are, living precisely the circumstances described in this book.
</p><p><a href="http://www.amazon.co.uk/gp/product/B00ADNP2ZM/ref=as_li_ss_tl?ie=UTF8&amp;camp=1634&amp;creative=19450&amp;creativeASIN=B00ADNP2ZM&amp;linkCode=as2&amp;tag=shkspr-21"><img src="https://shkspr.mobi/blog/wp-content/uploads/2013/04/Who-Owns-The-Future-Cover.jpg" alt="Who Owns The Future Cover" width="260" height="400" class="alignleft size-full wp-image-8087"></a>
</p><p>If you paid to read this, thank you! This book is a result of living my life as I do, which I hope provides value to you. The hope of this book is that someday we’ll all have more ways to grow wealth as a side effect of living our lives creatively and intelligently, with an eye to doing things of use to others.
</p><p>If you paid to read, then there has been a one-way transaction, in which you transferred money to someone else.
</p><p>If you got it for free, there has been a no-way transaction, and any value traded will be off the books, recorded not in any ledger but rather in the informal value systems of reputation, karma, or other wispy forms of barter. That doesn’t mean nothing has happened. Maybe you’ll get some positive strokes over a social network because of what you say about the book. That sort of activity might benefit us both. But it’s a kind of benefit that is unreliable and perishable.</p></blockquote>

<p>"Who Owns The Future?" is available from all good book stores, as both a hardback or eBook.  Jaron Lanier would probably prefer you buy it, but is probably equally wary of your personal details ending up in a mega data warehouse.</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=8086&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2013/04/traditional-book-publishers-dont-own-the-future/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
	</channel>
</rss>
