I've been playing with Google's Cloud Vision API. It is OCR (Optical Character Recognition) - but in THE CLOUD and uses MACHINE LEARNING! When it works, it is indistinguishable from magic. When it fails, it reveals a very limited understanding of human text. Let's take a look at this quick example - a piece of evidence from Leveson Inquiry Considering that the document is a digital scan of a fax of a print out, it low resolution, blurry, and skewed - it is nothing short of incredible that …
Continue reading →
Recently, I wanted to embed an photograph of a book page. I thought it would be nifty if the text from the page could be selected. If you hover your mouse over this image, you should be able to select part of the text. Ideally, it will look something like this... It even works on Android (tried on Chrome, Opera, FireFox) and iOS 7. So, how did I do it? Originally, I was pointed to Project Naptha - it seems to do everything I want but is very JavaScript heavy and requires modern…
Continue reading →
I've already blogged about the Leveson Inquiry's disturbing habit of releasing evidence as scanned in PDFs. I had a suggestion from digital journalist Kevin Anderson Terence Eden is on Mastodon@edentGah! The #leveson witness statements are photocopied & scanned in levesoninquiry.org.uk/evidence/?witn…Disastrous for open justice - shkspr.mobi/blog/index.php…❤️ 0💬 0♻️ 110:12 - Fri 11 May 2012Mr Anderson@kevglobalReplying to @edent@edent Put the Leveson docs up on Google Docs. I'd be curious how …
Continue reading →
I've been listening to the Leveson inquiry. A large part of the exchanges seem to go like this: Jay: Turning to page 51. Witness: Which bundle? Jay: 1606. Witness: 1660? Leveson: No, the page after. Jay: Paragraph 7. Witness: I don't have a paragraph 7. Jay: Ah, I have an earlier print out. Leveson: You'll find it in tab 15. Witness: Is this Volume 2? And so on, ad nauseum. Surely there's no reason to have so much paper wastefully printed and then discarded? Why not a single reference…
Continue reading →