Terence Eden. He has a beard and is smiling.

Terence Eden’s Blog

Theme Switcher:

Amazon Prime Video's weird Unicode problems

· 1 comment · 400 words · Viewed ~426 times


Description with an error in it.

It's 2019 and high-tech devices are still plagued by text encoding bugs. I recently bought the new 4K Amazon Fire Stick. It's a little Android dongle which plays videos. It's neat - but quite often displays weird text errors. Take the kids' TV show House of Anubis, the Fire displays the description like this: Looking at the source code for the description: That's the character "private use…

Domain hacks with unusual Unicode characters

· 3 comments · 850 words · Viewed ~18,907 times


Glowing computer text showing dot com dot info etc.

Unicode contains a range of symbols which don't get much use. For example, there are separate symbols for TradeMark - ™, Service Mark - ℠, and Prescriptions - ℞. Nestling among the "Letterlike Symbols" are two curious entries. Both of these are single characters: Telephone symbol - ℡ Numero Sign - № What's interesting is both .tel and .no are Top-Level-Domains (TLD) on the Domain Name System…

Forbidden Unicode

· 5 comments · 1,150 words · Viewed ~1,485 times


A book cover in the style of a 1950's pulp sci-fi novel. An AI generated set of computers are connected by wires.

I have been receiving letters from a dear friend by the name of Ophiuchus. He has been researching some curious anomalies in the Unicode Standard. While I cannot vouch for all he has written, I thought it worth presenting his discoveries to you. My friend, I bring you a curiosity! I have been engaged in a most frustrating task. That is, trying to decipher the ancient history of our friends at…

Why Android Pie Won't Be Getting the Copyleft Symbol

· 1 comment · 650 words · Viewed ~726 times


Wikipedia Copyleft page. The icon is a blank box.

Google is a company with nearly unlimited resources. It often chooses to use its power for the greater good of the Internet. Creating amazing projects like digitizing every printed book, bringing Internet access via high-altitude balloons, and offering high-quality language translation. And sometimes it just gets bored and abandons them. Google Noto is such a project. It is an attempt to create …

Virgin Media don't understand Unicode

· 3 comments · 400 words · Viewed ~927 times


HTML code from Virgin.

More adventures with Unicode. I logged in to my Virgin Media account to see when my promotional discount would end. Here's what their billing PDF said. Let'S Ignore The Weird Capitalisation Virgin'S System Uses. What's that  doing there? Their website says: No  symbol, but also no £ sign. Ah, but let's look at the underlying code. What's that weird character? It is the control ch…

Obsolete Technology in Unicode

· 15 comments · 650 words · Viewed ~4,354 times


Screenshot of the Unicode standard. The page shows symbols for Telephone Receivers, Pagers, and Fax Machines.

A short meander through some of the more obscure miscellany within Unicode. Languages hang around far longer than there are native speakers, and symbols get reused and repurposed (🍆). Here are some of the delightfully old-fashioned symbols hidden in your thoroughly modern smartphone. Tapes Long before solid-state drives, we used to record data on long thin strips of magnetic tape. 🖭 📼 I'm sure…

Pursuit Podcast - Life, The Unicode, And Everything

· 150 words


A beautiful hand drawing showing the flow of the conversation

The inimitable Jess Rose interviewed me for her Pursuit Podcast - talking about the Unicode Power Symbol proposal. We talked about how to subvert bureaucracy, building a team of supporters, adding new stuff to Unicode, and recognising that you're a background character in most people's lives. Bit of a ramble, but jolly good fun. Sketchnotes by the exquisitely talented Kate Holden. Find out…

únicode is hard

· 15 comments · 800 words · Viewed ~30,133 times


In the last couple of months, I've been seeing the ú symbol on British receipts. Why? 1963 - ASCII In the beginning* was ASCII. A standard way for computers to exchange text. ASCII was originally designed with 7 bits - that means 128 possible symbols. That ought to be enough for everyone, right? Wrong! ASCII is the American Code for Information Interchange. It contains a $ symbol, but …

Where do these arrows point?

· 7 comments · 800 words · Viewed ~386 times


This is a blog post about user interfaces. I was wandering along the beach one day, when I noticed some clever chap had drawn some arrows in the sand. Can you guess where they led? The more astute of you will have realised that these are not human drawn arrows. They are, of course, footprints left by birds. A bird's foot is a "backwards" arrow. The apex points to the bird's rear. It is …

How Do You Sort Chinese Numbers?

· 1 comment · 700 words · Viewed ~1,584 times


Imagine you have a series of number you wish to sort. Sorting is a well known computer science problem - generally speaking you compare one value to the next and then move the item either up or down a list. With "English" characters, that's fairly easy. When a computer sees the character 1 it's really seeing the Unicode character U+0031. When it sees 2 it's really seeing the character U+0032…

Why can't you send email to a Chinese address?

· 4 comments · 450 words · Viewed ~5,121 times


We all know what an email address looks like and how to validate them, right? A few years ago I got the Chinese domain name 莎士比亚.org. You can browse to it, link to it, and send email to it. Or can you? When I tried two years ago, none of the major email providers supported sending to non-ASCII email addresses. Today, I tried again with six of the big "Western" webmail providers. How did they…

Counting Invisible Strings

· 300 words


The PHP logo.

When is a string not a string? When it's a series of control characters! Not a particularly funny riddle, but one I've been wrestling with recently. Imagine we want to write a program which displays a Twitter user's name. Not their @ handle, but their "real" name. For example, instead of @POTUS, display "President Obama". Easy, right? Not quite. What happens when a user is named "️"? N…