Why are QR Codes with capital letters smaller than QR codes with lower-case letters?

@edentqr QR Codes · 19 comments · 550 words · Viewed ~28,655 times.

Take a look at these two QR codes. Scan them if you like, I promise there's nothing dodgy in them.

Left is upper-case HTTPS://EDENT.TEL/ and right is lower-case https://edent.tel/

You can clearly see that the one on the left is a "smaller" QR as it has fewer bits of data in it. Both go to the same URl, the only difference is the casing.

What's going on?

Your first thought might be that there's a different level of error-correction. QR codes can have increasing levels of redundancy in order to make sure they can be scanned when damaged. But, in this case, they both have Low error correction.

The smaller code is "Type 1" - it is 21px * 21px. The larger is "Type 2" with 25px * 25px.

The official specification describes the versions in more details. The smaller code should be able to hold 25 alphanumeric character. But https://edent.tel/ is only 18 characters long. So why is it bumped into a larger code?

Using a decoder like ZXING it is possible to see the raw bytes of each code.

UPPER

20 93 1a a6 54 63 dd 28   

35 1b 50 e9 3b dc 00 ec

11 ec 11

lower:

41 26 87 47 47 07 33 a2   

f2 f6 56 46 56 e7 42 e7

46 56 c2 f0 ec 11 ec 11   

ec 11 ec 11 ec 11 ec 11

ec 11

You might have noticed that they both end with the same sequence: ec 11 Those are "padding bytes" because the data needs to completely fill the QR code. But - hang on! - not only does the UPPER one safely contain the text, it also has some spare padding?

The answer lies in the first couple of bytes.

Once the raw bytes have been read, a QR scanner needs to know exactly what sort of code it is dealing with. The first four bits tell it the mode. Let's convert the hex to binary and then split after the first four bits:

Type	HEX	BIN	Split
UPPER	`20 93`	`00100000 10010011`	`0010 000010010011`
lower	`41 26`	`01000001 00100110`	`0100 000100100110`

The UPPER code is 0010 which indicates it is Alphanumeric - the standard says the next 9 bits show the length of data.

The lower code is 0100 which indicates it is Byte mode - the standard says the next 8 bits show the length of data.

Type	HEX	BIN	Split
UPPER	`20 93`	`00100000 10010011`	`0010 0000 10010`
lower	`41 26`	`01000001 00100110`	`0100 000 10010`

Look at that! They both have a length of 10010 which, converted to binary, is 18 - the exact length of the text.

Alphanumeric users 11 bits for every two characters, Byte mode uses (you guessed it!) 8 bits per single character.

But why is the lower-case code pushed into Byte mode? Isn't it using letters and number?

Well, yes. But in order to store data efficiently, Alphanumeric mode only has a limited subset of characters available. Upper-case letters, and a handful of punctuation symbols: space $ % * + - . / :

Luckily, that's enough for a protocol, domain, and path. Sadly, no GET parameters.

So, there you have it. If you want the smallest possible physical size for a QR code which contains a URl, make sure the text is all in capital letters.

This blog post was exhibited at QR Show, NYC

19 thoughts on “Why are QR Codes with capital letters smaller than QR codes with lower-case letters?”

2025-02-23 13:06

Gadgetoid said on bsky.app:

Should be the reverse damn it, ruins any jokes about uppercase QR codes being bigger…

Reply | Reply to original comment on bsky.app
2025-02-23 13:26

m_eiman says:

@blog I used this a while ago to make a QR code that needed to be tiny, saved a noticeable bit of space 👍🏻

| Reply to original comment on mastodon.sdf.org
2025-02-23 13:38

RevK :verified_r: says:

@blog yeh, so many people have no clue about details like this. But I have always done this for URLs in QR codes for this very reason.

| Reply to original comment on toot.me.uk
2025-02-23 13:53

cccac,jn.yaml says:

@blog 11 bits for a character encoding with 45 symbols, that's clever. (45*45 = 2025 possible two-character symbols, out of 2048 representable by 11 bits)

| Reply to original comment on boopsnoot.de
2025-02-23 14:11

Lsamuelson57 said on mastodon.social:

@Edent
Very cool observation.

Probably a subtle effect of compressibility. Looking at the ASCII table, uppercase letters up to "m" have one less "1" bit than lowercase.

Combined with statistical frequency of the first 16 letters vs the last 10 could make a difference.

Reply | Reply to original comment on mastodon.social
2025-02-23 14:40

Anton says:

Cool! Will remember to do this!

Reply
2025-02-23 14:50

ww 🩶 said on xyzzy.link:

@Edent this is one of the reasons bitcoin switched to a new address format. even though base58 mixed-case addresses were shorter, their qr codes were bigger than the new case-insensitive ones based on base32.

Reply | Reply to original comment on xyzzy.link
2025-02-23 16:25

Kakurady 🛫 FC2025 said on fursuits.online:

@Edent of course, this only works if the URL has no path after the origin, or if the server treats paths as case-insensitive. So test the code before printing it on a card/poster/something else.

Reply | Reply to original comment on fursuits.online
2025-02-23 18:47

Tanguy ⧓ Herrmann says:

@blog that's amazing. I didn't know about that.
As I like my QRCode tiny so they can be scanned further away, I already created an URL shortener to redirect to it. But now, I'm gonna make it uppercase for this reason.

Thank you very much for this detailed explanation!

| Reply to original comment on hachyderm.io
2025-02-23 19:21

tiger tiger tiger says:

@blog OMG; this is extremely useful. Thank you!!!

(why in the world didn't they settle for lowercase, though... I guess ASCII order)

| Reply to original comment on tiggi.es
2025-02-23 22:17

news.ycombinator.com said on news.ycombinator.com:

Why are QR Codes with capital letters smaller than QR codes with lower-case let | Hacker News

Reply | Reply to original comment on news.ycombinator.com
2025-02-23 23:26

Adam Langley said on infosec.exchange:

@Edent imperialviolet.org/2021/08/26/qrencoding.html if you want more tricks for densely encoding information in QR codes.

ImperialViolet - Efficient QR codes

Reply | Reply to original comment on infosec.exchange
2025-02-24 12:05

Matt Hardy 3.11 for Workgroups says:

@blog Many years ago I got hold of the specification and learned as much as I could about QR codes. Decoding by hand on the train instead of my regular sudoko. However I never twigged the all caps trick. Thanks

| Reply to original comment on techhub.social
2025-02-25 12:47

Kenny said on bsky.app:

I saw this on Hacker News. It was really interesting. 🙂

Reply | Reply to original comment on bsky.app
2025-02-26 00:00

Jenny List said on hackaday.com:

QR codes have been with us for a long time now, and after passing through their Gardenesque hype cycle of inappropriate usage, have now settled…
Reply | Reply to original comment on hackaday.com
2025-02-28 02:50

Webto Wang said on webtoart.com:

题图：通关了动物井，真他妈是个好游戏，好玩极了！【影视】左右翼都是我的翅膀！本季最强日剧怎么就拍成了缝合怪? 最近很闲就补了一下去年的大热日剧，上面图里这个 VIVANT，虽然之后口碑崩盘（但是收视率还挺高），但是看了两集之后仍然感到：这破剧本什么破玩意。潘22老师这个吐槽视频比剧还好看，但是建议快进看完前四集再看吐槽视频 😂 【道理】 The More You Own, The More You Maintain If every new feature just meant one more thing to maintain, things might not be that bad. But ten design components don't create ten relationships,
Reply | Reply to original comment on webtoart.com
2025-03-17 07:59

Likely Jan Lukas says:

@blog

I have not dived into QR codes now for some years, but I did enjoy manually designing them even though I was not able to figure out the proper math to do the masking for the final step.

However, I did not run across this little titbit at all, so thank you for sharing it! ❤️

| Reply to original comment on mstdn.ca
2025-03-17 09:57

Phil Archer says:

@blog We have a tool for estimating the size of QR Codes that are increasingly being used to replace our old friend the 1D barcode https://ref.gs1.org/tools/module-count. Setting the scheme and domain name to upper case is one of the options for the reasons you've set out thoroughly.

| Reply to original comment on mastodon.social
More comments on Mastodon.

Trackbacks and Pingbacks

2025-02-25 22:15

Podcast Tehnocultura ep 201 – Europa tehnologică – Tehnocultura:

[…] – latest: Perplexity AI / Hotel back login – video / Generations / tablet anti-glare / NASA, no recursion – 10 rules / interviuri fizice – test coding / greu de făcut un calculator / cod QR mai mic […]

Share this post on…

19 thoughts on “Why are QR Codes with capital letters smaller than QR codes with lower-case letters?”

Gadgetoid said on bsky.app:

m_eiman says:

RevK :verified_r: says:

cccac,jn.yaml says:

Lsamuelson57 said on mastodon.social:

Anton says:

ww 🩶 said on xyzzy.link:

Kakurady 🛫 FC2025 said on fursuits.online:

Tanguy ⧓ Herrmann says:

tiger tiger tiger says:

news.ycombinator.com said on news.ycombinator.com:

Adam Langley said on infosec.exchange:

Matt Hardy 3.11 for Workgroups says:

Kenny said on bsky.app:

Jenny List said on hackaday.com:

Webto Wang said on webtoart.com:

Likely Jan Lukas says:

Phil Archer says:

More comments on Mastodon.

Trackbacks and Pingbacks

Podcast Tehnocultura ep 201 – Europa tehnologică – Tehnocultura:

What are your reckons? Cancel reply