<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/rss-style.xsl" type="text/xsl"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	    xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	     xmlns:dc="http://purl.org/dc/elements/1.1/"
	   xmlns:atom="http://www.w3.org/2005/Atom"
	     xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	  xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>
<channel>
	<title>Hashtag Standards &#8211; Terence Eden’s Blog</title>
	<atom:link href="https://shkspr.mobi/blog/2010/02/hashtag-standards/feed/" rel="self" type="application/rss+xml" />
	<link>https://shkspr.mobi/blog</link>
	<description>Regular nonsense about tech and its effects 🙃</description>
	<lastBuildDate>Wed, 25 Feb 2026 08:49:27 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://shkspr.mobi/blog/wp-content/uploads/2023/07/cropped-avatar-32x32.jpeg</url>
	<title>Hashtag Standards &#8211; Terence Eden’s Blog</title>
	<link>https://shkspr.mobi/blog</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title><![CDATA[Hashtag Standards]]></title>
		<link>https://shkspr.mobi/blog/2010/02/hashtag-standards/</link>
					<comments>https://shkspr.mobi/blog/2010/02/hashtag-standards/#comments</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Thu, 25 Feb 2010 17:35:38 +0000</pubDate>
				<category><![CDATA[/etc/]]></category>
		<category><![CDATA[usability]]></category>
		<category><![CDATA[api]]></category>
		<category><![CDATA[geek]]></category>
		<category><![CDATA[hashtags]]></category>
		<category><![CDATA[standards]]></category>
		<category><![CDATA[twitter]]></category>
		<guid isPermaLink="false">http://shkspr.mobi/blog/?p=1730</guid>

					<description><![CDATA[This is one of the longest and geekiest posts I&#039;ve done. It&#039;s a work in progress. All comments and abuse welcome.  #hashtag – As long has there has been a way to search Tweets* people have been adding information to make the easy to find. The #hashtag syntax has become the standard for attaching a succinct tag to Tweets.  &#60;  p&#62;The Twitter Engineering Blog  That&#039;s all well and good, but as I d…]]></description>
										<content:encoded><![CDATA[<pre>This is one of the longest and geekiest posts I've done.
It's a work in progress.
All comments and abuse welcome.</pre>

<blockquote><p>#hashtag – As long has there has been a way to search Tweets* people have been adding information to make the easy to find. The #hashtag syntax has become the standard for attaching a succinct tag to Tweets.</p></blockquote>

<p>&lt;</p>

<p>p&gt;<a href="https://blog.twitter.com/engineering/en_us/a/2010/introducing-the-open-source-twitter-text-libraries">The Twitter Engineering Blog</a></p>

<p>That's all well and good, but <a href="https://shkspr.mobi/blog/2010/02/hashtags-and-implicit-knowledge/">as I discovered yesterday</a>, without standardisation the ability to search falls apart.</p>

<p>I'm not talking about whether you should use the #<span style="text-decoration: underline;">LondonFire</span><strong> </strong>tag rather than #<span style="text-decoration: underline;">FireOfLondon</span><strong> </strong>or #<span style="text-decoration: underline;">LDNfire</span>. Rather; how does a computer recognise what a <em>valid</em> tag is?</p>

<h2 id="why-does-this-matter"><a href="https://shkspr.mobi/blog/2010/02/hashtag-standards/#why-does-this-matter">Why Does This Matter?</a></h2>

<p>Search and tracking quickly break down if they are inconsistent.
For example, if you are using #<span style="text-decoration: underline;">Romeo&amp;Juliet</span> to mark all your conversations about the play you are watching, different Twitter clients will link through to either #<span style="text-decoration: underline;">Romeo</span>, #<span style="text-decoration: underline;">Romeo&amp;</span>, or #<span style="text-decoration: underline;">Romeo&amp;Juliet</span>.&nbsp; Each search returning potentially different conversations.</p>

<h2 id="whats-the-convention"><a href="https://shkspr.mobi/blog/2010/02/hashtag-standards/#whats-the-convention">What's The Convention?</a></h2>

<p>Twitter's website <em>ought</em> to be the definitive source of how hashtags work.  This is their main site.</p>

<p><a href="https://twitter.com/edent/status/9594091070"><img class="size-full wp-image-1738" title="Twitter Website Hashtag" src="https://shkspr.mobi/blog/wp-content/uploads/2010/02/Twitter-Website-Hashtag.png" alt="Twitter Website Hashtag" width="407" height="214"></a></p>

<p>Yet, when we visit their mobile site - we get a completely different experience.</p>

<p><a href="https://mobile.twitter.com/edent/status/9594091070"><img class="size-full wp-image-1731" title="Mobile.Twitter's hashtags" src="https://shkspr.mobi/blog/wp-content/uploads/2010/02/snap20100225_151129.png" alt="Mobile.Twitter's hashtags" width="320" height="480"></a></p>

<h2 id="application-confusion"><a href="https://shkspr.mobi/blog/2010/02/hashtag-standards/#application-confusion">Application Confusion</a></h2>

<p>Because there aren't any widely publicised definitions for what hashtags are, some applications have a significantly different attitude to hashtags</p>

<p></p><div id="attachment_1732" style="width: 490px" class="wp-caption aligncenter"><img aria-describedby="caption-attachment-1732" class="size-full wp-image-1732" title="SocialScope Hashtags" src="https://shkspr.mobi/blog/wp-content/uploads/2010/02/Capture8_1_53.jpg" alt="SocialScope Hashtags" width="480" height="320"><p id="caption-attachment-1732" class="wp-caption-text">SocialScope Hashtags</p></div><p></p>

<h2 id="ubertwitters-hashtag-support"><a href="https://shkspr.mobi/blog/2010/02/hashtag-standards/#ubertwitters-hashtag-support">

<div id="attachment_1771" style="width: 490px" class="wp-caption aligncenter"><img aria-describedby="caption-attachment-1771" class="size-full wp-image-1771" title="UberTwitter's Hashtag Support" src="https://shkspr.mobi/blog/wp-content/uploads/2010/02/x2_c38906.jpeg" alt="UberTwitter's Hashtag Support" width="480" height="360"><p id="caption-attachment-1771" class="wp-caption-text">UberTwitter's Hashtag Support</p></div></a></h2>

<h2 id="standardisation"><a href="https://shkspr.mobi/blog/2010/02/hashtag-standards/#standardisation">Standardisation</a></h2>

<p>To be fair, the Twitter team do have a standard.&nbsp; Even if they don't use it themselves.</p>

<p>They even have some <a href="http://github.com/mzsanford/twitter-text-conformance">limited test cases</a> and libraries in Ruby and Java.</p>

<p>So, given that Twitter, their implementation and apps all disagree on what a hashtag is, let's try to work our what they <em>should</em> be.</p>

<h2 id="anatomy-of-a-tag"><a href="https://shkspr.mobi/blog/2010/02/hashtag-standards/#anatomy-of-a-tag">Anatomy of a Tag</a></h2>

<p>To begin at the beginning.&nbsp; A hashtag starts with a hash. #.&nbsp; Simple, no? No.</p>

<p>There are two different hash symbols! There's the # we all know and love, and there's ＃.&nbsp; Looks pretty similar, but in fact it's the unicode symbol [U+FF03]</p>

<p>Actually, that's not the beginning.&nbsp; What comes before the # of the hashtag?</p>

<p>Consider the following examples - which should be hashtags?</p>

<ul>
    <li>#tag - the # starts off the Tweet</li>
    <li>This is my tweet #test - the # comes after a space.</li>
    <li>This is it.#tag - the # is pushed against some punctuation, perhaps for reasons of space.</li>
    <li>Here we go-#LiftOff - the # is pushed against a -</li>
    <li>I've run out of space#OhNo - the # is pushed against some text</li>
    <li>&amp;#nbsp; - the # is part of an HTML entity</li>
    <li>text　#hashtag - the # comes after a "wide space" (U+3000)</li>
    <li>Should I use #tag/#hashtag? The # comes after a /</li>
    <li>Is this valid ##tag - there are two #s</li>
</ul>

<p>So, we can see it's a little more complicated than we first thought.</p>

<h2 id="the-end"><a href="https://shkspr.mobi/blog/2010/02/hashtag-standards/#the-end">The End</a></h2>

<p>Let's skip over what's <em>in</em> a hashtag and as "how do we know that a tag has finished?"</p>

<p>Consider the following examples -</p>

<ul>
    <li>New album #OMG! - should the ! be part of the hashtag?</li>
    <li>#BreakingNews: dog bites man - should the : be part of the hashtag?</li>
    <li>(is this a #tag) - should the ) be part of the hashtag?</li>
    <li>I like #tags#</li>
</ul>

<p>We probably don't want to have any punctuation at the end of our tag.&nbsp; Can you think of any counter examples?</p>

<h2 id="yummy-filling"><a href="https://shkspr.mobi/blog/2010/02/hashtag-standards/#yummy-filling">Yummy Filling</a></h2>

<p>Our language is more than just the letters A-Z. We've got punctuation, numbers, symbols and all manner of other glyphs.&nbsp; Which of them count as part of a hashtag?</p>

<p>Take a look at these examples</p>

<ul>
    <li>Vote Bush! #Don't</li>
    <li>My dog died #:-(</li>
    <li>Einstein #e=mc^2</li>
    <li>I'm on bus #123</li>
    <li>I'm giving #110%</li>
</ul>

<p>Using Twitter's standards, <strong>none</strong> of the above render as complete tags.</p>

<h2 id="foreign-languages"><a href="https://shkspr.mobi/blog/2010/02/hashtag-standards/#foreign-languages">Foreign Languages</a></h2>

<p>We've mentioned accents above.&nbsp; As we can see in the first example, "funny" characters can cause problems.&nbsp; Broadly speaking, there are three issues.</p>

<ol>
    <li>Accents.&nbsp; Should the é on #Café be linked?</li>
    <li>Accents.&nbsp; Is #Romeo the same as #Ŕöméø?</li>
    <li>Japanese, and some other languages, don't use spaces.&nbsp; Is #tagの valid? What about # 会議中 ?</li>
</ol>

<h2 id="exhausted"><a href="https://shkspr.mobi/blog/2010/02/hashtag-standards/#exhausted">Exhausted</a></h2>

<p>These are a fraction of the possible problems.&nbsp; It's exhausting trying to find all the possible textual combinations and permutations which could lead into a hashtag.&nbsp; No wonder there is confusion!</p>

<p>Search is a complex, profitable, and useful business.&nbsp; It's of vital importance that there is a legitimate, comprehensive standard which <strong>all</strong> sites and applications can follow.</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=1730&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2010/02/hashtag-standards/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
	</channel>
</rss>
