<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/rss-style.xsl" type="text/xsl"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	    xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	     xmlns:dc="http://purl.org/dc/elements/1.1/"
	   xmlns:atom="http://www.w3.org/2005/Atom"
	     xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	  xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>
<channel>
	<title>homoglyphs &#8211; Terence Eden’s Blog</title>
	<atom:link href="https://shkspr.mobi/blog/tag/homoglyphs/feed/" rel="self" type="application/rss+xml" />
	<link>https://shkspr.mobi/blog</link>
	<description>Regular nonsense about tech and its effects 🙃</description>
	<lastBuildDate>Sat, 07 Oct 2023 15:13:51 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://shkspr.mobi/blog/wp-content/uploads/2023/07/cropped-avatar-32x32.jpeg</url>
	<title>homoglyphs &#8211; Terence Eden’s Blog</title>
	<link>https://shkspr.mobi/blog</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title><![CDATA[It is spelled "URl"]]></title>
		<link>https://shkspr.mobi/blog/2020/03/it-is-spelled-url/</link>
					<comments>https://shkspr.mobi/blog/2020/03/it-is-spelled-url/#comments</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Wed, 04 Mar 2020 18:47:20 +0000</pubDate>
				<category><![CDATA[/etc/]]></category>
		<category><![CDATA[homoglyphs]]></category>
		<category><![CDATA[homograph]]></category>
		<category><![CDATA[troll]]></category>
		<guid isPermaLink="false">https://shkspr.mobi/blog/?p=34205</guid>

					<description><![CDATA[There are many sectarian divides in computer.   &#34;Little-Endians&#34; and &#34;Big-Endians&#34; wage bitter war against each other over the order of bits. Should line in text files end with \r\n or just \n? And both vi and emacs users fight betwixt themselves while ignoring the superior foe - nano.   Perhaps the most contentious of these is the battle between URI and URL.  Should we refer to links on the web…]]></description>
										<content:encoded><![CDATA[<p>There are many sectarian divides in computer.</p>

<ul>
<li><a href="https://www.gutenberg.org/files/17157/17157-h/17157-h.htm">"Little-Endians" and "Big-Endians"</a> wage bitter war against each other over the order of bits.</li>
<li>Should line in text files end with <code>\r\n</code> or just <code>\n</code>?</li>
<li>And both <code>vi</code> and <code>emacs</code> users fight betwixt themselves while ignoring the superior foe - <code>nano</code>.</li>
</ul>

<p>Perhaps the most contentious of these is the battle between <code>URI</code> and <code>URL</code>.  Should we refer to links on the web as Uniform Resource <em>Identifiers</em> or <em>Locators</em>?  Obviously there is a correct answer - and anyone who disagrees is a heretic.</p>

<p>So, I've come up with a compromise guaranteed to <del>annoy</del> satisfy everyone - URl.</p>

<p>That's upper-case U, upper-case R, lower case L.</p>

<p>Perfect!</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=34205&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2020/03/it-is-spelled-url/feed/</wfw:commentRss>
			<slash:comments>15</slash:comments>
		
		
			</item>
		<item>
		<title><![CDATA[Homoglyphs for SEO]]></title>
		<link>https://shkspr.mobi/blog/2014/02/homoglyphs-for-seo/</link>
					<comments>https://shkspr.mobi/blog/2014/02/homoglyphs-for-seo/#comments</comments>
				<dc:creator><![CDATA[@edent]]></dc:creator>
		<pubDate>Sat, 22 Feb 2014 12:11:05 +0000</pubDate>
				<category><![CDATA[Security]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[homoglyphs]]></category>
		<category><![CDATA[security]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[web]]></category>
		<guid isPermaLink="false">http://shkspr.mobi/blog/?p=10011</guid>

					<description><![CDATA[Search Engine Optimisation is the (dark) art of getting a site to the top of Google&#039;s ranking algorithm.  If you&#039;re in the business of selling decorations for ponds, you want your shop to be right at the top of the results when people search for &#34;bespoke synthetic frog spawn.&#34;  The problem is, there are lots of people all playing the same game.  So, what &#34;unusual&#34; tactics can be used to drive…]]></description>
										<content:encoded><![CDATA[<p>Search Engine Optimisation is the (dark) art of getting a site to the top of Google's ranking algorithm.  If you're in the business of selling decorations for ponds, you want your shop to be right at the top of the results when people search for "bespoke synthetic frog spawn."</p>

<p>The problem is, there are lots of people all playing the same game.  So, what "unusual" tactics can be used to drive sites to the top?</p>

<p><a href="https://shkspr.mobi/blog/2014/02/homoglyph-attacks/" title="Homoglyph Attacks">Yesterday, I looked at how homoglyphs</a> like Il (capital i lower L) can be used to confuse humans.  Today, let's see how they can confuse computers.</p>

<p>Shortly after publishing my blog post, I was the <a href="https://www.google.com/search?q=%22%CE%BD%E2%85%B0%EF%BD%81%EF%BD%87%EF%BD%92%D0%B0%22">number one search result for"νⅰａｇｒа"</a>.</p>

<p><a href="https://www.google.com/search?q=%22%CE%BD%E2%85%B0%EF%BD%81%EF%BD%87%EF%BD%92%D0%B0%22"><img src="https://shkspr.mobi/blog/wp-content/uploads/2014/02/Homoglyphs-for-SEO-fs8.png" alt="Homoglyphs for SEO" width="710" height="508" class="aligncenter size-full wp-image-10013"></a></p>

<p>Why? Because, apparently, I'm the <em>only</em> person on the whole Web who has used those weird characters in precisely that combination.</p>

<p>We can already see this sort of spam attack being used on the Web.  Do a search for "Ԍооｇｌе" and you'll see a bunch of sites like these:</p>

<p><a href="https://www.google.co.uk/search?q=%D4%8C%D0%BE%D0%BE%EF%BD%87%EF%BD%8C%D0%B5"><img src="https://shkspr.mobi/blog/wp-content/uploads/2014/02/Google-Homoglyph-SEO-fs8.png" alt="Google Homoglyph SEO" width="710" height="677" class="aligncenter size-full wp-image-10014"></a></p>

<h2 id="evading-spam-filters"><a href="https://shkspr.mobi/blog/2014/02/homoglyphs-for-seo/#evading-spam-filters">Evading Spam Filters</a></h2>

<p>Now, no one in their right mind is going to type those perculiar characters in to a search box.  But they can have their uses elsewhere.</p>

<p>Google has a nifty little feature - the "I'm Feeling Lucky" button.  Press that, and you're immediately taken to Google's first result.</p>

<p>Simply append <samp>&amp;btnI</samp> on to a Google search URL and you'll be taken there.</p>

<p>Very few services will block a Google URL from being shared.  This means, if you can claim the top spot for a particular search, you'll be able to craft a URL which will <em>look</em> like it takes you to Google, but will <em>actually</em> take you elsewhere.</p>

<p>Copy and paste this into your URL bar and see what happens.</p>

<pre>https://www.google.com/search?q="νⅰａｇｒа"&amp;btnI</pre>

<h2 id="what-can-be-done-to-stop-this"><a href="https://shkspr.mobi/blog/2014/02/homoglyphs-for-seo/#what-can-be-done-to-stop-this">What Can Be Done To Stop This?</a></h2>

<p>Damned if I know!</p>

<p>Search engines - and others - could look at the characters being used as see if they naturally belong together.  Should the presence of a Cyrillic "о" in the middle of Latin text set off alarm bells? How about the Greek "Τ" mixed in with the Cherokee "Ꭺ"?</p>

<p>Perhaps computers need to get better at spotting "odd" patterns of letters - deriving a better semantic understanding of the text in context rather than just looking for literal strings.</p>

<p>"SEO Experts" are already exploiting this weakness in search engines.  Do we need to find a way to nip this in the bug?</p>
<img src="https://shkspr.mobi/blog/wp-content/themes/edent-wordpress-theme/info/okgo.php?ID=10011&HTTP_REFERER=RSS" alt="" width="1" height="1" loading="eager">]]></content:encoded>
					
					<wfw:commentRss>https://shkspr.mobi/blog/2014/02/homoglyphs-for-seo/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
	</channel>
</rss>
