What does "挨⎒" have to do with "<html"?
I received this weird bit of mojibake in an email. Here's the raw text view:
------=_NextPart_001_009E_01D4D8BF.D0737E10
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
=E6=8C=A8=E2=8E=92tml xmlns:v=3D"urn:schemas-microsoft-com:vml" xmlns:o=3D"=
urn:schemas-microsoft-com:office:office" xmlns:w=3D"urn:schemas-microsoft-c=
om:office:word" xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml=
" xmlns=3D"http://www.w3.org/TR/REC-html40">
What's going on?
- 挨 is a Chinese, Japanese, Korean (cjk) unified ideograph (U+6328)
- ⎒ is the passive-pull-up-output symbol (U+2392)
That's somehow replaced:
- < - less-than sign (U+003C)
- h - Latin small letter h (U+0068)
<h
in binary is 00111100 01101000
挨⎒
in binary is 11100110 10001100 10101000 11100010 10001110 10010010
It was sent from Outlook 2010 to Gmail. But I receive lots of those emails.
Try as I might, I just can't work out where this error has crept in. A glitch in the Matrix? Electronic interference? Any clues, gang?
Andrew McGlashan says:
I don't know, I was hoping that someone would post a useful comment (this probably isn't it). Perhaps it is actually an attempt to compromise your security, you know like, "Your child's name doesn't include DROP TABLE ???"