Tool to translate Unicode codes. Unicode is a character encoding standard aiming to give every character a numeric identifier.
Unicode Coding - dCode
Tag(s) : Character Encoding, Internet
dCode is free and its tools are a valuable help in games, maths, geocaching, puzzles and problems to solve every day!
A suggestion ? a feedback ? a bug ? an idea ? Write to dCode!
Unicode is a computer coding system that aims to unify text exchanges at the international level. With Unicode, each computer character is described by a name and a code (codepoint), identifying it uniquely regardless of the computer medium or the software used. Unicode has already listed over 100000 characters.
Among the first Unicode characters are the 128 ASCII codes (including the Latin alphabet), then the international phonetic alphabet, then the local alphabets (Greek, Cyrillic, etc.), then symbols and many others.
A message encoded with Unicode is composed of numbers that are automatically translated to the screen in characters that can be displayed to the user (via UTF-8 or UTF16).
Unicode encryption can be made by displaying the Unicode codes of each of the characters in the message.
Example: The message DCΦD€ (the DCODE word with a letter phi Φ and a symbol euro €)
Each character is actually encoded in the form:
Displayed Character | Unicode Code | Hexadecimal Unicode Code |
---|---|---|
D | 68 | 44 |
C | 67 | 43 |
Φ | 934 | 03A6 |
D | 68 | 44 |
€ | 8364 | 20AC |
Unicode numeric identifiers, like ASCII, are regularly displayed in hexadecimal format for a more concise writing.
In order make the translation of a Unicode message, reassociate each identifier code its Unicode character.
Example: The message 68,67,934,68,8364 is translated by each number: 68 => D, 67 => C, and so on, in order to obtain DCΦD€.
The message is composed of numbers (decimal or hexadecimal format, more rarely binary). For text composed of letters from the Latin alphabet, the numbers between 64 and 122 (corresponding to the ASCII and Unicode codes of the letters A-Z and a-z) will be the most frequent.
UTF-8 is a 100% Unicode compatible coding system, which has the advantage of being backward compatible with ASCII. UTF8 is used on more than 90% of websites.
UTF16 a 16-bit encoding variant, used by Windows.
UTF32 is another variant, still little used.
The official website of the Unicode Consortium is here
dCode retains ownership of the "Unicode Coding" source code. Except explicit open source licence (indicated Creative Commons / free), the "Unicode Coding" algorithm, the applet or snippet (converter, solver, encryption / decryption, encoding / decoding, ciphering / deciphering, breaker, translator), or the "Unicode Coding" functions (calculate, convert, solve, decrypt / encrypt, decipher / cipher, decode / encode, translate) written in any informatic language (Python, Java, PHP, C#, Javascript, Matlab, etc.) and all data download, script, or API access for "Unicode Coding" are not public, same for offline use on PC, mobile, tablet, iPhone or Android app!
Reminder : dCode is free to use.
The copy-paste of the page "Unicode Coding" or any of its results, is allowed (even for commercial purposes) as long as you credit dCode!
Exporting results as a .csv or .txt file is free by clicking on the export icon
Cite as source (bibliography):
Unicode Coding on dCode.fr [online website], retrieved on 2024-12-18,