encoding

#define encoding: \ I----------------------------------------------\ I----------------------------------------------\ I----------------------------------------------\ I _____ _ _ \ I | ___| | (_) \ I | |__ _ __ ___ ___ __| |_ _ __ __ _ \ I | __| '_ \ / __/ _ \ / _` | | '_ \ / _` | \ I | |__| | | | (_| (_) | (_| | | | | | (_| | \ I \____/_| |_|\___\___/ \__,_|_|_| |_|\__, | \ I __/ | \ I |___/ \ I----------------------------------------------\ I----------------------------------------------\ I----------------------------------------------I "Math/numeric bases" "Hardware" • technically all computer data is in binary, just true and false bits, what gives them meaning is how we interpret them, this is what encoding is BINARY: • dont get confused by the naming, in this context it refers to binary being interpreted literally • whats important is that most things are encoded in a more complicated way // @BAKE gcc -o $*.out $@ -std=c23 #include <stdio.h> signed main(void) { // variable representing a positive whole unsigned i = 0b0100; // binary notation printf("%d\n", h); // Out: 4 } BASE64: • using the numerical base 64 to represent data • the character set for digits is: A-Z, a-z, 0-9, '+', '/' • '=' is used for padding • often used to transfer a binary blob as plain text • base32 works based one the same logic, except its less commonly used • UNIX-like systems usually ship with a "base64" utility $ echo -ne '\xDE\xAD\xBE\xEF\x00' | base64 3q2+7wA= CARET_NOTATION: https://github.com/agvxov/caret_notater • designed to convert ascii control characters to printable characters • often used by commandline utilities {Bash} $ echo -ne 'This is caret notation: \x00\x01\x02\x03\x7F' | caret-notater; echo This is caret notation: ^@^A^B^C^? FLOATS: • basically scientific notation in base 2 • used for simulations {games}, where accuracy is secondary to performance • fast • versatile • notoriously imprecise — floats are not precise: • "precision loss" print(0.1 + 0.1 + 0.1 == 0.3) # Out: False • this roots from the fact the we are mapping an infinite set (fractions) to a finite set (number of values representable on ${N} bits), so for values we cannot represent, we must round • further complicates the situation that due to the nature of the encoding, the step between any two representable value is not constant, but its average grows as we move further away from zero # very rough visualization of the change in steps ( ( ( ( ( ( (0) ) ) ) ) ) ) — 3 fields: -1^<sign> * (1 + <mantissa>) * (2^<exponent> - 127) ○ sign (-/+) + - 0 — - 1 ○ exponent • we subtract 127 from the exponent so it can represent negative numbers; therefor ${n} = 127 : 0 ${n} < 127 : - ${n} > 127 : + • the values of all 0s and 1s are reserved; therefor it ranges between -126 and 127 ○ fraction/mantissa • the fractions is calculated as numbers less then two in a base of two • each position mark 1/2^N with an on-growing N as we get further from 0 sign exponent mantissa | | | +---+----------+-------------------------+ float | 1 | 8 | 23 | +---+----------+--+----------------------+-------------------------------+ double | 1 | 11 | 52 | +---+-------------+------------------------------------------------------+ { // float to binary (by hand) 11000010111110000011001100110011 // separate the fields (optionally insert dots to make it easier on the eyes) 1 1000.0101 1111.0000.0110.0110.0110.011 | | | sign | fraction exponent // determine the sign 1 => minus // read the exponent 1000.0101 // interpret as base 10 133 // subtract the magical 127 shift 133 - 127 = 6 // read the mantissa 1111.0000.0110.0110.0110.011 // this is very important: we will perform the shift here, // because it makes our lives significantly easier // what this means is that we will isolate as many digits // as indicated by the exponent, ie. 6 11.11.00 00.01.10.01.10.01.10.011 // add the (otherwise) implicit 1 to start of the first group 11.11.10.0 00.01.10.01.10.01.10.011 | | whole-part | fractional-part // convert the whole-part from binary // a painless method is to write the two exponents above the digits // and sum the positions that contain a 1 64 32 16 8 4 2 1 1 1 1 1 1 0 0 64 + 32 + 16 + 8 + 4 = 124 // convert the fractional part from binary, // using the same strategy 1/2 1/4 1/8 1/16 1/32 1/64 1/128 1/256 1/512 1/1024 1/2048 1/4096 1/8192 1/16384 1/32768 1/65536 1/129072 0 0 0 1 1 0 0 1 1 0 0 1 1 0 0 1 1 // we get an expression 1/16 + 1/32 + 1/256 + 1/512 + 1/4096 + 1/8192 + 1/65536 + 1/129072 52867147 = ─────────── 528678912 // now, its important to realize that this number cannot be expressed in base 10 // in a sense this is a sort-of reverse-precision-loss, // tho this has little to no significance outside of our example = 0.09999847412109375 // put it all together — 1 * 124.09999847412109375 = -124.09999847412109375 ^^^^^^^^^^^^^^^^^^^^^^ } DECIMALS: • representing base 10 digits with binary • used for accounting where no imprecision is acceptable • reliable • slow • have a relatively high memory footprint • 4 bits is the minimum amount of memory that can store a base 10 digit, therefor that is the amount of memory reserved for one • there are gonna be invalid/illegal values; ie. value which cannot be represented in base 10 on a single digit { 1011 /* 11; obviously doesnt fit to a single digit */ } • to avoid invalid values, whenever one would come up in an operation, 2^sizeof(digit) - 10 is added to the digit and 1 is carried to the next { II. I. 0101 0011 // 53 + 1001 0111 // + 97 ────────────────── // ────── 0001 0101 0000 // 150 I. 0011 + 0111 ─────── 1010 // the result is larger than 10 there for we add 6 0110 + 1010 ─────── 0000 // 1 will have to be carried II. 0001 // *this is the 1 we carried as the last operation had to be rolled over 0101 + 1001 ─────── 1111 // the result is larger than 10 therefor we add 6 0110 + 1111 ─────── 0101 // 1 will have to be carried III. 0001 // *this is the 1 we carried as the last operation had to be rolled over 0000 + 0000 ─────── 0001 } ASCII: #define ascii_table // ascii table -------------------------------------------------------------------------------------------------------------------- | ASCII - Windows Codepage 1252 (probably) /*0xfffffff2*/ | -------------------------------------------------------------------------------------------------------------------- | DEC | OCT | HEX | BIN | Symbol | HTML Number | HTML Name | Description | -------------------------------------------------------------------------------------------------------------------- | 0 | 000 | 00 | 00000000 | NUL | � | n/a | Null char | | 1 | 001 | 01 | 00000001 | SOH |  | n/a | Start of Heading | | 2 | 002 | 02 | 00000010 | STX |  | n/a | Start of Text | | 3 | 003 | 03 | 00000011 | ETX |  | n/a | End of Text | | 4 | 004 | 04 | 00000100 | EOT |  | n/a | End of Transmission | | 5 | 005 | 05 | 00000101 | ENQ |  | n/a | Enquiry | | 6 | 006 | 06 | 00000110 | ACK |  | n/a | Acknowledgment | | 7 | 007 | 07 | 00000111 | BEL |  | n/a | Bell | | 8 | 010 | 08 | 00001000 | BS |  | n/a | Back Space | | 9 | 011 | 09 | 00001001 | HT | | n/a | Horizontal Tab | | 10 | 012 | 0A | 00001010 | LF | | n/a | Line Feed | | 11 | 013 | 0B | 00001011 | VT | | n/a | Vertical Tab | | 12 | 014 | 0C | 00001100 | FF | | n/a | Form Feed | | 13 | 015 | 0D | 00001101 | CR | | n/a | Carriage Return | | 14 | 016 | 0E | 00001110 | SO |  | n/a | Shift Out / X-On | | 15 | 017 | 0F | 00001111 | SI |  | n/a | Shift In / X-Off | | 16 | 020 | 10 | 00010000 | DLE |  | n/a | Data Line Escape | | 17 | 021 | 11 | 00010001 | DC1 |  | n/a | Device Control 1 (oft. XON) | | 18 | 022 | 12 | 00010010 | DC2 |  | n/a | Device Control 2 | | 19 | 023 | 13 | 00010011 | DC3 |  | n/a | Device Control 3 (oft. XOFF) | | 20 | 024 | 14 | 00010100 | DC4 |  | n/a | Device Control 4 | | 21 | 025 | 15 | 00010101 | NAK |  | n/a | Negative Acknowledgement | | 22 | 026 | 16 | 00010110 | SYN |  | n/a | Synchronous Idle | | 23 | 027 | 17 | 00010111 | ETB |  | n/a | End of Transmit Block | | 24 | 030 | 18 | 00011000 | CAN |  | n/a | Cancel | | 25 | 031 | 19 | 00011001 | EM |  | n/a | End of Medium | | 26 | 032 | 1A | 00011010 | SUB |  | n/a | Substitute | | 27 | 033 | 1B | 00011011 | ESC |  | n/a | Escape | | 28 | 034 | 1C | 00011100 | FS |  | n/a | File Separator | | 29 | 035 | 1D | 00011101 | GS |  | n/a | Group Separator | | 30 | 036 | 1E | 00011110 | RS |  | n/a | Record Separator | | 31 | 037 | 1F | 00011111 | US |  | n/a | Unit Separator | | 32 | 040 | 20 | 00100000 | | | n/a | Space | | 33 | 041 | 21 | 00100001 | ! | ! | n/a | Exclamation mark | | 34 | 042 | 22 | 00100010 | " | " | " | Double quotes (or speech marks) | | 35 | 043 | 23 | 00100011 | # | # | n/a | Number | | 36 | 044 | 24 | 00100100 | $ | $ | n/a | Dollar | | 37 | 045 | 25 | 00100101 | % | % | n/a | Per cent sign | | 38 | 046 | 26 | 00100110 | & | & | & | Ampersand | | 39 | 047 | 27 | 00100111 | ' | ' | n/a | Single quote | | 40 | 050 | 28 | 00101000 | ( | ( | n/a | Open parenthesis (or open bracket) | | 41 | 051 | 29 | 00101001 | ) | ) | n/a | Close parenthesis (or close bracket) | | 42 | 052 | 2A | 00101010 | * | * | n/a | Asterisk | | 43 | 053 | 2B | 00101011 | + | + | n/a | Plus | | 44 | 054 | 2C | 00101100 | , | , | n/a | Comma | | 45 | 055 | 2D | 00101101 | - | - | n/a | Hyphen | | 46 | 056 | 2E | 00101110 | . | . | n/a | Period, dot or full stop | | 47 | 057 | 2F | 00101111 | / | / | n/a | Slash or divide | | 48 | 060 | 30 | 00110000 | 0 | 0 | n/a | Zero | | 49 | 061 | 31 | 00110001 | 1 | 1 | n/a | One | | 50 | 062 | 32 | 00110010 | 2 | 2 | n/a | Two | | 51 | 063 | 33 | 00110011 | 3 | 3 | n/a | Three | | 52 | 064 | 34 | 00110100 | 4 | 4 | n/a | Four | | 53 | 065 | 35 | 00110101 | 5 | 5 | n/a | Five | | 54 | 066 | 36 | 00110110 | 6 | 6 | n/a | Six | | 55 | 067 | 37 | 00110111 | 7 | 7 | n/a | Seven | | 56 | 070 | 38 | 00111000 | 8 | 8 | n/a | Eight | | 57 | 071 | 39 | 00111001 | 9 | 9 | n/a | Nine | | 58 | 072 | 3A | 00111010 | : | : | n/a | Colon | | 59 | 073 | 3B | 00111011 | ; | ; | n/a | Semicolon | | 60 | 074 | 3C | 00111100 | < | < | < | Less than (or open angled bracket) | | 61 | 075 | 3D | 00111101 | = | = | n/a | Equals | | 62 | 076 | 3E | 00111110 | > | > | > | Greater than (or close angled bracket) | | 63 | 077 | 3F | 00111111 | ? | ? | n/a | Question mark | | 64 | 100 | 40 | 01000000 | @ | @ | n/a | At symbol | | 65 | 101 | 41 | 01000001 | A | A | n/a | Uppercase A | | 66 | 102 | 42 | 01000010 | B | B | n/a | Uppercase B | | 67 | 103 | 43 | 01000011 | C | C | n/a | Uppercase C | | 68 | 104 | 44 | 01000100 | D | D | n/a | Uppercase D | | 69 | 105 | 45 | 01000101 | E | E | n/a | Uppercase E | | 70 | 106 | 46 | 01000110 | F | F | n/a | Uppercase F | | 71 | 107 | 47 | 01000111 | G | G | n/a | Uppercase G | | 72 | 110 | 48 | 01001000 | H | H | n/a | Uppercase H | | 73 | 111 | 49 | 01001001 | I | I | n/a | Uppercase I | | 74 | 112 | 4A | 01001010 | J | J | n/a | Uppercase J | | 75 | 113 | 4B | 01001011 | K | K | n/a | Uppercase K | | 76 | 114 | 4C | 01001100 | L | L | n/a | Uppercase L | | 77 | 115 | 4D | 01001101 | M | M | n/a | Uppercase M | | 78 | 116 | 4E | 01001110 | N | N | n/a | Uppercase N | | 79 | 117 | 4F | 01001111 | O | O | n/a | Uppercase O | | 80 | 120 | 50 | 01010000 | P | P | n/a | Uppercase P | | 81 | 121 | 51 | 01010001 | Q | Q | n/a | Uppercase Q | | 82 | 122 | 52 | 01010010 | R | R | n/a | Uppercase R | | 83 | 123 | 53 | 01010011 | S | S | n/a | Uppercase S | | 84 | 124 | 54 | 01010100 | T | T | n/a | Uppercase T | | 85 | 125 | 55 | 01010101 | U | U | n/a | Uppercase U | | 86 | 126 | 56 | 01010110 | V | V | n/a | Uppercase V | | 87 | 127 | 57 | 01010111 | W | W | n/a | Uppercase W | | 88 | 130 | 58 | 01011000 | X | X | n/a | Uppercase X | | 89 | 131 | 59 | 01011001 | Y | Y | n/a | Uppercase Y | | 90 | 132 | 5A | 01011010 | Z | Z | n/a | Uppercase Z | | 91 | 133 | 5B | 01011011 | [ | [ | n/a | Opening bracket | | 92 | 134 | 5C | 01011100 | \ | \ | n/a | Backslash | | 93 | 135 | 5D | 01011101 | ] | ] | n/a | Closing bracket | | 94 | 136 | 5E | 01011110 | ^ | ^ | n/a | Caret - circumflex | | 95 | 137 | 5F | 01011111 | _ | _ | n/a | Underscore | | 96 | 140 | 60 | 01100000 | ` | ` | n/a | Grave accent | | 97 | 141 | 61 | 01100001 | a | a | n/a | Lowercase a | | 98 | 142 | 62 | 01100010 | b | b | n/a | Lowercase b | | 99 | 143 | 63 | 01100011 | c | c | n/a | Lowercase c | | 100 | 144 | 64 | 01100100 | d | d | n/a | Lowercase d | | 101 | 145 | 65 | 01100101 | e | e | n/a | Lowercase e | | 102 | 146 | 66 | 01100110 | f | f | n/a | Lowercase f | | 103 | 147 | 67 | 01100111 | g | g | n/a | Lowercase g | | 104 | 150 | 68 | 01101000 | h | h | n/a | Lowercase h | | 105 | 151 | 69 | 01101001 | i | i | n/a | Lowercase i | | 106 | 152 | 6A | 01101010 | j | j | n/a | Lowercase j | | 107 | 153 | 6B | 01101011 | k | k | n/a | Lowercase k | | 108 | 154 | 6C | 01101100 | l | l | n/a | Lowercase l | | 109 | 155 | 6D | 01101101 | m | m | n/a | Lowercase m | | 110 | 156 | 6E | 01101110 | n | n | n/a | Lowercase n | | 111 | 157 | 6F | 01101111 | o | o | n/a | Lowercase o | | 112 | 160 | 70 | 01110000 | p | p | n/a | Lowercase p | | 113 | 161 | 71 | 01110001 | q | q | n/a | Lowercase q | | 114 | 162 | 72 | 01110010 | r | r | n/a | Lowercase r | | 115 | 163 | 73 | 01110011 | s | s | n/a | Lowercase s | | 116 | 164 | 74 | 01110100 | t | t | n/a | Lowercase t | | 117 | 165 | 75 | 01110101 | u | u | n/a | Lowercase u | | 118 | 166 | 76 | 01110110 | v | v | n/a | Lowercase v | | 119 | 167 | 77 | 01110111 | w | w | n/a | Lowercase w | | 120 | 170 | 78 | 01111000 | x | x | n/a | Lowercase x | | 121 | 171 | 79 | 01111001 | y | y | n/a | Lowercase y | | 122 | 172 | 7A | 01111010 | z | z | n/a | Lowercase z | | 123 | 173 | 7B | 01111011 | { | { | n/a | Opening brace | | 124 | 174 | 7C | 01111100 | | | | | n/a | Vertical bar | | 125 | 175 | 7D | 01111101 | } | } | n/a | Closing brace | | 126 | 176 | 7E | 01111110 | ~ | ~ | n/a | Equivalency sign - tilde | | 127 | 177 | 7F | 01111111 | n/a |  | n/a | Delete | | 128 | 200 | 80 | 10000000 | € | € | € | Euro sign | | 129 | 201 | 81 | 10000001 | n/a | n/a | n/a | n/a | | 130 | 202 | 82 | 10000010 | ‚ | ‚ | ‚ | Single low-9 quotation mark | | 131 | 203 | 83 | 10000011 | ƒ | ƒ | ƒ | Latin small letter f with hook | | 132 | 204 | 84 | 10000100 | „ | „ | „ | Double low-9 quotation mark | | 133 | 205 | 85 | 10000101 | … | … | … | Horizontal ellipsis | | 134 | 206 | 86 | 10000110 | † | † | † | Dagger | | 135 | 207 | 87 | 10000111 | ‡ | ‡ | ‡ | Double dagger | | 136 | 210 | 88 | 10001000 | ˆ | ˆ | ˆ | Modifier letter circumflex accent | | 137 | 211 | 89 | 10001001 | ‰ | ‰ | ‰ | Per mille sign | | 138 | 212 | 8A | 10001010 | Š | Š | Š | Latin capital letter S with carrot | | 139 | 213 | 8B | 10001011 | ‹ | ‹ | ‹ | Single left-pointing angle quotation | | 140 | 214 | 8C | 10001100 | Œ | Œ | Œ | Latin capital ligature OE | | 141 | 215 | 8D | 10001101 | n/a | n/a | n/a | n/a | | 142 | 216 | 8E | 10001110 | Ž | Ž | ­n/a | Latin capital letter Z with carrot | | 143 | 217 | 8F | 10001111 | n/a | n/a | n/a | n/a | | 144 | 220 | 90 | 10010000 | n/a | n/a | n/a | n/a | | 145 | 221 | 91 | 10010001 | ‘ | ‘ | ‘ | Left single quotation mark | | 146 | 222 | 92 | 10010010 | ’ | ’ | ’ | Right single quotation mark | | 147 | 223 | 93 | 10010011 | “ | “ | “ | Left double quotation mark | | 148 | 224 | 94 | 10010100 | ” | ” | ” | Right double quotation mark | | 149 | 225 | 95 | 10010101 | • | • | • | Bullet | | 150 | 226 | 96 | 10010110 | – | – | – | En dash | | 151 | 227 | 97 | 10010111 | — | — | — | Em dash | | 152 | 230 | 98 | 10011000 | ˜ | ˜ | ˜ | Small tilde | | 153 | 231 | 99 | 10011001 | ™ | ™ | ™ | Trade mark sign | | 154 | 232 | 9A | 10011010 | š | š | š | Latin small letter S with carrot | | 155 | 233 | 9B | 10011011 | › | › | › | Single right-pointing angle quotation mark | | 156 | 234 | 9C | 10011100 | œ | œ | œ | Latin small ligature oe | | 157 | 235 | 9D | 10011101 | n/a | n/a | n/a | n/a | | 158 | 236 | 9E | 10011110 | ž | ž | | Latin small letter z with carrot | | 159 | 237 | 9F | 10011111 | Ÿ | Ÿ | Ÿ | Latin capital letter Y with diaeresis | | 160 | 240 | A0 | 10100000 | n/a |   |   | Non-breaking space | | 161 | 241 | A1 | 10100001 | ¡ | ¡ | ¡ | Inverted exclamation mark | | 162 | 242 | A2 | 10100010 | ¢ | ¢ | ¢ | Cent sign | | 163 | 243 | A3 | 10100011 | £ | £ | £ | Pound sign | | 164 | 244 | A4 | 10100100 | ¤ | ¤ | ¤ | Currency sign | | 165 | 245 | A5 | 10100101 | ¥ | ¥ | ¥ | Yen sign | | 166 | 246 | A6 | 10100110 | ¦ | ¦ | ¦ | Pipe, Broken vertical bar | | 167 | 247 | A7 | 10100111 | § | § | § | Section sign | | 168 | 250 | A8 | 10101000 | ¨ | ¨ | ¨ | Spacing diaeresis - umlaut | | 169 | 251 | A9 | 10101001 | © | © | © | Copyright sign | | 170 | 252 | AA | 10101010 | ª | ª | ª | Feminine ordinal indicator | | 171 | 253 | AB | 10101011 | « | « | « | Left double angle quotes | | 172 | 254 | AC | 10101100 | ¬ | ¬ | ¬ | Not sign | | 173 | 255 | AD | 10101101 | n/a | ­ | ­ | Soft hyphen | | 174 | 256 | AE | 10101110 | ® | ® | ® | Registered trade mark sign | | 175 | 257 | AF | 10101111 | ¯ | ¯ | ¯ | Spacing macron - overline | | 176 | 260 | B0 | 10110000 | ° | ° | ° | Degree sign | | 177 | 261 | B1 | 10110001 | ± | ± | ± | Plus-or-minus sign | | 178 | 262 | B2 | 10110010 | ² | ² | ² | Superscript two - squared | | 179 | 263 | B3 | 10110011 | ³ | ³ | ³ | Superscript three - cubed | | 180 | 264 | B4 | 10110100 | ´ | ´ | ´ | Acute accent - spacing acute | | 181 | 265 | B5 | 10110101 | µ | µ | µ | Micro sign | | 182 | 266 | B6 | 10110110 | ¶ | ¶ | ¶ | Pilcrow sign - paragraph sign | | 183 | 267 | B7 | 10110111 | · | · | · | Middle dot - Georgian comma | | 184 | 270 | B8 | 10111000 | ¸ | ¸ | ¸ | Spacing cedilla | | 185 | 271 | B9 | 10111001 | ¹ | ¹ | ¹ | Superscript one | | 186 | 272 | BA | 10111010 | º | º | º | Masculine ordinal indicator | | 187 | 273 | BB | 10111011 | » | » | » | Right double angle quotes | | 188 | 274 | BC | 10111100 | ¼ | ¼ | ¼ | Fraction one quarter | | 189 | 275 | BD | 10111101 | ½ | ½ | ½ | Fraction one half | | 190 | 276 | BE | 10111110 | ¾ | ¾ | ¾ | Fraction three quarters | | 191 | 277 | BF | 10111111 | ¿ | ¿ | ¿ | Inverted question mark | | 192 | 300 | C0 | 11000000 | À | À | À | Latin capital letter A with grave | | 193 | 301 | C1 | 11000001 | Á | Á | Á | Latin capital letter A with acute | | 194 | 302 | C2 | 11000010 |  |  |  | Latin capital letter A with circumflex | | 195 | 303 | C3 | 11000011 | à | à | à | Latin capital letter A with tilde | | 196 | 304 | C4 | 11000100 | Ä | Ä | Ä | Latin capital letter A with diaeresis | | 197 | 305 | C5 | 11000101 | Å | Å | Å | Latin capital letter A with ring above | | 198 | 306 | C6 | 11000110 | Æ | Æ | Æ | Latin capital letter AE | | 199 | 307 | C7 | 11000111 | Ç | Ç | Ç | Latin capital letter C with cedilla | | 200 | 310 | C8 | 11001000 | È | È | È | Latin capital letter E with grave | | 201 | 311 | C9 | 11001001 | É | É | É | Latin capital letter E with acute | | 202 | 312 | CA | 11001010 | Ê | Ê | Ê | Latin capital letter E with circumflex | | 203 | 313 | CB | 11001011 | Ë | Ë | Ë | Latin capital letter E with diaeresis | | 204 | 314 | CC | 11001100 | Ì | Ì | Ì | Latin capital letter I with grave | | 205 | 315 | CD | 11001101 | Í | Í | Í | Latin capital letter I with acute | | 206 | 316 | CE | 11001110 | Î | Î | Î | Latin capital letter I with circumflex | | 207 | 317 | CF | 11001111 | Ï | Ï | Ï | Latin capital letter I with diaeresis | | 208 | 320 | D0 | 11010000 | Ð | Ð | Ð | Latin capital letter ETH | | 209 | 321 | D1 | 11010001 | Ñ | Ñ | Ñ | Latin capital letter N with tilde | | 210 | 322 | D2 | 11010010 | Ò | Ò | Ò | Latin capital letter O with grave | | 211 | 323 | D3 | 11010011 | Ó | Ó | Ó | Latin capital letter O with acute | | 212 | 324 | D4 | 11010100 | Ô | Ô | Ô | Latin capital letter O with circumflex | | 213 | 325 | D5 | 11010101 | Õ | Õ | Õ | Latin capital letter O with tilde | | 214 | 326 | D6 | 11010110 | Ö | Ö | Ö | Latin capital letter O with diaeresis | | 215 | 327 | D7 | 11010111 | × | × | × | Multiplication sign | | 216 | 330 | D8 | 11011000 | Ø | Ø | Ø | Latin capital letter O with slash | | 217 | 331 | D9 | 11011001 | Ù | Ù | Ù | Latin capital letter U with grave | | 218 | 332 | DA | 11011010 | Ú | Ú | Ú | Latin capital letter U with acute | | 219 | 333 | DB | 11011011 | Û | Û | Û | Latin capital letter U with circumflex | | 220 | 334 | DC | 11011100 | Ü | Ü | Ü | Latin capital letter U with diaeresis | | 221 | 335 | DD | 11011101 | Ý | Ý | Ý | Latin capital letter Y with acute | | 222 | 336 | DE | 11011110 | Þ | Þ | Þ | Latin capital letter THORN | | 223 | 337 | DF | 11011111 | ß | ß | ß | Latin small letter sharp s - ess-zed | | 224 | 340 | E0 | 11100000 | à | à | à | Latin small letter a with grave | | 225 | 341 | E1 | 11100001 | á | á | á | Latin small letter a with acute | | 226 | 342 | E2 | 11100010 | â | â | â | Latin small letter a with circumflex | | 227 | 343 | E3 | 11100011 | ã | ã | ã | Latin small letter a with tilde | | 228 | 344 | E4 | 11100100 | ä | ä | ä | Latin small letter a with diaeresis | | 229 | 345 | E5 | 11100101 | å | å | å | Latin small letter a with ring above | | 230 | 346 | E6 | 11100110 | æ | æ | æ | Latin small letter ae | | 231 | 347 | E7 | 11100111 | ç | ç | ç | Latin small letter c with cedilla | | 232 | 350 | E8 | 11101000 | è | è | è | Latin small letter e with grave | | 233 | 351 | E9 | 11101001 | é | é | é | Latin small letter e with acute | | 234 | 352 | EA | 11101010 | ê | ê | ê | Latin small letter e with circumflex | | 235 | 353 | EB | 11101011 | ë | ë | ë | Latin small letter e with diaeresis | | 236 | 354 | EC | 11101100 | ì | ì | ì | Latin small letter i with grave | | 237 | 355 | ED | 11101101 | í | í | í | Latin small letter i with acute | | 238 | 356 | EE | 11101110 | î | î | î | Latin small letter i with circumflex | | 239 | 357 | EF | 11101111 | ï | ï | ï | Latin small letter i with diaeresis | | 240 | 360 | F0 | 11110000 | ð | ð | ð | Latin small letter eth | | 241 | 361 | F1 | 11110001 | ñ | ñ | ñ | Latin small letter n with tilde | | 242 | 362 | F2 | 11110010 | ò | ò | ò | Latin small letter o with grave | | 243 | 363 | F3 | 11110011 | ó | ó | ó | Latin small letter o with acute | | 244 | 364 | F4 | 11110100 | ô | ô | ô | Latin small letter o with circumflex | | 245 | 365 | F5 | 11110101 | õ | õ | õ | Latin small letter o with tilde | | 246 | 366 | F6 | 11110110 | ö | ö | ö | Latin small letter o with diaeresis | | 247 | 367 | F7 | 11110111 | ÷ | ÷ | ÷ | Division sign | | 248 | 370 | F8 | 11111000 | ø | ø | ø | Latin small letter o with slash | | 249 | 371 | F9 | 11111001 | ù | ù | ù | Latin small letter u with grave | | 250 | 372 | FA | 11111010 | ú | ú | ú | Latin small letter u with acute | | 251 | 373 | FB | 11111011 | û | û | û | Latin small letter u with circumflex | | 252 | 374 | FC | 11111100 | ü | ü | ü | Latin small letter u with diaeresis | | 253 | 375 | FD | 11111101 | ý | ý | ý | Latin small letter y with acute | | 254 | 376 | FE | 11111110 | þ | þ | þ | Latin small letter thorn | | 255 | 377 | FF | 11111111 | ÿ | ÿ | ÿ | Latin small letter y with diaeresis | -------------------------------------------------------------------------------------------------------------------- ======================================================================================================================== ======================================================================================================================== ======================================================================================================================== ----------------------------------------------------------------------------------------------------------------------- | ASCII - IMB PC Code page 437 /*0xfffffff1*/ | ----------------------------------------------------------------------------------------------------------------------- | DEC | OCT | HEX | BIN | Symbol | HTML Number | HTML Name | Description | ----------------------------------------------------------------------------------------------------------------------- | 0 | 000 | 00 | 00000000 | NUL | n/a | n/a | Null char | | 1 | 001 | 01 | 00000001 | ☺ | n/a | n/a | Smiley | | 2 | 002 | 02 | 00000010 | ☻ | n/a | n/a | Black Smiley | | 3 | 003 | 03 | 00000011 | ♥ | n/a | n/a | Heart | | 4 | 004 | 04 | 00000100 | ♦ | n/a | n/a | Diamond | | 5 | 005 | 05 | 00000101 | ♣ | n/a | n/a | Club | | 6 | 006 | 06 | 00000110 | ♠ | n/a | n/a | Spade | | 7 | 007 | 07 | 00000111 | • | n/a | n/a | Bullet Point | | 8 | 010 | 08 | 00001000 | ◘ | n/a | n/a | Reverse Bullet Point | | 9 | 011 | 09 | 00001001 | ○ | n/a | n/a | Circle | | 10 | 012 | 0A | 00001010 | ◙ | n/a | n/a | Black Circle | | 11 | 013 | 0B | 00001011 | ♂ | n/a | n/a | Male | | 12 | 014 | 0C | 00001100 | ♀ | n/a | n/a | Female | | 13 | 015 | 0D | 00001101 | ♪ | n/a | n/a | Ti | | 14 | 016 | 0E | 00001110 | ♫ | n/a | n/a | Titi | | 15 | 017 | 0F | 00001111 | ☼ | n/a | n/a | Solar Symbol | | 16 | 020 | 10 | 00010000 | ► | n/a | n/a | Right Pointing Triangle | | 17 | 021 | 11 | 00010001 | ◄ | n/a | n/a | Left Pointing Triangle | | 18 | 022 | 12 | 00010010 | ↕ | n/a | n/a | Up-Down Arrow | | 19 | 023 | 13 | 00010011 | ‼ | n/a | n/a | Double Exclamation Mark | | 20 | 024 | 14 | 00010100 | ¶ | n/a | n/a | Pilcrow sign - paragraph sign | | 21 | 025 | 15 | 00010101 | § | n/a | n/a | Section Sign | | 22 | 026 | 16 | 00010110 | ▬ | n/a | n/a | Black Rectangle | | 23 | 027 | 17 | 00010111 | ↨ | n/a | n/a | Up-Down Arrow with Foundation | | 24 | 030 | 18 | 00011000 | ↑ | n/a | n/a | Up Arrow | | 25 | 031 | 19 | 00011001 | ↓ | n/a | n/a | Down Arrow | | 26 | 032 | 1A | 00011010 | → | n/a | n/a | Right Arrow | | 27 | 033 | 1B | 00011011 | ← | n/a | n/a | Left Arrow | | 28 | 034 | 1C | 00011100 | ∟ | n/a | n/a | 90 Degree Angle | | 29 | 035 | 1D | 00011101 | ↔ | n/a | n/a | Left-right Pointing Arrow | | 30 | 036 | 1E | 00011110 | ▲ | n/a | n/a | Up Pointing Arrow | | 31 | 037 | 1F | 00011111 | ▼ | n/a | n/a | Down Pointing Arrow | | 32 | 040 | 20 | 00100000 | n/a | | n/a | Space | | 33 | 041 | 21 | 00100001 | ! | ! | n/a | Exclamation mark | | 34 | 042 | 22 | 00100010 | " | " | n/a | Double quotes (or speech marks) | | 35 | 043 | 23 | 00100011 | # | # | " | Number | | 36 | 044 | 24 | 00100100 | $ | $ | n/a | Dollar | | 37 | 045 | 25 | 00100101 | % | % | n/a | Per cent sign | | 38 | 046 | 26 | 00100110 | & | & | n/a | Ampersand | | 39 | 047 | 27 | 00100111 | ' | ' | & | Single quote | | 40 | 050 | 28 | 00101000 | ( | ( | n/a | Open parenthesis (or open bracket) | | 41 | 051 | 29 | 00101001 | ) | ) | n/a | Close parenthesis (or close bracket) | | 42 | 052 | 2A | 00101010 | * | * | n/a | Asterisk | | 43 | 053 | 2B | 00101011 | + | + | n/a | Plus | | 44 | 054 | 2C | 00101100 | , | , | n/a | Comma | | 45 | 055 | 2D | 00101101 | - | - | n/a | Hyphen | | 46 | 056 | 2E | 00101110 | . | . | n/a | Period, dot or full stop | | 47 | 057 | 2F | 00101111 | / | / | n/a | Slash or divide | | 48 | 060 | 30 | 00110000 | 0 | 0 | n/a | Zero | | 49 | 061 | 31 | 00110001 | 1 | 1 | n/a | One | | 50 | 062 | 32 | 00110010 | 2 | 2 | n/a | Two | | 51 | 063 | 33 | 00110011 | 3 | 3 | n/a | Three | | 52 | 064 | 34 | 00110100 | 4 | 4 | n/a | Four | | 53 | 065 | 35 | 00110101 | 5 | 5 | n/a | Five | | 54 | 066 | 36 | 00110110 | 6 | 6 | n/a | Six | | 55 | 067 | 37 | 00110111 | 7 | 7 | n/a | Seven | | 56 | 070 | 38 | 00111000 | 8 | 8 | n/a | Eight | | 57 | 071 | 39 | 00111001 | 9 | 9 | n/a | Nine | | 58 | 072 | 3A | 00111010 | : | : | n/a | Colon | | 59 | 073 | 3B | 00111011 | ; | ; | n/a | Semicolon | | 60 | 074 | 3C | 00111100 | < | < | n/a | Less than (or open angled bracket) | | 61 | 075 | 3D | 00111101 | = | = | < | Equals | | 62 | 076 | 3E | 00111110 | > | > | n/a | Greater than (or close angled bracket) | | 63 | 077 | 3F | 00111111 | ? | ? | > | Question mark | | 64 | 100 | 40 | 01000000 | @ | @ | n/a | At symbol | | 65 | 101 | 41 | 01000001 | A | A | n/a | Uppercase A | | 66 | 102 | 42 | 01000010 | B | B | n/a | Uppercase B | | 67 | 103 | 43 | 01000011 | C | C | n/a | Uppercase C | | 68 | 104 | 44 | 01000100 | D | D | n/a | Uppercase D | | 69 | 105 | 45 | 01000101 | E | E | n/a | Uppercase E | | 70 | 106 | 46 | 01000110 | F | F | n/a | Uppercase F | | 71 | 107 | 47 | 01000111 | G | G | n/a | Uppercase G | | 72 | 110 | 48 | 01001000 | H | H | n/a | Uppercase H | | 73 | 111 | 49 | 01001001 | I | I | n/a | Uppercase I | | 74 | 112 | 4A | 01001010 | J | J | n/a | Uppercase J | | 75 | 113 | 4B | 01001011 | K | K | n/a | Uppercase K | | 76 | 114 | 4C | 01001100 | L | L | n/a | Uppercase L | | 77 | 115 | 4D | 01001101 | M | M | n/a | Uppercase M | | 78 | 116 | 4E | 01001110 | N | N | n/a | Uppercase N | | 79 | 117 | 4F | 01001111 | O | O | n/a | Uppercase O | | 80 | 120 | 50 | 01010000 | P | P | n/a | Uppercase P | | 81 | 121 | 51 | 01010001 | Q | Q | n/a | Uppercase Q | | 82 | 122 | 52 | 01010010 | R | R | n/a | Uppercase R | | 83 | 123 | 53 | 01010011 | S | S | n/a | Uppercase S | | 84 | 124 | 54 | 01010100 | T | T | n/a | Uppercase T | | 85 | 125 | 55 | 01010101 | U | U | n/a | Uppercase U | | 86 | 126 | 56 | 01010110 | V | V | n/a | Uppercase V | | 87 | 127 | 57 | 01010111 | W | W | n/a | Uppercase W | | 88 | 130 | 58 | 01011000 | X | X | n/a | Uppercase X | | 89 | 131 | 59 | 01011001 | Y | Y | n/a | Uppercase Y | | 90 | 132 | 5A | 01011010 | Z | Z | n/a | Uppercase Z | | 91 | 133 | 5B | 01011011 | [ | [ | n/a | Opening bracket | | 92 | 134 | 5C | 01011100 | \ | \ | n/a | Backslash | | 93 | 135 | 5D | 01011101 | ] | ] | n/a | Closing bracket | | 94 | 136 | 5E | 01011110 | ^ | ^ | n/a | Caret - circumflex | | 95 | 137 | 5F | 01011111 | _ | _ | n/a | Underscore | | 96 | 140 | 60 | 01100000 | ` | ` | n/a | Grave accent | | 97 | 141 | 61 | 01100001 | a | a | n/a | Lowercase a | | 98 | 142 | 62 | 01100010 | b | b | n/a | Lowercase b | | 99 | 143 | 63 | 01100011 | c | c | n/a | Lowercase c | | 100 | 144 | 64 | 01100100 | d | d | n/a | Lowercase d | | 101 | 145 | 65 | 01100101 | e | e | n/a | Lowercase e | | 102 | 146 | 66 | 01100110 | f | f | n/a | Lowercase f | | 103 | 147 | 67 | 01100111 | g | g | n/a | Lowercase g | | 104 | 150 | 68 | 01101000 | h | h | n/a | Lowercase h | | 105 | 151 | 69 | 01101001 | i | i | n/a | Lowercase i | | 106 | 152 | 6A | 01101010 | j | j | n/a | Lowercase j | | 107 | 153 | 6B | 01101011 | k | k | n/a | Lowercase k | | 108 | 154 | 6C | 01101100 | l | l | n/a | Lowercase l | | 109 | 155 | 6D | 01101101 | m | m | n/a | Lowercase m | | 110 | 156 | 6E | 01101110 | n | n | n/a | Lowercase n | | 111 | 157 | 6F | 01101111 | o | o | n/a | Lowercase o | | 112 | 160 | 70 | 01110000 | p | p | n/a | Lowercase p | | 113 | 161 | 71 | 01110001 | q | q | n/a | Lowercase q | | 114 | 162 | 72 | 01110010 | r | r | n/a | Lowercase r | | 115 | 163 | 73 | 01110011 | s | s | n/a | Lowercase s | | 116 | 164 | 74 | 01110100 | t | t | n/a | Lowercase t | | 117 | 165 | 75 | 01110101 | u | u | n/a | Lowercase u | | 118 | 166 | 76 | 01110110 | v | v | n/a | Lowercase v | | 119 | 167 | 77 | 01110111 | w | w | n/a | Lowercase w | | 120 | 170 | 78 | 01111000 | x | x | n/a | Lowercase x | | 121 | 171 | 79 | 01111001 | y | y | n/a | Lowercase y | | 122 | 172 | 7A | 01111010 | z | z | n/a | Lowercase z | | 123 | 173 | 7B | 01111011 | { | { | n/a | Opening brace | | 124 | 174 | 7C | 01111100 | | | | | n/a | Vertical bar | | 125 | 175 | 7D | 01111101 | } | } | n/a | Closing brace | | 126 | 176 | 7E | 01111110 | ~ | ~ | n/a | Equivalency sign - tilde | | 127 | 177 | 7F | 01111111 | ⌂ |  | n/a | Miscellaneous Technical | | 128 | 200 | 80 | 10000000 | Ç | € | n/a | Latin capital letter c with cedilla | | 129 | 201 | 81 | 10000001 | ü | n/a | n/a | Latin small letter u with diaeresis | | 130 | 202 | 82 | 10000010 | é | ‚ | n/a | Latin small letter e with acute | | 131 | 203 | 83 | 10000011 | â | ƒ | n/a | Latin small letter a with circumflex | | 132 | 204 | 84 | 10000100 | ä | „ | n/a | Latin small letter a with diaeresis | | 133 | 205 | 85 | 10000101 | à | … | n/a | Latin small letter a with grave | | 134 | 206 | 86 | 10000110 | å | † | n/a | Latin small letter a with ring above | | 135 | 207 | 87 | 10000111 | ç | ‡ | n/a | Latin small letter c with cedilla | | 136 | 210 | 88 | 10001000 | ê | ˆ | n/a | Latin small letter e with circumflex | | 137 | 211 | 89 | 10001001 | ë | ‰ | n/a | Latin small letter e with diaeresis | | 138 | 212 | 8A | 10001010 | è | Š | n/a | Latin small letter e with grave | | 139 | 213 | 8B | 10001011 | ï | ‹ | n/a | Latin small letter i with diaeresis | | 140 | 214 | 8C | 10001100 | î | Œ | n/a | Latin small letter i with circumflex | | 141 | 215 | 8D | 10001101 | ì | n/a | n/a | Latin small letter i with grave | | 142 | 216 | 8E | 10001110 | Ä | Ž | n/a | Latin capital letter a with diaeresiS | | 143 | 217 | 8F | 10001111 | Å | n/a | n/a | Latin capital letter a with ring above | | 144 | 220 | 90 | 10010000 | É | n/a | n/a | Latin capital letter e with acute | | 145 | 221 | 91 | 10010001 | æ | ‘ | n/a | Latin small letter ae | | 146 | 222 | 92 | 10010010 | Æ | ’ | n/a | Latin capital letter ae | | 147 | 223 | 93 | 10010011 | ô | “ | n/a | Latin small letter o with circumflex | | 148 | 224 | 94 | 10010100 | ö | ” | n/a | Latin small letter o with diaeresis | | 149 | 225 | 95 | 10010101 | ò | • | n/a | Latin small letter o with grave | | 150 | 226 | 96 | 10010110 | û | – | n/a | Latin small letter u with circumflex | | 151 | 227 | 97 | 10010111 | ù | — | n/a | Latin small letter u with grave | | 152 | 230 | 98 | 10011000 | ÿ | ˜ | n/a | Latin small letter y with diaeresis | | 153 | 231 | 99 | 10011001 | Ö | ™ | n/a | Latin capital letter o with diaeresiS | | 154 | 232 | 9A | 10011010 | Ü | š | n/a | Latin capital letter u with diaeresiS | | 155 | 233 | 9B | 10011011 | ¢ | › | n/a | Cent sign | | 156 | 234 | 9C | 10011100 | £ | œ | n/a | Pound sign | | 157 | 235 | 9D | 10011101 | ¥ | n/a | n/a | Yen sign | | 158 | 236 | 9E | 10011110 | ₧ | ž | n/a | PEseta sign | | 159 | 237 | 9F | 10011111 | ƒ | Ÿ | n/a | Latin small letter f with hook | | 160 | 240 | A0 | 10100000 | á |   | n/a | Latin small letter a with acute | | 161 | 241 | A1 | 10100001 | í | ¡ | n/a | Latin small letter i with acute | | 162 | 242 | A2 | 10100010 | ó | ¢ | n/a | Latin small letter o with acute | | 163 | 243 | A3 | 10100011 | ú | £ | n/a | Latin small letter u with acute | | 164 | 244 | A4 | 10100100 | ñ | ¤ | n/a | Latin small letter n with tilde | | 165 | 245 | A5 | 10100101 | Ñ | ¥ | n/a | Latin capital letter n with tilde | | 166 | 246 | A6 | 10100110 | ª | ¦ | n/a | Feminine ordinal indicator | | 167 | 247 | A7 | 10100111 | º | § | n/a | Masculine ordinal indicator | | 168 | 250 | A8 | 10101000 | ¿ | ¨ | n/a | Inverted question mark | | 169 | 251 | A9 | 10101001 | ⌐ | © | n/a | Reversed not sign | | 170 | 252 | AA | 10101010 | ¬ | ª | n/a | Not sign | | 171 | 253 | AB | 10101011 | ½ | « | n/a | Vulgar fraction one half | | 172 | 254 | AC | 10101100 | ¼ | ¬ | n/a | Vulgar fraction one quarter | | 173 | 255 | AD | 10101101 | ¡ | ­ | n/a | Inverted exclamation mark | | 174 | 256 | AE | 10101110 | « | ® | n/a | Left-pointing double angle quotation mark | | 175 | 257 | AF | 10101111 | » | ¯ | n/a | Right-pointing double angle quotation mark | | 176 | 260 | B0 | 10110000 | ░ | ° | n/a | Light shade | | 177 | 261 | B1 | 10110001 | ▒ | ± | n/a | Medium shade | | 178 | 262 | B2 | 10110010 | ▓ | ² | n/a | Dark shade | | 179 | 263 | B3 | 10110011 | │ | ³ | n/a | Box drawings light vertical | | 180 | 264 | B4 | 10110100 | ┤ | ´ | n/a | Box drawings light vertical and left | | 181 | 265 | B5 | 10110101 | ╡ | µ | n/a | Box drawings vertical single and left double | | 182 | 266 | B6 | 10110110 | ╢ | ¶ | n/a | Box drawings vertical double and left single | | 183 | 267 | B7 | 10110111 | ╖ | · | n/a | Box drawings down double and left single | | 184 | 270 | B8 | 10111000 | ╕ | ¸ | n/a | Box drawings down single and left double | | 185 | 271 | B9 | 10111001 | ╣ | ¹ | n/a | Box drawings double vertical and left | | 186 | 272 | BA | 10111010 | ║ | º | n/a | Box drawings double vertical | | 187 | 273 | BB | 10111011 | ╗ | » | n/a | Box drawings double down and left | | 188 | 274 | BC | 10111100 | ╝ | ¼ | n/a | Box drawings double up and left | | 189 | 275 | BD | 10111101 | ╜ | ½ | n/a | Box drawings up double and left single | | 190 | 276 | BE | 10111110 | ╛ | ¾ | n/a | Box drawings up single and left double | | 191 | 277 | BF | 10111111 | ┐ | ¿ | n/a | Box drawings light down and left | | 192 | 300 | C0 | 11000000 | └ | À | n/a | Box drawings light up and right | | 193 | 301 | C1 | 11000001 | ┴ | Á | n/a | Box drawings light up and horizontal | | 194 | 302 | C2 | 11000010 | ┬ |  | n/a | Box drawings light down and horizontal | | 195 | 303 | C3 | 11000011 | ├ | à | n/a | Box drawings light vertical and right | | 196 | 304 | C4 | 11000100 | ─ | Ä | n/a | Box drawings light horizontal | | 197 | 305 | C5 | 11000101 | ┼ | Å | n/a | Box drawings light vertical and horizontal | | 198 | 306 | C6 | 11000110 | ╞ | Æ | n/a | Box drawings vertical single and right double | | 199 | 307 | C7 | 11000111 | ╟ | Ç | n/a | Box drawings vertical double and right single | | 200 | 310 | C8 | 11001000 | ╚ | È | n/a | Box drawings double up and right | | 201 | 311 | C9 | 11001001 | ╔ | É | n/a | Box drawings double down and right | | 202 | 312 | CA | 11001010 | ╩ | Ê | n/a | Box drawings double up and horizontal | | 203 | 313 | CB | 11001011 | ╦ | Ë | n/a | Box drawings double down and horizontal | | 204 | 314 | CC | 11001100 | ╠ | Ì | n/a | Box drawings double vertical and right | | 205 | 315 | CD | 11001101 | ═ | Í | n/a | Box drawings double horizontal | | 206 | 316 | CE | 11001110 | ╬ | Î | n/a | Box drawings double vertical and horizontal | | 207 | 317 | CF | 11001111 | ╧ | Ï | n/a | Box drawings up single and horizontal double | | 208 | 320 | D0 | 11010000 | ╨ | Ð | n/a | Box drawings up double and horizontal single | | 209 | 321 | D1 | 11010001 | ╤ | Ñ | n/a | Box drawings down single and horizontal double| | 210 | 322 | D2 | 11010010 | ╥ | Ò | n/a | Box drawings down double and horizontal single| | 211 | 323 | D3 | 11010011 | ╙ | Ó | n/a | Box drawings up double and right single | | 212 | 324 | D4 | 11010100 | ╘ | Ô | n/a | Box drawings up single and right double | | 213 | 325 | D5 | 11010101 | ╒ | Õ | n/a | Box drawings down single and right double | | 214 | 326 | D6 | 11010110 | ╓ | Ö | n/a | Box drawings down double and right single | | 215 | 327 | D7 | 11010111 | ╫ | × | n/a | Box drawings vertical and horizontal | | 216 | 330 | D8 | 11011000 | ╪ | Ø | n/a | Box drawings vertical and horizontal | | 217 | 331 | D9 | 11011001 | ┘ | Ù | n/a | Box drawings light up and left | | 218 | 332 | DA | 11011010 | ┌ | Ú | n/a | Box drawings light down and right | | 219 | 333 | DB | 11011011 | █ | Û | n/a | Full block | | 220 | 334 | DC | 11011100 | ▄ | Ü | n/a | Lower half block | | 221 | 335 | DD | 11011101 | ▌ | Ý | n/a | Left half block | | 222 | 336 | DE | 11011110 | ▐ | Þ | n/a | Right half block | | 223 | 337 | DF | 11011111 | ▀ | ß | n/a | Upper half block | | 224 | 340 | E0 | 11100000 | α | à | n/a | Greek small letter alpha | | 225 | 341 | E1 | 11100001 | ß | á | n/a | Latin small letter sharp | | 226 | 342 | E2 | 11100010 | Γ | â | n/a | Greek capital letter gamma | | 227 | 343 | E3 | 11100011 | π | ã | n/a | Greek small letter pi | | 228 | 344 | E4 | 11100100 | Σ | ä | n/a | Greek capital letter sigma | | 229 | 345 | E5 | 11100101 | σ | å | n/a | Greek small letter sigma | | 230 | 346 | E6 | 11100110 | µ | æ | n/a | Micro sign | | 231 | 347 | E7 | 11100111 | τ | ç | n/a | Greek small letter tau | | 232 | 350 | E8 | 11101000 | Φ | è | n/a | Greek capital letter phi | | 233 | 351 | E9 | 11101001 | Θ | é | n/a | Greek capital letter theta | | 234 | 352 | EA | 11101010 | Ω | ê | n/a | Greek capital letter omega | | 235 | 353 | EB | 11101011 | δ | ë | n/a | Greek small letter delta | | 236 | 354 | EC | 11101100 | ∞ | ì | n/a | Infinity | | 237 | 355 | ED | 11101101 | φ | í | n/a | Greek small letter phi | | 238 | 356 | EE | 11101110 | ε | î | n/a | Greek small letter epsilon | | 239 | 357 | EF | 11101111 | ∩ | ï | n/a | Intersection | | 240 | 360 | F0 | 11110000 | ≡ | ð | n/a | Identical to | | 241 | 361 | F1 | 11110001 | ± | ñ | n/a | Plus-minus sign | | 242 | 362 | F2 | 11110010 | ≥ | ò | n/a | Greater-than or equal to | | 243 | 363 | F3 | 11110011 | ≤ | ó | n/a | Less-than or equal to | | 244 | 364 | F4 | 11110100 | ⌠ | ô | n/a | Top half integral | | 245 | 365 | F5 | 11110101 | ⌡ | õ | n/a | Bottom half integral | | 246 | 366 | F6 | 11110110 | ÷ | ö | n/a | Division sign | | 247 | 367 | F7 | 11110111 | ≈ | ÷ | n/a | Almost equal to | | 248 | 370 | F8 | 11111000 | ° | ø | n/a | Degree sign | | 249 | 371 | F9 | 11111001 | ∙ | ù | n/a | Bullet operator | | 250 | 372 | FA | 11111010 | · | ú | n/a | Middle dot | | 251 | 373 | FB | 11111011 | √ | û | n/a | Square root | | 252 | 374 | FC | 11111100 | ⁿ | ü | n/a | Superscript Latin small letter n | | 253 | 375 | FD | 11111101 | ² | ý | n/a | Superscript two | | 254 | 376 | FE | 11111110 | ■ | þ | n/a | Black square | | 255 | 377 | FF | 11111111 | n/a | ÿ | n/a | No-break space | ----------------------------------------------------------------------------------------------------------------------- UNICODE:"Universal Coded Character Set" • completely ASCII compatible on systems that align to bytes (ie. all) Standards: • UTF == "Unicode Transformation Format" • the number represents the code unit length in bits • all are capable of representing all unicode chars utf_8: • variable length • completely backwards compatible with ascii • a single char is 1-4 bytes long • a symbol can technically be longer than 4 bytes, if its created from multiple unicode chars (usually 2) ○ formal notation U+[hex-1][hex-2][hex-3][hex-4] +------------------------------------------------------------------------------------+ | Code point <-> UTF-8 conversion | +------------------+------------------+-----------+-----------+-----------+----------+ | First code point | Last code point | Byte 1 | Byte 2 | Byte 3 | Byte 4 | +------------------+------------------+-----------+-----------+-----------+----------+ | U+0000 | U+007F | 0xxxxxxx | | | | | U+0080 | U+07FF | 110xxxxx | 10xxxxxx | | | | U+0800 | U+FFFF | 1110xxxx | 10xxxxxx | 10xxxxxx | | | U+10000 | U+10FFFF | 11110xxx | 10xxxxxx | 10xxxxxx | 10xxxxxx | +------------------+------------------+-----------+-----------+-----------+----------+ utf_16: • variable length • uses one or two 16 bit "code units" • had some adoption, but is being slowly replaced in favour of utf8 utf_32: • fixed length • every character is stored on 32 bits and is simply map-ed to a symbol • basically a very large ascii table • never had any substantial adoption and never will have