. "UTF-8" . . . . "UTF-8 is a character encoding capable of encoding all possible characters, or code points, defined by Unicode and originally designed by Ken Thompson and Rob Pike. The encoding is variable-length and uses 8-bit code units. It was designed for backward compatibility with ASCII and to avoid the complications of endianness and byte order marks in the alternative UTF-16 and UTF-32 encodings. The name is derived from Unicode (or Universal Coded Character Set) Transformation Format \u2013 8-bit." . . . . . . "25\u0628\u0643 \u0627\u0644\u0645\u062D\u062A\u0648\u0649 \u0647\u0646\u0627 \u064A\u0646\u0642\u0635\u0647 \u0627\u0644\u0627\u0633\u062A\u0634\u0647\u0627\u062F \u0628\u0645\u0635\u0627\u062F\u0631. \u064A\u0631\u062C\u0649 \u0625\u064A\u0631\u0627\u062F \u0645\u0635\u0627\u062F\u0631 \u0645\u0648\u062B\u0648\u0642 \u0628\u0647\u0627. \u0623\u064A \u0645\u0639\u0644\u0648\u0645\u0627\u062A \u063A\u064A\u0631 \u0645\u0648\u062B\u0642\u0629 \u064A\u0645\u0643\u0646 \u0627\u0644\u062A\u0634\u0643\u064A\u0643 \u0628\u0647\u0627 \u0648\u0625\u0632\u0627\u0644\u062A\u0647\u0627. (\u0645\u0627\u0631\u0633 2016) UTF-8 \u0647\u064A \u0627\u062E\u062A\u0635\u0627\u0631 \u0644\u0644\u062C\u0645\u0644\u0629 (8-bit Unicode Transformation Format) \u0648\u062A\u0631\u062C\u0645\u062A\u0647\u0627 (\u0635\u064A\u063A\u0629 \u062A\u062D\u0648\u064A\u0644 \u0646\u0638\u0627\u0645 \u0627\u0644\u062D\u0631\u0648\u0641 \u0627\u0644\u062F\u0648\u0644\u064A \u0627\u0644\u0645\u0648\u062D\u062F \u0628\u0642\u0648\u0629 8 \u0628\u062A)\u060C \u0647\u0630\u0627 \uFFFC\uFFFC\u0627\u0644\u062A\u0631\u0645\u064A\u0632\uFFFC\uFFFC \u0648\u0636\u0639 \u0645\u0646 \u0642\u0628\u0644 \u0643\u0644 \u0645\u0646 \u0631\u0648\u0628 \u0628\u0627\u064A\u0643 \u0648\u0643\u064A\u0646 \u062A\u0648\u0645\u0633\u0646 \u0644\u062A\u0645\u062B\u064A\u0644 \u0645\u0639\u064A\u0627\u0631 \u0646\u0638\u0627\u0645 \u0627\u0644\u062D\u0631\u0648\u0641 \u0627\u0644\u062F\u0648\u0644\u064A \u0627\u0644\u0645\u0648\u062D\u062F \u0644\u0644\u062D\u0631\u0648\u0641 \u0627\u0644\u0623\u0628\u062C\u062F\u064A\u0629 \u0644\u0623\u063A\u0644\u0628 \u0644\u063A\u0627\u062A \u0627\u0644\u0639\u0627\u0644\u0645\u060C \u0648\u064A\u062A\u0645 \u062A\u0634\u0641\u064A\u0631 \u0627\u0644\u0631\u0645\u0648\u0632 \u0641\u064A\u0647\u0627 \u0641\u064A \u062D\u062C\u0645 \u064A\u062A\u0631\u0627\u0648\u062D \u0628\u064A\u0646 \u0628\u0627\u064A\u062A \u0648\u0627\u062D\u062F \u06484 \u0628\u0627\u064A\u062A \u0644\u0644\u0631\u0645\u0632 \u0627\u0644\u0648\u0627\u062D\u062F. \u064A\u062A\u0645 \u062A\u062D\u062F\u064A\u062F \u0637\u0648\u0644 \u062A\u0634\u0641\u064A\u0631 \u0627\u0644\u0631\u0645\u0632 \u0628\u062D\u0633\u0628 \u0627\u0644\u0634\u0643\u0644 \u0627\u0644\u0622\u062A\u064A:"@ar . "32188"^^ . . . "UTF-8" . . "UTF-8 (8-bit Unicode Transformation Format) is een manier om Unicode/ISO 10646-tekens op te slaan als een stroom van bytes, een zogenaamde tekencodering. Alternatieven zijn UTF-16 en UTF-32. UTF-8 is een tekencodering met variabele lengte: niet elk teken gebruikt evenveel bytes. Afhankelijk van het teken worden 1 tot 4 bytes gebruikt. Voor het vastleggen van elk van de 128 Basic ASCII-tekens (0--127) zijn slechts 7 bits nodig. De functie van de oorspronkelijke 8e parity-/strobe- bit werd al snel overbodig. Basic ASCII bestaat uit slechts \u00E9\u00E9n byte waarvan het hoogste bit altijd een nul is. De Extended ASCII-tekenset(128-255) is een aanvulling op Basic-ASCII, en bevat beeldschermtype (EGA/VGA) en land-afhankelijke tekens. Extended ASCII bestaat uit dezelfde byte met het hoogste bit altijd een \u00E9\u00E9n." . "UTF-8 (Unicode Transformation Format, 8 bit) \u00E8 una codifica dei caratteri Unicode in sequenze di lunghezza variabile di byte, creata da Rob Pike e Ken Thompson. UTF-8 usa gruppi di byte per rappresentare i caratteri Unicode, ed \u00E8 particolarmente utile per il trasferimento tramite sistemi di posta elettronica a 8-bit." . . "UTF-8" . . "UTF-8\uFF088-bit Unicode Transformation Format\uFF09\u662F\u4E00\u7A2E\u91DD\u5C0DUnicode\u7684\u53EF\u8B8A\u9577\u5EA6\u5B57\u5143\u7DE8\u78BC\uFF0C\u4E5F\u662F\u4E00\u79CD\u524D\u7F00\u7801\u3002\u5B83\u53EF\u4EE5\u7528\u4F86\u8868\u793AUnicode\u6A19\u6E96\u4E2D\u7684\u4EFB\u4F55\u5B57\u5143\uFF0C\u4E14\u5176\u7DE8\u78BC\u4E2D\u7684\u7B2C\u4E00\u500B\u4F4D\u5143\u7D44\u4ECD\u8207ASCII\u76F8\u5BB9\uFF0C\u9019\u4F7F\u5F97\u539F\u4F86\u8655\u7406ASCII\u5B57\u5143\u7684\u8EDF\u9AD4\u7121\u9808\u6216\u53EA\u9808\u505A\u5C11\u90E8\u4EFD\u4FEE\u6539\uFF0C\u5373\u53EF\u7E7C\u7E8C\u4F7F\u7528\u3002\u56E0\u6B64\uFF0C\u5B83\u9010\u6F38\u6210\u70BA\u96FB\u5B50\u90F5\u4EF6\u3001\u7DB2\u9801\u53CA\u5176\u4ED6\u5132\u5B58\u6216\u50B3\u9001\u6587\u5B57\u7684\u61C9\u7528\u4E2D\uFF0C\u512A\u5148\u63A1\u7528\u7684\u7DE8\u78BC\u3002 UTF-8\u4F7F\u7528\u4E00\u81F3\u516D\u500B\u4F4D\u5143\u7D44\u70BA\u6BCF\u500B\u5B57\u7B26\u7DE8\u78BC\uFF08\u5C3D\u7BA1\u5982\u6B64\uFF0C2003\u5E7411\u6708UTF-8\u88ABRFC 3629\u91CD\u65B0\u89C4\u8303\uFF0C\u53EA\u80FD\u4F7F\u7528\u539F\u6765Unicode\u5B9A\u4E49\u7684\u533A\u57DF\uFF0CU+0000\u5230U+10FFFF\uFF0C\u4E5F\u5C31\u662F\u8BF4\u6700\u591A\u56DB\u500B\u5B57\u8282\uFF09\uFF1A \u5C0D\u4E0A\u8FF0\u63D0\u53CA\u7684\u7B2C\u56DB\u7A2E\u5B57\u5143\u800C\u8A00\uFF0CUTF-8\u4F7F\u7528\u56DB\u81F3\u516D\u500B\u4F4D\u5143\u7D44\u4F86\u7DE8\u78BC\u4F3C\u4E4E\u592A\u8017\u8CBB\u8CC7\u6E90\u4E86\u3002\u4F46UTF-8\u5C0D\u6240\u6709\u5E38\u7528\u7684\u5B57\u5143\u90FD\u53EF\u4EE5\u7528\u4E09\u500B\u4F4D\u5143\u7D44\u8868\u793A\uFF0C\u800C\u4E14\u5B83\u7684\u53E6\u4E00\u7A2E\u9078\u64C7\uFF0CUTF-16\u7DE8\u78BC\uFF0C\u5C0D\u524D\u8FF0\u7684\u7B2C\u56DB\u7A2E\u5B57\u7B26\u540C\u6A23\u9700\u8981\u56DB\u500B\u4F4D\u5143\u7D44\u4F86\u7DE8\u78BC\uFF0C\u6240\u4EE5\u8981\u6C7A\u5B9AUTF-8\u6216UTF-16\u54EA\u7A2E\u7DE8\u78BC\u6BD4\u8F03\u6709\u6548\u7387\uFF0C\u9084\u8981\u8996\u6240\u4F7F\u7528\u7684\u5B57\u5143\u7684\u5206\u4F48\u7BC4\u570D\u800C\u5B9A\u3002\u4E0D\u904E\uFF0C\u5982\u679C\u4F7F\u7528\u4E00\u4E9B\u50B3\u7D71\u7684\u58D3\u7E2E\u7CFB\u7D71\uFF0C\u6BD4\u5982DEFLATE\uFF0C\u5247\u9019\u4E9B\u4E0D\u540C\u7DE8\u78BC\u7CFB\u7D71\u9593\u7684\u7684\u5DEE\u7570\u5C31\u8B8A\u5F97\u5FAE\u4E0D\u8DB3\u9053\u4E86\u3002\u82E5\u9867\u53CA\u50B3\u7D71\u58D3\u7E2E\u7B97\u6CD5\u5728\u58D3\u7E2E\u8F03\u77ED\u6587\u5B57\u4E0A\u7684\u6548\u679C\u4E0D\u5927\uFF0C\u53EF\u4EE5\u8003\u616E\u4F7F\u7528Unicode\u6A19\u6E96\u58D3\u7E2E\u683C\u5F0F\uFF08SCSU\uFF09\u3002" . "UTF-8 (Abk. f\u00FCr 8-Bit UCS Transformation Format, wobei UCS wiederum Universal Character Set abk\u00FCrzt) ist die am weitesten verbreitete Kodierung f\u00FCr Unicode-Zeichen (Unicode und UCS sind praktisch identisch). Die Kodierung wurde im September 1992 von Ken Thompson und Rob Pike bei Arbeiten am Plan-9-Betriebssystem festgelegt. Die Kodierung wurde zun\u00E4chst im Rahmen von X/Open als FSS-UTF (filesystem safe UTF in Abgrenzung zu UTF-1, das diese Eigenschaft nicht hat) bezeichnet, in den Folgejahren erfolgte im Rahmen der Standardisierung die Umbenennung auf die heute \u00FCbliche Bezeichnung UTF-8. UTF-8 ist in den ersten 128 Zeichen (Indizes 0\u2013127) deckungsgleich mit ASCII und eignet sich mit in der Regel nur einem Byte Speicherbedarf f\u00FCr Zeichen vieler westlicher Sprachen besonders f\u00FCr die Kodierung englischsprachiger Texte, die sich im Regelfall ohne Modifikation daher sogar mit nicht-UTF-8-f\u00E4higen Texteditoren ohne Beeintr\u00E4chtigung bearbeiten lassen, was einen der Gr\u00FCnde f\u00FCr den Status als De-facto-Standard-Zeichenkodierung des Internets und damit verbundener Dokumenttypen darstellt. Im Oktober 2016 verwendeten 87,7 % aller Websites UTF-8. In anderen Sprachen ist der Speicherbedarf in Byte pro Zeichen gr\u00F6\u00DFer, wenn diese vom ASCII-Zeichensatz abweichen: Bereits die deutschen Umlaute erfordern zwei Byte; kyrillische, fern\u00F6stliche und Sprachen aus dem afrikanischen Raum belegen bis zu 4 Byte je Zeichen. Da die Verarbeitung von UTF-8 als Multibyte-Zeichenfolge wegen der notwendigen Analyse jedes Bytes im Vergleich zu Zeichenkodierungen mit fester Byteanzahl je Zeichen mehr Rechenaufwand und f\u00FCr bestimmte Sprachen auch mehr Speicherplatz erfordert, werden abh\u00E4ngig vom Einsatzszenario auch andere UTF-Kodierungen zur Abbildung von UNICODE-Zeichens\u00E4tzen verwendet: Microsoft Windows als meistgenutztes Desktop-Betriebssystem verwendet intern als Kompromiss zwischen UTF-8 und UTF-32 etwa UTF-16 Little Endian." . . "UTF-8" . "UTF-8 (8-bit Unicode Transformation Format) es un formato de codificaci\u00F3n de caracteres Unicode e ISO 10646 utilizando s\u00EDmbolos de longitud variable. UTF-8 fue creado por Robert C. Pike y Kenneth L. Thompson. Est\u00E1 definido como est\u00E1ndar por la de la Internet Engineering Task Force (IETF). Actualmente es una de las tres posibilidades de codificaci\u00F3n reconocidas por Unicode y lenguajes web, o cuatro en ISO 10646. Sus caracter\u00EDsticas principales son:" . "UTF-8\uFF08\u30E6\u30FC\u30C6\u30A3\u30FC\u30A8\u30D5\u306F\u3061\u3001\u30E6\u30FC\u30C6\u30A3\u30FC\u30A8\u30D5\u30A8\u30A4\u30C8\uFF09\u306FISO/IEC 10646 (UCS) \u3068Unicode\u3067\u4F7F\u3048\u308B8\u30D3\u30C3\u30C8\u7B26\u53F7\u5358\u4F4D\u306E\u6587\u5B57\u7B26\u53F7\u5316\u5F62\u5F0F\u53CA\u3073\u6587\u5B57\u7B26\u53F7\u5316\u30B9\u30AD\u30FC\u30E0\u3002 \u6B63\u5F0F\u540D\u79F0\u306F\u3001ISO/IEC 10646\u3067\u306F \u201CUCS Transformation Format 8\u201D\u3001Unicode\u3067\u306F \u201CUnicode Transformation Format-8\u201D \u3068\u3044\u3046\u3002\u4E21\u8005\u306FISO/IEC 10646\u3068Unicode\u306E\u30B3\u30FC\u30C9\u91CD\u8907\u7BC4\u56F2\u3067\u4E92\u63DB\u6027\u304C\u3042\u308B\u3002RFC\u306B\u3082\u4ED5\u69D8\u304C\u3042\u308B\u3002 2\u30D0\u30A4\u30C8\u76EE\u4EE5\u964D\u306B\u300C/\u300D\u306A\u3069\u306EASCII\u6587\u5B57\u304C\u73FE\u308C\u306A\u3044\u3088\u3046\u306B\u5DE5\u592B\u3055\u308C\u3066\u3044\u308B\u3053\u3068\u304B\u3089\u3001UTF-FSS (File System Safe) \u3068\u3082\u3044\u308F\u308C\u308B\u3002\u65E7\u540D\u79F0\u306FUTF-2\u3002 \u30C7\u30FC\u30BF\u4EA4\u63DB\u65B9\u5F0F\u3001\u30D5\u30A1\u30A4\u30EB\u5F62\u5F0F\u3068\u3057\u3066\u3001\u4E00\u822C\u7684\u306BUTF-8\u306F\u4F7F\u308F\u308C\u308B\u50BE\u5411\u306B\u3042\u308B\u3002 \u5F53\u521D\u306F\u3001\u30D9\u30EB\u7814\u7A76\u6240\u306B\u304A\u3044\u3066Plan 9\u3067\u7528\u3044\u308B\u30A8\u30F3\u30B3\u30FC\u30C9\u3068\u3057\u3066\u3001\u30ED\u30D6\u30FB\u30D1\u30A4\u30AF\u306B\u3088\u308B\u8A2D\u8A08\u6307\u91DD\u306E\u3082\u3068\u3001\u30B1\u30F3\u30FB\u30C8\u30F3\u30D7\u30BD\u30F3\u306B\u3088\u3063\u3066\u8003\u6848\u3055\u308C\u305F\u3002" . . "UTF-8" . "UTF-8 (8-bit Unicode Transformation Format) \u00E9 um tipo de codifica\u00E7\u00E3o Unicode de comprimento vari\u00E1vel criado por Ken Thompson e Rob Pike. Pode representar qualquer caracter universal padr\u00E3o do Unicode, sendo tamb\u00E9m compat\u00EDvel com o ASCII. Por esta raz\u00E3o, est\u00E1 lentamente a ser adaptado como tipo de codifica\u00E7\u00E3o padr\u00E3o para email, p\u00E1ginas web, e outros locais onde os caracteres s\u00E3o armazenados." . "\u0421\u0442\u0430\u043D\u0434\u0430\u0440\u0442 UTF-8 \u043E\u0444\u0438\u0446\u0438\u0430\u043B\u044C\u043D\u043E \u0437\u0430\u043A\u0440\u0435\u043F\u043B\u0451\u043D \u0432 \u0434\u043E\u043A\u0443\u043C\u0435\u043D\u0442\u0430\u0445 \u0438 ISO/IEC 10646 Annex D.\u041A\u043E\u0434\u0438\u0440\u043E\u0432\u043A\u0430 \u043D\u0430\u0448\u043B\u0430 \u0448\u0438\u0440\u043E\u043A\u043E\u0435 \u043F\u0440\u0438\u043C\u0435\u043D\u0435\u043D\u0438\u0435 \u0432 UNIX-\u043F\u043E\u0434\u043E\u0431\u043D\u044B\u0445 \u043E\u043F\u0435\u0440\u0430\u0446\u0438\u043E\u043D\u043D\u044B\u0445 \u0441\u0438\u0441\u0442\u0435\u043C\u0430\u0445 \u0438 \u0432\u0435\u0431-\u043F\u0440\u043E\u0441\u0442\u0440\u0430\u043D\u0441\u0442\u0432\u0435.\u0421\u0430\u043C \u0436\u0435 \u0444\u043E\u0440\u043C\u0430\u0442 UTF-8 \u0431\u044B\u043B \u0438\u0437\u043E\u0431\u0440\u0435\u0442\u0451\u043D 2 \u0441\u0435\u043D\u0442\u044F\u0431\u0440\u044F 1992 \u0433\u043E\u0434\u0430 \u041A\u0435\u043D\u043E\u043C \u0422\u043E\u043C\u043F\u0441\u043E\u043D\u043E\u043C \u0438 \u0420\u043E\u0431\u043E\u043C \u041F\u0430\u0439\u043A\u043E\u043C \u0438 \u0440\u0435\u0430\u043B\u0438\u0437\u043E\u0432\u0430\u043D \u0432 Plan 9.\u0412 \u043A\u0430\u0447\u0435\u0441\u0442\u0432\u0435 BOM \u0438\u0441\u043F\u043E\u043B\u044C\u0437\u0443\u0435\u0442 \u043F\u043E\u0441\u043B\u0435\u0434\u043E\u0432\u0430\u0442\u0435\u043B\u044C\u043D\u043E\u0441\u0442\u044C \u0431\u0430\u0439\u0442 EF16, BB16, BF16 (\u0447\u0442\u043E \u0443 \u043D\u0435\u0451 \u0441\u0430\u043C\u043E\u0439 \u044F\u0432\u043B\u044F\u0435\u0442\u0441\u044F \u0442\u0440\u0451\u0445\u0431\u0430\u0439\u0442\u043E\u0432\u043E\u0439 \u0440\u0435\u0430\u043B\u0438\u0437\u0430\u0446\u0438\u0435\u0439 \u0441\u0438\u043C\u0432\u043E\u043B\u0430 FEFF16). \u041E\u0434\u043D\u0438\u043C \u0438\u0437 \u043F\u0440\u0435\u0438\u043C\u0443\u0449\u0435\u0441\u0442\u0432 \u044F\u0432\u043B\u044F\u0435\u0442\u0441\u044F \u0441\u043E\u0432\u043C\u0435\u0441\u0442\u0438\u043C\u043E\u0441\u0442\u044C \u0441 ASCII \u2014 \u043B\u044E\u0431\u044B\u0435 \u0438\u0445 7-\u0431\u0438\u0442\u043D\u044B\u0435 \u0441\u0438\u043C\u0432\u043E\u043B\u044B \u043E\u0442\u043E\u0431\u0440\u0430\u0436\u0430\u044E\u0442\u0441\u044F \u043A\u0430\u043A \u0435\u0441\u0442\u044C, \u0430 \u043E\u0441\u0442\u0430\u043B\u044C\u043D\u044B\u0435 \u0432\u044B\u0434\u0430\u044E\u0442 \u043F\u043E\u043B\u044C\u0437\u043E\u0432\u0430\u0442\u0435\u043B\u044E \u043C\u0443\u0441\u043E\u0440 (\u0448\u0443\u043C).\u041F\u043E\u044D\u0442\u043E\u043C\u0443 \u0432 \u0441\u043B\u0443\u0447\u0430\u0435, \u0435\u0441\u043B\u0438 \u043B\u0430\u0442\u0438\u043D\u0441\u043A\u0438\u0435 \u0431\u0443\u043A\u0432\u044B \u0438 \u043F\u0440\u043E\u0441\u0442\u0435\u0439\u0448\u0438\u0435 \u0437\u043D\u0430\u043A\u0438 \u043F\u0440\u0435\u043F\u0438\u043D\u0430\u043D\u0438\u044F (\u0432\u043A\u043B\u044E\u0447\u0430\u044F \u043F\u0440\u043E\u0431\u0435\u043B) \u0437\u0430\u043D\u0438\u043C\u0430\u044E\u0442 \u0441\u0443\u0449\u0435\u0441\u0442\u0432\u0435\u043D\u043D\u044B\u0439 \u043E\u0431\u044A\u0451\u043C \u0442\u0435\u043A\u0441\u0442\u0430, UTF-8 \u0434\u0430\u0451\u0442 \u0432\u044B\u0438\u0433\u0440\u044B\u0448 \u043F\u043E \u043E\u0431\u044A\u0451\u043C\u0443 \u043F\u043E \u0441\u0440\u0430\u0432\u043D\u0435\u043D\u0438\u044E \u0441 UTF-16." . "UTF-8 (8-bit Unicode Transformation Format) \u00E9 um tipo de codifica\u00E7\u00E3o Unicode de comprimento vari\u00E1vel criado por Ken Thompson e Rob Pike. Pode representar qualquer caracter universal padr\u00E3o do Unicode, sendo tamb\u00E9m compat\u00EDvel com o ASCII. Por esta raz\u00E3o, est\u00E1 lentamente a ser adaptado como tipo de codifica\u00E7\u00E3o padr\u00E3o para email, p\u00E1ginas web, e outros locais onde os caracteres s\u00E3o armazenados. UTF-8 usa de um a quatro bytes (estritamente, octetos) por car\u00E1cter, dependendo do s\u00EDmbolo Unicode que representa. \u00C9 necess\u00E1rio apenas um byte para codificar os 128 caracteres ASCII (Unicode U+0000 a U+007F). S\u00E3o necess\u00E1rios dois bytes para caracteres Latinos com diacr\u00EDticos. S\u00E3o tamb\u00E9m usados dois bytes para representar caracteres dos alfabetos Grego, Cir\u00EDlico, Arm\u00EAnio, Hebraico, S\u00EDrio e Thaana (Unicode U+0080 a U+07FF). S\u00E3o necess\u00E1rios tr\u00EAs bytes para o resto do Plano Multilingual B\u00E1sico (que cont\u00E9m praticamente todos os caracteres comuns utilizados). Existem ainda outros caracteres que necessitam de quatro bytes. Quatro bytes pode parecer muito para um car\u00E1cter (\"code point\"), mas muito raramente s\u00E3o utilizados. Al\u00E9m disso, UTF-16 (a principal alternativa ao UTF-8) necessita tamb\u00E9m de quatro bytes para estes \"code points\". A defini\u00E7\u00E3o de qual dos dois \u00E9 mais eficiente (UTF-8 ou UTF-16) depende da variedade de \"code points\" usados. Contudo, as diferen\u00E7as entre os v\u00E1rios tipos de codifica\u00E7\u00E3o tornam-se irrelevantes com o uso de sistemas de compress\u00E3o como o DEFLATE. Para textos curtos nos quais os tradicionais algoritmos n\u00E3o funcionam bem e se faz necess\u00E1rio ter o tamanho em considera\u00E7\u00E3o, \u00E9 geralmente usado o Esquema Padr\u00E3o de Compress\u00E3o para Unicode (Standard Compression Scheme for Unicode). O \"Internet Engineering Task Force\" (IETF) requer que todos os protocolos utilizados na Internet suportem, pelo menos, o UTF-8. O \"Internet Mail Consortium\" (IMC) recomenda que todos os clientes de email consigam ler e criar mails usando o UTF-8." . . . . . "UTF-8" . . . "UTF-8" . . . . . . . "744956649"^^ . . . . . . . . . . "25\u0628\u0643 \u0627\u0644\u0645\u062D\u062A\u0648\u0649 \u0647\u0646\u0627 \u064A\u0646\u0642\u0635\u0647 \u0627\u0644\u0627\u0633\u062A\u0634\u0647\u0627\u062F \u0628\u0645\u0635\u0627\u062F\u0631. \u064A\u0631\u062C\u0649 \u0625\u064A\u0631\u0627\u062F \u0645\u0635\u0627\u062F\u0631 \u0645\u0648\u062B\u0648\u0642 \u0628\u0647\u0627. \u0623\u064A \u0645\u0639\u0644\u0648\u0645\u0627\u062A \u063A\u064A\u0631 \u0645\u0648\u062B\u0642\u0629 \u064A\u0645\u0643\u0646 \u0627\u0644\u062A\u0634\u0643\u064A\u0643 \u0628\u0647\u0627 \u0648\u0625\u0632\u0627\u0644\u062A\u0647\u0627. (\u0645\u0627\u0631\u0633 2016) UTF-8 \u0647\u064A \u0627\u062E\u062A\u0635\u0627\u0631 \u0644\u0644\u062C\u0645\u0644\u0629 (8-bit Unicode Transformation Format) \u0648\u062A\u0631\u062C\u0645\u062A\u0647\u0627 (\u0635\u064A\u063A\u0629 \u062A\u062D\u0648\u064A\u0644 \u0646\u0638\u0627\u0645 \u0627\u0644\u062D\u0631\u0648\u0641 \u0627\u0644\u062F\u0648\u0644\u064A \u0627\u0644\u0645\u0648\u062D\u062F \u0628\u0642\u0648\u0629 8 \u0628\u062A)\u060C \u0647\u0630\u0627 \uFFFC\uFFFC\u0627\u0644\u062A\u0631\u0645\u064A\u0632\uFFFC\uFFFC \u0648\u0636\u0639 \u0645\u0646 \u0642\u0628\u0644 \u0643\u0644 \u0645\u0646 \u0631\u0648\u0628 \u0628\u0627\u064A\u0643 \u0648\u0643\u064A\u0646 \u062A\u0648\u0645\u0633\u0646 \u0644\u062A\u0645\u062B\u064A\u0644 \u0645\u0639\u064A\u0627\u0631 \u0646\u0638\u0627\u0645 \u0627\u0644\u062D\u0631\u0648\u0641 \u0627\u0644\u062F\u0648\u0644\u064A \u0627\u0644\u0645\u0648\u062D\u062F \u0644\u0644\u062D\u0631\u0648\u0641 \u0627\u0644\u0623\u0628\u062C\u062F\u064A\u0629 \u0644\u0623\u063A\u0644\u0628 \u0644\u063A\u0627\u062A \u0627\u0644\u0639\u0627\u0644\u0645\u060C \u0648\u064A\u062A\u0645 \u062A\u0634\u0641\u064A\u0631 \u0627\u0644\u0631\u0645\u0648\u0632 \u0641\u064A\u0647\u0627 \u0641\u064A \u062D\u062C\u0645 \u064A\u062A\u0631\u0627\u0648\u062D \u0628\u064A\u0646 \u0628\u0627\u064A\u062A \u0648\u0627\u062D\u062F \u06484 \u0628\u0627\u064A\u062A \u0644\u0644\u0631\u0645\u0632 \u0627\u0644\u0648\u0627\u062D\u062F. \u064A\u062A\u0645 \u062A\u062D\u062F\u064A\u062F \u0637\u0648\u0644 \u062A\u0634\u0641\u064A\u0631 \u0627\u0644\u0631\u0645\u0632 \u0628\u062D\u0633\u0628 \u0627\u0644\u0634\u0643\u0644 \u0627\u0644\u0622\u062A\u064A: \n* \u0625\u0630\u0627 \u0643\u0627\u0646 \u0642\u064A\u0645\u0629 \u0627\u0644\u0628\u0627\u064A\u062A \u0627\u0644\u0623\u0648\u0644 \u0623\u0642\u0644 \u0645\u0646 127\u060C \u0623\u064A \u0623\u0646 \u0627\u0644\u0628\u062A \u0627\u0644\u062B\u0627\u0645\u0646 \u064A\u0633\u0627\u0648\u064A \u0635\u0641\u0631\u060C \u0641\u0625\u0646 \u0647\u0630\u0627 \u0627\u0644\u0628\u0627\u064A\u062A \u0647\u0648 \u0643\u0627\u0645\u0644 \u062A\u0634\u0641\u064A\u0631 \u0627\u0644\u0631\u0645\u0632\u060C \u0648\u0628\u0627\u0644\u062A\u0627\u0644\u064A \u0637\u0648\u0644\u0647 \u0648\u0627\u062D\u062F \u0628\u0627\u064A\u062A\u060C \u062A\u0642\u0639 \u0642\u064A\u0645 ASCII \u0641\u064A \u0647\u0630\u0627 \u0627\u0644\u0645\u062C\u0627\u0644. \n* \u0625\u0630\u0627 \u0643\u0627\u0646 \u0642\u064A\u0645\u0629 \u0627\u0644\u0628\u0627\u064A\u062A \u0627\u0644\u0623\u0648\u0644 \u0623\u0643\u0628\u0631 \u0645\u0646 127\u060C \u0623\u064A \u0623\u0646 \u0642\u064A\u0645\u0629 \u0627\u0644\u0628\u062A \u0627\u0644\u062B\u0627\u0645\u0646 \u064A\u0633\u0627\u0648\u064A \u0648\u0627\u062D\u062F\u060C \u0641\u0625\u0646 \u062A\u0634\u0641\u064A\u0631 \u0627\u0644\u0631\u0645\u0632 \u0645\u062A\u0639\u062F\u062F \u0627\u0644\u0628\u0627\u064A\u062A\u0627\u062A \u062D\u0633\u0628 \u0627\u0644\u0623\u062A\u064A: \n* \u0644\u0627 \u064A\u062C\u0648\u0632 \u0623\u0646 \u064A\u0643\u0648\u0646 \u0627\u0644\u0628\u062A \u0627\u0644\u062B\u0627\u0645\u0646 \u0645\u0646 \u0627\u0644\u0628\u0627\u064A\u062A \u0627\u0644\u0623\u0648\u0644 \u0645\u0633\u0627\u0648\u064A\u0627 \u0644\u0648\u0627\u062D\u062F \u0648\u0627\u0644\u0628\u062A \u0627\u0644\u0633\u0627\u0628\u0639 \u064A\u0633\u0627\u0648\u064A \u0635\u0641\u0631\u060C \u0648\u0648\u0642\u0648\u0639 \u0645\u062B\u0644 \u0647\u0630\u0647 \u0627\u0644\u062D\u0627\u0644\u0629 \u0641\u064A \u0627\u0644\u0628\u0627\u064A\u062A \u0627\u0644\u0623\u0648\u0644 \u0645\u0646 \u0627\u0644\u062A\u0634\u0641\u064A\u0631 \u062A\u0639\u0646\u064A \u0623\u0646 \u0647\u0646\u0627\u0643 \u062E\u0637\u0623 \u0625\u0645\u0627 \u0641\u064A \u0627\u0644\u062A\u0634\u0641\u064A\u0631 \u0623\u0648 \u0641\u064A \u0637\u0631\u064A\u0642\u0629 \u0627\u0644\u0642\u0631\u0627\u0621\u0629\u060C \u0641\u0647\u0630\u0647 \u0627\u0644\u0642\u064A\u0645 \u0645\u0633\u0645\u0648\u062D\u0629 \u0641\u064A \u0627\u0644\u0628\u0627\u064A\u062A \u0627\u0644\u062B\u0627\u0646\u064A \u0648\u0627\u0644\u062B\u0627\u0644\u062B \u0648\u0627\u0644\u0631\u0627\u0628\u0639 \u0648\u0644\u0643\u0646 \u0644\u064A\u0633 \u0627\u0644\u0623\u0648\u0644. \n* \u0625\u0630\u0627 \u0643\u0627\u0646 \u0627\u0644\u0628\u062A \u0627\u0644\u062B\u0627\u0645\u0646 \u0645\u0646 \u0627\u0644\u0628\u0627\u064A\u062A \u0627\u0644\u0623\u0648\u0644 \u0645\u0633\u0627\u0648\u064A\u0627 \u0644\u0648\u0627\u062D\u062F \u0648\u0643\u0630\u0644\u0643 \u0627\u0644\u0628\u062A \u0627\u0644\u0633\u0627\u0628\u0639 \u0645\u0633\u0627\u0648\u064A\u0627 \u0644\u0648\u0627\u062D\u062F \u0648\u0627\u0644\u0628\u062A \u0627\u0644\u0633\u0627\u062F\u0633 \u064A\u0633\u0627\u0648\u064A \u0635\u0641\u0631\u060C \u0641\u0625\u0646 \u0637\u0648\u0644 \u0627\u0644\u062A\u0634\u0641\u064A\u0631 \u0647\u0648 2 \u0628\u0627\u064A\u062A. \n* \u0625\u0630\u0627 \u0643\u0627\u0646 \u0627\u0644\u0628\u062A \u0627\u0644\u062B\u0627\u0645\u0646 \u0645\u0646 \u0627\u0644\u0628\u0627\u064A\u062A \u0627\u0644\u0623\u0648\u0644 \u0645\u0633\u0627\u0648\u064A\u0627 \u0644\u0648\u0627\u062D\u062F \u0648\u0643\u0630\u0644\u0643 \u0627\u0644\u0628\u062A \u0627\u0644\u0633\u0627\u0628\u0639 \u0645\u0633\u0627\u0648\u064A\u0627 \u0644\u0648\u0627\u062D\u062F \u0648\u0627\u0644\u0628\u062A \u0627\u0644\u0633\u0627\u062F\u0633 \u064A\u0633\u0627\u0648\u064A \u0648\u0627\u062D\u062F \u0648\u0627\u0644\u062E\u0627\u0645\u0633 \u064A\u0633\u0627\u0648\u064A \u0635\u0641\u0631\u060C \u0641\u0625\u0646 \u0637\u0648\u0644 \u0627\u0644\u062A\u0634\u0641\u064A\u0631 \u0647\u0648 3 \u0628\u0627\u064A\u062A. \n* \u0625\u0630\u0627 \u0643\u0627\u0646 \u0627\u0644\u0628\u062A \u0627\u0644\u062B\u0627\u0645\u0646 \u0645\u0646 \u0627\u0644\u0628\u0627\u064A\u062A \u0627\u0644\u0623\u0648\u0644 \u0645\u0633\u0627\u0648\u064A\u0627 \u0644\u0648\u0627\u062D\u062F \u0648\u0643\u0630\u0644\u0643 \u0627\u0644\u0628\u062A \u0627\u0644\u0633\u0627\u0628\u0639 \u0645\u0633\u0627\u0648\u064A\u0627 \u0644\u0648\u0627\u062D\u062F \u0648\u0627\u0644\u0628\u062A \u0627\u0644\u0633\u0627\u062F\u0633 \u064A\u0633\u0627\u0648\u064A \u0648\u0627\u062D\u062F \u0648\u0627\u0644\u062E\u0627\u0645\u0633 \u064A\u0633\u0627\u0648\u064A \u0648\u0627\u062D\u062F \u0648\u0627\u0644\u0631\u0627\u0628\u0639 \u064A\u0633\u0627\u0648\u064A \u0635\u0641\u0631\u060C \u0641\u0625\u0646 \u0637\u0648\u0644 \u0627\u0644\u062A\u0634\u0641\u064A\u0631 \u0647\u0648 4 \u0628\u0627\u064A\u062A."@ar . . . . . . . "UTF-8" . . . "UTF-8 (abr\u00E9viation de l\u2019anglais Universal Character Set Transformation Format - 8 bits) est un codage de caract\u00E8res informatiques con\u00E7u pour coder l\u2019ensemble des caract\u00E8res du \u00AB r\u00E9pertoire universel de caract\u00E8res cod\u00E9s \u00BB, initialement d\u00E9velopp\u00E9 par l\u2019ISO dans la norme internationale ISO/CEI 10646, aujourd\u2019hui totalement compatible avec le standard Unicode, en restant compatible avec la norme ASCII limit\u00E9e \u00E0 l\u2019anglais de base (et quelques autres langues beaucoup moins fr\u00E9quentes), mais tr\u00E8s largement r\u00E9pandue depuis des d\u00E9cennies." . . . . . . . . . . . . . . . . . . "UTF-8 (8-bit Unicode Transformation Format) es un formato de codificaci\u00F3n de caracteres Unicode e ISO 10646 utilizando s\u00EDmbolos de longitud variable. UTF-8 fue creado por Robert C. Pike y Kenneth L. Thompson. Est\u00E1 definido como est\u00E1ndar por la de la Internet Engineering Task Force (IETF). Actualmente es una de las tres posibilidades de codificaci\u00F3n reconocidas por Unicode y lenguajes web, o cuatro en ISO 10646. Sus caracter\u00EDsticas principales son: \n* Es capaz de representar cualquier car\u00E1cter Unicode. \n* Usa s\u00EDmbolos de longitud variable (de 1 a 4 bytes por car\u00E1cter Unicode). \n* Incluye la especificaci\u00F3n US-ASCII de 7 bits, por lo que cualquier mensaje ASCII se representa sin cambios. \n* Incluye sincron\u00EDa. Es posible determinar el inicio de cada s\u00EDmbolo sin reiniciar la lectura desde el principio de la comunicaci\u00F3n. \n* No superposici\u00F3n. Los conjuntos de valores que puede tomar cada byte de un car\u00E1cter multibyte, son disjuntos, por lo que no es posible confundirlos entre s\u00ED. Estas caracter\u00EDsticas lo hacen atractivo en la codificaci\u00F3n de correos electr\u00F3nicos y p\u00E1ginas web. El IETF requiere que todos los protocolos de Internet indiquen qu\u00E9 codificaci\u00F3n utilizan para los textos y que UTF-8 sea una de las codificaciones contempladas. El Internet Mail Consortium (IMC) recomienda que todos los programas de correo electr\u00F3nico sean capaces de crear y mostrar mensajes codificados utilizando UTF-8." . . . . . . . . . . . "MijmeoH9LT4"^^ . . . "\u0635\u064A\u063A\u0629 \u0627\u0644\u062A\u062D\u0648\u064A\u0644 \u0627\u0644\u0645\u0648\u062D\u062F-8"@ar . . . . . . . . . . . . . . . . . "UTF-8\uFF088-bit Unicode Transformation Format\uFF09\u662F\u4E00\u7A2E\u91DD\u5C0DUnicode\u7684\u53EF\u8B8A\u9577\u5EA6\u5B57\u5143\u7DE8\u78BC\uFF0C\u4E5F\u662F\u4E00\u79CD\u524D\u7F00\u7801\u3002\u5B83\u53EF\u4EE5\u7528\u4F86\u8868\u793AUnicode\u6A19\u6E96\u4E2D\u7684\u4EFB\u4F55\u5B57\u5143\uFF0C\u4E14\u5176\u7DE8\u78BC\u4E2D\u7684\u7B2C\u4E00\u500B\u4F4D\u5143\u7D44\u4ECD\u8207ASCII\u76F8\u5BB9\uFF0C\u9019\u4F7F\u5F97\u539F\u4F86\u8655\u7406ASCII\u5B57\u5143\u7684\u8EDF\u9AD4\u7121\u9808\u6216\u53EA\u9808\u505A\u5C11\u90E8\u4EFD\u4FEE\u6539\uFF0C\u5373\u53EF\u7E7C\u7E8C\u4F7F\u7528\u3002\u56E0\u6B64\uFF0C\u5B83\u9010\u6F38\u6210\u70BA\u96FB\u5B50\u90F5\u4EF6\u3001\u7DB2\u9801\u53CA\u5176\u4ED6\u5132\u5B58\u6216\u50B3\u9001\u6587\u5B57\u7684\u61C9\u7528\u4E2D\uFF0C\u512A\u5148\u63A1\u7528\u7684\u7DE8\u78BC\u3002 UTF-8\u4F7F\u7528\u4E00\u81F3\u516D\u500B\u4F4D\u5143\u7D44\u70BA\u6BCF\u500B\u5B57\u7B26\u7DE8\u78BC\uFF08\u5C3D\u7BA1\u5982\u6B64\uFF0C2003\u5E7411\u6708UTF-8\u88ABRFC 3629\u91CD\u65B0\u89C4\u8303\uFF0C\u53EA\u80FD\u4F7F\u7528\u539F\u6765Unicode\u5B9A\u4E49\u7684\u533A\u57DF\uFF0CU+0000\u5230U+10FFFF\uFF0C\u4E5F\u5C31\u662F\u8BF4\u6700\u591A\u56DB\u500B\u5B57\u8282\uFF09\uFF1A 1. \n* 128\u500BUS-ASCII\u5B57\u7B26\u53EA\u9700\u4E00\u500B\u4F4D\u5143\u7D44\u7DE8\u78BC\uFF08Unicode\u7BC4\u570D\u7531U+0000\u81F3U+007F\uFF09\u3002 2. \n* \u5E36\u6709\u9644\u52A0\u7B26\u53F7\u7684\u62C9\u4E01\u6587\u3001\u5E0C\u81D8\u6587\u3001\u897F\u91CC\u723E\u5B57\u6BCD\u3001\u4E9E\u7F8E\u5C3C\u4E9E\u8A9E\u3001\u5E0C\u4F2F\u4F86\u6587\u3001\u963F\u62C9\u4F2F\u6587\u3001\u6558\u5229\u4E9E\u6587\u53CA\u5B83\u62FF\u5B57\u6BCD\u5247\u9700\u8981\u4E24\u500B\u4F4D\u5143\u7D44\u7DE8\u78BC\uFF08Unicode\u7BC4\u570D\u7531U+0080\u81F3U+07FF\uFF09\u3002 3. \n* \u5176\u4ED6\u57FA\u672C\u591A\u6587\u7A2E\u5E73\u9762\uFF08BMP\uFF09\u4E2D\u7684\u5B57\u5143\uFF08\u9019\u5305\u542B\u4E86\u5927\u90E8\u5206\u5E38\u7528\u5B57\uFF0C\u5982\u5927\u90E8\u5206\u7684\u6C49\u5B57\uFF09\u4F7F\u7528\u4E09\u500B\u4F4D\u5143\u7D44\u7DE8\u78BC\uFF08Unicode\u8303\u56F4\u7531U+0800\u81F3U+FFFF\uFF09\u3002 4. \n* \u5176\u4ED6\u6975\u5C11\u4F7F\u7528\u7684Unicode \u8F14\u52A9\u5E73\u9762\u7684\u5B57\u5143\u4F7F\u7528\u56DB\u81F3\u516D\u4F4D\u5143\u7D44\u7DE8\u78BC\uFF08Unicode\u8303\u56F4\u7531U+10000\u81F3U+1FFFFF\u4F7F\u7528\u56DB\u5B57\u8282\uFF0CUnicode\u8303\u56F4\u7531U+200000\u81F3U+3FFFFFF\u4F7F\u7528\u4E94\u5B57\u8282\uFF0CUnicode\u8303\u56F4\u7531U+4000000\u81F3U+7FFFFFFF\u4F7F\u7528\u516D\u5B57\u8282\uFF09\u3002 \u5C0D\u4E0A\u8FF0\u63D0\u53CA\u7684\u7B2C\u56DB\u7A2E\u5B57\u5143\u800C\u8A00\uFF0CUTF-8\u4F7F\u7528\u56DB\u81F3\u516D\u500B\u4F4D\u5143\u7D44\u4F86\u7DE8\u78BC\u4F3C\u4E4E\u592A\u8017\u8CBB\u8CC7\u6E90\u4E86\u3002\u4F46UTF-8\u5C0D\u6240\u6709\u5E38\u7528\u7684\u5B57\u5143\u90FD\u53EF\u4EE5\u7528\u4E09\u500B\u4F4D\u5143\u7D44\u8868\u793A\uFF0C\u800C\u4E14\u5B83\u7684\u53E6\u4E00\u7A2E\u9078\u64C7\uFF0CUTF-16\u7DE8\u78BC\uFF0C\u5C0D\u524D\u8FF0\u7684\u7B2C\u56DB\u7A2E\u5B57\u7B26\u540C\u6A23\u9700\u8981\u56DB\u500B\u4F4D\u5143\u7D44\u4F86\u7DE8\u78BC\uFF0C\u6240\u4EE5\u8981\u6C7A\u5B9AUTF-8\u6216UTF-16\u54EA\u7A2E\u7DE8\u78BC\u6BD4\u8F03\u6709\u6548\u7387\uFF0C\u9084\u8981\u8996\u6240\u4F7F\u7528\u7684\u5B57\u5143\u7684\u5206\u4F48\u7BC4\u570D\u800C\u5B9A\u3002\u4E0D\u904E\uFF0C\u5982\u679C\u4F7F\u7528\u4E00\u4E9B\u50B3\u7D71\u7684\u58D3\u7E2E\u7CFB\u7D71\uFF0C\u6BD4\u5982DEFLATE\uFF0C\u5247\u9019\u4E9B\u4E0D\u540C\u7DE8\u78BC\u7CFB\u7D71\u9593\u7684\u7684\u5DEE\u7570\u5C31\u8B8A\u5F97\u5FAE\u4E0D\u8DB3\u9053\u4E86\u3002\u82E5\u9867\u53CA\u50B3\u7D71\u58D3\u7E2E\u7B97\u6CD5\u5728\u58D3\u7E2E\u8F03\u77ED\u6587\u5B57\u4E0A\u7684\u6548\u679C\u4E0D\u5927\uFF0C\u53EF\u4EE5\u8003\u616E\u4F7F\u7528Unicode\u6A19\u6E96\u58D3\u7E2E\u683C\u5F0F\uFF08SCSU\uFF09\u3002 \u7DB2\u969B\u7DB2\u8DEF\u5DE5\u7A0B\u5DE5\u4F5C\u5C0F\u7D44\uFF08IETF\uFF09\u8981\u6C42\u6240\u6709\u7DB2\u969B\u7DB2\u8DEF\u5354\u8B70\u90FD\u5FC5\u9808\u652F\u6301UTF-8\u7DE8\u78BC\u3002\u4E92\u806F\u7DB2\u90F5\u4EF6\u806F\u76DF\uFF08IMC\uFF09\u5EFA\u8B70\u6240\u6709\u96FB\u5B50\u90F5\u4EF6\u8EDF\u4EF6\u90FD\u652F\u6301UTF-8\u7DE8\u78BC\u3002" . "UTF-8 (abr\u00E9viation de l\u2019anglais Universal Character Set Transformation Format - 8 bits) est un codage de caract\u00E8res informatiques con\u00E7u pour coder l\u2019ensemble des caract\u00E8res du \u00AB r\u00E9pertoire universel de caract\u00E8res cod\u00E9s \u00BB, initialement d\u00E9velopp\u00E9 par l\u2019ISO dans la norme internationale ISO/CEI 10646, aujourd\u2019hui totalement compatible avec le standard Unicode, en restant compatible avec la norme ASCII limit\u00E9e \u00E0 l\u2019anglais de base (et quelques autres langues beaucoup moins fr\u00E9quentes), mais tr\u00E8s largement r\u00E9pandue depuis des d\u00E9cennies. L\u2019UTF-8 est utilis\u00E9 par 82,2 % des sites web en d\u00E9cembre 2014, puis 87.6% en 2016 Par sa nature, UTF-8 est d\u2019un usage de plus en plus courant sur Internet, et dans les syst\u00E8mes devant \u00E9changer de l'information. Il s\u2019agit \u00E9galement du codage le plus utilis\u00E9 dans les syst\u00E8mes GNU, Linux et compatibles pour g\u00E9rer le plus simplement possible des textes et leurs traductions dans tous les syst\u00E8mes d\u2019\u00E9critures et tous les alphabets du monde." . . . . "UTF-8 (Unicode Transformation Format, 8 bit) \u00E8 una codifica dei caratteri Unicode in sequenze di lunghezza variabile di byte, creata da Rob Pike e Ken Thompson. UTF-8 usa gruppi di byte per rappresentare i caratteri Unicode, ed \u00E8 particolarmente utile per il trasferimento tramite sistemi di posta elettronica a 8-bit." . . . . . . . . "UTF-8 (8-bit Unicode Transformation Format) is een manier om Unicode/ISO 10646-tekens op te slaan als een stroom van bytes, een zogenaamde tekencodering. Alternatieven zijn UTF-16 en UTF-32. UTF-8 is een tekencodering met variabele lengte: niet elk teken gebruikt evenveel bytes. Afhankelijk van het teken worden 1 tot 4 bytes gebruikt. Voor het vastleggen van elk van de 128 Basic ASCII-tekens (0--127) zijn slechts 7 bits nodig. De functie van de oorspronkelijke 8e parity-/strobe- bit werd al snel overbodig. Basic ASCII bestaat uit slechts \u00E9\u00E9n byte waarvan het hoogste bit altijd een nul is. De Extended ASCII-tekenset(128-255) is een aanvulling op Basic-ASCII, en bevat beeldschermtype (EGA/VGA) en land-afhankelijke tekens. Extended ASCII bestaat uit dezelfde byte met het hoogste bit altijd ee" . . . . . . "Characters, Symbols and the Unicode Miracle \u2013 Computerphile"^^ . . . "UTF-8 is a character encoding capable of encoding all possible characters, or code points, defined by Unicode and originally designed by Ken Thompson and Rob Pike. The encoding is variable-length and uses 8-bit code units. It was designed for backward compatibility with ASCII and to avoid the complications of endianness and byte order marks in the alternative UTF-16 and UTF-32 encodings. The name is derived from Unicode (or Universal Coded Character Set) Transformation Format \u2013 8-bit. UTF-8 is the dominant character encoding for the World Wide Web, accounting for 87.7% of all Web pages in October 2016 (the most popular East Asian encodings, Shift JIS and GB 2312, have 1.1% and 0.8% respectively). The Internet Mail Consortium (IMC) recommends that all e-mail programs be able to display and create mail using UTF-8, and the W3C recommends UTF-8 as the default encoding in XML and HTML. UTF-8 encodes each of the 1,112,064 valid code points in the Unicode code space (1,114,112 code points minus 2,048 surrogate code points) using one to four 8-bit bytes (a group of 8 bits is known as an octet in the Unicode Standard). Code points with lower numerical values (i.e., earlier code positions in the Unicode character set, which tend to occur more frequently) are encoded using fewer bytes. The first 128 characters of Unicode, which correspond one-to-one with ASCII, are encoded using a single octet with the same binary value as ASCII, making valid ASCII text valid UTF-8-encoded Unicode as well. And ASCII bytes do not occur when encoding non-ASCII code points into UTF-8, making UTF-8 safe to use within most programming and document languages that interpret certain ASCII characters in a special way, such as end of string." . "UTF-8" . . "UTF-8" . . "UTF-8\uFF08\u30E6\u30FC\u30C6\u30A3\u30FC\u30A8\u30D5\u306F\u3061\u3001\u30E6\u30FC\u30C6\u30A3\u30FC\u30A8\u30D5\u30A8\u30A4\u30C8\uFF09\u306FISO/IEC 10646 (UCS) \u3068Unicode\u3067\u4F7F\u3048\u308B8\u30D3\u30C3\u30C8\u7B26\u53F7\u5358\u4F4D\u306E\u6587\u5B57\u7B26\u53F7\u5316\u5F62\u5F0F\u53CA\u3073\u6587\u5B57\u7B26\u53F7\u5316\u30B9\u30AD\u30FC\u30E0\u3002 \u6B63\u5F0F\u540D\u79F0\u306F\u3001ISO/IEC 10646\u3067\u306F \u201CUCS Transformation Format 8\u201D\u3001Unicode\u3067\u306F \u201CUnicode Transformation Format-8\u201D \u3068\u3044\u3046\u3002\u4E21\u8005\u306FISO/IEC 10646\u3068Unicode\u306E\u30B3\u30FC\u30C9\u91CD\u8907\u7BC4\u56F2\u3067\u4E92\u63DB\u6027\u304C\u3042\u308B\u3002RFC\u306B\u3082\u4ED5\u69D8\u304C\u3042\u308B\u3002 2\u30D0\u30A4\u30C8\u76EE\u4EE5\u964D\u306B\u300C/\u300D\u306A\u3069\u306EASCII\u6587\u5B57\u304C\u73FE\u308C\u306A\u3044\u3088\u3046\u306B\u5DE5\u592B\u3055\u308C\u3066\u3044\u308B\u3053\u3068\u304B\u3089\u3001UTF-FSS (File System Safe) \u3068\u3082\u3044\u308F\u308C\u308B\u3002\u65E7\u540D\u79F0\u306FUTF-2\u3002 \u30C7\u30FC\u30BF\u4EA4\u63DB\u65B9\u5F0F\u3001\u30D5\u30A1\u30A4\u30EB\u5F62\u5F0F\u3068\u3057\u3066\u3001\u4E00\u822C\u7684\u306BUTF-8\u306F\u4F7F\u308F\u308C\u308B\u50BE\u5411\u306B\u3042\u308B\u3002 \u5F53\u521D\u306F\u3001\u30D9\u30EB\u7814\u7A76\u6240\u306B\u304A\u3044\u3066Plan 9\u3067\u7528\u3044\u308B\u30A8\u30F3\u30B3\u30FC\u30C9\u3068\u3057\u3066\u3001\u30ED\u30D6\u30FB\u30D1\u30A4\u30AF\u306B\u3088\u308B\u8A2D\u8A08\u6307\u91DD\u306E\u3082\u3068\u3001\u30B1\u30F3\u30FB\u30C8\u30F3\u30D7\u30BD\u30F3\u306B\u3088\u3063\u3066\u8003\u6848\u3055\u308C\u305F\u3002" . . . . . . . . . . . . . . . . . . . . . . . . . . . . . "\u0421\u0442\u0430\u043D\u0434\u0430\u0440\u0442 UTF-8 \u043E\u0444\u0438\u0446\u0438\u0430\u043B\u044C\u043D\u043E \u0437\u0430\u043A\u0440\u0435\u043F\u043B\u0451\u043D \u0432 \u0434\u043E\u043A\u0443\u043C\u0435\u043D\u0442\u0430\u0445 \u0438 ISO/IEC 10646 Annex D.\u041A\u043E\u0434\u0438\u0440\u043E\u0432\u043A\u0430 \u043D\u0430\u0448\u043B\u0430 \u0448\u0438\u0440\u043E\u043A\u043E\u0435 \u043F\u0440\u0438\u043C\u0435\u043D\u0435\u043D\u0438\u0435 \u0432 UNIX-\u043F\u043E\u0434\u043E\u0431\u043D\u044B\u0445 \u043E\u043F\u0435\u0440\u0430\u0446\u0438\u043E\u043D\u043D\u044B\u0445 \u0441\u0438\u0441\u0442\u0435\u043C\u0430\u0445 \u0438 \u0432\u0435\u0431-\u043F\u0440\u043E\u0441\u0442\u0440\u0430\u043D\u0441\u0442\u0432\u0435.\u0421\u0430\u043C \u0436\u0435 \u0444\u043E\u0440\u043C\u0430\u0442 UTF-8 \u0431\u044B\u043B \u0438\u0437\u043E\u0431\u0440\u0435\u0442\u0451\u043D 2 \u0441\u0435\u043D\u0442\u044F\u0431\u0440\u044F 1992 \u0433\u043E\u0434\u0430 \u041A\u0435\u043D\u043E\u043C \u0422\u043E\u043C\u043F\u0441\u043E\u043D\u043E\u043C \u0438 \u0420\u043E\u0431\u043E\u043C \u041F\u0430\u0439\u043A\u043E\u043C \u0438 \u0440\u0435\u0430\u043B\u0438\u0437\u043E\u0432\u0430\u043D \u0432 Plan 9.\u0412 \u043A\u0430\u0447\u0435\u0441\u0442\u0432\u0435 BOM \u0438\u0441\u043F\u043E\u043B\u044C\u0437\u0443\u0435\u0442 \u043F\u043E\u0441\u043B\u0435\u0434\u043E\u0432\u0430\u0442\u0435\u043B\u044C\u043D\u043E\u0441\u0442\u044C \u0431\u0430\u0439\u0442 EF16, BB16, BF16 (\u0447\u0442\u043E \u0443 \u043D\u0435\u0451 \u0441\u0430\u043C\u043E\u0439 \u044F\u0432\u043B\u044F\u0435\u0442\u0441\u044F \u0442\u0440\u0451\u0445\u0431\u0430\u0439\u0442\u043E\u0432\u043E\u0439 \u0440\u0435\u0430\u043B\u0438\u0437\u0430\u0446\u0438\u0435\u0439 \u0441\u0438\u043C\u0432\u043E\u043B\u0430 FEFF16)." . . . . . . "UTF-8 \u2013 system kodowania Unicode, wykorzystuj\u0105cy od 8 do 32 bit\u00F3w do zakodowania pojedynczego znaku, w pe\u0142ni kompatybilny z ASCII. Jest najcz\u0119\u015Bciej wykorzystywany do przechowywania napis\u00F3w w plikach i komunikacji sieciowej." . "UTF-8" . . "UTF-8 \u2013 system kodowania Unicode, wykorzystuj\u0105cy od 8 do 32 bit\u00F3w do zakodowania pojedynczego znaku, w pe\u0142ni kompatybilny z ASCII. Jest najcz\u0119\u015Bciej wykorzystywany do przechowywania napis\u00F3w w plikach i komunikacji sieciowej." . "UTF-8 (Abk. f\u00FCr 8-Bit UCS Transformation Format, wobei UCS wiederum Universal Character Set abk\u00FCrzt) ist die am weitesten verbreitete Kodierung f\u00FCr Unicode-Zeichen (Unicode und UCS sind praktisch identisch). Die Kodierung wurde im September 1992 von Ken Thompson und Rob Pike bei Arbeiten am Plan-9-Betriebssystem festgelegt. Die Kodierung wurde zun\u00E4chst im Rahmen von X/Open als FSS-UTF (filesystem safe UTF in Abgrenzung zu UTF-1, das diese Eigenschaft nicht hat) bezeichnet, in den Folgejahren erfolgte im Rahmen der Standardisierung die Umbenennung auf die heute \u00FCbliche Bezeichnung UTF-8." . . .