CHARSETS 7

中文man手册

目录

CHARSETS

NAME
æè¿°
ASCII
ISO 8859
KOI8-R
UNICODEï¼ç»[å]ä¸ä»£ç ,宽[å]åèå符éï¼
ISO 2022 AND ISO 4873
åè
[䏿çç»´æ¤äºº]
[ä¸æçææ°æ´æ°]
ãä¸å½linux论åmanæå页翻è¯è®¡åã:
è·

NAME

charsets - ç¨åºå对å符éåå½éåçè§ç¹

æè¿°

Linux æ¯ä¸ä¸ªå½éæ§çæä½ç³»ç»ãå®çåç§åæ ·å®ç¨ç¨åºå设 å¤é©±å¨ç¨åº (忬æ§å¶å°é©±å¨ç¨åº ) æ¯æå¤ç§è¯è¨çå符éï¼ åæ¬å¸¦æéå ç¬¦å·çæä¸åæ¯è¡¨å- 符ï¼éé³ç¬¦ï¼è¿å(忝ç»å), åå¨é¨éæä¸æåæ¯è¡¨ï¼åæ¬å¸èè¯ï¼å¤ä»£æ¯æå¤«è¯- ï¼é¿æä¼¯è¯ï¼ åå¸ä¼¯æ¥è¯ã )

è¿ä»½æå以ç¨åºåçç¼åå»çå¾ä¸åçåç¬¦éæ åï¼ä»¥åå®ä»¬æ¯å¦ä½ å¨ Linux ä¸- è°åå¨ä¸èµ·çãè®¨è®ºçæ å忬 ASCIIï¼ISO 8859ï¼KOI8-R ï¼ Unicodeï¼ISO 2022 å ISO 4873 ã

ASCII

ASCII (,ç¾å½å½å®¶ä¿¡æ¯äº¤æ¢(ç¨)æ å(代)ç ) æ¯æåç 7-bitå符é, ååæ¯ä¸ºç¾å¼è±è¯è®¾è®¡çãå½åå®è¢« ECMA-6 æ åææè¿°ã

å¨è±å½ä½¿ç¨ä¸ç§ ASCIIçåä½ï¼è¿å使¯ï¼ç¨è±å½ç£å¼ç符å·ä»£æ¿ç¾å½ç crosshatch/octothorpe/hash çç£å¼ç¬¦å·ï¼;å½éè¦æ¶ï¼ ç¾å½çï¼ç¬¦å·ï¼åè±å½çåä½ï¼ç¬¦å·ï¼å¯ä»¥ç¨"US ASCII"å"UK ASCII" ä½ä¸ºåºå«ã

å ä¸º Linux æ¯ä¸ºç¾å½è®¾è®¡ç硬件åç, å®çæ¥å°±æ¯æ US ASCII ã

ISO 8859

ISO 8859 æ¯ä¸ç³»å 10 ï¼-bit å符é,å®åå«ç¾å½ ASCII çä½ä½ (7 -bit ), 128 ï½159 èå´åçä¸å¯è§æ§å¶å符ï¼å 96 个å®å®½å¾å½¢ï¼å符ï¼å¨ 160-255 éã ãLP è¿äºå符éä¸ï¼æéè¦æ¯ ISO 8859-1 ( Latin-1 )ã å®çæ¥å°±è¢« Linux æ§å¶å°é©±å¨ç¨åºæ¯æï¼ X11R6 çæ¯æå¾ä¹å¾å¥½ï¼å¹¶ä¸æ¯ HTML çåºç¡å符éã

Linux 䏿§å¶å°ä¹æ¯æå¶ä»ç 8859 å符é ï¼éè¿ç¨æ·æ¨¡å¼å®ç¨ç¨åº( ä¾å¦ setfont(8)) æ¥ä¿®æ¹é®çç»å®å EGA å¾å½¢è¡¨æ ¼ï¼ 以åè¿è¡æ§å¶å°é©±å¨ç¨åºéçåä½è¡¨æ ¼ä¸ç“ user mapping(ç¨æ·å½±å°)”ã

ä¸é¢æ¯æ¯ä¸ªéåç®ççæè¿°ï¼
8859-1 (Latin-1)

Latin-1 è¦ç大夿°ç西欧è¯è¨ï¼æ¯å¦é¿å°å·´å°¼äº, å æ³°ç½å°¼äºè¯, 丹麦, è·å°,è±è¯,æ³ç½ç¾¤å²,è¬å°,æ³è¯,å¾·è¯- ,å å©è¥¿äº,ç±å°å°,å°å², æå¤§å©ï¼æªå¨ï¼è¡èçï¼è¥¿ççåçå¸ã缺å°è·å°ç ijè¿åï¼iä¸jååï¼ ï¼ æ³å½ç oeï¼oä¸eååï¼åæ§é£æ ¼ç’,,’ èå¾·è¯ä¸- ‘‘ï¼è¿æ ·çï¼å¼å·æ¯å¯ä»¥çã

8859-2 (Latin-2)

Latin-2 æ¯æå¤§å¤æ°çæä¸æä¹¦åçæ¯æå¤«è¯å䏿¬§çè¯è¨ï¼ åç½å°äº , æ·åè¯, å¾·è¯, åçå©, æ³¢å°ï¼ç½é©¬å°¼äºï¼æ¯æ´ä¼åï¼ åæ¯æ´æå°¼äºã

8859-3 (Latin-3)

Latin-3 æ¯ä¸çè¯,å éè¥¿äº , 马è³ä»äºº, ååè³å¶è¯ä½è忬¢è¿çï¼è¯- è¨ï¼ã

8859-4 (Latin-4)

Latin-4 ä»ç»äºç±æ²å°¼äºè¯ï¼ææç»´äºï¼åç«é¶å®çå符 ã宿¯å®è´¨ä¸è¿æ¶ç; åè§ 8859-10 (Latin-6 ) ã

8859-5

å¤ä»£æ¯æå¤«è¯åæ¯æ¯æä¿å å©äºè¯, ç½ä¿ç½æ¯è¯,马å¶é¡¿è¯, ä¿è¯, å¡å°ç»´äºè¯åä¹åå°è¯ã ä¹åå°äººè¯»å¸¦æä¸æç¬ç‘geh’为‘heh’,åï¼å½ï¼éè¦ç¨å¸¦æä¸æç¬ç ghe åæ£ç¡®çghe.åè§ä¸é¢çï¼å³äºï¼KOI8-R ç讨论ã ï¼è¯æ³¨ï¼è¿äºå¤å½äººä¹¦å乿¯æä»¬ä¹ä¸æä¹éè¦çè§£å§ï¼å¸æä¸é¢çè§£éä¸è¦ æäººæç³æ¶äºï¼

8859-6

æ¯æé¿æä¼¯è¯ã 8859-6 åå表æ¯å离å符格å¼çä¸ç§åºå®çå- ä½ï¼ä½æ¯ä¸ä¸ªåé çæ¾ç¤ºå¼æåºè¯¥èåè¿äºæ¥ä½¿ç¨åéçè¯é¦ï¼ä¸é´å- æ¯ï¼åæå表格å¼ã

8859-7

æ¯æç°ä»£çå¸èè¯ã

8859-8

æ¯æå¸ä¼¯æ¥è¯ã

8859-9 (Latin-5)

è¿æ¯Latin-1 çä¸ç§åä½ï¼å®ç¨åè³å¶è¯çä¸äºï¼å- 符ï¼ä»£æ¿å¾å°ç¨çå°å²è¯ã

8859-10 (Latin-6)

Latin 6 å¢å æ«å çº½ç¹(è¯ï¼å¯¹äºlast Inuit æä¸ç¥éæ¯å¦æ¯å¯¹ç) (æ ¼éµå°è¯) å Sami ( ææ®å°è¯ ) ï¼è¿äºæ¯ Lattin 4 ä¸- 缺å°çï¼æ¥è¦çæ´ä¸ªå欧å°åºï¼çå符éï¼ã RFC 1345 ååºäºåæ¥çåä¸åçâ latin 6 "ã Skolt Sami ä»ç¶æ¯è¿äºéè¦æ´å¤ç éé³ç¬¦å·ã

8859-13 (Latin-7)
8859-14 (Latin-8)
8859-15

å¢å äºæ¬§æ´²ç¬¦å·åæ³å½è¿åï¼å®ä»¬æ¯ Latin-1 é缺æ¼çã

KOI8-R

KOI8-R æ¯å¨ä¿å½æµè¡çä¸ä¸ªé ISO å符éãä¸åé¨åæ¯ US ASCII; ä¸é¨æ¯æ¯ ISO 8859-5 è®¾è®¡çæ´å¥½ç夿¯æå¤«å符éã

æ§å¶å°ä¸ºäºæ¯æ KOI8-R å符éï¼å¨ Linux ä¸ï¼ å¯ä»¥å©ç¨ç¨æ·æ¨¡å¼å®ç¨ç¨åºä¿®æ¹é®çç»å®å EGA å¾å½¢è¡¨æ ¼ï¼ 以å卿§å¶å°ç驱å¨ç¨åºä¸ä½¿ç¨åä½è¡¨âuser mappingï¼ç¨æ·æ å°ï¼âã

UNICODEï¼ç»[å]ä¸ä»£ç ,宽[å]åèå符éï¼

Unicodeï¼ ISO 10646 ) æ¯ä¸ä¸ªæ åï¼å®çç®æ æ¯æç½å°è¡¨ç° 卿¯ç§äººç±»è¯- è¨ä¸çæ¯ç§å·²ç¥å符ãUnicode çç¼ç æ¯ 32 ä½ç ( æ§äºççæ¬ä½¿ç¨äº 16 ä½ ) ãå¨ Unicode çä¸äºä¿¡æ¯å¯ä»¥å¨<http://www.unicode.com>è·å¾ã

Linux 使ç¨ï¼ä½ç Unicode è½¬ç§»æ ¼å¼ (UTF-8 ) 表示 Unicode ã UTF-8 æ¯å¯åé¿ç Unicode ç¼ç ã使ç¨ï¼ä¸ªåèç» 7 bit ç¼ç ï¼ä½¿ç¨ï¼ä¸ªåèç» ï¼ï¼ bit ç¼ç ï¼ 使ç¨ï¼ä¸ªåèç» ï¼ï¼ bit ç¼ç ï¼ä½¿ç¨ï¼ä¸ªåèç» ï¼ï¼ bit ç¼ç ï¼ä½¿ç¨ï¼ä¸ªåèç» ï¼ï¼ bit ç¼ç ï¼ä½¿ç¨ï¼ä¸ªåèç» ï¼ï¼ bit ç¼ç

让 0,1 , x 代表é¶ï¼ä¸ï¼æä»»æçä½ãåè0xxxxxxx 代表Unicode 00000000 0xxxxxxxï¼ è¿ä¸ªç¬¦å·å ASCII 0xxxxxxx ç¼ç çç¬¦å·æ¯ä¸æ ·ã è¿æ ·ï¼ ASCII æ²¡ææ¹ä¸º UTF-8ï¼å¹¶ä¸åªç¨ ASCII ç人ä¸ä¼æ³¨æå°ä»»ä½ååï¼ ä¸å¨ä»£ç ï¼å¹¶ä¸ä¸å¨æä»¶å¤§å°ã

åè 110xxxxx æ¯ä¸ä¸ª2 åè代ç çå¼å§ï¼ 110xxxxx 10yyyyyy ç»è£æ 00000xxx xxyyyyyy ã åè 1110xxxx æ¯ä¸ä¸ª ï¼ åè代ç çå¼å§ï¼ 1110xxxx 10yyyyyy 10zzzzzz 被ç»è£æ xxxxyyyy yyzzzzzzã ï¼å¦æ UTF-8 ä½¿ç¨ 31-bit ISO 10646 ç¼ç ï¼é£ä¹è¿ä¸ªçº§æ°å°±ä¼å»¶ä¼¸ å° 6 åèç¼ç ï¼

å¯¹äº ISO-8859-1 çç¨æ·èè¨ï¼è¿æå³ç带é«ä½çå符ç¼ç æä¸¤ä¸ªåèã è¿ä¼ä»¤æ®éçææ¬æä»¶å¢å¤§ï¼å°ï¼ä¸ªç¾åç¹ãä¸è¿æ²¡æåæ¢é®é¢, å ä¸º Unicode ISO-8859-1 符å·çå¼çäºä»ä»¬ç ISO-8859-1 å¼ (ç¨ 8 个å导é¶ååç¼) ãå¯¹äºæ¥è¯çç¨æ·ï¼è¿æå³ç忥叏ç¨ç 16 ä½ç¼ç å° å  3 个å- èï¼å¹¶ä¸è¿è¦æ±ææ©å±çæ å°è¡¨ãè®¸å¤æ¥æ¬äººå æ¤æ¯è¾å欢 ISO 2022 ã

注æ UTF-8 æ¯èªæåæ¥çï¼ 10xxxxxx æ¯ä¸æ¡å°¾å·´, ä»»ä½å¶å® çå- èæ¯ç¼ç ç头ãASCII åèåºç°å¨ UTF-8 æµä¸å¯ä¸çå¯è½æ¯ ä½ä¸ºèªå·±åºç°ãç¹å«æ¯, ä¸ä¼æ NULs æ " /’s åµå¥å¨é£äºæ¯è¾å¤§çç¼ç ä¸ã

å ä¸ºç¼ç ä¸ç ASCIIï¼ç¹å«æ¯, NUL å’/’, 没æåå, æä»¥åæ ¸ä¸ä¼æ³¨æå° å¨ä½¿ç¨ UTF-8ã宿 ¹æ¬ä¸å¨ä¹å®æ£å¨å¤ççé£åè代表ä»ä¹ä¸è¥¿ã

Unicode æ°æ®æµçåç°é常æ¯éè¿" subfont "è¡¨æ¥æä½ï¼è¿ä¸ªè¡¨æ¯ Unicode çä¸ä¸ªåéå°åç¬¦è¡¨æ ¼çæ å°ãåæ ¸åé¨ä½¿ç¨ Unicode æè¿°è£è½½å¥æ¾ç¤ºååç subfontãè¿æå³çå¨ UTF-8 ä¸çä¸ä¸ªæ¨¡å¼ è½ä½¿ç¨ 512 个ä¸åç符å·ãè¿å¯¹äºæ¥è¯ï¼æ±è¯åæé²è¯æ¥è¯´æ¯ä¸å¤çï¼ ä½æ¯å®æ»¡è¶³äºå¤§å¤æ°å¶å®ç¨éã

ISO 2022 AND ISO 4873

ISO 2022 å 4873 æ åæè¿°äºä¸ä¸ªåºäº VT100 å®ç°çå使§å¶æ¨¡åï¼ Linux åæ ¸å xterm (1) ( é¨å ) æ¯æè¿ä¸ªæ¨¡åã å®å¨æ¥æ¬åé©å½å¾æµè¡ã

宿 4 个å¾å½¢çå符éï¼ç§°ä¸º G0 ï¼ G1 ï¼ G2 å G3 ï¼å¹¶ä¸ å¶ä¸- ä¹ä¸æ¯å½åçé«ä½ä¸ºï¼ çç¼ç çå符é(æå G0 ),èä»ä»¬ä¹ 䏿¯å½åçé«ä½ä¸ºï¼çç¼ç çå符é(æå G1 )ãæ¯ç§å¾å½¢çåç¬¦éæ 94 æ 96 个å符 ï¼å¹¶ä¸æ¯å®é䏿¯ä¸ä¸ª 7-bitå符éã å®ä½¿ç¨ 040-0177 ( 041-0176 ) æ 0240-0377 ( 0241-0376 )ç¼ç ä¸çä¸ä¸ªãG0 大尿»æ¯ä¸º 94ï¼å¹¶ä¸ä½¿ç¨ 041-0176 ä¹é´çç¼ç ã

å符ä¹é´åæ¢ç¨è½¬æ¢ï¼shift functionsï¼åè½ ˆN (SO æ LS1), ˆO (SI æ LS0), ESC n (LS2), ESC o (LS3), ESC N (SS2), ESC O (SS3), ESC ˜ (LS1R), ESC } (LS2R), ESC | (LS3R). LSn æå符éGnæ è®°ä¸ºå½åå- 符éï¼ç¨äºé«ä½ä¸ºï¼çç¼ç ã LSnR æå符é Gnæ è®°ä¸ºå½åå- 符éï¼ç¨äºé«ä½ä¸ºï¼çç¼ç ã SSn æå符éGn (n=2 or 3) æ è®°ä¸ºå½åå符éï¼ åªç¨äºä¸ä¸ä¸ªåç¬¦ï¼ ä¸ç®¡å®çé«ä½ç弿¯ä»ä¹ï¼

94 å符çéåç¨å Gn åç¬¦éæ¯ç¨ä¸ä¸ªéé¸åºå ESC ( xx ï¼ç¨äº G0ï¼ï¼ESC ) xx ï¼ç¨äº G1ï¼ï¼ ESC * xx ï¼ç¨äº G2ï¼ï¼ESC + xx ï¼ç¨äº G3ï¼ï¼ç- 代表çï¼è¿éç xx æ¯ä¸ä¸ªç¬¦å· æèæ¯å¨ ISO 2375 å½é注åç¼ç å符éä¸- çä¸å¯¹ç¬¦å·ã ä¾å¦ï¼ESC ( @ éç¨ ISO 646 å符éä½ä¸ºGOï¼ ESC ( A éç¨ UK æ åå符é(ç¨ç£ä»£æ¿æ°åè®°å·), ESC ( B éæ© ASCII ( ç¨ç¾åä»£æ¿æµéè´§å¸), ESC ( M ä¸ºéæ´²è¯è¨éæ©ä¸ä¸ªå符éï¼ ESC ( ! A éæ©å¤å·´å符é, çç. çç.

94 å符çéåç¨å Gn åç¬¦éæ¯ç¨ä¸ä¸ªéé¸åºå ESC - xx ï¼å¯¹äº G1ï¼, ESC . xx ï¼å¯¹äº G2ï¼ æ ESC / xx ï¼å¯¹äº G3ï¼çè¡¨ç¤ºï¼ ä¾å¦, ESC - G éæ©å¸ä¼¯è±åæ¯è¡¨ä½ä¸º G1.

å¤åèçå符éç¨å Gn åç¬¦éæ¯ç¨ä¸ä¸ªéé¸åºå ESC $ xx æè ESC $ ( xx ï¼å¯¹äº G0ï¼ï¼ ESC $ ) xx ï¼å¯¹äº G1ï¼ï¼ESC $ * xx ï¼å¯¹äº G2ï¼ï¼ESC $ + xx ï¼å¯¹äº G3ï¼çæ¥è¡¨ç¤ºï¼ ä¾å¦, ESC $ ( C 为 G0éæ©é©å½å符é. æ¥æ¬å- 符éåç± ESC $ Béæ© æ´å¤ä¸´è¿ççæ¬ç±ESC & @ ESC $ Béæ©.

ISO 4873 è§å®äºä¸ä¸ªèå´æ¯è¾çªç使ç¨å符éï¼å®ç G0æ¯åºå®ç (æ»æ¯ ASCII), æä»¥ G1, G2 å G3åªè½è¢«è°ç¨äºé«æ¬¡åºä½ç¼ç éã 尤嶿¯ï¼ä¸åä½¿ç¨ ˆN å ˆOï¼ESC ( xx ä»ç¨äº xx=B, å ESC ) xx, ESC * xx, ESC + xx åå«çä»·äº ESC - xx, ESC . xx, ESC / xxï¼

åè

console(4), console_ioctl(4), console_codes(4), ascii(7), iso_8859_1(7), unicode(7), utf-8(7)

[䏿çç»´æ¤äºº]

Scorpio <rawk@chinese.com>

[ä¸æçææ°æ´æ°]

2000/10/23

ãä¸å½linux论åmanæå页翻è¯è®¡åã:

http://cmpp.linuxforum.net

è·

æ¬é¡µé¢ä¸æçç±ä¸æ man æå页计åæä¾ã
䏿 man æå页计åï¼https://github.com/man-pages-zh/manpages-zh