[tex-live] next encoding

James Cloos cloos at jhcloos.com
Sun Mar 1 07:24:45 CET 2009


Markus Kuhn’s uniset¹ has this mapping file from Unicode Inc:

,----[ NEXTSTEP.TXT ]
| #
| #       Name:             NextStep Encoding to Unicode
| #       Unicode version:  1.1
| #       Table version:    0.1
| #       Table format:     Format A
| #       Date:             1999 September 23
| #       Authors:          Rick McGowan
| #
| #       Copyright (c) 1991-1999 Unicode, Inc.  All Rights reserved.
| #
| #	This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
| #	No claims are made as to fitness for any particular purpose.  No
| #	warranties of any kind are expressed or implied.  The recipient
| #	agrees to determine applicability of information provided.  If this
| #	file has been provided on optical media by Unicode, Inc., the sole
| #	remedy for any claim will be exchange of defective media within 90
| #	days of receipt.
| #
| #	Unicode, Inc. hereby grants the right to freely use the information
| #	supplied in this file in the creation of products supporting the
| #	Unicode Standard, and to make copies of this file in any form for
| #	internal or external distribution as long as this notice remains
| #	attached.
| #
| #	General notes:
| #
| #	This table contains the data the Unicode Consortium has on how
| #       NextStep Encoding characters map into Unicode.  Since the first
| #	128 characters (0x0 - 0x7f) are identical to ASCII and Unicode,
| #	this table only maps the NextStep range from 0x80 - 0xFF.
| #
| #	This file is provided for historical reference only and pertains
| #	to NextStep and OpenStep products shipped prior to the aquisition
| #	of NeXT by Apple Computer, Inc.  See http://www.apple.com for
| #	further information.
| #
| #       Format:  Three tab-separated columns
| #                Column #1 is the NextStep code (in hex as 0xXX)
| #                Column #2 is the Unicode (in hex as 0xXXXX)
| #                Column #3 NextStep name, Unicode name (follows a comment sign, '#')
| #
| #       The entries are in NextStep order
| #
| #       Any comments or problems, contact info at unicode.org
| #
| 0x80	0x00a0	# NO-BREAK SPACE
| 0x81	0x00c0	# LATIN CAPITAL LETTER A WITH GRAVE
| 0x82	0x00c1	# LATIN CAPITAL LETTER A WITH ACUTE
| 0x83	0x00c2	# LATIN CAPITAL LETTER A WITH CIRCUMFLEX
| 0x84	0x00c3	# LATIN CAPITAL LETTER A WITH TILDE
| 0x85	0x00c4	# LATIN CAPITAL LETTER A WITH DIAERESIS
| 0x86	0x00c5	# LATIN CAPITAL LETTER A WITH RING
| 0x87	0x00c7	# LATIN CAPITAL LETTER C WITH CEDILLA
| 0x88	0x00c8	# LATIN CAPITAL LETTER E WITH GRAVE
| 0x89	0x00c9	# LATIN CAPITAL LETTER E WITH ACUTE
| 0x8a	0x00ca	# LATIN CAPITAL LETTER E WITH CIRCUMFLEX
| 0x8b	0x00cb	# LATIN CAPITAL LETTER E WITH DIAERESIS
| 0x8c	0x00cc	# LATIN CAPITAL LETTER I WITH GRAVE
| 0x8d	0x00cd	# LATIN CAPITAL LETTER I WITH ACUTE
| 0x8e	0x00ce	# LATIN CAPITAL LETTER I WITH CIRCUMFLEX
| 0x8f	0x00cf	# LATIN CAPITAL LETTER I WITH DIAERESIS
| 0x90	0x00d0	# LATIN CAPITAL LETTER ETH
| 0x91	0x00d1	# LATIN CAPITAL LETTER N WITH TILDE
| 0x92	0x00d2	# LATIN CAPITAL LETTER O WITH GRAVE
| 0x93	0x00d3	# LATIN CAPITAL LETTER O WITH ACUTE
| 0x94	0x00d4	# LATIN CAPITAL LETTER O WITH CIRCUMFLEX
| 0x95	0x00d5	# LATIN CAPITAL LETTER O WITH TILDE
| 0x96	0x00d6	# LATIN CAPITAL LETTER O WITH DIAERESIS
| 0x97	0x00d9	# LATIN CAPITAL LETTER U WITH GRAVE
| 0x98	0x00da	# LATIN CAPITAL LETTER U WITH ACUTE
| 0x99	0x00db	# LATIN CAPITAL LETTER U WITH CIRCUMFLEX
| 0x9a	0x00dc	# LATIN CAPITAL LETTER U WITH DIAERESIS
| 0x9b	0x00dd	# LATIN CAPITAL LETTER Y WITH ACUTE
| 0x9c	0x00de	# LATIN CAPITAL LETTER THORN
| 0x9d	0x00b5	# MICRO SIGN 
| 0x9e	0x00d7	# MULTIPLICATION SIGN
| 0x9f	0x00f7	# DIVISION SIGN
| 0xa0	0x00a9	# COPYRIGHT SIGN
| 0xa1	0x00a1	# INVERTED EXCLAMATION MARK
| 0xa2	0x00a2	# CENT SIGN
| 0xa3	0x00a3	# POUND SIGN
| 0xa4	0x2044	# FRACTION SLASH
| 0xa5	0x00a5	# YEN SIGN
| 0xa6	0x0192	# LATIN SMALL LETTER F WITH HOOK
| 0xa7	0x00a7	# SECTION SIGN
| 0xa8	0x00a4	# CURRENCY SIGN
| 0xa9	0x2019	# RIGHT SINGLE QUOTATION MARK
| 0xaa	0x201c	# LEFT DOUBLE QUOTATION MARK
| 0xab	0x00ab	# LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
| 0xac	0x2039	# LATIN SMALL LETTER
| 0xad	0x203a	# LATIN SMALL LETTER
| 0xae	0xfb01	# LATIN SMALL LIGATURE FI
| 0xaf	0xfb02	# LATIN SMALL LIGATURE FL
| 0xb0	0x00ae	# REGISTERED SIGN
| 0xb1	0x2013	# EN DASH
| 0xb2	0x2020	# DAGGER
| 0xb3	0x2021	# DOUBLE DAGGER
| 0xb4	0x00b7	# MIDDLE DOT
| 0xb5	0x00a6	# BROKEN BAR
| 0xb6	0x00b6	# PILCROW SIGN
| 0xb7	0x2022	# BULLET
| 0xb8	0x201a	# SINGLE LOW-9 QUOTATION MARK
| 0xb9	0x201e	# DOUBLE LOW-9 QUOTATION MARK
| 0xba	0x201d	# RIGHT DOUBLE QUOTATION MARK
| 0xbb	0x00bb	# RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
| 0xbc	0x2026	# HORIZONTAL ELLIPSIS
| 0xbd	0x2030	# PER MILLE SIGN
| 0xbe	0x00ac	# NOT SIGN
| 0xbf	0x00bf	# INVERTED QUESTION MARK
| 0xc0	0x00b9	# SUPERSCRIPT ONE
| 0xc1	0x02cb	# MODIFIER LETTER GRAVE ACCENT
| 0xc2	0x00b4	# ACUTE ACCENT
| 0xc3	0x02c6	# MODIFIER LETTER CIRCUMFLEX ACCENT
| 0xc4	0x02dc	# SMALL TILDE
| 0xc5	0x00af	# MACRON
| 0xc6	0x02d8	# BREVE
| 0xc7	0x02d9	# DOT ABOVE
| 0xc8	0x00a8	# DIAERESIS
| 0xc9	0x00b2	# SUPERSCRIPT TWO
| 0xca	0x02da	# RING ABOVE
| 0xcb	0x00b8	# CEDILLA
| 0xcc	0x00b3	# SUPERSCRIPT THREE
| 0xcd	0x02dd	# DOUBLE ACUTE ACCENT
| 0xce	0x02db	# OGONEK
| 0xcf	0x02c7	# CARON
| 0xd0	0x2014	# EM DASH
| 0xd1	0x00b1	# PLUS-MINUS SIGN
| 0xd2	0x00bc	# VULGAR FRACTION ONE QUARTER
| 0xd3	0x00bd	# VULGAR FRACTION ONE HALF
| 0xd4	0x00be	# VULGAR FRACTION THREE QUARTERS
| 0xd5	0x00e0	# LATIN SMALL LETTER A WITH GRAVE
| 0xd6	0x00e1	# LATIN SMALL LETTER A WITH ACUTE
| 0xd7	0x00e2	# LATIN SMALL LETTER A WITH CIRCUMFLEX
| 0xd8	0x00e3	# LATIN SMALL LETTER A WITH TILDE
| 0xd9	0x00e4	# LATIN SMALL LETTER A WITH DIAERESIS
| 0xda	0x00e5	# LATIN SMALL LETTER A WITH RING ABOVE
| 0xdb	0x00e7	# LATIN SMALL LETTER C WITH CEDILLA
| 0xdc	0x00e8	# LATIN SMALL LETTER E WITH GRAVE
| 0xdd	0x00e9	# LATIN SMALL LETTER E WITH ACUTE
| 0xde	0x00ea	# LATIN SMALL LETTER E WITH CIRCUMFLEX
| 0xdf	0x00eb	# LATIN SMALL LETTER E WITH DIAERESIS
| 0xe0	0x00ec	# LATIN SMALL LETTER I WITH GRAVE
| 0xe1	0x00c6	# LATIN CAPITAL LETTER AE
| 0xe2	0x00ed	# LATIN SMALL LETTER I WITH ACUTE
| 0xe3	0x00aa	# FEMININE ORDINAL INDICATOR
| 0xe4	0x00ee	# LATIN SMALL LETTER I WITH CIRCUMFLEX
| 0xe5	0x00ef	# LATIN SMALL LETTER I WITH DIAERESIS
| 0xe6	0x00f0	# LATIN SMALL LETTER ETH
| 0xe7	0x00f1	# LATIN SMALL LETTER N WITH TILDE
| 0xe8	0x0141	# LATIN CAPITAL LETTER L WITH STROKE
| 0xe9	0x00d8	# LATIN CAPITAL LETTER O WITH STROKE
| 0xea	0x0152	# LATIN CAPITAL LIGATURE OE
| 0xeb	0x00ba	# MASCULINE ORDINAL INDICATOR
| 0xec	0x00f2	# LATIN SMALL LETTER O WITH GRAVE
| 0xed	0x00f3	# LATIN SMALL LETTER O WITH ACUTE
| 0xee	0x00f4	# LATIN SMALL LETTER O WITH CIRCUMFLEX
| 0xef	0x00f5	# LATIN SMALL LETTER O WITH TILDE
| 0xf0	0x00f6	# LATIN SMALL LETTER O WITH DIAERESIS
| 0xf1	0x00e6	# LATIN SMALL LETTER AE
| 0xf2	0x00f9	# LATIN SMALL LETTER U WITH GRAVE
| 0xf3	0x00fa	# LATIN SMALL LETTER U WITH ACUTE
| 0xf4	0x00fb	# LATIN SMALL LETTER U WITH CIRCUMFLEX
| 0xf5	0x0131	# LATIN SMALL LETTER DOTLESS I
| 0xf6	0x00fc	# LATIN SMALL LETTER U WITH DIAERESIS
| 0xf7	0x00fd	# LATIN SMALL LETTER Y WITH ACUTE
| 0xf8	0x0142	# LATIN SMALL LETTER L WITH STROKE
| 0xf9	0x00f8	# LATIN SMALL LETTER O WITH STROKE
| 0xfa	0x0153	# LATIN SMALL LIGATURE OE
| 0xfb	0x00df	# LATIN SMALL LETTER SHARP S
| 0xfc	0x00fe	# LATIN SMALL LETTER THORN
| 0xfd	0x00ff	# LATIN SMALL LETTER Y WITH DIAERESIS
| 0xfe	0xfffd	# .notdef, REPLACEMENT CHARACTER
| 0xff	0xfffd	# .notdef, REPLACEMENT CHARACTER
`----

1] http://www.cl.cam.ac.uk/~mgk25/download/uniset.tar.gz

-JimC
-- 
James Cloos <cloos at jhcloos.com>         OpenPGP: 1024D/ED7DAEA6


More information about the tex-live mailing list