[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: UNICODE glyph standard
re: UNICODE becoming a glyph standard
It is happening as we speak :=)
Check out the unicode FTP site (ftp.unicode.org), which contains the UNICODE
2.0 CD-ROM files. Buried deep down you will find:
unix/mappings/vendors/apple/corpchr.txt
(Here appended) This is the `corporate use subarea for Apple.' I understand
other vendors (MicroSoft, Adobe, MonoType) are doing similar things. We
can only hope they are coordinating to avoid stepping on each others toes.
What all this does is flesh out UNICODE 2.0 so it can be somewhat useful for
glyph encoding.
This process is not unlike the development of Windows ANSI, which started
with the not-so-great-at-all (*) ISO Latin 1, and added a bunch of
typographically useful glyphs between 128 and 160 (quoteleft, quoteright,
quotedblleft, quotedblright, quotesinglbase, quotedblbase, guilsinglleft,
guilsinglright, endash, emdash, OE, oe, Scaron, scaron, Ydieresis, florin
dagger, daggerdbl, bullet, ellipsis, perthousand, trademark, circumflex,
tilde) to make a somewhat useful kludge.
Regards, Berthold.
(*) ISO Latin 1:
Has grave, acute, dieresis, macron, cedilla,
but not breve, dotaccent, and dotlessi. why?
Has `grave accent' where ASCII has `quoteleft', ugh!
Has `quotesingle' (`apostrophe') instead of `quoteright',
Has `minus' where `hyphen' should be.
Has none of the left and right single or double quotes.
Does not have endash and emdash.
Has several ambiguities (minus <=> hyphen,
circumflex <=> asciicircum, tilde <=> asciitilde).
Is an encoding that does not match ASCII in the low range.
Some essentially useless characters like `Ydieresis' and `currency'.
Macintosh Corporate use subarea:
##########################################################################
#
# Name: MacOS_CorpChars
# Unicode versions: 1.1, 2.0
# Table version/date:
# Current - 0.4, 15 Nov 1995 (from internal version <8>)
# Add chars for Hebrew and Thai
# Previous - 0.3, 15 Apr 1995 (from internal version <5>)
# Author: Peter Edberg <edberg1@applelink.apple.com>
#
# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
# Unicode is a trademark of Unicode Inc. For the sake of brevity,
# throughout this document, "Macintosh" can be used to refer to
# Macintosh computers and "Unicode" can be used to refer to the
# Unicode standard.
#
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
#
# These mapping tables and character lists are preliminary and
# subject to change. Updated tables will be available from the
# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
# (http://www.info.apple.com), and possibly on diskette from APDA
# (Apple's mail-order distribution service for developers).
#
# Format:
# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
# Column #1 is the Unicode corporate character code point
# (in hex as 0xNNNN)
# Column #2 is a comment containing:
# 1) an informal name describing the MacOS usage of the character
# ("presentation" is abbreviated as "pres.")
# 2) another '#'
# 3) a list of the corresponding code points in MacOS encodings
# (Japanese is abbreviated as just J)
#
# The entries are in Unicode order.
#_______________________________________________________________________
*/
# The following (22) are added for MacOS Thai encoding.
# In this encoding, positional variants of upper vowels, tone marks,
# and other marks are normally handled automatically by Worldscript I.
# However, the Thai-DTP keyboard allows the codes for the positional
# variants to be entered directly, so they must be treated as
# characters. When the abstract character is treated as a positional
# variant, it has the right (and high, if relevant) position.
0xF884 # form for THAI CHARACTER MAI HAN-AKAT, left position # Thai-x92
0xF885 # form for THAI CHARACTER SARA I, left position # Thai-x94
0xF886 # form for THAI CHARACTER SARA II, left position # Thai-x95
0xF887 # form for THAI CHARACTER SARA UE, left position # Thai-x96
0xF888 # form for THAI CHARACTER SARA UEE, left position # Thai-x97
0xF889 # form for THAI CHARACTER MAITAIKHU, left position # Thai-x93
0xF88A # form for THAI CHARACTER MAI EK, left position # Thai-x98
0xF88B # form for THAI CHARACTER MAI EK, low position # Thai-x88
0xF88C # form for THAI CHARACTER MAI EK, low left position # Thai-x83
0xF88D # form for THAI CHARACTER MAI THO, left position # Thai-x99
0xF88E # form for THAI CHARACTER MAI THO, low position # Thai-x89
0xF88F # form for THAI CHARACTER MAI THO, low left position # Thai-x84
0xF890 # form for THAI CHARACTER MAI TRI, left position # Thai-x9A
0xF891 # form for THAI CHARACTER MAI TRI, low position # Thai-x8A
0xF892 # form for THAI CHARACTER MAI TRI, low left position # Thai-x85
0xF893 # form for THAI CHARACTER MAI CHATTAWA, left position # Thai-x9B
0xF894 # form for THAI CHARACTER MAI CHATTAWA, low position # Thai-x8B
0xF895 # form for THAI CHARACTER MAI CHATTAWA, low left position # Thai-x86
0xF896 # form for THAI CHARACTER THANTHAKHAT, left position # Thai-x9C
0xF897 # form for THAI CHARACTER THANTHAKHAT, low position # Thai-x8C
0xF898 # form for THAI CHARACTER THANTHAKHAT, low left position # Thai-x87
0xF899 # form for THAI CHARACTER NIKHAHIT, left position # Thai-x8F
# The following (6) are added for MacOS Hebrew encoding. Four of
# these are for the obsolete "canoral" codes that were used before
# System7.1/Worldscript to control positioning of nikud marks (points).
# In the future these 4 code points may be redefined.
0xF89A # Hebrew ligature lamed holam # Hebrew-xC0
0xF89B # Hebrew canoral 1 # Hebrew-xC2
0xF89C # Hebrew canoral 2 # Hebrew-xC3
0xF89D # Hebrew canoral 3 # Hebrew-xC4
0xF89E # Hebrew canoral 4 # Hebrew-xC5
0xF89F # Hebrew point qamats qatan # Hebrew-xDE
# The following (1) is added to handle mapping of the single undefined
# code point in MacOS Greek and Turkish encodings, thus permitting full
# round-trip fidelity.
0xF8A0 # undefined1 # Greek-0xFF, Turkish-0xF5
# The following (54) were added for MacOS Japanese encoding
# part 1 - Apple corporate Unicode chars for MacOS Japanese extended
# characters not in Unicode.
0xF8A1 # digit zero full stop # J-0x8591
0xF8A2 # roman numeral thirteen # J-0x85AB
0xF8A3 # roman numeral fourteen # J-0x85AC
0xF8A4 # roman numeral fifteen # J-0x85AD
0xF8A5 # small roman numeral thirteen # J-0x85BF
0xF8A6 # small roman numeral fourteen # J-0x85C0
0xF8A7 # small roman numeral fifteen # J-0x85C1
0xF8A8 # square m (meter?) # J-0x8645
0xF8A9 # square g (gram?) # J-0x864B
0xF8AA # square l (liter?) # J-0x8650
0xF8AB # square TB # J-0x865D
0xF8AC # FAX sign # J-0x869E
0xF8AD # downwards arrow leftwards of upwards arrow # J-0x86CE
0xF8AE # rightwards black arrow # J-0x86D3
0xF8AF # leftwards black arrow # J-0x86D4
0xF8B0 # upwards black arrow # J-0x86D5
0xF8B1 # downwards black arrow # J-0x86D6
0xF8B2 # square "limited company, ltd. [yuugen gaisha]" # J-0x87FB
0xF8B3 # square "foundation [zaidan houjin]" # J-0x87FC
0xF8B4 # inverted double prime quotation mark # J-0x8855
# part 2 - Apple corporate Unicode chars for MacOS Japanese vertical
# forms not in Unicode.
0xF8B5 # pres. form for vertical IDEOGRAPHIC COMMA # J-0xEB41
0xF8B6 # pres. form for vertical IDEOGRAPHIC FULL STOP # J-0xEB42
0xF8B7 # pres. form for vertical OVERLINE # J-0xEB50
0xF8B8 # pres. form for vertical KATAKANA-HIRAGANA PROLONGED SOUND MARK #
J-0xEB5B
0xF8B9 # pres. form for vertical HYPHEN # J-0xEB5D
0xF8BA # pres. form for vertical WAVE DASH # J-0xEB60
0xF8BB # pres. form for vertical DOUBLE VERTICAL LINE # J-0xEB61
0xF8BC # pres. form for vertical FULLWIDTH VERTICAL LINE # J-0xEB62
0xF8BD # pres. form for vertical MIDLINE HORIZONTAL ELLIPSIS # J-0xEB63
0xF8BE # pres. form for vertical FULLWIDTH LEFT SQUARE BRACKET # J-0xEB6D
0xF8BF # pres. form for vertical FULLWIDTH RIGHT SQUARE BRACKET # J-0xEB6E
0xF8C0 # pres. form for vertical FULLWIDTH EQUALS SIGN # J-0xEB81
0xF8C1 # pres. form for vertical HIRAGANA LETTER SMALL A # J-0xEC9F
0xF8C2 # pres. form for vertical HIRAGANA LETTER SMALL I # J-0xECA1
0xF8C3 # pres. form for vertical HIRAGANA LETTER SMALL U # J-0xECA3
0xF8C4 # pres. form for vertical HIRAGANA LETTER SMALL E # J-0xECA5
0xF8C5 # pres. form for vertical HIRAGANA LETTER SMALL O # J-0xECA7
0xF8C6 # pres. form for vertical HIRAGANA LETTER SMALL TU # J-0xECC1
0xF8C7 # pres. form for vertical HIRAGANA LETTER SMALL YA # J-0xECE1
0xF8C8 # pres. form for vertical HIRAGANA LETTER SMALL YU # J-0xECE3
0xF8C9 # pres. form for vertical HIRAGANA LETTER SMALL YO # J-0xECE5
0xF8CA # pres. form for vertical HIRAGANA LETTER SMALL WA # J-0xECEC
0xF8CB # pres. form for vertical KATAKANA LETTER SMALL A # J-0xED40
0xF8CC # pres. form for vertical KATAKANA LETTER SMALL I # J-0xED42
0xF8CD # pres. form for vertical KATAKANA LETTER SMALL U # J-0xED44
0xF8CE # pres. form for vertical KATAKANA LETTER SMALL E # J-0xED46
0xF8CF # pres. form for vertical KATAKANA LETTER SMALL O # J-0xED48
0xF8D0 # pres. form for vertical KATAKANA LETTER SMALL TU # J-0xED62
0xF8D1 # pres. form for vertical KATAKANA LETTER SMALL YA # J-0xED83
0xF8D2 # pres. form for vertical KATAKANA LETTER SMALL YU # J-0xED85
0xF8D3 # pres. form for vertical KATAKANA LETTER SMALL YO # J-0xED87
0xF8D4 # pres. form for vertical KATAKANA LETTER SMALL WA # J-0xED8E
0xF8D5 # pres. form for vertical KATAKANA LETTER SMALL KA # J-0xED95
0xF8D6 # pres. form for vertical KATAKANA LETTER SMALL KE # J-0xED96
# The following (14) were added for MacOS Dingbats encoding
0xF8D7 # medium left parenthesis ornament # Dingbats-0x80
0xF8D8 # medium right parenthesis ornament # Dingbats-0x81
0xF8D9 # medium flattened left parenthesis ornament # Dingbats-0x82
0xF8DA # medium flattened right parenthesis ornament # Dingbats-0x83
0xF8DB # medium left-pointing angle bracket ornament # Dingbats-0x84
0xF8DC # medium right-pointing angle bracket ornament # Dingbats-0x85
0xF8DD # heavy left-pointing angle quotation mark ornament # Dingbats-0x86
0xF8DE # heavy right-pointing angle quotation mark ornament # Dingbats-0x87
0xF8DF # heavy left-pointing angle bracket ornament # Dingbats-0x88
0xF8E0 # heavy right-pointing angle bracket ornament # Dingbats-0x89
0xF8E1 # light left tortoise shell bracket ornament # Dingbats-0x8A
0xF8E2 # light right tortoise shell bracket ornament # Dingbats-0x8B
0xF8E3 # medium left curly bracket ornament # Dingbats-0x8C
0xF8E4 # medium right curly bracket ornament # Dingbats-0x8D
# The following (26) were added for MacOS Symbol encoding
0xF8E5 # radical extender # Symbol-0x60
0xF8E6 # vertical arrow extender # Symbol-0xBD
0xF8E7 # horizontal arrow extender # Symbol-0xBE
0xF8E8 # registered sign sans serif # Symbol-0xE2
0xF8E9 # copyright sign sans serif # Symbol-0xE3
0xF8EA # trade mark sign sans serif # Symbol-0xE4
0xF8EB # left parenthesis top # Symbol-0xE6
0xF8EC # left parenthesis extender # Symbol-0xE7
0xF8ED # left parenthesis bottom # Symbol-0xE8
0xF8EE # left square bracket top # Symbol-0xE9
0xF8EF # left square bracket extender # Symbol-0xEA
0xF8F0 # left square bracket bottom # Symbol-0xEB
0xF8F1 # left curly bracket top # Symbol-0xEC
0xF8F2 # left curly bracket center # Symbol-0xED
0xF8F3 # left curly bracket bottom # Symbol-0xEE
0xF8F4 # curly bracket extender # Symbol-0xEF
0xF8F5 # integral extender # Symbol-0xF4
0xF8F6 # right parenthesis top # Symbol-0xF6
0xF8F7 # right parenthesis extender # Symbol-0xF7
0xF8F8 # right parenthesis bottom # Symbol-0xF8
0xF8F9 # right square bracket top # Symbol-0xF9
0xF8FA # right square bracket extender # Symbol-0xFA
0xF8FB # right square bracket bottom # Symbol-0xFB
0xF8FC # right curly bracket top # Symbol-0xFC
0xF8FD # right curly bracket center # Symbol-0xFD
0xF8FE # right curly bracket bottom # Symbol-0xFE
# The following (1) was added for MacOS Roman encoding
0xF8FF # Apple logo # Roman-0xF0, Symbol-0xF0, Croatian-0xD8
# NOTE: The graphic image associated with the Apple logo character is
# not authorized for use without permission of Apple, and unauthorized
# use might constitute trademark infringement.