summaryrefslogtreecommitdiffstats
path: root/tools/encoding/macIceland.txt
diff options
context:
space:
mode:
authorhobbs <hobbs>2001-10-04 20:23:40 (GMT)
committerhobbs <hobbs>2001-10-04 20:23:40 (GMT)
commit025509dc5eada37e29ca3211508c19448f3f3fa6 (patch)
tree9a04d3bea5a349da76931caf5a44bba327c7f54b /tools/encoding/macIceland.txt
parenta50e0e7d7635a3d06cea406824f322bea74f4905 (diff)
downloadtcl-025509dc5eada37e29ca3211508c19448f3f3fa6.zip
tcl-025509dc5eada37e29ca3211508c19448f3f3fa6.tar.gz
tcl-025509dc5eada37e29ca3211508c19448f3f3fa6.tar.bz2
Updated encodings with latest mappings from www.unicode.org. This
did not include some Mac encodings that have special multi-unichar translations now (like symbols, dingbats and japanese). Also does not include big5, gb or euc* as those have different formats in the latest Unicode version that need new conversion tools. Not all related .enc files changed as some had been updates separately.
Diffstat (limited to 'tools/encoding/macIceland.txt')
-rw-r--r--tools/encoding/macIceland.txt148
1 files changed, 114 insertions, 34 deletions
diff --git a/tools/encoding/macIceland.txt b/tools/encoding/macIceland.txt
index ee79a37..d28bd9d 100644
--- a/tools/encoding/macIceland.txt
+++ b/tools/encoding/macIceland.txt
@@ -1,11 +1,31 @@
+#=======================================================================
+# FTP file name: ICELAND.TXT
#
-# Name: MacOS_Icelandic [to Unicode]
-# Unicode versions: 1.1, 2.0
-# Table version: 0.2 (from internal ufrm version <5>)
-# Date: 15 April 1995
-# Author: Peter Edberg <edberg1@applelink.apple.com>
+# Contents: Map (external version) from Mac OS Icelandic
+# character set to Unicode 2.1
#
-# Copyright (c) 1995 Apple Computer, Inc. All Rights reserved.
+# Copyright: (c) 1995-1999 by Apple Computer, Inc., all rights
+# reserved.
+#
+# Contact: charsets@apple.com
+#
+# Changes:
+#
+# b02 1999-Sep-22 Encoding changed for Mac OS 8.5; change
+# mapping of 0xDB from CURRENCY SIGN to EURO
+# SIGN. Update contact e-mail address. Matches
+# internal utom<b2>, ufrm<b2>, and Text
+# Encoding Converter version 1.5.
+# n06 1998-Feb-05 Minor update to header comments, add
+# information on font variants
+# n03 1997-Dec-14 Update to match internal utom<n4>, ufrm<n16>:
+# Change standard mapping for 0xBD from U+2126
+# to its canonical decomposition, U+03A9.
+# n02 1995-Apr-15 First version (after fixing some typos).
+# Matches internal ufrm<n5>.
+#
+# Standard header:
+# ----------------
#
# Apple, the Apple logo, and Macintosh are trademarks of Apple
# Computer, Inc., registered in the United States and other countries.
@@ -17,45 +37,105 @@
# Apple makes no warranty or representation, either express or
# implied, with respect to these tables, their quality, accuracy, or
# fitness for a particular purpose. In no event will Apple be liable
-# for direct, indirect, special, incidental, or consequential damages
+# for direct, indirect, special, incidental, or consequential damages
# resulting from any defect or inaccuracy in this document or the
# accompanying tables.
#
-# These mapping tables and character lists are preliminary and
-# subject to change. Updated tables will be available from the
-# Unicode Inc. ftp site (unicode.org), the Apple Computer ftp site
-# (ftp.info.apple.com), the Apple Computer World-Wide Web pages
-# (http://www.info.apple.com), and possibly on diskette from APDA
-# (Apple's mail-order distribution service for developers).
+# These mapping tables and character lists are subject to change.
+# The latest tables should be available from the following:
#
-# Format:
-# -------
+# <ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
+# <ftp://dev.apple.com/devworld/Technical_Documentation/Misc._Standards/>
+#
+# For general information about Mac OS encodings and these mapping
+# tables, see the file "README.TXT".
+#
+# Format:
+# -------
#
# Three tab-separated columns;
# '#' begins a comment which continues to the end of the line.
-# Column #1 is the MacOS Icelandic code (in hex as 0xNN)
-# Column #2 is the Unicode (in hex as 0xNNNN)
-# Column #3 is the Unicode name (follows a comment sign, '#')
+# Column #1 is the Mac OS Icelandic code (in hex as 0xNN)
+# Column #2 is the corresponding Unicode (in hex as 0xNNNN)
+# Column #3 is a comment containing the Unicode name
#
-# The entries are in MacOS Icelandic code order.
+# The entries are in Mac OS Icelandic code order.
#
-# One of these mappings requires the use of a corporate character
-# (for the Apple logo character). See the file "MacOS-CorpCharList".
-# NOTE: The graphic image associated with the Apple logo character
-# is not authorized for use without permission of Apple, and
-# unauthorized use might constitute trademark infringement.
+# One of these mappings requires the use of a corporate character.
+# See the file "CORPCHAR.TXT" and notes below.
+#
+# Control character mappings are not shown in this table, following
+# the conventions of the standard UTC mapping tables. However, the
+# Mac OS Roman character set uses the standard control characters at
+# 0x00-0x1F and 0x7F.
+#
+# Notes on Mac OS Icelandic:
+# --------------------------
#
-# Notes on MacOS Icelandic:
-# -------------------------
+# 1. General
#
-# The MacOS Icelandic encoding shares the script code smRoman
-# (0) with the standard MacOS Roman encoding. To determine if
+# Mac OS Icelandic is used for Icelandic and Faroese.
+#
+# The Mac OS Icelandic encoding shares the script code smRoman
+# (0) with the standard Mac OS Roman encoding. To determine if
# the Icelandic encoding is being used, you must also check if
# the system region code is 21, verIceland.
#
-# This character set is a variant of standard MacOS Roman,
+# This character set is a variant of standard Mac OS Roman,
# adding upper and lower eth, thorn, and Y acute. It has 6 code
-# point differences from standard MacOS Roman.
+# point differences from standard Mac OS Roman.
+#
+# Before Mac OS 8.5, code point 0xDB was CURRENCY SIGN, and was
+# mapped to U+00A4. In Mac OS 8.5 and later versions, code point
+# 0xDB is changed to EURO SIGN and maps to U+20AC; the standard
+# Apple fonts are updated for Mac OS 8.5 to reflect this. There are
+# "currency sign" variants of the Mac OS Icelandic encoding that
+# still map 0xDB to U+00A4; these can be used for older fonts.
+# Note: U+20AC is new with Unicode 2.1; for earlier Unicode
+# versions, Mac OS Icelandic 0xDB may be mapped to private-use
+# character U+F8A0.
+#
+# 2. Font variants
+#
+# The table in this file gives the Unicode mappings for the standard
+# Mac OS Icelandic encoding. This encoding is supported by the
+# Icelandic versions of the fonts Chicago, Geneva, Monaco, and New
+# York, and is the encoding supported by the text processing
+# utilities. However, other TrueType fonts implement a slightly
+# different encoding; the difference is only in two code points.
+# For the standard variant, these are:
+# 0xBB -> 0x00AA FEMININE ORDINAL INDICATOR
+# 0xBC -> 0x00BA MASCULINE ORDINAL INDICATOR
+#
+# For the TrueType variant (used by the Icelandic versions of the
+# fonts Courier, Helvetica, Palatino, and Times), these are:
+# 0xBB -> 0xFB01 LATIN SMALL LIGATURE FI
+# 0xBC -> 0xFB02 LATIN SMALL LIGATURE FL
+#
+# Unicode mapping issues and notes:
+# ---------------------------------
+#
+# The following corporate zone Unicode character is used in this
+# mapping:
+#
+# 0xF8FF Apple logo
+#
+# NOTE: The graphic image associated with the Apple logo character
+# is not authorized for use without permission of Apple, and
+# unauthorized use might constitute trademark infringement.
+#
+# Details of mapping changes in each version:
+# -------------------------------------------
+#
+# Changes from version n06 to version b02:
+#
+# - Encoding changed for Mac OS 8.5; change mapping of 0xDB from
+# CURRENCY SIGN (U+00A4) to EURO SIGN (U+20AC).
+#
+# Changes from version n02 to version n03:
+#
+# - Change mapping of 0xBD from U+2126 to its canonical
+# decomposition, U+03A9.
#
##################
@@ -201,7 +281,7 @@
0xAB 0x00B4 # ACUTE ACCENT
0xAC 0x00A8 # DIAERESIS
0xAD 0x2260 # NOT EQUAL TO
-0xAE 0x00C6 # LATIN CAPITAL LIGATURE AE
+0xAE 0x00C6 # LATIN CAPITAL LETTER AE
0xAF 0x00D8 # LATIN CAPITAL LETTER O WITH STROKE
0xB0 0x221E # INFINITY
0xB1 0x00B1 # PLUS-MINUS SIGN
@@ -216,8 +296,8 @@
0xBA 0x222B # INTEGRAL
0xBB 0x00AA # FEMININE ORDINAL INDICATOR
0xBC 0x00BA # MASCULINE ORDINAL INDICATOR
-0xBD 0x2126 # OHM SIGN
-0xBE 0x00E6 # LATIN SMALL LIGATURE AE
+0xBD 0x03A9 # GREEK CAPITAL LETTER OMEGA
+0xBE 0x00E6 # LATIN SMALL LETTER AE
0xBF 0x00F8 # LATIN SMALL LETTER O WITH STROKE
0xC0 0x00BF # INVERTED QUESTION MARK
0xC1 0x00A1 # INVERTED EXCLAMATION MARK
@@ -246,7 +326,7 @@
0xD8 0x00FF # LATIN SMALL LETTER Y WITH DIAERESIS
0xD9 0x0178 # LATIN CAPITAL LETTER Y WITH DIAERESIS
0xDA 0x2044 # FRACTION SLASH
-0xDB 0x00A4 # CURRENCY SIGN
+0xDB 0x20AC # EURO SIGN
0xDC 0x00D0 # LATIN CAPITAL LETTER ETH
0xDD 0x00F0 # LATIN SMALL LETTER ETH
0xDE 0x00DE # LATIN CAPITAL LETTER THORN