Wu, P.-C., "Plain Base32 ASCII-compatible Encoding and 8-bit Dual-mode Transformation Format of ISO 10646," Computer Standards & Interfaces, Vol. 23, No. 5, Nov. 2001, pp.457-466. (SCI Expanded, EI)

論文題目: ISO 10646簡單基底三十二ASCII相容編碼及八位元雙模式轉換格式

Abstract

A variety of proposals have been proposed for format of ASCII-compatible encoding (ACE) in Internationalized Domain Name (IDN). The issue of supporting UTF-8 in IDN is also raised during the discussion. In this paper, we address alternatives to these encoding methods. We first propose a plain base32 ACE, which achieves the space efficiency of 3.2 bytes per 16-bit character. Secondly, we propose an 8-bit dual-mode transformation format of ISO 10646 Universal Character Set. The 128 base characters (80)16¾(FF)16 are used to represent non-ASCII characters. This results in space efficiency of 2.29 bytes per 16-bit character.

Key Words: Internationalized Domain Name, Universal Character Set, base128, simplicity, space efficiency.

 

摘要

目前已有許多ASCII相容編碼的國際化網域名稱提案。在國際化網域名稱中支援UTF-8的議題也同時受到討論。本文提出替代這些編碼的方法。我們首先提出簡單基底三十二ASCII相容編碼,達到每一16位元字元3.2位元組的空間效率。其次,我們提出ISO 10646通用字元集的八位元雙模式轉換格式。128個基底字元(80)16¾(FF)16用來表示非ASCII字元,如此可達到每一16位元字元佔2.29位元組的空間效率。

關鍵詞:國際化網域名稱、通用字元集、基底128、簡單性、空間效率。