Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: ConScript Unicode Registry

From:David McCann <david@...>
Date:Friday, April 11, 2008, 16:11
On Fri, 2008-04-11 Tristan McLeay wrote:
> Actually,I thought [the private use areas] were appropriate for > interchanging text as long as you are in agreement with your > associates as to how it should be interchanged? What other purpose > are there for PUA characters? (I am aware some fonts put small > caps letters in them, or other fancy styles --- this is less than > optimal and strictly unnecessary especially with modern font > formats like Open Type.) The CSUR would seem to be appropriate for > that.
The Unicode manual says "Private-use characters are often used to implement end-user defined characters (EUDC), which are common in East Asian computing environments." "Systems vendors and/or software developers may need to reserve some private-use characters for internal use by their software. The corporate use subarea is the preferred area for such reservations. Assignments of character semantics in this subarea may be completely internal, hidden from end users, and used only for vendor-specific application support, or they may be published as vendor-specific character assignments available to applications and end users. An example of the former case would be the assignment of a character code to a system support operation such as <MOVE> or <COPY>; an example of the latter case would be the assignment of a character code to a vendor-specific logo character such as Apple’s apple character." This is one of the problems with the Conlang Script registry: it ignores the statement that "By convention, the primary Private Use Area is divided into a corporate use subarea for platform writers, starting at U+F8FF and extending downward in values, and an end-user subarea, starting at U+E000 and extending upward." and writes all over the corporate area. The other problem is that most of the scripts registered sink without trace. Apart from Tengwar, Cirth, Klingon, and Vedurian, most of the links given are now broken. There doesn't seem to be much sense in registering scripts unless there is a real community of users, not just a proud inventor. I can see why Unicode is cautious. Incidentally, isn't Open Type just a Microsoft format? I don't know if it's available for the Mac as well as AAT, but it certainly isn't available for Linux. Another problem for the CSUR, which assumes the availability of OT in places. The CSUR also encodes too much, ignoring the instruction that punctuation and diacritics should not be duplicated for each script.