Theiling Online    Sitemap    Conlang Mailing List HQ   

Re: CHAT: Wordwrapping (was: File converter)

From:Paul Bennett <paul.bennett@...>
Date:Friday, November 12, 1999, 9:27
Pablo>>>>>>
Paul Bennett <Paul.Bennett@...> wrote:

> While I'm on the subject, would anyone appreciate a program to "Romanise" a > Unicode UTF-8 file into 7-Bit PC-Ascii? I've been thinking about writing one, > but I need a stronger reason to start a project of this size than just my own > idle curiosity.
How would you do that? I mean, would it have a substitution table for the Unicode characters beyond ASCII? <<<<<< Yes. A lookup table with 64k-entries, each of which points to a sequence of 7-bit characters. (Possibly a sequence of 8-bit Latin-1 characters?). For example, the entry for the hirigana syllable {ka} would point to the ascii character sequence {k}{a}.
>>>>>>
While you're at it, do you know, or have you made a program that can word-wrap text (not on screen, but actually inserting line breaks)? It should be easy, but I've tried with VBA and the resulting program somehow mangles the output in one way or another every time! <<<<<< Once again, on Unix, there's a built-in command for this (in fact there's several). Boudewijn has posted a good demonstration of a homegrown one. By the sounds of the above, you're not a Python-head, and you're not on Unix. I'll have a tinker in "real" Basic, instead of "toy" Basic, and see what I can do... --- Pb ************************************************************* This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the sender. This footnote also confirms that this email message has been scanned for the presence of computer viruses. *************************************************************