Re: CHAT: Wordwrapping (was: File converter)
From: | Paul Bennett <paul.bennett@...> |
Date: | Friday, November 12, 1999, 9:27 |
Pablo>>>>>>
Paul Bennett <Paul.Bennett@...> wrote:
> While I'm on the subject, would anyone appreciate a program to "Romanise" a
> Unicode UTF-8 file into 7-Bit PC-Ascii? I've been thinking about writing one,
> but I need a stronger reason to start a project of this size than just my own
> idle curiosity.
How would you do that? I mean, would it have a substitution table
for the Unicode characters beyond ASCII?
<<<<<<
Yes. A lookup table with 64k-entries, each of which points to a sequence of
7-bit characters. (Possibly a sequence of 8-bit Latin-1 characters?). For
example, the entry for the hirigana syllable {ka} would point to the ascii
character sequence {k}{a}.
>>>>>>
While you're at it, do you know, or have you made a program that can
word-wrap text (not on screen, but actually inserting line breaks)?
It should be easy, but I've tried with VBA and the resulting program
somehow mangles the output in one way or another every time!
<<<<<<
Once again, on Unix, there's a built-in command for this (in fact there's
several). Boudewijn has posted a good demonstration of a homegrown one. By the
sounds of the above, you're not a Python-head, and you're not on Unix. I'll
have a tinker in "real" Basic, instead of "toy" Basic, and see what I can do...
---
Pb
*************************************************************
This email and any files transmitted with it are confidential
and intended solely for the use of the individual or entity
to whom they are addressed.
If you have received this email in error please notify the
sender. This footnote also confirms that this email message
has been scanned for the presence of computer viruses.
*************************************************************