Re: Dictionary Programs?
From: | Peter Clark <peter-clark@...> |
Date: | Wednesday, September 4, 2002, 14:20 |
On Tuesday 03 September 2002 14:03, Le cavallero wrote:
> I probably will make use of that generous offer when I get around to
> compiling my lexicon. What is Pydima, will it output lexical data using
> Latex?
Pydima is the Python Dictionary Maker that Pablo Flores and I are working on.
It unfortunately is not yet ready for public consumption yet, but I'm working
on it. Basically, it's to scratch the itch for a good dictionary creation
program. Open up any professional bi-lingual dictionary and you will soon
appreciate how difficult that would be to duplicate using just some word
processing program or text editor. There are cross-referenced entries, plus
you essentially double the amount of data by having two langauges. That means
that you need to keep two separate files, or at least two separate sections
in the same file, and remember what links to what, and (if you are really
interested in a professional look) what type of sub-entries follow what. This
is doable with small dictionaries, but as one's langauge grows, it becomes
unmanagable for anything more than simple entries.
The idea behind Pydima is that the user creates only one text file (later on
I intend to add a graphical interface, but first things first) in which the
entries are sorted and cross-linked by the program. Here's an example:
\lx agua
\ge water
\ps n
\pn f
\xv ~ abajo
\xe downstream
\xv ~ bendita
\xe holy water
\xv ~ corriente
\xe running water
\xv ~ de colonia
\xe eau-de-cologne
\xv se me hizo ~ la boca
\xe it made my mouth water
Spanish Dictionary results:
agua f. water; ~ abajo downstream. ~ bendita holy water. ~ corriente running
water. ~ de colonia eau-de-cologne. se me hizo ~ la boca it made my mouth
water.
English Dictionary results:
water n agua f.
You may complain, "But the English entry is too small!" Yes, but we only have
one entry. If we were to add another entry, let's say for "regar" and
"llorar", Pydima would automatically sort the relevant information, without
any further user intervention. Thus, with the addition of those two entries,
the English dictionary would begin to look more like:
water n agua f. · vt regar (plants etc) · vi llorar (eyes)
The more entries added on the Spanish side, the larger the English side will
grow.
As for output, Pydima already outputs HTML (courtesy of Pablo), and LaTeX is
next. Another possibility would be RTF, but my eyes keep glazing over
whenever I try to read the specs. :)
:Peter
Reply