Conlang: Re: Arabic and BACK TO Self-segregating morphology (Thomas Hart Chappell, Dec 21 '05, 20:35)

Re: Arabic and BACK TO Self-segregating morphology

From:	Thomas Hart Chappell <tomhchappell@...>
Date:	Wednesday, December 21, 2005, 20:35

From:

Thomas Hart Chappell <tomhchappell@...>

Date:

Wednesday, December 21, 2005, 20:35

On Wed, 21 Dec 2005 11:37:32 -0500, Jim Henry <jimhenry1973@...>

>wrote:

>On 12/20/05, Patrick Littell <puchitao@...> wrote: >> [snip] > >> ........for things like (prefix)i(word)u(word)o(suffix) there are at >> least >> five ways to parse it. We can figure out which morphemes are >> prefixes, which are roots, and which are suffixes, but not their >> hierarchical structure. > >In other words, it's sometimes unclear whether a prefix >modifies the first element of a compound, or modifies >the whole compound -- & ditto with suffxes w.r.t. the >final element of the compound or the whole compound word. > >> Anyway, there are answers; I just want to see if anyone has any >> interesting solutions that don't require infixation, circumfixation, >> or the equivalent. > >What about having high- and low-precedence binder morphemes?

I was just about to mention that, when I read your post!

>That would require a larger inventory of vowels, probably. But suppose >i, o and u are the high-precedence binder morphemes and order of >compounding precedence is left to right, then

See below for varying order-of-binding.

>kainalijaupalitaosa > >could only be parsed as >(((ka + nalija) + palita) + sa) > >But if y, e, and a are the low-precedence binder morphemes, >you could also have, e.g.: > >kaynalijaepalitaosa >= (( ka + nalija) + ( palita + sa )) > >This scheme may be too complex to speak and hear in real time; >we would have to experiement for a while and see how >it works. > >-- >Jim Henry

Operator-precedence grammars are studied in Compiler Theory; and here's a summary of what you'd need to know. 1) Each (binary infix) operator is either Left-Associative or Right- Associative or Non-Associative. So, if /a/ were Left-Associative, /bacad/ could only be parsed as (bac)ad; and if /e/ were Right-Associative, /beced/ could only be parsed as be(ced); and if /u/ were Non-Associative, /bucud/ would be ungrammatical and make no sense; /buc/ would be ok, and /cud/ would be ok, but not /bucud/. *1*) As a variant, you could allow multiple non-associative operators of the same precedence to be grammatical, as long as there were no way to assign any difference to the groupings. In other words, /bucud/ in the example above, might be OK, provided it could not be interpreted as (buc)ud nor as bu(cud). 2) Given any two (binary infix) operators, either the first takes precedence over the second, or the second takes precedence over the first, or they have equal precedence. 3) Precedence is a totally-ordering relationship; if /a/ takes precedence over /e/, and /e/ takes precedence over /i/, then /a/ takes precedence over /i/. 4) Operators of equal precedence _must_ have the same kind of associativity; they are either all Left-Associative, or all Right- Associative, or all Non-Associative. To put it another way, operators with differing associativity must also have differing precedence. ----- OK, imagine a language with seven vowels /a/ /e/ /i/ /o/ /u/ /w/ /y/ and 19 consonants. Imagine all lexical, semantically non-empty, morphemes are consonant- clusters. Imagine the vowels all represent connectors -- makes no difference whether they are compounding-connectors, or affixing-connectors, or whatever. Imagine their precedence order is: /w/ /y/ Left-associative, equal precedence at first; /o/ /u/ Right-associative, equal precedence at second; /i/ Left-associative (third highest precedence); /e/ Right-associative (fourth highest precedence); /a/ Non-associative (lowest precedence); Now consider the pattern /bVcVdVf/. If each V could be any one of /a/ /e/ /i/ /o/ /u/ /w/ /y/, this could lead to 7*7*7=343 different "words"; however, no more than one of the Vs can be replaced by /a/, so there are actually 6*6*6 + 3*6*6 = 216+108 = 324 possible words. /becedef/ is ((bec)ed)ef /bacedef/ is ba((ced)ef) /becadef/ is (bec)a(def) /becedaf/ is ((bec)ed)af /bicidif/ is bi(ci(dif)) /bacidif/ is ba(ci(dif)) /bicadif/ is (bic)a(dif) /bicidaf/ is (bi(cid))af /becedef/ is ((bec)ed)ef /becedif/ is (bec)e(dif) /becidef/ is (be(cid))ef /becidif/ is be(ci(dif)) /bicedef/ is ((bic)ed)ef /bicedif/ is (bic)e(dif) /bicidef/ is (bi(cid))ef /bicidif/ is bi(ci(dif)) /bicidif/ is bi(ci(dif)) /bicidof/ is bi(ci(dof)) /bicodif/ is bi((cod)if) /bicodof/ is bi((cod)of) /bocidif/ is (boc)i(dif) /bocidof/ is (boc)i(dof) /bocodif/ is ((boc)od)if /bocodof/ is ((boc)od)of /becedef/ is ((bec)ed)ef /becedof/ is (bec)e(dof) /becodef/ is (be(cod))ef /becodof/ is be((cod)of) /bocedef/ is ((boc)ed)ef /bocedof/ is (boc)e(dof) /bocodef/ is ((boc)od)ef /bocodof/ is ((boc)o)dof /bicidif/ is bi(ci(dif)) /bicidwf/ is bi(ci(dwf)) /bicwdif/ is bi((cwd)if) /bicwdwf/ is bi(cw(dwf)) /bwcidif/ is (bwc)i(dif) /bwcidwf/ is (bwc)i(dwf) /bwcwdif/ is (bw(cwd))if /bwcwdwf/ is bw(cw(dwf)) ----- You could vary the number of levels of precedence, and the associativity or lack of it at each level, as well as the direction of associativity at each level where there is any, to solve most of your problems. I'd probably recommend a lowest-level non-associative one for compounding words, a highest-level left-associative one for introducing a bias in favor of suffixes as opposed to prefixes, and run through the whole "Left, Right, Non", cycle twice to allow each kind of associativity to have precedence over each kind. (That would make six levels, for those keeping count. With only one binder per level, you would only need six binders; however you might want make a semantic or grammatical difference in two or more different binders at the same level.) This could completely solve the problem of whether a prefix or suffix on a compound word modifies just the component, or just the whole compound. ----- Well, that may be more than you wanted to read. I hope it helped, instead. Tom H.C. in MI