5. Words, words, words
Perhaps the greatest difference among languages and even speakers of the same language lies in the lexicon. Indeed, one definition of the lexicon is the location of all of the idiosyncratic aspects of language.
"The creative powers of English morphology are pathetic compared to what we find in other languages. 127....In effect, Kivunjo and languages like it are building an entire sentence inside a single complex word, the verb. 128"
BUT RECALL DISCUSSION OF "STOP!" It too can be interpreted as a single word sentence in English -- but not a very complex or interesting one.
morphological creativity
"words not simply retrieved from a mental archive"
"The engineering trick behind human language--its being a discrete combinatorial system--is used in at least two different places: sentences and phrases are built out of words by the rules of syntax, and the words themselves are built out of smaller bits by another set of rules, the rules of morphology. 127"
Both words and phrases can have hierarchical structure.
NP->det+N+(S)
Adj->"un"+Adj (prefix+Adj)
Adj->verb stem+"able" (stem+suffix) for example the complex word "unfixable" where the stem is "fix."
types of morphemes
While all words in a language share much in common, almost by definition we can assume they are different in form and meaning and that classes of words are distinctive in one or more ways from other classes.
bound and free morphemes
The morphemes like -ed, -s, that do not occur on their own are known as bound morphemes. In contrast, nouns, verbs, adjectives, etc. are known as free morphemes since they can appear on their own without modification.
function (closed) and content (open) morpheme classes
Morphemes known
inflections
Inflections are bound morphemes that modify the form of a word. Some languages, for example those with extensive case structure like Russian or Latin, rely more on inflections than does English--which has lost most of its Indo-European inflection heritage.
derivational rules (English examples p.128-9
|
Derivational morphemes |
Inflectional morphemes |
|
adjective and verb prefix un |
adjective suffix er, r |
|
adverbializer ly |
adjective suffix est, st |
|
nominalizer er |
noun suffix ie |
|
noun prefix ex |
noun suffix s, es |
|
verb prefix dis |
noun suffix 's, ' |
|
verb prefix mis |
verb suffix s, es |
|
verb prefix out |
verb suffix ed, d |
|
verb prefix over |
verb suffix ing |
|
verb prefix pre |
verb suffix en |
|
verb prefix pro |
|
|
verb prefix re |
|
|
Clitics (fragments attached to stems) |
|
|
noun phrase post-clitic 'd |
v:aux|would, v|have&PAST |
|
noun phrase post-clitic 'll |
v:aux|will |
|
noun phrase post-clitic 'm |
v|be&1S, v:aux|be&1S |
|
noun phrase post-clitic 're |
v|be&PRES, v:aux|be&PRES |
|
noun phrase post-clitic 's |
v|be&3S, v:aux|be&3S |
|
verbal post-clitic n't |
neg|not |
)
morphological structure "rules"
N -> Nstem + Ninflection
N-> dog + (-s)
"A noun can consist of a noun stem followed by a noun inflection. 131" The root of a word is the smallest bit that cannot be cut into smaller parts. Roots combine with inflections to form stems and stems can recursively form other stems..to an extent.
(Kiparsky noted cases like "mice-infested" but not "rats-infested" were probably due the fact that "mice" has its own lexical entry whereas "rats" is derived from "rat" + /pl/.
Other apparent m-rules exist but the meaning is not generally predictable from the structured parts of N roots and their inflections.136-7.
application of phonological rules
Note the commonality of formation of plurals, 3 person singular verbs, and possessives.
There are several ways to state this rule. Clearly speakers have not memorized plurals since the same phonological processes occur in several inflectional situations including possessives and verb inflections. As Morris Halle points out, how else can we explain how English speakers correctly say "I like Johann Sebastian Bach's music" since there no English final [-ch]. Speakers do recognize it as unvoiced however.
rule
Look at the final phonological segment (phoneme) of the morpheme to be inflected. If that morpheme ends in a phoneme segment that is an affricate (e.g. fish, church, gauge) add [ez], ie. create a new syllable. Otherwise add a short continuous fricative whose voicing is the same as that final phoneme onto the final syllable of the inflected morpheme.
(cat, dog, fish,--> cats, dogz, fishez)
mental dictionary
Information about the "arbitrary signs" of language is "stored" in our mental dictionary (a.k.a mental lexicon, vocabulary store).
Details of individual words may be idiosyncratic to a degree.
words as syntactic atoms
(The old chemistry model again?)
words as string of linguistic stuff: "listeme"
This coined word is used to indicate that lots of information about words and some phrases just must be memorized in the lexicon. 148.
level information: pronunciation, syntax, semantics
morpheme meaning as contribution ...
Each morpheme contributes something, often unique, to the meaning of its phrases and consequently its sentences.
be careful about sense/reference of expressions
Morphemes themselves do not refer and their only sense is in regard to contribution to their phrases (expressions)
Thus "John", "the instructor in 712", and "father of Kristin and Alexandra" all can be used to refer to John Limber, writer of these words, but these 3 expressions do not MEAN the same thing. The meaning sense of these expressions is given by the meanings of the basic morphemes in their syntactic arrangements (phrases)
See Frege on the "the morning/evening star" and "Venus."
associations?
possible words: phonology and the lexicon
The lexicon can be thought of as a repository for the possible words of a language as defined by the phonology. Unlike combinations of numbers, not all combinations of phonological segments can be meaningful (MIller, 1990). The actual number of words is much, much less than the possible number of combinations because certain combinations are ruled out by that phonology--so-called morpheme structure rules or phonotactic rules of a language. In English, for example, "ftik" or "tsar" are not possible words despite our ability to articulate them; other languages may find them perfectly good forms.
(note that similarities in phonotactic rules may be a major factor in APPARENT similarities across languages since phonetic values "mutate" within those structures.)
linguistic levels in the lexicon
Pinker, p.130 points out words are structured like sentences and cannot be generated by a chaining (Markoff) device.
N->Nstem + Ninflection (N-> dog + -s)
Nstem->Nstem +Nstem (N->Yugoslavia report)
Nstem->Nroot +Nrootaffix (N->Darwin + -ian)
Pinker, p.133 notes how this structure can lead to ambiguity, e.g. blackboard vs black board and how one can "test" whether something like Yugoslavia report is a compound word or phrase.
See Burke et al (1990) for an example of "frisbee"
referent of the phrase (pragmatics)
sense or meaning contribution of words to their phrase
this is the contribution the entire word makes; it is determined combinatorially by its internal structure; a "sense" derived from the lexical entries for its "head" and other component morphemes.
in some cases root-affixe combinations have unpredictable meanings and stems entered meaning entered in the lexicon.
morphological "Syntactic" structure
syllable structure
phonological segment structure (phonemic structure)
phonetic structure
distinctive feature structure
movements
acoustic structure
redundancy
Put another way, the possible words in every language are redundant. Choices in sequences of elements are not completely independent of each other as experienced Scrabble players know. The redundancy of language manifests itself in various places including the structure of words.
actual number of words-individuals and languages?
empirical estimates
One estimate (Nagy and Anderson in Miller, 1990, p.138) puts the vocabulary of high school students at between 45,000 and 60,000 depending on exactly what words are involved. There is of course considerable variability.
A self-rating estimate based on a sample of words carried out in my 712 class in 1993 averaged 45,000 words "used" and 91,000 words "known."
dictionaries

Figure x Exponential growth of English dictionaries
types of morphemes
open class (content words)
closed class (function words)
proper names
See Semenza and Zettin (1989) They argue from the clinical case of aphasia reported here, that proper names can be pure referring expressions -- they have no "sense" as discussed above.
acquisition of vocabulary
rate is accelerating
(Compare to training apes to use symbols or signs.)
the "gavagai" problem
See Pinker and 2nd language video--W. Quine's problem.
other issues
the lexicon: cognition & perception
frequency and familiarity
age of acquisition
priming effects
methods
associations among words
Word associations were perhaps first used by Francis Galton (1822-1911) as tap into the mind ["They lay bare the foundations of a man's thought..in Miller, p.154]. He measured the time one word gave rise to another idea. Carl Jung (1875-1961) tried to apply the idea to diagnosing psychiatric problems in a famous paper, relating it to some of Freud's ideas. Much research has been done on the topic, from collecting norms to building mental models generating these associations.
Jung, C. G. (1910). "The association method." American Journal of Psychology 21: 219-269.
Jung's 3 Clark lectures, Sept.,1909 translated from the German by Dr. A.A. Brill. CJ references German papers, 1906.
are derivations entered in our mental lexicons?
Do we compute the meaning of "unfixable" from its morphemes and structure or just remember it wholistically despite its origins?
words are special
word superiority effect
phoneme restoration effect
idioms (listemes)
Information must be in the lexicon that cannot be derived from the component elements. Since phonetic elements carry negligible "phonetic symbolism," this means the minimal meaningful elements are morphemes. Idiom, however, are word or phrase structures with their own unique meanings.
idiom translation-non lexical?
ENG: That's nothing to write home about
Swedish: "You wouldn't hang that on your Christmas tree" (LINGUIST List: Vol-6-1006.)