Edward Sapir (1884–1939). Language: An Introduction to the Study of Speech. 1921.
Chapter 8Language as a Historical Product: Phonetic Law
I
These parallelisms in drift may operate in the phonetic as well as in the morphological sphere, or they may affect both at the same time. Here is an interesting example. The English type of plural represented by foot: feet, mouse: mice is strictly parallel to the German Fuss: Füsse, Maus: Mäuse. One would be inclined to surmise that these dialectic forms go back to old Germanic or West-Germanic alternations of the same type. But the documentary evidence shows conclusively that there could have been no plurals of this type in primitive Germanic. There is no trace of such vocalic mutation (“umlaut”) in Gothic, our most archaic Germanic language. More significant still is the fact that it does not appear in our oldest Old High German texts and begins to develop only at the very end of the Old High German period (circa 1000
How did such strikingly individual alternations as fot: fet, fuoss: füesse develop? We have now reached what is probably the most central problem in linguistic history, gradual phonetic change. “Phonetic laws” make up a large and fundamental share of the subject-matter of linguistics. Their influence reaches far beyond the proper sphere of phonetics and invades that of morphology, as we shall see. A drift that begins as a slight phonetic readjustment or unsettlement may in the course of millennia bring about the most profound structural changes. The mere fact, for instance, that there is a growing tendency to throw the stress automatically on the first syllable of a word may eventually change the fundamental type of the language, reducing its final syllables to zero and driving it to the use of more and more analytical or symbolic methods. The English phonetic laws involved in the rise of the words foot, feet, mouse and mice from their early West-Germanic prototypes fot, foti, mus, musi may be briefly summarized as follows:
1. In foti “feet” the long o was colored by the following i to long ö, that is, o kept its lip-rounded quality and its middle height of tongue position but anticipated the front tongue position of the i; ö is the resulting compromise. This assimilatory change was regular, i.e., every accented long o followed by an i in the following syllable automatically developed to long ö; hence tothi “teeth” became töthi, fodian “to feed” became födian. At first there is no doubt the alternation between o and ö was not felt as intrinsically significant. It could only have been an unconscious mechanical adjustment such as may be observed in the speech of many to-day who modify the “oo” sound of words like you and few in the direction of German ü without, however, actually departing far enough from the “oo” vowel to prevent their acceptance of who and you as satisfactory rhyming words. Later on the quality of the ö vowel must have departed widely enough from that of o to enable ö to rise in consciousness as a neatly distinct vowel. As soon as this happened, the expression of plurality in föti, töthi, and analogous words became symbolic and fusional, not merely fusional.
2. In musi “mice” the long u was colored by the following i to long ü. This change also was regular; lusi “lice” became lüsi, kui “cows” became küi (later simplified to kü; still preserved as ki- in kine), fulian “to make foul” became fülian (still preserved as -file in defile). The psychology of this phonetic law is entirely analogous to that of 1.
3. The old drift toward reducing final syllables, a rhythmic consequence of the strong Germanic stress on the first syllable, now manifested itself. The final -i, originally an important functional element, had long lost a great share of its value, transferred as that was to the symbolic vowel change (o: ö). It had little power of resistance, therefore, to the drift. It became dulled to a colorless -e; föti became föte.
4. The weak -e finally disappeared. Probably the forms föte and föt long coexisted as prosodic variants according to the rhythmic requirements of the sentence, very much as Füsse and Füss’ now coexist in German.
5. The ö of föt became “unrounded” to long e (our present a of fade). The alternation of fot: foti, transitionally fot: föti, föte, föt, now appears as fot: fet. Analogously, töth appears as teth, födian as fedian, later fedan. The new long e-vowel “fell together” with the older e- vowel already existent (e.g., her “here,” he “he”). Henceforward the two are merged and their later history is in common. Thus our present he has the same vowel as feet, teeth, and feed. In other words, the old sound pattern o, e, after an interim of o, ö, e, reappeared as o, e, except that now the e had greater “weight” than before.
6. Fot: fet, mus: müs (written mys) are the typical forms of Anglo-Saxon literature. At the very end of the Anglo-Saxon period, say about 1050 to 1100
7. In Chaucer’s day (circa 1350–1400
8. About the same time the long u- vowels were diphthongized to ou (i.e., o of present Scotch not + u of full). The Chaucerian mus: mis now appears as the Shakespearean mous: meis. This change may have manifested itself somewhat later than 7; all English dialects have diphthongized old Germanic long i. but the long undiphthongized u is still preserved in Lowland Scotch, in which house and mouse rhyme with our loose. 7 and 8 are analogous developments, as were 5 and 6; 8 apparently lags behind 7 as 6, centuries earlier, lagged behind 7.
9. Some time before 1550 the long e of fet (written feet) took the position that had been vacated by the old long i, now diphthongized (see 7), i.e., e took the higher tongue position of i. Our (and Shakespeare’s) “long e” is, then, phonetically the same as the old long i. Feet now rhymed with the old write and the present beat.
10. About the same time the long o of fot (written foot) took the position that had been vacated by the old long u, now diphthongized (see 8), i.e., o took the higher tongue position of u. Our (and Shakespeare’s) “long oo” is phonetically the same as the old long u. Foot now rhymed with the old out and the present boot. To summarize 7 to 10, Shakespeare pronounced meis, mous, fit, fut, of which meis and mous would affect our ears as a rather “mincing” rendering of our present mice and mouse, fit would sound practically identical with (but probably a bit more “drawled” than) our present feet, while foot, rhyming with boot, would now be set down as “broad Scotch.”
11. Gradually the first vowel of the diphthong in mice (see 7) was retracted and lowered in position. The resulting diphthong now varies in different English dialects, but ai (i.e., a of father, but shorter, + short i) may be taken as a fairly accurate rendering of its average quality. What we now call the “long i” (of words like ride, bite, mice) is, of course, an ai-diphthong. Mice is now pronounced mais.
12. Analogously to 11, the first vowel of the diphthong in mouse (see 8) was unrounded and lowered in position. The resulting diphthong may be phonetically rendered au, though it too varies considerably according to dialect. Mouse, then, is now pronounced maus.
13. The vowel of foot (see 10) became “open” in quality and shorter in quantity, i.e., it fell together with the old short u-vowel of words like full, wolf, wool. This change has taken place in a number of words with an originally long u (Chaucerian long close o), such as forsook, hook, book, look, rook, shook, all of which formerly had the vowel of boot. The older vowel, however, is still preserved in most words of this class, such as fool, moon, spool, stoop. It is highly significant of the nature of the slow spread of a “phonetic law” that there is local vacillation at present in several words. One hears roof, soot, and hoop, for instance, both with the “long” vowel of boot and the “short” of foot. It is impossible now, in other words, to state in a definitive manner what is the “phonetic law” that regulated the change of the older foot (rhyming with boot) to the present foot. We know that there is a strong drift towards the short, open vowel of foot, but whether or not all the old “long oo” words will eventually be affected we cannot presume to say. If they all, or practically all, are taken by the drift, phonetic law 13 will be as “regular,” as sweeping, as most of the twelve that have preceded it. If not, it may eventually be possible, if past experience is a safe guide, to show that the modified words form a natural phonetic group, that is, that the “law” will have operated under certain definable limiting conditions, e.g., that all words ending in a voiceless consonant (such as p, t, k, f) were affected (e.g., hoof, foot, look, roof), but that all words ending in the oo-vowel or in a voiced consonant remained unaffected (e.g., do, food, move, fool). Whatever the upshot, we may be reasonably certain that when the “phonetic law” has run its course, the distribution of “long” and “short” vowels in the old oo- words will not seem quite as erratic as at the present transitional moment. We learn, incidentally, the fundamental fact that phonetic laws do not work with spontaneous automatism, that they are simply a formula for a consummated drift that sets in at a psychologically exposed point and gradually worms its way through a gamut of phonetically analogous forms.
It will be instructive to set down a table of form sequences, a kind of gross history of the words foot, feet, mouse, mice for the last 1500 years:
- fot: foti; mus: musi (West Germanic)
- fot: föti; mus: müsi
- fot: föte; mus: müse
- fot: föt; mus: müs
- fot: fet; mus: müs (Anglo-Saxon)
- fot: fet; mus: mis (Chaucer)
- fot: fet; mous: meis
- fut (rhymes with boot: fit; mous: meis (Shakespeare)
- fut: fit; maus: mais
- fut (rhymes with put): fit; maus: mais (English of 1900)
It will not be necessary to list the phonetic laws that gradually differentiated the modern German equivalents of the original West Germanic forms from their English cognates. The following table gives a rough idea of the form sequences in German:
- fot: foti; mus: musi (West Germanic)
- foss: fossi; mus: musi
- fuoss: fuossi; mus: musi (Old High German)
- fuoss: füessi; mus: müsi
- fuoss: füesse; mus: müse (Middle High German)
- fuoss: füesse; mus: müze
- fuos: füese; mus: müze
- fuos: füese; mous: möüze
- fus: füse; mous: möüze (Luther)
- fus: füse; maus: moize (German of 1900)
We cannot even begin to ferret out and discuss all the psychological problems that are concealed behind these bland tables. Their general parallelism is obvious. Indeed we might say that to-day the English and German forms resemble each other more than does either set the West Germanic prototypes from which each is independently derived. Each table illustrates the tendency to reduction of unaccented syllables, the vocalic modification of the radical element under the influence of the following vowel, the rise in tongue position of the long middle vowels (English o to u, e to i; German o to uo to u, üe to ü), the diphthongizing of the old high vowels (English i to ei to ai; English and German u to ou to au; German ü to öü to oi). These dialectic parallels cannot be accidental. They are rooted in a common, pre-dialectic drift.
Phonetic changes are “regular.” All but one (English table, X.), and that as yet uncompleted, of the particular phonetic laws represented in our tables affect all examples of the sound in question or, if the phonetic change is conditional, all examples of the same sound that are analogously circumstanced. An example of the first type of change is the passage in English of all old long i-vowels to diphthongal ai via ei. The passage could hardly have been sudden or automatic, but it was rapid enough to prevent an irregularity of development due to cross drifts. The second type of change is illustrated in the development of Anglo-Saxon long o to long e, via ö, under the influence of a following i. In the first case we may say that au mechanically replaced long u, in the second that the old long o “split” into two sounds—long o, eventually u, and long e, eventually i. The former type of change did no violence to the old phonetic pattern, the formal distribution of sounds into groups; the latter type rearranged the pattern somewhat. If neither of the two sounds into which an old one “splits” is a new sound, it means that there has been a phonetic leveling, that two groups of words, each with a distinct sound or sound combination, have fallen together into one group. This kind of leveling is quite frequent in the history of language. In English, for instance, we have seen that all the old long ü-vowels, after they had become unrounded, were indistinguishable from the mass of long i-vowels. This meant that the long i-vowel became a more heavily weighted point of the phonetic pattern than before. It is curious to observe how often languages have striven to drive originally distinct sounds into certain favorite positions, regardless of resulting confusions. In Modern Greek, for instance, the vowel i is the historical resultant of no less than ten etymologically distinct vowels (long and short) and diphthongs of the classical speech of Athens. There is, then, good evidence to show that there are general phonetic drifts toward particular sounds.
More often the phonetic drift is of a more general character. It is not so much a movement toward a particular set of sounds as toward particular types of articulation. The vowels tend to become higher or lower, the diphthongs tend to coalesce into monophthongs, the voiceless consonants tend to become voiced, stops tend to become spirants. As a matter of fact, practically all the phonetic laws enumerated in the two tables are but specific instances of such far-reaching phonetic drifts. The raising of English long o to u and of long e to i, for instance, was part of a general tendency to raise the position of the long vowels, just as the change of t to ss in Old High German was part of a general tendency to make voiceless spirants of the old voiceless stopped consonants. A single sound change, even if there is no phonetic leveling, generally threatens to upset the old phonetic pattern because it brings about a disharmony in the grouping of sounds. To reëstablish the old pattern without going back on the drift the only possible method is to have the other sounds of the series shift in analogous fashion. If, for some reason or other, p becomes shifted to its voiced correspondent b, the old series p, t, k appears in the unsymmetrical form b, t, k. Such a series is, in phonetic effect, not the equivalent of the old series, however it may answer to it in etymology. The general phonetic pattern is impaired to that extent. But if t and k are also shifted to their voiced correspondents d and g, the old series is reëstablished in a new form: b, d, g. The pattern as such is preserved, or restored. Provided that the new series b, d, g does not become confused with an old series b, d, g of distinct historical antecedents. If there is no such older series, the creation of a b, d, g series causes no difficulties. If there is, the old patterning of sounds can be kept intact only by shifting the old b, d, g sounds in some way. They may become aspirated to bh, dh, gh or spirantized or nasalized or they may develop any other peculiarity that keeps them intact as a series and serves to differentiate them from other series. And this sort of shifting about without loss of pattern, or with a minimum loss of it, is probably the most important tendency in the history of speech sounds. Phonetic leveling and “splitting” counteract it to some extent but, on the whole, it remains the central unconscious regulator of the course and speed of sound changes.
The desire to hold on to a pattern, the tendency to “correct” a disturbance by an elaborate chain of supplementary changes, often spread over centuries or even millennia—these psychic undercurrents of language are exceedingly difficult to understand in terms of individual psychology, though there can be no denial of their historical reality. What is the primary cause of the unsettling of a phonetic pattern and what is the cumulative force that selects these or those particular variations of the individual on which to float the pattern readjustments we hardly know. Many linguistic students have made the fatal error of thinking of sound change as a quasi-physiological instead of as a strictly psychological phenomenon, or they have tried to dispose of the problem by bandying such catchwords as “the tendency to increased ease of articulation” or “the cumulative result of faulty perception” (on the part of children, say, in learning to speak). These easy explanations will not do. “Ease of articulation” may enter in as a factor, but it is a rather subjective concept at best. Indians find hopelessly difficult sounds and sound combinations that are simple to us; one language encourages a phonetic drift that another does everything to fight. “Faulty perception” does not explain that impressive drift in speech sounds which I have insisted upon. It is much better to admit that we do not yet understand the primary cause or causes of the slow drift in phonetics, though we can frequently point to contributing factors. It is likely that we shall not advance seriously until we study the intuitional bases of speech. How can we understand the nature of the drift that frays and reforms phonetic patterns when we have never thought of studying sound patterning as such and the “weights” and psychic relations of the single elements (the individual sounds) in these patterns?
Every linguist knows that phonetic change is frequently followed by morphological rearrangements, but he is apt to assume that morphology exercises little or no influence on the course of phonetic history. I am inclined to believe that our present tendency to isolate phonetics and grammar as mutually irrelevant linguistic provinces is unfortunate. There are likely to be fundamental relations between them and their respective histories that we do not yet fully grasp. After all, if speech sounds exist merely because they are the symbolic carriers of significant concepts and groupings of concepts, why may not a strong drift or a permanent feature in the conceptual sphere exercise a furthering or retarding influence on the phonetic drift? I believe that such influences may be demonstrated and that they deserve far more careful study than they have received.
This brings us back to our unanswered question: How is it that both English and German developed the curious alternation of unmodified vowel in the singular (foot, Fuss) and modified vowel in the plural (feet, Füsse)? Was the pre-Anglo-Saxon alternation of fot and föti an absolutely mechanical matter, without other than incidental morphological interest? It is always so represented, and, indeed, all the external facts support such a view. The change from o to ö, later e, is by no means peculiar to the plural. It is found also in the dative singular (fet), for it too goes back to an older foti. Moreover, fet of the plural applies only to the nominative and accusative; the genitive has fota, the dative fotum. Only centuries later was the alternation of o and e reinterpreted as a means of distinguishing number; o was generalized for the singular, e for the plural. Only when this reassortment of forms took place was the modern symbolic value of the foot: feet alternation clearly established. Again, we must not forget that o was modified to ö (e) in all manner of other grammatical and derivative formations. Thus, a pre-Anglo-Saxon hohan (later hon) “to hang” corresponded to a höhith, hehith (later hehth) “hangs”; to dom “doom,” blod “blood,” and fod “food” corresponded the verbal derivatives dömian (later deman) “to deem,” blödian (later bledan) “to bleed,” and födian (later fedan) “to feed.” All this seems to point to the purely mechanical nature of the modification of o to ö to e. So many unrelated functions were ultimately served by the vocalic change that we cannot believe that it was motivated by any one of them.
The German facts are entirely analogous. Only later in the history of the language was the vocalic alternation made significant for number. And yet consider the following facts. The change of foti to föti antedated that of föti to föte, föt. This may be looked upon as a “lucky accident,” for if foti had become fote, fot before the -i had had the chance to exert a retroactive influence on the o, there would have been no difference between the singular and the plural. This would have been anomalous in Anglo-Saxon for a masculine noun. But was the sequence of phonetic changes an “accident”? Consider two further facts. All the Germanic languages were familiar with vocalic change as possessed of functional significance. Alternations like sing, sang, sung (Anglo-Saxon singan, sang, sungen) were ingrained in the linguistic consciousness. Further, the tendency toward the weakening of final syllables was very strong even then and had been manifesting itself in one way and another for centuries. I believe that these further facts help us to understand the actual sequence of phonetic changes. We may go so far as to say that the o (and u) could afford to stay the change to ö (and ü) until the destructive drift had advanced to the point where failure to modify the vowel would soon result in morphological embarrassment. At a certain moment the -i ending of the plural (and analogous endings with i in other formations) was felt to be too weak to quite bear its functional burden. The unconscious Anglo-Saxon mind, if I may be allowed a somewhat summary way of putting the complex facts, was glad of the opportunity afforded by certain individual variations, until then automatically canceled out, to have some share of the burden thrown on them. These particular variations won through because they so beautifully allowed the general phonetic drift to take its course without unsettling the morphological contours of the language. And the presence of symbolic variation (sing, sang, sung) acted as an attracting force on the rise of a new variation of similar character. All these factors were equally true of the German vocalic shift. Owing to the fact that the destructive phonetic drift was proceeding at a slower rate in German than in English, the preservative change of uo to üe (u to ü) did not need to set in until 300 years or more after the analogous English change. Nor did it. And this is to my mind a highly significant fact. Phonetic changes may sometimes be unconsciously encouraged in order to keep intact the psychological spaces between words and word forms. The general drift seizes upon those individual sound variations that help to preserve the morphological balance or to lead to the new balance that the language is striving for.
I would suggest, then, that phonetic change is compacted of at least three basic strands: (1) A general drift in one direction, concerning the nature of which we know almost nothing but which may be suspected to be of prevailingly dynamic character (tendencies, e.g., to greater or less stress, greater or less voicing of elements); (2) A readjusting tendency which aims to preserve serve or restore the fundamental phonetic pattern of the language; (3) A preservative tendency which sets in when a too serious morphological unsettlement is threatened by the main drift. I do not imagine for a moment that it is always possible to separate these strands or that this purely schematic statement does justice to the complex forces that guide the phonetic drift. The phonetic pattern of a language is not invariable, but it changes far less readily than the sounds that compose it. Every phonetic element that it possesses may change radically and yet the pattern remain unaffected. It would be absurd to claim that our present English pattern is identical with the old Indo-European one, yet it is impressive to note that even at this late day the English series of initial consonants:
p | t | k |
b | d | g |
f | th | h |
corresponds point for point to the Sanskrit series:
b | d | g |
bh | dh | gh |
p | t | k |
The relation between phonetic pattern and individual sound is roughly parallel to that which obtains between the morphologic type of a language and one of its specific morphological features. Both phonetic pattern and fundamental type are exceedingly conservative, all superficial appearances to the contrary notwithstanding. Which is more so we cannot say. I suspect that they hang together in a way that we cannot at present quite understand.
If all the phonetic changes brought about by the phonetic drift were allowed to stand, it is probable that most languages would present such irregularities of morphological contour as to lose touch with their formal ground-plan. Sound changes work mechanically. Hence they are likely to affect a whole morphological group here—this does not matter—, only part of a morphological group there—and this may be disturbing. Thus, the old Anglo-Saxon paradigm:
Sing. | Plur. | ||
N. Ac. | fot | fet (older foti) | |
G. | fotes | fota | |
D. | fet (older foti) | fotum |
could not long stand unmodified. The o—e alternation was welcome in so far as it roughly distinguished the singular from the plural. The dative singular fet, however, though justified historically, was soon felt to be an intrusive feature. The analogy of simpler and more numerously represented paradigms created the form fote (compare, e.g., fisc “fish,” dative singular fisce). Fet as a dative becomes obsolete. The singular now had o throughout. But this very fact made the genitive and dative o-forms of the plural seem out of place. The nominative and accusative fet was naturally far more frequently in use than were the corresponding form of the genitive and dative. These, in the end, could not but follow the analogy of fet. At the very beginning of the Middle English period, therefore, we find that the old paradigm has yielded to a more regular one:
Sing. | Plur. | |
N. Ac. | +fot | +fet |
G. | +fotes | fete |
D. | fote | feten |
The starred forms are the old nucleus around which the new paradigm is built. The unstarred forms are not genealogical kin of their formal prototypes. They are analogical replacements.
The history of the English language teems with such levelings or extensions. Elder and eldest were at one time the only possible comparative and superlative forms of old (compare German alt, älter, der älteste; the vowel following the old-, alt- was originally an i, which modified the quality of the stem vowel). The general analogy of the vast majority of English adjectives, however, has caused the replacement of the forms elder and eldest by the forms with unmodified vowel, older and oldest. Elder and eldest survive only as somewhat archaic terms for the older and oldest brother or sister. This illustrates the tendency for words that are psychologically disconnected from their etymological or formal group to preserve traces of phonetic laws that have otherwise left no recognizable trace or to preserve a vestige of a morphological process that has long lost its vitality. A careful study of these survivals or atrophied forms is not without value for the reconstruction of the earlier history of a language or for suggestive hints as to its remoter affiliations.
Analogy may not only refashion forms within the confines of a related cluster of forms (a “paradigm”) but may extend its influence far beyond. Of a number of functionally equivalent elements, for instance, only one may survive, the rest yielding to its constantly widening influence. This is what happened with the English -s plural. Originally confined to a particular class of masculines, though an important class, the -s plural was gradually generalized for all nouns but a mere handful that still illustrate plural types now all but extinct (foot: feet, goose: geese, tooth: teeth, mouse: mice, louse: lice; ox: oxen; child: children; sheep: sheep, deer: deer). Thus analogy not only regularizes irregularities that have come in the wake of phonetic processes but introduces disturbances, generally in favor of greater simplicity or regularity, in a long established system of forms. These analogical adjustments are practically always symptoms of the general morphological drift of the language.
A morphological feature that appears as the incidental consequence of a phonetic process, like the English plural with modified vowel, may spread by analogy no less readily than old features that owe their origin to other than phonetic causes. Once the e- vowel of Middle English fet had become confined to the plural, there was no theoretical reason why alternations of the type fot: fet and mus: mis might not have become established as a productive type of number distinction in the noun. As a matter of fact, it did not so become established. The fot: fet type of plural secured but a momentary foothold. It was swept into being by one of the surface drifts of the language, to be swept aside in the Middle English period by the more powerful drift toward the use of simple distinctive forms. It was too late in the day for our language to be seriously interested in such pretty symbolisms as foot: feet. What examples of the type arose legitimately, in other words via purely phonetic processes, were tolerated for a time, but the type as such never had a serious future.
It was different in German. The whole series of phonetic changes comprised under the term “umlaut,” of which u: ü and au: oi (written äu) are but specific examples, struck the German language at a time when the general drift to morphological simplification was not so strong but that the resulting formal types (e.g., Fuss: Füsse; fallen “to fall”: fällen “to fell”; Horn “horn”: Gehörne “group of horns”; Haus “house”: Häuslein “little house”) could keep themselves intact and even extend to forms that did not legitimately come within their sphere of influence. “Umlaut” is still a very live symbolic process in German, possibly more alive to-day than in medieval times. Such analogical plurals as Baum “tree”: Bäume (contrast Middle High German boum: boume) and derivatives as lachen “to laugh”: Gelächter “laughter” (contrast Middle High German gelach) show that vocalic mutation has won through to the status of a productive morphologic process. Some of the dialects have even gone further than standard German, at least in certain respects. In Yiddish, for instance, “umlaut” plurals have been formed where there are no Middle High German prototypes or modern literary parallels, e.g., tog “day”: teg “days” (but German Tag: Tage) on the analogy of gast “guest”: gest “guests” (German Gast: Gäste), shuch “shoe”: shich “shoes” (but German Schuh: Schuhe) on the analogy of fus “foot”: fis “feet.” It is possible that “umlaut” will run its course and cease to operate as a live functional process in German, but that time is still distant. Meanwhile all consciousness of the merely phonetic nature of “umlaut” vanished centuries ago. It is now a strictly morphological process, not in the least a mechanical phonetic adjustment. We have in it a splendid example of how a simple phonetic law, meaningless in itself, may eventually color or transform large reaches of the morphology of a language.