Swadesh list

From Wikipedia, the free encyclopedia

Jump to: navigation, search

A Swadesh list is one of several lists of vocabulary with basic meanings, developed by Morris Swadesh in the 1940–50s, which is used in lexicostatistics (quantitative language relatedness assessment) and glottochronology (language divergence dating).

There are two basic versions of Swadesh list, one with 200 meanings, the other with 100 meanings. In the composite listing on this page, there are actually 207 meanings in total, since seven of the entries in the 100-meaning list (breast, fingernail, full, horn, knee, moon, round) were not in the original 200-meaning list. To see which words are in which lists, see these Wiktionary's Swadesh list appendix.

Contents

[edit] Usage in lexicostatistics and glottochronology

The Swadesh word list is used in lexicostatistics and glottochronology to determine the approximate date of first separation of genetically related languages, though other lists may be used. The closeness of the relationship of the languages is suggested to be roughly proportional to the number of cognate words present in the list. The reason that a fixed set of concepts is used, rather than a list of arbitrary words, is that the basic vocabulary learned during early childhood is assumed to change very slowly over time. Note that the task of counting the number of cognate words in the list is far from trivial, and may be subject to dispute, because cognates do not necessarily look similar, and recognition of cognates presupposes knowledge of the sound laws of the respective languages. For example, English 'wheel' and Hindi 'chakra' are cognates, although they are not recognizable as such without knowledge of the history of both languages. Also, even in cases where the number of cognates is undisputed, use of Swadesh lists for dating is disputed, because of the underlying assumption that the rate of replacement of basic vocabulary is constant over long periods of time. While Swadesh lists are a useful tool to get a rough idea, mainstream historical linguistics is usually very sceptical about claims of relatedness based on Swadesh lists exclusively.

The use of Swadesh lists in glottochronology was most popular during the 1960s and 1970s, after which enthusiasm waned and the discussion of the method's merit became emotional, leading to a temporary demise of the method. Refinements since the early 1970s include the incorporation of a geographical dimension into the equations, accounting for borrowing, and the use of robust statistical models, borrowed from phylogenetics.

A recent example of the use of Swadesh lists for absolute dating is the study of Gray and Atkinson (2003), calculating a tree of Indo-European languages with absolute dates for its nodes, using Bayesian principles, dating the Proto-Indo-European language to ca. 7000 BC (see Indo-Hittite). The study, which begins with a merciless criticism of the earlier forms of glottochronology, is based on the set of 200-word swadesh lists compiled by Isidore Dyen for 87 Indo-European languages. This 200-word swadesh list was already early abandoned by Swadesh for suspect with too many borrowed items, and has additionally been shown to be very unreliable (cf. Embleton 1995). (Swadesh later introduced a 100 item list which he considered more universal and culture-free. Because of this and false underlying assumptions of rates in language change, the work is generally argued against by practitioners of historical linguistics (cf. e.g. Campbell 1998:177ff), although the criticism has very little concrete basis, apart from verbal argument.[citation needed]) Gray and Atkinson use models developed for the analysis of phylogenetic relationships in biology and it remains unclear whether any critical violations of the models' assumptions are violated in the course of language evolution. It remains to be seen if the method will achieve wide acceptance in linguistics.

[edit] Swadesh list in English

Below is the Swadesh list of 207 words in the English language. For a Swadesh list that compares English, French, German, Italian, Spanish, Dutch, Esperanto, Swedish, and Latin (with links to other lists in other languages), see Wiktionary:Swadesh list.

  1. I
  2. you (singular)
  3. he
  4. we
  5. you (plural)
  6. they
  7. this
  8. that
  9. here
  10. there
  11. who
  12. what
  13. where
  14. when
  15. how
  16. not
  17. all
  18. many
  19. some
  20. few
  21. other
  22. one
  23. two
  24. three
  25. four
  26. five
  27. big
  28. long
  29. wide
  30. thick
  31. heavy
  32. small
  33. short
  34. narrow
  35. thin
  36. woman
  37. man (adult male)
  38. Man (human being)
  39. child
  40. wife
  41. husband
  42. mother
  43. father
  44. animal
  45. fish
  46. bird
  47. dog
  48. louse
  49. snake
  50. worm
  51. tree
  52. forest
  53. stick
  54. fruit
  55. seed
  56. leaf
  57. root
  58. bark
  59. flower
  60. grass
  61. rope
  62. skin
  63. meat
  64. blood
  65. bone
  66. fat (n.)
  67. egg
  68. horn
  69. tail
  70. feather
  71. hair
  72. head
  73. ear
  74. eye
  75. nose
  76. mouth
  77. tooth
  78. tongue
  79. fingernail
  80. foot
  81. leg
  82. knee
  83. hand
  84. wing
  85. belly
  86. guts
  87. neck
  88. back
  89. breast
  90. heart
  91. liver
  92. drink
  93. eat
  94. bite
  95. suck
  96. spit
  97. vomit
  98. blow
  99. breathe
  100. laugh
  101. see
  102. hear
  103. know
  104. think
  105. smell
  106. fear
  107. sleep
  108. live
  109. die
  110. kill
  111. fight
  112. hunt
  113. hit
  114. cut
  115. split
  116. stab
  117. scratch
  118. dig
  119. swim
  120. fly (v.)
  121. walk
  122. come
  123. lie
  124. sit
  125. stand
  126. turn
  127. fall
  128. give
  129. hold
  130. squeeze
  131. rub
  132. wash
  133. wipe
  134. pull
  135. push
  136. throw
  137. tie
  138. sew
  139. count
  140. say
  141. sing
  142. play
  143. float
  144. flow
  145. freeze
  146. swell
  147. sun
  148. moon
  149. star
  150. water
  151. rain
  152. river
  153. lake
  154. sea
  155. salt
  156. stone
  157. sand
  158. dust
  159. earth
  160. cloud
  161. fog
  162. sky
  163. wind
  164. snow
  165. ice
  166. smoke
  167. fire
  168. ashes
  169. burn
  170. road
  171. mountain
  172. red
  173. green
  174. yellow
  175. white
  176. black
  177. night
  178. day
  179. year
  180. warm
  181. cold
  182. full
  183. new
  184. old
  185. good
  186. bad
  187. rotten
  188. dirty
  189. straight
  190. round
  191. sharp
  192. dull
  193. smooth
  194. wet
  195. dry
  196. correct
  197. near
  198. far
  199. right
  200. left
  201. at
  202. in
  203. with
  204. and
  205. if
  206. because
  207. name


[edit] How to make a Swadesh list

1. Make sure you do not confuse nouns, verbs and adjectives. Most words are assumed to be nouns by default, hence "the rain", not "to rain"; "the fat", not "fat" as an adjective.

2. In a Swadesh list, it is tacitly assumed that no words have synonyms, and all of them differ from each other to some extent, either in meaning or style. For this reason, only the most basic, common, simple, currently used words are supposed to be included, e.g. "straight", not "direct" (in English). Evidently, this rule is often neglected, especially when it comes to lexemes with complex meaning, so synonyms unavoidably find their way into the lists. In many cases, the lists are composed by rather inexperienced language speakers, who cannot differentiate finer aspects of meaning. In that case, to avoid mistakes, please do a dictionary search and add a short note to synonyms for further editing, e.g. "straight" (road), "direct" (flight). That is better than just piling up several synonyms with misleading semantics. Do not include false friends into genetically similar lists, e.g. German "gross" : English "great". The English counterpart in this case should be "big", which is the most basic, simple word for something large in size.

3. If in doubt, which meaning to choose, try to select the most basic one, e.g. "thin as paper", not "a thin person".

4. Be particularly careful with such ambiguous lexemes as "split", "fight", "rub", "squeeze", "swell", "blow", etc. In most languages, they produce 2-5 several synonyms depending on context. They were excluded from the 100-word Swadesh list for reasons of semantical instability.

5. Do not take lists composed by other researchers and editors for granted, they may contain errors or minor inconsistencies. Do your own checks before doing lexicostatistical research.

[edit] See also

[edit] References

  • Campbell, Lyle. (1998). Historical linguistics: An Introduction. Edinburgh: Edinburgh University Press. ISBN 0262532670.
  • Embleton, Sheila (1995). Review of ‘An Indo-European classification: A lexicostatistical experiment’ by I. Dyen; J.B. Kruskal & P.Black. TAPS Monograph 82-5, Philadelphia. inDiachronica 12-2/1992:263–68.
  • Gray, Russell D.; & Atkinson, Quentin D. Language-tree divergence times support the Anatolian theory of Indo-European origin, Nature, 426.
  • Gudschinsky, Sarah. (1956). The ABC's of lexicostatistics (glottochronology). Word,12, 175–210.
  • Hoijer, Harry. (1956). Lexicostatistics: A critique. Language, 32, 49–60.
  • Holm, Hans J. (2007). The new Arboretum of Indo-European "Trees". Can New Algorithms Reveal the Phylogeny and Even Prehistory of Indo-European? Journal of Quantitative Linguistics, vol. 14, 167–214.
  • Sankoff, David (1970). "On the Rate of Replacement of Word-Meaning Relationships."Language 46.564–569.
  • Swadesh, Morris. (1950). Salish internal relationships. International Journal of American Linguistics, 16, 157–167.
  • Swadesh, Morris. (1952). Lexicostatistic dating of prehistoric ethnic contacts. Proceedings American Philosophical Society, 96, 452–463.
  • Swadesh, Morris. (1955). Towards greater accuracy in lexicostatistic dating. International Journal of American Linguistics, 21, 121–137.
  • Swadesh, Morris. (1971). The origin and diversification of language. Edited post mortem by Joel Sherzer. Chicago: Aldine. ISBN 202-01001-5. Contains p 283 final 100-word list!
  • Swadesh, Morris, et al. (1972). What is glottochronology? In M. Swadesh, The Origin and Diversification of Language (pp. 271–284). London: Routledge & Kegan Paul. ISBN 0202308413.
  • Wittmann, Henri (1969). "A lexico-statistic inquiry into the diachrony of Hittite."Indogermanische Forschungen 74.1-10.[1]
  • Wittmann, Henri (1973). "The lexicostatistical classification of the French-based Creole languages." Lexicostatistics in genetic linguistics: Proceedings of the Yale conference, April 3–4, 1971, dir. Isidore Dyen, 89–99. La Haye: Mouton.[2]

[edit] External links

Personal tools