Linguistic landscaping of South Asia using digital language resources: Genetic vs. areal linguistics
2014 (English)In: Proceedings of LREC 2014, Reykjavik: ELRA , 2014, 3137-3144 p.Conference paper (Refereed)
Like many other research fields, linguistics is entering the age of big data. We are now at a point where it is possible to see how new research questions can be formulated and old research questions addressed from a new angle or established results verified on the basis of exhaustive collections of data, rather than small, carefully selected samples. For example, South Asia is often mentioned in the literature as a classic example of a linguistic area, but there is no systematic, empirical study substantiating this claim. Examination of genealogical and areal relationships among South Asian languages requires a large-scale quantitative and qualitative comparative study, encompassing more than one language family. Further, such a study cannot be conducted manually, but needs to draw on extensive digitized language resources and state-of-the-art computational tools. We present some preliminary results of our large-scale investigation of the genealogical and areal relationships among the languages of this region, based on the linguistic descriptions available in the 19 tomes of Grierson's monumental Linguistic Survey of India (1903-1927), which is currently being digitized with the aim of turning the linguistic information in the LSI into a digital language resource suitable for a broad array of linguistic investigations.
Place, publisher, year, edition, pages
Reykjavik: ELRA , 2014. 3137-3144 p.
genetic linguistics, areal linguistics, lexicostatistics, historical linguistics, language typology, contact linguistics, language contact
General Language Studies and Linguistics Language Technology (Computational Linguistics)
Research subject Linguistics; Computational Linguistics
IdentifiersURN: urn:nbn:se:uu:diva-241011ISI: 000355611004124ISBN: 978-2-9517408-8-4OAI: oai:DiVA.org:uu-241011DiVA: diva2:777475
9th International Conference on Language Resources and Evaluation (LREC), MAY 26-31, 2014, Reykjavik, ICELAND
ProjectsDigital areal linguistics
FunderSwedish Research Council