uu.seUppsala University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Linguistic landscaping of South Asia using digital language resources: Genetic vs. areal linguistics
Svenska språket, Göteborgs universitet. (Språkbanken)
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Languages, Department of Linguistics and Philology. (Lingvistik)
Göteborgs universitet. (Språkbanken)
Max Planck Institute for Evolutionary Anthropology, Leipzig. (Linguistics)
2014 (English)In: Proceedings of LREC 2014, Reykjavik: ELRA , 2014, 3137-3144 p.Conference paper, Published paper (Refereed)
Abstract [en]

Like many other research fields, linguistics is entering the age of big data. We are now at a point where it is possible to see how new research questions can be formulated and old research questions addressed from a new angle or established results verified on the basis of exhaustive collections of data, rather than small, carefully selected samples. For example, South Asia is often mentioned in the literature as a classic example of a linguistic area, but there is no systematic, empirical study substantiating this claim. Examination of genealogical and areal relationships among South Asian languages requires a large-scale quantitative and qualitative comparative study, encompassing more than one language family. Further, such a study cannot be conducted manually, but needs to draw on extensive digitized language resources and state-of-the-art computational tools. We present some preliminary results of our large-scale investigation of the genealogical and areal relationships among the languages of this region, based on the linguistic descriptions available in the 19 tomes of Grierson's monumental Linguistic Survey of India (1903-1927), which is currently being digitized with the aim of turning the linguistic information in the LSI into a digital language resource suitable for a broad array of linguistic investigations.

Place, publisher, year, edition, pages
Reykjavik: ELRA , 2014. 3137-3144 p.
Keyword [en]
genetic linguistics, areal linguistics, lexicostatistics, historical linguistics, language typology, contact linguistics, language contact
National Category
General Language Studies and Linguistics Language Technology (Computational Linguistics)
Research subject
Linguistics; Computational Linguistics
Identifiers
URN: urn:nbn:se:uu:diva-241011ISI: 000355611004124ISBN: 978-2-9517408-8-4 (print)OAI: oai:DiVA.org:uu-241011DiVA: diva2:777475
Conference
9th International Conference on Language Resources and Evaluation (LREC), MAY 26-31, 2014, Reykjavik, ICELAND
Projects
Digital areal linguistics
Funder
Swedish Research Council
Available from: 2015-01-08 Created: 2015-01-08 Last updated: 2015-10-06Bibliographically approved

Open Access in DiVA

No full text

Other links

http://www.lrec-conf.org/proceedings/lrec2014/pdf/159_Paper.pdf

Authority records BETA

Saxena, Anju

Search in DiVA

By author/editor
Saxena, Anju
By organisation
Department of Linguistics and Philology
General Language Studies and LinguisticsLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 780 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf