Anastasia Panova presented the Corpus of Daghestanian Russian at the Twelfth conference of the Slavic Linguistics Society
Anastasia Panova, intern-researcher at the Linguistic Convergence Laboratory, participated in the Twelfth conference of the Slavic Linguistics Society, from 21 to 24 September in Ljubljana (Slovenia), where she gave a poster presentation "A corpus outlook on regional varieties of Russian in Daghestan".
The poster presentation was part of a workshop on Slavic Spoken Corpora, chaired by Ruprecht von Waldenfels. The Ustja River Basin Corpus (Michael Daniel, Nina Dobrushina), which is part of another project of the Linguistic Convergence Laboratory, was also represented at the conference, alongside other corpora that contain spoken language. In addition, Sergey Say and Maria Ovsyannikova from St.-Petersburg, who won a grant for the creation of a spoken text corpus in our grant competition, also took part in the conference. Their project is called "The Bashkir language of the villages Rachmetovo and Baimovo of the Abzelilov region of the Republic of Bashkortostan". At the conference they discussed possible problems that might occur when texts recorded during an expedition to Bashkortostan are transferred to the platform that is also used for the Corpus of Daghestanian Russian.
Participation in the conference helped solve some problems and questions which occurred in the process of working on the corpora. It was suggested to add the interview texts of the Daghestanian Russian Corpus to the Russian Learners Corpus, which automatically marks "mistakes" and shows the language norm. This platform can help to identify peculiarities of the Daghestanian variety in comparison with standard Russian.
Currently, the corpus is actively being expanded with more data. Over 19 hours of recorded interviews with residents of Daghestanian villages will be added in the near future.