[SlawKaus] Workshops zur (polnischen) Korpuslinguistik

Ivan Levin akasaiko at gmail.com
Do Jun 6 16:26:45 CEST 2024


Liebe Liste,

ich möchte Sie gerne an die zwei *Workshops zur Korpuslinguistik* erinnern,
die unser Institut zusammen mit dem Alexander-Brückner-Zentrum nächste und
übernächste Woche organisiert. Für die Teilnahme ist keine
computerlinguistische Erfahrung notwendig, jedoch wird ein zumindest
*begrenztes
passives Verständnis des Polnischen* erwünscht.

On June 14 and 17, there will be two workshops on corpus linguistics with a
focus on the Polish language. They are organized by the Institute of Slavic
and Caucasus Studies of the Jena University and by the
Aleksander-Brückner-Zentrum (Jena/Halle). Both workshops will be conducted
in person in Jena as well as online. There are no prerequisites for
participation. However, openness to learning new computer tools and at
least some passive understanding of Polish are very much encouraged. If you
want to participate in either (or both) of the workshops, please fill out *this
registration form <https://forms.gle/wZNWcyAnGcHMdgwcA>*. If you have any
special needs or expectations, please express them in the registration form.

*EXPLORING SOCIOLINGUISTIC VARIATION VIA A DIALECTAL CORPUS: THE SPISZ
CORPUS  (June 14, 9:00-14:00)*
Dialectal machine-readable corpora are a valuable source of sociolinguistic
data if the questions are correctly addressed. The aim of the course is to
familiarize the students with the exploration of sociolinguistic variation
via a corpus, in this case, the corpus which documents the dialect of the
Polish part of Spisz. In the first part of the course, we shall formulate
queries that pick up dialectal and standard features in the speech of the
speakers. Once the data are gathered, we shall explore the social variables
that determine the use of the dialect or standard forms. The data will be
analyzed and visualized with Excel.
The workshop will be conducted by *Dr. Rafał L. Górski*, professor at
Jagiellonian University and Institute of Polish Language, Polish Academy of
Sciences. He collaborated in compiling a number of corpora of Polish:
National Corpus of Polish, Spisz Corpus, Corpus of Polish Speech, Old
Polish Corpus. In his research he focuses on the use corpus-based methods
to investigate grammar.

*MAKING YOUR OWN LINGUISTIC CORPUS WITH SPOCO (June 17 9:00-15:00)*
The workshop focuses on concordancing with the corpus manager Spoco, a
simple and adaptable web interface for dialect corpora. An introductory
lecture will be followed by hands-on exercises. The workshop will cover
both exploring existing corpora and creating tailored resources with Spoco.
It will also most likely include some programming with Python. Students
should be open to this; however, no previous programming skills are
required.
The workshop will be conducted by *Dr. Michał Woźniak* (Institute of Polish
Language, Polish Academy of Sciences), who specializes in natural language
processing, computational linguistics, and corpus linguistics. Dr. Woźniak
participated in the creation of numerous corpora.

Mit freundlichen Grüßen
Ivan Levin
-------------- nächster Teil --------------
Ein Dateianhang mit HTML-Daten wurde abgetrennt...
URL: <https://lserv.uni-jena.de/pipermail/slawkaus/attachments/20240606/7a69b8ae/attachment.html>


Mehr Informationen über die Mailingliste SlawKaus