Uni-Tübingen


Tutorium der Sektion Computerlinguistik
 

Thema: Information-Theoretic Analyses of Natural Languages

  Dienstag, 22.02.2022, 10:00 - 17:00 Uhr

 Das Tutorium kann von allen angemeldeten Teilnehmern der Jahrestagung besucht werden. Eine gesonderte Anmeldung ist nicht erforderlich.

Die Workshop-Sprache ist Englisch.

Dozierende:
Christian Bentz (Universität Tübingen) & Ximena Gutierrez-Vasques (Universität Zürich)

Languages transmit information. They are used to send messages across meters, kilometers, and around the globe. To better understand their information carrying potential, we can harness information theory. In fact, one of its first applications, back in the early 1950s, was a study estimating the amount of uncertainty in English text. Since then, information-theoretic measures have been applied in a multitude of quantitative, computational, and psycholinguistic studies of natural languages. This workshop will, firstly, give a brief introduction to the conceptual underpinnings of information-theoretic measures such as entropy, conditional entropy, and mutual information. Secondly, some problems, pitfalls, and possible solutions for their estimation are discusses. Thirdly, we will give some hands-on exercises for using these measures in research on natural languages. The workshop will provide all relevant data and code online. It will not require students to have a strong programming background.

Ansprechpersonen:
Christian Bentz (chrisspam prevention@christianbentz.de)
Ximena Gutierrez-Vasques (ximena.gutierrezvasques@uzh.ch)

Privacy settings

Our website uses cookies. Some of them are mandatory, while others allow us to improve your user experience on our website. The settings you have made can be edited at any time.

or

Essential

in2code

Videos

in2code
YouTube
Google