Fall School | University of Tübingen

5th Fall School of the Methods Center on October 6 and 7, 2026

Advanced AI for Research Design and Analysis

The Methods Center at the University of Tübingen cordially invites PhD candidates, postdoctoral researchers, and professors to join our Fall School with this years topic: Advanced AI for Research Design and Analysis.

Workshops will be given by PD Dr. Rudolf Debelak (University of Zürich & EPFL), Prof. Dr. Fritz Günther (Humboldt University Berlin), and Prof. Dr. Martin Spindler (University of Hamburg).

WS 1: Automated Item Generation with Agentic AI Approaches (Rudolf Debelak)

WS 2: Your personal AI research assistant: Using Large Language Models and chatbots in applied research (Fritz Günther)

WS 3: Applied Causal Inference with AI (Martin Spindler)

Workshops

WS 1: Automated Item Generation with Agentic AI Approaches (Rudolf Debelak, University of Zürich/EPFL)

In psychology and education, large-scale assessments are pivotal for measuring cognitive abilities, such as reading comprehension, mathematical skills, and general knowledge. To ensure reliability, validity, and fairness, high-stakes assessments rely on robust banks of high-quality items. Emerging research focuses on using generative and predictive machine learning models to enable cost-effective and rapid item generation.

In this workshop, we will address the following topics:

1. Theoretical Framework: We present a framework for automated item generation (AIG) utilizing agentic workflows and predictive modeling.

2. Implementation: We demonstrate how to implement this framework using open-source models, with a specific focus on reading comprehension and mathematics.

3. Evaluation: We discuss the validation of this framework through the rigorous evaluation of generated items.

Prerequisites: Participants require fundamental knowledge of machine learning principles and Python.

WS 2: Your personal AI research assistant: Using Large Language Models and chatbots in applied research (Fritz Günther, Humboldt University Berlin)

Large Language Models (LLMs) have rapidly become one of the most relevant scientific topics, both as theories of language and cognition, but also as tools for research. This trend has accelerated even more since applications like ChatGPT are available to the public. This course will first give a basic introduction to LLMs, outlining the core components of their architecture and how they are trained. For this we will take a bird’s eye perspective, focussing on the key concepts rather than mathematical details: The training data, the training objective (predicting words in contexts), basics of (deep) neural network models, simple language models, the central components of the transformer architecture that underlies common LLMs, and the difference between unidirectional and bidirectional models. This part of the course will end with an overview of how users can interact with LLMs beyond chatbot interfaces (for example, via API calls), and how we can access not only their outputs, but also some of their internal representations and weights.

The second part of the course will focus on use cases of LLMs in research, in their role as “research assistant systems” that allow us to automatize and “outsource” labor-intensive tasks. This will include using LLMs

• as “pseudo-participants” to pilot or simulate behavioral studies
• to automatically generate and simulate new datasets (or extrapolate and scale up existing datasets)
• to automatically annotate data and automatize qualitative analyses
• to analyze open text responses, thus taking a quantitative rather than a qualitative approach to text analysis
• for systematic literature reviews

Prerequisites: The course is targeted towards a general audience with no specific pre-existing expertise in the topic. General coding skills are useful, but knowledge of a specific programming language is no requirement for participation.

WS 3: Applied Causal Inference with AI (Martin Spindler, University of Hamburg)

Many questions in the social sciences, public health, economics, and business, such as whether a policy works, whether an intervention reduces risk, or which patients benefit most, are fundamentally causal. Yet modern datasets are increasingly high-dimensional, messy, and rich in potential confounders, making purely parametric approaches brittle: small modeling mistakes can translate into large bias in estimated treatment effects. At the same time, many machine-learning methods excel at prediction rather than causal estimation, so naively plugging ML models into causal workflows can yield strong accuracy metrics while still producing misleading effect estimates. This course presents an applied, end-to-end workflow for causal inference with AI/ML methods, emphasizing reliable treatment-effect estimation and valid uncertainty quantification. We begin by formalizing causal questions using the potential outcomes framework and causal graphs, emphasizing identification: what can (and cannot) be learned from observational data under explicit assumptions. We then move to estimation in complex settings and introduce debiased (double) machine learning (DML), a principled framework that leverages flexible ML for the components needed to estimate treatment effects while supporting valid inference. Building on this foundation, we cover advanced topics that practitioners routinely face: heterogeneous treatment effects and how to summarize and communicate effect heterogeneity responsibly; robustness and sensitivity analysis to assess how conclusions change under plausible violations of assumptions. If time permits, we conclude with a brief tour of current research directions at the intersection of causal inference and AI. The course is intended for applied researchers and data scientists interested in estimating causal effects.

Background reading: causalml-book.org

Registration

If you are interested in joining, please write an email to officespam prevention@mz.uni-tuebingen.de using this registration form.

Please be aware that we fill the 20 spots for each of the three parallel workshop in the order of the registration.

The deadline for the registration is 25 May, 2026 (Pfingstmontag).

The fee for the workshop is 100€ and it includes catering for the breaks and for lunch. On Monday evening (5 Oct), we will organize an informal get-together. On Tuesday evening (6 Oct) we offer to go to dinner together at one's own expense.