Course number: 423211

Treating words as data is becoming a popular approach to analysing text documents in the social sciences. The main goal of this seminar is to provide students with the basics for understanding the possibilities and pitfalls of automated content analysis with R. Starting with a brief overview of text-as-data methods, this seminar delves into specific text mining techniques, including algorithms for supervised and unsupervised ideological scaling. The course includes theoretical sessions introducing and discussing conceptual frameworks as described in the reading material, as well as hands-on instruction in the application of key pre-processing and text-mining methods. Proficiency in R is not a prerequisite for participation in this seminar, although basic knowledge may be helpful.