Modern Alchemy: Hands-On with the Distant Reader
In this hands-on workshop participants will first learn how to use a system called the Distant Reader to transform sets of unstructured data (like journal articles) into structured data affectionally called “study carrels”. These data sets are amenable to analysis by both people as well as computers. Second, participants will learn how to use a Python-based command-line tool (the Reader Toolbox) to apply text mining and natural language processing tasks against the structured data. These processess include things such as feature extraction, concordancing, topic modeling, full text indexing, semantic indexing, network analysis, etc. In the end, participants will learn of an additional way to turn data into information – modern alchemy.
Learning Objectives: Particpants will learn how to create data sets from collections of narrative text, and then they will learn how to analyze ("read") the data sets.