Matthew Wilkens F'14
ACLS Digital Innovation Fellowships 2014
University of Notre Dame
Literary Geography at Scale
Literary Geography at Scale uses natural language processing algorithms and automated geocoding to extract geographic information from nearly eleven million digitized volumes held by the HathiTrust Digital Library. The project extends existing computationally assisted work on American and international literary geography to new regions, new historical periods – including the present day – and to a vastly larger collection of texts. It also provides scholars in the humanities and social sciences with an enormous yet accessible trove of geographic information. Because the HathiTrust corpus includes books published over many centuries in a variety of languages and across nearly all disciplines, the derived data is potentially useful to researchers in a range of humanities and computational fields. Literary Geography at Scale is one of the largest humanities text-mining projects to date and the first truly large-scale study of 20th and 21st century literature.