Skip to main content

Digital Humanities and Cornell University: Research Guide: Text Analysis

Selected resources on digital humanities; links to support at Cornell University Library.

Corpora for text analysis

JSTOR: Data for Research
Online/Dowloadable Corpora
APIs for Scholarly Resources (MIT)

Textual analysis/text mining

OCR correction crowdsourcing games

Purposeful gaming from the Biodiversity Heritage Library (BHL)

Two OCR correction games were developed within BHL (a collaboratiion among Cornell, Harvard, the Missouri Botanical Garden and the New York Botanical Garden).

Smorball  (http://smorballgame.org) is fast-paced. 

Beanstalk (http://beanstalkgame.org) is leisurely.

Both can be hosted and run for other OCR correction projects. Instructions are on the BHL wiki (linked above).