A Humanist’s Cookbook for Natural Language Processing in Python

initial release of an open educational resource. – A Humanist’s Cookbook for Natural Language Processing in Python.

The project is presented as a series of notebooks, a series of Python 3 recipes for common problems and issues associated with preparing data for text analysis and natural language processing. The target audience is students—intermediate programmers who have begun to learn their way around Python but who need a little help pulling the pieces together to get something done.

The project has two main goals:

  • Present code blocks for common problems.
  • Contextualize those blocks with humanists in mind.

Brandon Walsh and Rebecca Draughon. “A Humanist’s Cookbook For Natural Language Processing In Python”. Published September 10, 2020. https://scholarslab.lib.virginia.edu/blog/a-humanists-cookbook-for-natural-language-processing-in-python/.


Matplotlib is versatile and powerful Python data visualization library. It has been recently used for rendering the first picture of a black hole and to illustrate the existence of gravitational waves.

Matplotlib is a Python 2D plotting library which produces publication quality figures in a variety of hardcopy formats and interactive environments across platforms. Matplotlib can been be used in Python scripts, the Python and IPython shells and has been implemented in Jupyter notebooks, web application servers, and four graphical user interface toolkits.