VEP Early Modern Science Collection

VEP proudly presents two corpora of early modern scientific writing, curated by Alan Hogarth. They are released under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Before downloading the corpora, read about the format of the text files here, and about our text processing workflow here. Corpora are generated from Text Creation Partnership (TCP) XML files.

Please note that our download corpora do not contain texts from EEBO-TCP Phase II, which will not be in the public domain until five years after the completion of the TCP project for Phase II. However, metadata for EEBO-TCP Phase II texts is available for download.


Early Modern Science Corpora

The Big Names of Science
This corpus contains 329 early modern scientific texts by 100 ‘Big Name’ authors, published between 1530 and 1724. The authors were selected on the basis of their reputation and influence as early modern writers who address scientific subjects.


The Super Science
The Super Science corpus lists information about every ‘scientific’ text from EEBO-TCP Phases I and II, ECCO-TCP, and Evans-TCP. The corpus comprises 1,979 texts and covers the period 1482-1710.

Credits: Metadata prepared by Alan Hogarth, with supervision by Jonathan Hope. XML files processed and curated by Deidre Stuffer for release as plain text files.