Home

The History Lab's mission is to use data science to recover and repair the fabric of the past. We are beginning with declassified documents, which include some of the earliest examples of electronic records. By bringing together fragmented collections in a common database, we can use natural language processing and machine learning tools to explore them. The ultimate goal is to develop history as a data science so that citizens can keep the government accountable in the age of big data and AI.

Our multidisciplinary team of researchers has gathered nearly 5 million documents, comprising over 18 million pages, to create the Freedom of Information Archive (FOIArchive), the world's largest database of declassified government records. 

bank icon
FOIArchive

Learn more about the Freedom of Information Archive.

gears icon
Our Methods

How we turn documents into data.

code icon
Our Code

Explore the open-source code that powers the FOIArchive and our tools.

gift icon
Donate

Help support our efforts.

Recent News