Epi Archive collects global notifiable disease data automatically and continuously and makes it uniform and readily accessible.
Government reporting of notifiable disease data is common and widespread, though most countries do not report in a machine-readable format. This is despite the World Health Organization (WHO) International Health Regulations stating that "[e]ach State Party shall notify WHO, by the most efficient means of communication available." 1 Data are often in the form of a file that contains text, tables and graphs summarizing weekly or monthly case counts. This presents a problem when information is needed for more data intensive approaches to epidemiology, biosurveillance and public health. Epi Archive solves this problem.
 WHO International Health Regulations, edition 3.
Epi Archive collects data by scraping government public health websites automatically, downloading the artifacts containing the data (often pdf files), parsing the data and persisting it to a database before serving it to users through a simple web interface and a REST API.
The web interface allows users to peruse incident data for specific regions, diseases and time intervals. Users can generate a chart and download the data they are viewing. The "Archived Files" section of the site allows users to view the original artifacts from which data was parsed and download that artifact from our local cache or the original source (if it is still available). Finally, the data in Epi Archive is available to disease models (or other software) through a Representational State Transfer Application Programming Interface (REST API).