Earlier this year Readex published the Territorial Papers of the United States, 1764-1953, the most important early American content not yet digitized—until now.
More than half of America’s states began as territories. From the 1760s to the 1950s the United States of America expanded southward and westward, acquiring territories that spanned from Florida to California to Alaska. Before they evolved into twenty-seven American states, these territories were managed by the U.S. State and Interior departments. The official history of their formative territorial years is recorded in Territorial Papers of the United States—a collection of Native American negotiations and treaties, official correspondence with the federal government, military records, judicial proceedings, population data, financial statistics, land records, and more.
About two thirds of these documents are in manuscript form. This means they cannot be made full-text searchable through the application of Optical Character Recognition (“OCR”) technologies. Yes, there are technologies today that can do a fairly decent job applying OCR to certain types of manuscripts, but the handwriting needs to be very clear, and extremely uniform, for the technology to work at all, and even then the results don’t match the quality that can be achieved from printed (as opposed to manuscript) documents.
The documents in Territorial Papers of the United States are from many time periods and in many handwritings, making them poor candidates for OCR application.