When the New York Times decided to make all the public domain articles from 1851-1922 available free of charge, they needed a tool that was up to the task. All 11 million articles from 1851-1980 were available as images in PDF format. To generate a PDF version of the article takes quite a bit of work — each article is actually composed of numerous smaller TIFF images that need to be scaled and glued together in a coherent fashion.
Read the full article: Open Blog