Paper
29 January 2007 Google Books: making the public domain universally accessible
Adam Langley, Dan S. Bloomberg
Author Affiliations +
Proceedings Volume 6500, Document Recognition and Retrieval XIV; 65000H (2007) https://doi.org/10.1117/12.710609
Event: Electronic Imaging 2007, 2007, San Jose, CA, United States
Abstract
Google Book Search is working with libraries and publishers around the world to digitally scan books. Some of those works are now in the public domain and, in keeping with Google's mission to make all the world's information useful and universally accessible, we wish to allow users to download them all. For users, it is important that the files are as small as possible and of printable quality. This means that a single codec for both text and images is impractical. We use PDF as a container for a mixture of JBIG2 and JPEG2000 images which are composed into a final set of pages. We discuss both the implementation of an open source JBIG2 encoder, which we use to compress text data, and the design of the infrastructure needed to meet the technical, legal and user requirements of serving many scanned works. We also cover the lessons learnt about dealing with different PDF readers and how to write files that work on most of the readers, most of the time.
© (2007) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Adam Langley and Dan S. Bloomberg "Google Books: making the public domain universally accessible", Proc. SPIE 6500, Document Recognition and Retrieval XIV, 65000H (29 January 2007); https://doi.org/10.1117/12.710609
Lens.org Logo
CITATIONS
Cited by 10 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Computer programming

Image compression

JPEG2000

Associative arrays

Halftones

Image quality

Legal

RELATED CONTENT

High-flexibility scalable image coding
Proceedings of SPIE (June 23 2003)
Coding Of Data For Laser Recorders
Proceedings of SPIE (October 26 1983)
Efficient transcoding of JPEG2000 images
Proceedings of SPIE (November 19 2003)

Back to Top