Mass book digitization: The deeper story of Google Books and the Open Content Alliance

Kalev Leetaru

Abstract


The Google Books and Open Content Alliance (OCA) initiatives have become the poster children of the access digitization revolution. With their sights firmly set on creating digital copies of millions upon millions of books and making them available to the world for free, the two projects have captured the popular imagination. Yet, such scale comes at a price, and certain sacrifices must be made to achieve this volume. With its greater visibility, most studies have focused on Google Books, addressing limitations of its image and metadata quality. Yet, there has been surprisingly little comparative work of the two endeavors, exploring the relationship between these two peers and their deeper similarities, rather than their obvious surface differences. While the academic community has lauded OCA's "open" model and condemned the proprietary Google, all is not always as it seems. Upon delving deeper into the underpinnings of both projects, we find Google achieves greater transparency in many regards, while OCA's operational reality is more proprietary than often thought. Further, significant concerns are raised about the long-term sustainability of the OCA rights model, its metadata management, and its transparency that must be addressed as OCA moves forward.

Full Text:

HTML


DOI: http://dx.doi.org/10.5210/fm.v13i10.2101



A Great Cities Initiative of the University of Illinois at Chicago University Library.

© First Monday, 1995-2017. ISSN 1396-0466.