A Release and a Call - Collections as Data Facets

Consensus around what collections as data means and consequently what it takes to think about, prepare, provision, and support the use of them remains unsettled. However, lack of consensus has not stopped a range of creative activity in this space. Rather it might be said that the unsettled nature of collections as data fosters a generative space that encourages novel alignments of people, purpose, and resources. In an effort to collect and communicate collections as data activity, the Collections as Data project team presents Collections as Data Facets.

A facet documents a collections as data implementation. An implementation consists of the people, services, practices, technologies, and infrastructure that aim to encourage computational use of cultural heritage collections. Each facet suggests practical entry points to engaging collections as data. The practical orientation of the questions that comprise the facet are directly informed by stakeholder experience. A facet covers the administrative case that was made to allow an implementation to take place, the people and roles involved in the implementation, workflows and code where applicable, assessment, and approaches to supporting use. A growing collection of facets presents a multifaceted argument for the present and future state of collections as data.


Facet 1 - MIT Libraries Text and Data Mining

  • Richard Rodgers, Massachusetts Institute of Technology Libraries

Facet 2 - Carnegie Museum of Art Collection Data

  • David Newbury, Carnegie Museum of Art and Daniel Fowler, Open Knowledge International

Facet 3 - CalCOFI Hydrobiological Survey of Monterey Bay

  • Amanda Whitmire, Stanford University Libraries

Facet 4 - American Philosophical Society Open Data Projects

  • Scott Ziegler, American Philosophical Society

Facet 5 - OPENN

  • Dot Porter, University of Pennsylvania Libraries

Facet 6 - Chronicling America

  • Deborah Thomas, Nathan Yarasavage, and Robin Butterhof, Library of Congress

Facet 7 - La Gaceta de la Habana

  • Paige Morgan, Elliot Williams, and Laura Capell, University of Miami Libraries

Facet 8 - Text as Data Initiative

  • Zach Coble, Nick Wolf, and Scott Collard, New York University Libraries

Facet 9 - #HackFSM

  • Mary Elings and Quinn Dombrowski, University of California Berkeley

Facet 10 - HathiTrust Research Center Extracted Features Dataset

  • Eleanor Dickson, University of Illinois at Urbana Champaign

Facet 11 - Beyond Penn’s Treaty

  • Michael Zarafonetis and Sarah M. Horowitz, Haverford College

Facet 12 - Ticha: A Digital Text Explorer for Colonial Zapotec

  • Brook Lillehaugen and Michael Zarafonetis, Haverford College

Facet 13 - Vanderbilt Library Legacy Data Projects

  • Veronica Ikeshoji-Orlati, Vanderbilt University

Facet 14 - The Museum of Modern Art Exhibition Index

  • Jonathan Lill, MoMA Archives

Facet 15 - Social Feed Manager

  • Laura Wrubel, Justin Littman, and Dan Kerchner, George Washington University

Call for Submissions

We welcome submission of additional facets. Facets can describe scalable, non-scalable, experimental, work in progress, and permanent collections as data implementations. Facets from a range of sources are encouraged, e.g. libraries, museums, archives, research centers, academic departments.

The next round of facets are due by January 15, 2018.

Submissions should follow the Facet template.

Submit Facets to Thomas Padilla - thomas.padilla@unlv.edu