Consensus around what collections as data means and consequently what it takes to think about, prepare, provision, and support the use of them remains unsettled. However, lack of consensus has not stopped a range of creative activity in this space. Rather it might be said that the unsettled nature of collections as data fosters a generative space that encourages novel alignments of people, purpose, and resources. In an effort to collect and communicate collections as data activity, the Collections as Data project team presents Collections as Data Facets.
A facet documents a collections as data implementation. An implementation consists of the people, services, practices, technologies, and infrastructure that aim to encourage computational use of cultural heritage collections. Each facet suggests practical entry points to engaging collections as data. The practical orientation of the questions that comprise the facet are directly informed by stakeholder experience. A facet covers the administrative case that was made to allow an implementation to take place, the people and roles involved in the implementation, workflows and code where applicable, assessment, and approaches to supporting use. A growing collection of facets presents a multifaceted argument for the present and future state of collections as data.
Facet 1 - MIT Libraries Text and Data Mining
- Richard Rodgers, Massachusetts Institute of Technology Libraries
Facet 2 - Carnegie Museum of Art Collection Data
- David Newbury, Carnegie Museum of Art and Daniel Fowler, Open Knowledge International
Facet 3 - CalCOFI Hydrobiological Survey of Monterey Bay
- Amanda Whitmire, Stanford University Libraries
Facet 4 - American Philosophical Society Open Data Projects
- Scott Ziegler, American Philosophical Society
Facet 5 - OPENN
- Dot Porter, University of Pennsylvania Libraries
Facet 6 - Chronicling America
- Deborah Thomas, Nathan Yarasavage, and Robin Butterhof, Library of Congress
Facet 7 - La Gaceta de la Habana
- Paige Morgan, Elliot Williams, and Laura Capell, University of Miami Libraries
Facet 8 - Text as Data Initiative
- Zach Coble, Nick Wolf, and Scott Collard, New York University Libraries
Facet 9 - #HackFSM
- Mary Elings and Quinn Dombrowski, University of California Berkeley
Facet 10 - HathiTrust Research Center Extracted Features Dataset
- Eleanor Dickson, University of Illinois at Urbana Champaign
Facet 11 - Beyond Penn’s Treaty
- Michael Zarafonetis and Sarah M. Horowitz, Haverford College
Facet 12 - Ticha: A Digital Text Explorer for Colonial Zapotec
- Brook Lillehaugen and Michael Zarafonetis, Haverford College
Facet 13 - Vanderbilt Library Legacy Data Projects
- Veronica Ikeshoji-Orlati, Vanderbilt University
Facet 14 - The Museum of Modern Art Exhibition Index
- Jonathan Lill, MoMA Archives
Facet 15 - Social Feed Manager
- Laura Wrubel, Justin Littman, and Dan Kerchner, George Washington University
Call for Submissions
We welcome submission of additional facets. Facets can describe scalable, non-scalable, experimental, work in progress, and permanent collections as data implementations. Facets from a range of sources are encouraged, e.g. libraries, museums, archives, research centers, academic departments.
The next round of facets are due by January 15, 2018.
Submissions should follow the Facet template.
Submit Facets to Thomas Padilla - firstname.lastname@example.org