The Authors Guild, Inc. et al v. Hathitrust et al
Filing
110
DECLARATION of John Wilkin in Support re: 100 MOTION for Summary Judgment.. Document filed by Hathitrust. (Attachments: # 1 Exhibit A (Part 1 of 2), # 2 Exhibit A (Part 2 of 2), # 3 Exhibit B-C)(Petersen, Joseph)
EXHIBIT B
Partnership Community | HathiTrust Digital Library
Page 1 of 3
Help
Feedback
Digital Library
Home
About
Collections
My Collections
Search About HathiTrust
Go
Partnership Community
HathiTrust is an international community of research libraries committed to the long-term curation and availability of the cultural record. Through their
common efforts and deep commitment to the public good, the libraries support the teaching and learning activities of the faculty, students or
researchers at their home institutions, and the scholarly needs of the broader public as well.
Current HathiTrust Partners
Consortia
• Committee on Institutional Cooperation (#)
About the Committee on Institutional Cooperation
Headquartered in the Midwest, the Committee on Institutional Cooperation (CIC) is a consortium of the Big Ten universities plus the University
of Chicago. For half a century, these 13 world-class research institutions have advanced their academic missions, generated unique
opportunities for students and faculty, and served the common good by sharing expertise, leveraging campus resources, and collaborating on
innovative programs. Governed and funded by the Provosts of the member universities, CIC mandates are coordinated by a staff from its
Champaign, Illinois headquarters. Learn more at www.cic.net (http://www.cic.net) .
• Triangle Research Libraries Network (#)
About TRLN
Triangle Research Libraries Network (TRLN) is a collaborative organization of Duke University, North Carolina Central University, North
Carolina State University, and The University of North Carolina at Chapel Hill, the purpose of which is to marshal the financial, human, and
information resources of their research libraries through cooperative efforts in order to create a rich and unparalleled knowledge environment
that furthers the universities' teaching, research, and service missions. TRLN is one of the oldest consortia in the country. The hallmark of
cooperation among the four universities has been a highly successful cooperative collection development program that has resulted in little
duplication across institutions and a high degree of uniqueness in the respective collections. With the licensing of electronic content, the
cooperation has expanded to include electronic resources. The shared collections are made available to users from any TRLN library through
reciprocal borrowing arrangements and through a privileged delivery service. Another of TRLN's strengths is the broad range of staff
development, continuing education, and training opportunities that are offered locally and attended widely by all levels of staff.
More information can be found at http://www.trln.org (http://www.trln.org)
• University of California (#)
About the University of California Libraries
The University of California libraries comprise world-class collections and innovative services. The University of California has built 10 campus
libraries of distinction, comprising world-class collections that give a competitive edge to UC research and instruction.
More than 100 libraries on the UC campuses support the University's teaching and research. Collectively, the UC libraries make up the largest
research/academic library in the world, with over 35 million volumes in their holdings and significant digital collections. At UC, faculty, students,
and staff have access to nearly all of the resources that combine to comprise the libraries' collective holdings; that is, to the largest university
research library in the world.
Visit http://libraries.universityofcalifornia.edu/ (http://libraries.universityofcalifornia.edu/) for more information.
Individual Institutions
• Arizona State University (#)
• Baylor University (#)
• Boston College (#)
• Boston University (#)
• California Digital Library (#)
• Columbia University (#)
• Cornell University (#)
• Dartmouth College (#)
• Duke University (#)
• Emory University (#)
• Harvard University Library (#)
(http://hul.harvard.edu/)
• Indiana University (#)
http://www.hathitrust.org/community
6/22/2012
Partnership Community | HathiTrust Digital Library
Page 2 of 3
(http://www.libraries.iub.edu/)
• Johns Hopkins University (#)
• Lafayette College (#)
• Library of Congress (#)
• Massachusetts Institute of Technology (#)
(http://libraries.mit.edu/about/about.html)
• McGill University (#)
• Michigan State University (#)
• New York University (#)
• New York Public Library (#)
• North Carolina Central University (#)
• North Carolina State University (#)
• Northwestern University (#)
• The Ohio State University (#)
• The Pennsylvania State University (#)
• Princeton University (#)
• Purdue University (#)
• Stanford University (#)
• Texas A&M University (#)
• Universidad Complutense de Madrid (#)
• University of Arizona (#)
• University of California Berkeley (#)
(http://www.lib.berkeley.edu/AboutLibrary/index.html)
• University of California Davis (#)
• University of California Irvine (#)
(http://www.lib.uci.edu/about/about-us.html)
• University of California Los Angeles (#)
• University of California Merced (#)
• University of California Riverside (#)
• University of California San Diego (#)
• University of California San Francisco (#)
• University of California Santa Barbara (#)
• University of California Santa Cruz (#)
• The University of Chicago (#)
• University of Connecticut (#)
• University of Delaware (#)
• University of Florida (#)
• University of Illinois (#)
• University of Illinois at Chicago (#)
• The University of Iowa (#)
• University of Maryland (#)
• University of Michigan (#)
• University of Minnesota (#)
• University of Missouri (#)
• University of Nebraska-Lincoln (#)
(http://libraries.unl.edu/about)
• The University of North Carolina at Chapel Hill (#)
• University of Notre Dame (#)
• University of Pennsylvania (#)
• University of Pittsburgh (#)
• University of Utah (#)
• University of Virginia (#)
• University of Washington (#)
• University of Wisconsin-Madison (#)
• Utah State University (#)
http://www.hathitrust.org/community
6/22/2012
Partnership Community | HathiTrust Digital Library
Page 3 of 3
• Washington University (#)
• Yale University Library (#)
Please see our Eligibility and Agreements (eligibility_agreements) page for information about how to be involved!
About
http://www.hathitrust.org/community
Help
Feedback
Take-Down Policy
Privacy
Contact
6/22/2012
EXHIBIT C
Certification Report on the
HathiTrust Digital Repository
CRL Report on HathiTrust Audit and Certification
M arc h 2 0 11
Executive Summary
The Center for Research Libraries (CRL) conducted a preservation audit of HathiTrust (www.HathiTrust.org)
between November 2009 and December 2010, and on the basis of that audit certifies HathiTrust as a trustworthy
digital repository. The CRL Certification Advisory Panel has concluded that the practices and services described
in HathiTrust public communications and published documentation are generally sound and appropriate to the
content being archived and to the general needs of the CRL community. Moreover the Panel expects that in the
future, HathiTrust will continue to be able to deliver content that is understandable and usable by its designated
community.
CRL certification applies to the repository’s ability to preserve and manage digital files of books digitized by the
University of Michigan, Google, and the Internet Archive, as well as the digital files generated from books digitized by
other providers that conform to comparable standards.
This certification is based upon review by CRL and members of its Certification Advisory Panel of extensive
documentation gathered by CRL as well as data and documentation provided by HathiTrust between November
2009 and December 2010; and a site visit held in May 2010. CRL’s analysis was guided by the criteria included in the
Trustworthy Repositories Audit and Certification Checklist (TRAC), and other metrics developed by CRL through its
various digital repository assessment activities.
CRL conducted its audit with reference to generally accepted best practices in the management of digital systems;
and with reference to the interests of its community of research libraries and the practices and needs of scholarly
researchers in the humanities, sciences and social sciences in the United States and Canada. The purpose of the audit
was to obtain reasonable assurance that HathiTrust provides, and is likely to continue to provide, services adequate
to those interests and needs without material flaws or defects and as described in HathiTrust’s public disclosures. The
CRL audit provides a reasonable basis for these findings.
CRL assigns HathiTrust the following levels of certification (the numeric rating is based on a scale of 1 through 5, with
5 being the highest level, and 1 being the minimum certifiable level):1
Category
HathiTrust Rating
A. Organizational Infrastructure
2
B. Digital Object Management
3
C. Technologies, Technical Infrastructure, Security
4
The ratings reflect the existence of robust systems and sound processes in most areas, in particular those areas
addressed in TRAC category C and, to a lesser extent, in category B metrics; and the still emerging status of systems
and processes addressed in category A metrics. In the course of the audit, the Certification Advisory Panel identified
several issues that CRL urges HathiTrust to address to more fully satisfy the concerns of CRL libraries. Those issues
are described in Section B, Detailed Audit Findings, below, and pertain to specific criteria in the TRAC checklist.
HathiTrust has agreed to address these issues and, as a condition of continued certification, to make certain
disclosures to CRL periodically. Those requirements for periodic disclosure are outlined in Section C of this report.
1
A working version of the schema CRL uses in providing summary ratings of a repository’s compliance with the TRAC criteria is available at:
http://www.crl.edu/archiving-preservation/digital-archives/metrics-assessing-and-certifying/crl-ratings.
PA G E
2
CRL Report on HathiTrust Audit and Certification
M arc h 2 0 11
About the Audit Participants
HathiTrust
The HathiTrust (www.HathiTrust.org) digital repository was established by the University of Michigan Libraries in
October 2008. HathiTrust offers academic and research libraries the opportunity to combine resources to build and
maintain a large scale repository. Participating members include research libraries in the United States and Europe.
The majority of HathiTrust’s content was digitized through the Google Books project, which has worked with libraries
to scan books on library shelves. Additional content comes from the Internet Archive and from collections digitized by
partner libraries. As of December 2010, HathiTrust had ingested approximately 7.5 million volumes. Repository costs
are shared by the participating members. The actual repository systems are located at the University of Michigan in
Ann Arbor and at Indiana University in Bloomington.
Center for Research Libraries
The Center for Research Libraries (CRL - www.crl.edu) is an international consortium of university, college, and
independent research libraries. CRL supports advanced research and learning in the humanities, sciences, and social
sciences by ensuring the survival and accessibility of source materials vital to those disciplines. In order to enable its
community to accelerate the shift to electronic-only resources in a careful and responsible manner, CRL has a hybrid
strategy of preserving and maintaining shared physical collections of materials and certifying digital repositories of
interest to its community.
CRL analysis of HathiTrust documentation and operations was undertaken by Marie Waltz and other CRL staff.
Additional technical support for the site visit and the assessment of HathiTrust repository systems and architecture
was provided by James A. Jacobs, Data Services Librarian Emeritus, University of California, San Diego.
To guide its HathiTrust audit CRL enlisted a panel of advisors representing the various sectors of its membership.
The Certification Advisory Panel includes leaders in collection development, preservation, library administration, and
digital information technology, and is so constituted as to ensure that the certification process addresses the interests
of the entire CRL community.
Martha Brogan (Chair)
Director of Collection Development & Management
University of Pennsylvania
Anne Pottier
Associate University Librarian
McMaster University
Winston Atkins
Preservation Officer
Duke University
Oya Y. Rieger
Associate University Librarian for Information
Technologies
Cornell University
Bart Harloe
Director of Libraries
St. Lawrence University
Perry Willett
Digital Preservation Services Manager
California Digital Library
William Parod
Senior Repository Developer
Northwestern University Libraries
PA G E
3
CRL Report on HathiTrust Audit and Certification
M arc h 2 0 11
A. Audit and Assessment Methodology and Criteria
This assessment was undertaken to determine whether or not HathiTrust meets the commitments it has made
in regard to the long-term preservation of digital scholarly content for the academic community and whether the
repository complies with established criteria for trusted digital repositories. The assessment included a site visit, a
review of the information independently gathered by CRL from published and unpublished sources, and a review of
documents and documentation provided by HathiTrust.
CRL conducted its audit with reference to:
n
generally accepted best practices in the management of digital systems
n
the interests of the CRL community of research libraries
n
t
he practices and needs of scholarly researchers in the humanities, sciences, and social sciences in the
United States and Canada.
n
the criteria included in Trustworthy Repositories Audit & Certification: Criteria and Checklist2
n
the Open Archive Information System reference model3 (OAIS)
n
other metrics developed by CRL through its analyses of digital repositories.
The general metrics used by CRL in such assessments are based on the Trustworthy Repositories Audit and
Certification (TRAC) checklist, and on other metrics developed by CRL through its analyses of digital repositories.
TRAC was developed by a joint task force, created by the Research Libraries Group (RLG) and the National Archives
and Records Administration in 2003 to provide criteria for use in identifying digital repositories capable of reliably
storing, migrating, and providing long-term access to digital collections. TRAC represents best current practice and
thinking about the organizational and technical infrastructure required to be considered trustworthy and thus worthy
of investment by the research and libraries communities.
CRL assessed HathiTrust on each of the three categories of criteria specified in TRAC and has assigned a level of
certification for each. The numeric rating (below) is based on a scale of 1 through 5, with 5 being the highest level, and
1 being the minimum certifiable level:
TRAC Category
HathiTrust Rating
Organizational Infrastructure
2
Digital Object Management
3
Technologies, Technical Infrastructure, Security
4
The basis for assignment of these ratings is provided in Section B, Detailed Audit Findings, below.
It should be noted that CRL certification of HathiTrust applies specifically to the repository’s ability to preserve and
manage digital files of books digitized by the University of Michigan, Google, and the Internet Archive, as well as the
digital files generated from books digitized by other providers that conform to comparable standards. CRL did not
assess SDR procedures and processes for acquiring and managing more complex digital objects such as audio,
video, archived Web sites, or other types of content.
2
3
TRAC - http://www.crl.edu/sites/default/files/attachments/pages/trac_0.pdf3 OAIS - http://public.ccsds.org/publications/archive/650x0b1.pdf
OAIS - http://public.ccsds.org/publications/archive/650x0b1.pdf
PA G E
4
CRL Report on HathiTrust Audit and Certification
M arc h 2 0 11
B. Detailed Audit Findings
On the basis of the audit, CRL identified areas in which HathiTrust will need to improve processes or provide greater
disclosure of information about those processes. These areas correspond to specific TRAC criteria or to features of the
repository that members of the Certification Advisory Panel believe are important to the CRL community.
The specific areas identified for improvement are:
1. Definition of Rights and Ownership of HathiTrust Enterprise Assets
(TRAC criteria A3.3, A3.7, and A4.3)
“Repository commits to transparency and accountability in all actions supporting the operation and management of
the repository, especially those that affect the preservation of digital content over time.” (TRAC criterion A3.7)
HT is not a separate legal entity, and so cannot legally own the capital equipment, content, metadata, and other assets
acquired or generated by the partnership. HathiTrust therefore must clearly define and establish where ownership
and control of the repository’s major assets and other property essential to the continued access and preservation of
repository content reside.
The ownership of the individual objects within the repository is clearly specified in agreements with the depositors.
However, it would be appropriate to clarify the ownership of the aggregate HathiTrust database; the rights and
ownership of new content (such as derivative files, metadata, etc.); and rights to the non-content assets of the
operation, including software and system tools to be developed in the future.
2. Succession or Disposition Plan for HathiTrust Assets (TRAC A1.2)
TRAC A1.2 requires “an appropriate, formal succession plan, contingency plans, and/or escrow arrangements in place
in case the repository ceases to operate or the governing or funding institution substantially changes its scope.”
Most 501c3 organizations provide in their bylaws for the disposition of property in the event of the organization’s
dissolution. HathiTrust should explicitly address the transferability or disposition of its assets in the event of
discontinuation of repository operations. Under present circumstances the University of Michigan, as parent
organization of the repository, would seem to be the owner of those assets and arbiter of such decisions, absent
agreements to the contrary.
3. larify and Strengthen the Quality Assurance and Print Archiving Components of
C
the HathiTrust Program (TRAC A3.8 and B1.1, B1.8, B1.7 and B2.4)
TRAC criterion A3.8 requires that the repository “commits to defining, collecting, tracking, and providing, on
demand, its information integrity measurements.” Criterion B1.1 requires that the “Repository identifies properties it
will preserve for digital objects,” and B1.7, 1.8, and 2.4 address the recording of information about rejected SIPs.
One explicit goal described in the HathiTrust mission statement is to “coordinate shared storage strategies among
libraries, thus reducing long-term capital and operating costs of libraries associated with the storage and care of
print collections.” The repository should put in place and clarify its plan for achieving that goal, as the cost reduction
described is a relevant metric of the value of HathiTrust and its services. The new HathiTrust pricing model, to be
introduced in 2013, will directly correlate the overlap between the repository corpus and the print holdings of the
participating libraries. This will increase pressure for participating libraries to divest of print volumes available
through the repository.
The quality assurance measures for HathiTrust digital content do not yet support this goal. Inspection criteria and
standards are in place for materials ingested from the Google Books project, but it is not clear what results when an
object fails such inspection. It is also unclear to what level of quality review materials digitized by partner institutions
PA G E
5
CRL Report on HathiTrust Audit and Certification
M arc h 2 0 11
or those made available through entities such as the Internet Archive are subjected. This will be material to library
decisions on whether to retain, conserve, or dispose of corresponding physical copies of books represented in the
repository.
Currently, and despite significant efforts to identify and correct systemic problems in digitization, HathiTrust only
attests to the integrity of the transferred file, and not to the completeness of the original digitization effort. This may
impact institutions’ workflow for print archiving and divestiture.
C. Ongoing Requirements
The TRAC document notes that “. . . attaining trusted status is not a one-time accomplishment—achieved and
forgotten. To retain trusted status, a repository will need to undertake a regular cycle of audit and/or certification.”
To that end CRL expects that in addition to acting to remedy the issues identified above HathiTrust will also make
certain disclosures on a regular basis. CRL and HathiTrust have agreed that ongoing certification is contingent upon
HathiTrust making the following disclosures every two years:
n
A
n item- or volume-level listing of new content added to the repository since prior certification. (Public
disclosure through the current HathiFiles inventory would fulfill this requirement.)
n
D
escription of any significant changes in repository system architecture or configurations, operating systems
and/or critical software;
n
N
ew agreements and contracts with key depositors of content, content users, major funders or sources of
revenue, and providers of critical repository services;
n
N
ew key policies regarding acquisition, management, and disposition of archived content and related files and
metadata;
n
R
ecords of significant events (such as content migrations, system failures, loss or corruption of digital
content) and significant changes in the characteristics of digital content ingested since the most recent audit,
such as server logs; and of significant events and changes in the operations of the repository.
n
T
he most recent three years of financial statements for the repository organization or service unit. The
financial statements should indicate the categories and, where appropriate, sources of revenue and the level
of same; the functional allocation of expenses; and changes in the financial position of the organization
supporting the service unit.
n
R
evenue and expense projections by function, for the repository organization or service unit, for the next
three years.
Certification is also contingent upon HathiTrust’s agreement to a periodic, systematic sampling and inspection of
the repository’s archived content by CRL, or by a third party designated by CRL, using either a manual or automated
process as determined by mutual agreement between CRL and HathiTrust.
PA G E
6
Disclaimer: Justia Dockets & Filings provides public litigation records from the federal appellate and district courts. These filings and docket sheets should not be considered findings of fact or liability, nor do they necessarily reflect the view of Justia.
Why Is My Information Online?