NDSA:Content WG March 2, 2011 Meeting Minutes

From DLF Wiki

Return to NDSA:Content Working Group Home


NDSA Content Meeting Minutes

Wednesday, March 2, 2011 11am EST

Attendees (17)

[Note to attendees: If I left anyone off, please add your name or let me know and I'll add it!]

  • Anderson, Martha | Director, NDIIPP, Library of Congress | mande@loc.gov
  • Askari, Emilia | Intern at Library of Congress, UMICH student | easkari@umich.edu
  • Baker, Timothy D. | Maryland State Archives | timb@MDSA.NET
  • Cornwall, Daniel | Alaska State Library | daniel.cornwall@ALASKA.GOV
  • Faundeen, John, Archivist | U.S. Geological Survey | faundeen@usgs.gov
  • Gallinger, Michelle | Library of Congress | mgal@loc.gov
  • Grotke, Abbie | Web Archiving Team Lead, Library of Congress, and Co-Chair of the NDSA Content Working Group | abgr@LOC.GOV | 202-707-2833 | @agrotke
  • Hartman, Cathy | Associate Dean of Libraries, University of North Texas/ Co-Chair of the NDSA Content Working Group | cathy.hartman@UNT.EDU
  • Howard, Rachel | Digital Initiatives Librarian, University of Louisville | rachel.howard@LOUISVILLE.EDU
  • McAninch, Glen | Kentucky Department for Libraries and Archives | Glen.McAninch@ky.gov
  • Moffatt, Christie | National Library of Medicine | moffattc@mail.nlm.nih.gov
  • Owens, Trevor | Library of Congress | trow@loc.gov
  • Rossum, Deborah |Digital Content Manager| SCOLA |712-566-2202| drossum@SCOLA.ORG

(and Brian from SCOLA, too - I didn't catch his last name)

  • Stoller, Michael | New York University | Michael.stoller@NYU.EDU
  • Watters, Pete | Arizona State Library, Archives, and Public Records | pwatters@LIB.AZ.US
  • Weise, John | Manager of the Digital Library Production Service (DLPS) at the University of Michigan | jweise@UMICH.EDU
  • Whitacre, Lindsay Skay | WGBH | lindsay_whitacre@WGBH.ORG

Clearinghouse/Group B Report Out

Glen gave a report on the activities of NDSA:Clearinghouse Action Team (Group B). They had a call last week and continued to brainstorm on the list of at-risk content. Next step is to define categories, group things, refine descriptions, and identify stakeholders. Kristine and Glen will work on that, they welcome others to help. Still a question of what we mean by at-risk; what is valuable or not.

Cathy mentioned that Mark is working with a group at Los Alamos National Laboratory on analysis of links and different kinds of content. This work is showing gaps in what has been preserved. They are working on a paper. Cathy will send contact information to Kristine and Glen in case they want to follow up.

Martha said that while NSF has a focus on taking care of data sets created through funding, there's a real gap in preservation of data sets that are not NSF funded. There is a grey area about who would care for that. She said they had a White House visitor recently that was looking at Health data related to this topic.

Content Registry Group A Report Out

Daniel reported on the NDSA:Content Registry Action Team task to identify existing registry projects. Rachel, Abbie, Arlene, Christie worked to compile the list of 14 sites, available here: http://www.loc.gov/extranet/wiki/osi/ndiip/ndsa/index.php?title=Existing_Registry_Projects . All agreed that that likely exhausted the effort to locate.

His initial take shows that three focus on digital preservation, others might be about digital preservation, and the rest are digitization clearinghouses. We need to group the entries, possible groupings include digitization vs. preservation, or born-digital vs. digitized collections. Grouping by content type and geography was also discussed.

Daniel reviewed the workplan for Group A. Step 1 is complete, Step 2 is basically covered by the ALA definition of what's preserved that was discussed previously. Next step is to take the list of registries and figure out how many NDSA members are represented in each registry. If NDSA members are already using one more prominently, that will help us decide whether or not we should adopt more broadly. He plans to work out a time table for next steps by next month's CWG meeting.

A suggestion was made to distribute the list of registries to the NDSA-ALL listserv and ask who is in what registry, rather than our group trying to search each registry to find out that information. Others agreed - Daniel will draft a message and send to that list. Will give folks 2 weeks to respond, and move on from there.

Others noted that with the list we came up with, some are very focused on specific categories of content but none covers all types of content. Would be useful to have one centralized location. This strengthens the argument that NDSA could be providing a valuable resource with a single source for all preserved content.

Demo of NDIIPP Partner Collections

Trevor gave a demo/overview of the NDSA:http://digitalpreservation.gov/collections/collections.html NDIIPP Partner collections view on digitalpreservation.gov. Built using Recollection, which is a web application that LC developed for with a company called Zepheira, the Partner Collections view is based on a spreadsheet that NDIIPP had about its partners. Trevor is happy to share the underlying spreadsheet with anyone who would like to see it. His email is trow@loc.gov.

The site allows faceted searching, plotting things on a timeline and a map, among other features to easily navigate and drill down to the data. Spreadsheet and XML data can currently be loaded into Recollection very quickly and then augmentation services are available to enable plotting on maps or creation of timelines. There is a webform also that can be used to add collections to this site, which, if NDSA is interested in adopting this as the NDSA registry, could be modified to fit our needs. This effort would actually benefit greatly from NDSA contributions, to determine what data would be useful to display about these collections and others. One great thing is that this can be embedded in any website, so if this were adopted/built on by the NDSA, potentially any NDSA member could publish a view of all preserved collections on their own websites.

Recollection is available for anyone to have an account and play around with - Trevor is happy to set up accounts for anyone who is interested in testing it out. LC is in the process of making it open-source. Trevor is also happy to hear any feedback about the tool or answer any questions, and to talk further about whether or not NDSA would like to explore using this tool/approach for the NDSA Content Registry.

Next Meeting

Our next call will be April 6th at 11am EST. Agenda and call details to be sent a few days prior.

-End-