NDSA:Web Archiving Survey: Difference between revisions

From DLF Wiki
Jump to navigation Jump to search
(-----------------------------------------------------------------------------------)
Line 14: Line 14:
***K-12 School
***K-12 School
***Federal Government
***Federal Government
***State Government (including Archives or Libraries (or keep separate?)
***State Government (including Archives, state records centers, or Libraries (or keep separate?)(I would include.  They generally do this work based on state statute or regs.)
***Local Government
***Local Government (what is the difference in "local" and "city"?)
***City Government
***City Government
***County Government
***County Government
***Other (please describe)
***Other (please describe)
*How long have you been archiving
*How long have you been archiving
*Are you using a service or company to archive or crawling in-house
*Are you using a service or company to archive, or crawling in-house
**if service what one (Archive-IT, IA's crawling services, Hanzo, Iterasi etc.)
**if service what one (Archive-IT, IA's crawling services, Hanzo, Iterasi etc.)
**if in-house, what crawling tools used (heritrix, httrack, other)
**if in-house, what crawling tools used (heritrix, httrack, other)
Line 40: Line 40:
***Youtube and other video
***Youtube and other video
***All of above as part of regular collecting of websites
***All of above as part of regular collecting of websites
**Computers and Technology  
**Computers and Technology
***software
***gaming
***other?
**Government  
**Government  
***State
***State
Line 46: Line 49:
***City
***City
***County
***County
**Spontaneous Events, disasters, tragedy
**Spontaneous Events, for example: natural disasters, tragedy, environmental events, spontaneous political demonstrations
**Politics and Elections
**Politics and Elections
***Local elections
***Local elections
Line 59: Line 62:
***Broadcast/Television
***Broadcast/Television
**International content (leave open-ended, let them describe what they are collecting internationally?)
**International content (leave open-ended, let them describe what they are collecting internationally?)
***topical (specific subject area - sports, political parties, cultural events)
***geographical (one country or many)
*Permissions/Copyright
*Permissions/Copyright
**Crawl permissions
**Crawl permissions
Line 67: Line 72:
**Full text indexing?
**Full text indexing?
**Public access URL:  
**Public access URL:  
*Researchers (do we want any questions on research use?)
*Researchers (do we want any questions on research use?)(Yes.  Could ask in an open-ended way how researchers are using the content)
*Ever participated in a collaborative web archive (give examples), yes/no
*Ever participated in a collaborative web archive (give examples), yes/no
**if so, describe role/project
**if so, describe role/project
Line 76: Line 81:
*IIPC Curators list
*IIPC Curators list
*Archive-IT list
*Archive-IT list
*ALA groups (need to find people we don't normally talk to)

Revision as of 13:48, 4 August 2011

The goals of the survey are to find out the scope of collecting web content in the United States: what organizations collection development policies state (if they have one), what they are actually collecting, and what services are being used to archive, among other things.

If anyone is interested in helping develop this survey, contact Abbie Grotke (abgr@loc.gov).

Draft Survey Questions

  • Organization information (name, URL, contact, etc.)
    • Type of organization:
      • Historical Society
      • College or University
      • Museum
      • Public Library
      • Consortium
      • K-12 School
      • Federal Government
      • State Government (including Archives, state records centers, or Libraries (or keep separate?)(I would include. They generally do this work based on state statute or regs.)
      • Local Government (what is the difference in "local" and "city"?)
      • City Government
      • County Government
      • Other (please describe)
  • How long have you been archiving
  • Are you using a service or company to archive, or crawling in-house
    • if service what one (Archive-IT, IA's crawling services, Hanzo, Iterasi etc.)
    • if in-house, what crawling tools used (heritrix, httrack, other)
  • Does your organization have collection policies that cover web archiving?
    • are these publicly accessible (provide URL)
    • If not but you are willing to share... (give instructions for emailing?)
  • Scope of collecting, various Qs about what they archive (initial list from Archive-IT, will flesh this out to include more, allow for comments or description):
    • Arts & Humanities
      • Dance
      • Music
      • Art
      • Literature
      • Architecture
      • Film/Television (?)
    • Blogs and Social Media
      • Blogs
      • Facebook
      • Twitter
      • Youtube and other video
      • All of above as part of regular collecting of websites
    • Computers and Technology
      • software
      • gaming
      • other?
    • Government
      • State
      • Federal
      • City
      • County
    • Spontaneous Events, for example: natural disasters, tragedy, environmental events, spontaneous political demonstrations
    • Politics and Elections
      • Local elections
      • State elections
      • Federal elections
    • Science and Health
    • Society and Culture
    • Universities and Libraries
    • News
      • Newspapers
      • Citizen Journalism/Community News
      • Broadcast/Television
    • International content (leave open-ended, let them describe what they are collecting internationally?)
      • topical (specific subject area - sports, political parties, cultural events)
      • geographical (one country or many)
  • Permissions/Copyright
    • Crawl permissions
    • Access permissions
    • Respect robots (?)
  • Access
    • Access tool
    • Full text indexing?
    • Public access URL:
  • Researchers (do we want any questions on research use?)(Yes. Could ask in an open-ended way how researchers are using the content)
  • Ever participated in a collaborative web archive (give examples), yes/no
    • if so, describe role/project
  • Interested in collaborating on future projects?

Distribution

  • NDSA/NDIIPP listservs/blog/twitter, etc.
  • IIPC Curators list
  • Archive-IT list
  • ALA groups (need to find people we don't normally talk to)