NDSA:Web Archiving Survey: Difference between revisions

From DLF Wiki
Abgr (talk | contribs)
Nowviskie (talk | contribs)
adding logo
 
(17 intermediate revisions by 3 users not shown)
Line 1: Line 1:
The goals of the survey are to find out the scope of collecting web content in the United States: what organizations collection development policies state (if they have one), what they are actually collecting, and what services are being used to archive, among other things.  
[[File:NDSA Logo.png|thumb]]
=2013 Survey=


If anyone is interested in helping develop this survey, contact Abbie Grotke (abgr@loc.gov).
[[NDSA:August 21, 2013 Meeting Minutes]]


==Draft Survey Questions==
Survey is CLOSED. Analysis is underway. Contact Abbie if you want to get involved. 


*Organization information (name, URL, contact, etc.)
Blog post: http://blogs.loc.gov/digitalpreservation/2013/10/archiving-web-content-take-the-2013-ndsa-survey/  
**Type of organization:
***Historical Society
***College or University
***Museum
***Public Library
***Consortium
***K-12 School
***Federal Government
***State Government (including Archives, state records centers, or Libraries (or keep separate?)(I would include. They generally do this work based on state statute or regs.)
***Local Government (what is the difference in "local" and "city"?)
***City Government
***County Government
***Other (please describe)
*What year did you begin archiving?
*Are you using an external service or company to archive, or crawling in-house?
**If using an external service, what one? (Archive-IT, IA's crawling services, Hanzo, Iterasi, other)
**If you are crawling in-house, what crawling tools are you using (Heritrix, Httrack, other)
*Does your organization have collection policies that cover web archiving?
**Are these publicly accessible (provide URL)?
**If not, but you are willing to share with NDSA members, please email to ndsa@loc.gov with the subject line: Web Archiving Survey Selection Policy
*Do you use web archiving primarily to a) Archive your own web site as a type of institutional record or b)Archive content from other organizations for future research use.  or c) both    [with an option for comments/description]
*Scope of collecting, various Qs about what they archive (allow for comments or description):
**Arts & Humanities
***Dance
***Music
***Art
***Literature
***Architecture
***Film
***Television
**Blogs and Social Media
***Blogs
****Do you try to archive comments and imbedded media?
***Facebook
***Twitter
***YouTube and other video
***All of above as part of regular collecting of websites
***Other
**Computers and Technology
***software
***gaming
***other?
**Government
***State
***Federal
***City
***County
**Spontaneous Events, for example: natural disasters, tragedy, environmental events, spontaneous political demonstrations
**Politics and Elections
***Local elections
***State elections
***Federal elections
**Science and Health
**Society and Culture
**Corporate, Organizational sites
****University or College sites
****
**News
***Newspapers
***Citizen Journalism/Community News
***Broadcast/Television
**International content (leave open-ended, let them describe what they are collecting internationally?)
***topical (specific subject area - sports, political parties, cultural events)
***geographical (one country or many)
*Permissions/Copyright
**Do you ask permission to crawl?  (always, never, sometimes (depends on the content))
**Do you ask display permissions (access)? (always, never, sometimes (depends on the content)
**Do you respect robots.txt when crawling? (always, never, sometimes
**Describe[comments box to explain any of these, further describe]
*Access
**What access tool do you use (if any) for viewing Web archives?
**Do you do full text indexing?  [yes, for testing only; yes, researchers can utilize; no]
**Public access URL:
*Researchers (do we want any questions on research use?)(Yes.  Could ask in an open-ended way how researchers are using the content)
*Ever participated in a collaborative web archive (give examples), yes/no
**if so, describe role/project
*Interested in collaborating on future projects?


==Distribution==
Survey questions: http://www.digitalpreservation.gov/ndsa/documents/ndsa_web_archiving_survey_2013.pdf?loclr=blogsig
*NDSA/NDIIPP listservs/blog/twitter, etc.
 
*IIPC Curators list
Survey Data (identifying information removed): [[File:Ndsa wa survey 2013.xls ]]
*Archive-IT list
 
*ALA groups (need to find people we don't normally talk to)
=2011 survey=  
 
The goal of the survey was to find out the scope of collecting web content in the United States: what organizations collection development policies state (if they have one), what they are actually collecting, and what services are being used to archive, among other things.
 
The Survey was conducted in October 2011 and 91 results were received.
 
Raw results are here: http://www.surveymonkey.com/sr.aspx?sm=35pr1N8b4guFUbdaGYf08gvZGp99eG9r2dfnbeuOSzY_3d
Password is CWG
 
A final report, completed in July 2012, is here: [[File:ndsa_web_archiving_survey_report_2012.pdf]]
 
See these blog posts for more information:
 
http://blogs.loc.gov/digitalpreservation/2012/05/web-archiving-arrives-results-from-the-ndsa-web-archiving-survey/
 
http://blogs.loc.gov/digitalpreservation/2012/07/the-ndsa-web-archiving-survey/
 
And an addendum for NDSA members is here: [[File:ndsa_web_archiving_survey_report_2012_addendum.pdf]]. Please do not distribute this addendum beyond NDSA members.

Latest revision as of 16:57, 29 November 2016

2013 Survey

NDSA:August 21, 2013 Meeting Minutes

Survey is CLOSED. Analysis is underway. Contact Abbie if you want to get involved.

Blog post: http://blogs.loc.gov/digitalpreservation/2013/10/archiving-web-content-take-the-2013-ndsa-survey/

Survey questions: http://www.digitalpreservation.gov/ndsa/documents/ndsa_web_archiving_survey_2013.pdf?loclr=blogsig

Survey Data (identifying information removed): File:Ndsa wa survey 2013.xls

2011 survey

The goal of the survey was to find out the scope of collecting web content in the United States: what organizations collection development policies state (if they have one), what they are actually collecting, and what services are being used to archive, among other things.

The Survey was conducted in October 2011 and 91 results were received.

Raw results are here: http://www.surveymonkey.com/sr.aspx?sm=35pr1N8b4guFUbdaGYf08gvZGp99eG9r2dfnbeuOSzY_3d Password is CWG

A final report, completed in July 2012, is here: File:Ndsa web archiving survey report 2012.pdf

See these blog posts for more information:

http://blogs.loc.gov/digitalpreservation/2012/05/web-archiving-arrives-results-from-the-ndsa-web-archiving-survey/

http://blogs.loc.gov/digitalpreservation/2012/07/the-ndsa-web-archiving-survey/

And an addendum for NDSA members is here: File:Ndsa web archiving survey report 2012 addendum.pdf. Please do not distribute this addendum beyond NDSA members.