NDSA:Columbia University
- What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)
- Design & implement coherent & comprehensive preservation program for ensuring survival & continued accessibility of Libraries’ digital content. Develop & budget for long-term digital archiving strategy for content created by the Libraries, whether “born-digital” or converted from analog formats.
- Provide stable, secure storage for large-scale access & long- term preservation
- Support efficient creation & management of administrative, descriptive, structural, preservation & rights metadata
- Support object relationships, actions, behaviors, fine-grained access control policies
- What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?
- SUN SAM-FS platform, four copies, two on disk, two on tape
- 70TB effective storage with 9.6TB tier I disk cache
- Offsite disk storage at NYSERNet Data Center, Syracuse, New York, dedicated 1Gb/s network link to Columbia
- Risk averse - use "tried and true" technologies
- Open to maximize sustainability and flexibility
- Entrance and exit strategy
- Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.)
- text, images, data sets, audio, limited video
- How big is your collection? (In terms of number of objects and storage space required)
- What are your performance requirements? Further, why are these your particular requirements?
- What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?
- What do you think the key advantages of the system you use?
- What do you think are the key problems or disadvantages your system present?
- What important principles informed your decision about the particular tool or service you chose to use?
- How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations?
- What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)
- What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)
- Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?