What sort of use cases is your system designed to support? What doesn't this support?
repository back up
back up from file directory
disaster recovery backup of content
single file recovery
Preservation activities that require additional compute resources, and space
Staging area for pre-perservation ready content
Provide predictable URLs, and we maintain the content ID provided
Activities not currently supported\- file format migration, explicit versioning, not repository system ( so no collection or hierarchy mechanisms), no policy management implementations, automatic repair of local file copy
What preservation strategies would your system support?
multiple copies in multiple locations under multiple administrations
auto synchronization with primary copy
all copies web accessible and can view/download
can run bit integrity checking to compare primary and secondary copies with manifest
format identification (in-progress)
provenance auditing (on roadmap)
repair of secondary copies (roadmap)
What preservation standards would your system support?
Any that involve specifications for a "bundle" of bits\- such as bag it
Compatible for storing any type of package ( ie, AIP)
What resources are required to support a solution implemented in your environment?
almost none
you need one administrator to manage the DuraCloud account
you might require some technical help to get your content out of your local system and push a copy to DuraCloud
What infrastructure do you rely on?
public cloud storage
public cloud compute
private cloud storage
How can the cloud environment impact digital preservation activities?
hopefully make it easier to do support activities which are difficult to provision and manage internally
relieves pressure of managing/upgrading internal hardware, and forecasting server & storage requirements
If we put data in your system today what systems and processes are in place so that we can get it back 50 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)
You own and manage your own account and data\- you are not handing it over to us\- so you can do what you want with it at any time
The software is all open source, so if you ever decide to run the whole stack/application on your own\- you can
The system is tied to multiple cloud providers, lower the risk if one goes out of business.
Your original copy is your local copy, and most likely the copy of record. DuraCloud is just a backup.
If one provider goes out of business we will assist you to move your content out and to another provider.
Concerns to address
confidential data
DuraCloud is one low level component of an overall preservation strategy. It does not address fine-grained policy and access control considerations. It can be used to house entire collections of confidential data, and/or support a system which provides granular controls, but it does not do so itself. Does support basic authentication,and you can make spaces within duracloud dark or light.
encrypted data
DuraCloud can store any "bundle of bits". It does not provide it's own primitives for encryption. Due to the remote nature of many Duracloud use cases, maintaining encryption on an end-to-end basis is out of scope.
auditing
auditing of content
system audit potential
preservation risks
Cloud is emerging market
ability to fund preservation solutions-particularly when online
legal compliance
Content access and copyright is controlled and managed by the user/account holder