<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.diglib.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Lesliej</id>
	<title>DLF Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.diglib.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Lesliej"/>
	<link rel="alternate" type="text/html" href="https://wiki.diglib.org/Special:Contributions/Lesliej"/>
	<updated>2026-05-30T19:03:11Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.44.0</generator>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2026</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2026"/>
		<updated>2011-04-04T21:55:44Z</updated>

		<summary type="html">&lt;p&gt;Lesliej: /* iRODS direct responses */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule and Slides==&lt;br /&gt;
# Feb 1, Tues, 1:00 EST call with iRods Reagan Moore ([[NDSA:Media:NIAID.ppt|presentation]])&lt;br /&gt;
# Feb 14, Monday, 11:00 EST call with Duracloud ([[NDSA:Media:DuracloudNDSA.ppt|presentation]])&lt;br /&gt;
# Feb 17, Thurs, 11:00 EST call with MetaArchive/GDDP Katherine Skinner, Matt Schultz and Martin Halbert MetaArchive NDSA ([[NDSA:Media:MetaArchive NDSA Infrastructure.ppt|presentation]])&lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation &lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Questions for Cloud Service Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
# What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
# What preservation standards would your system support? &lt;br /&gt;
# What resources are required to support a solution implemented in your environment? &lt;br /&gt;
# What infrastructure do you rely on?&lt;br /&gt;
# How can your system impact digital preservation activities?&lt;br /&gt;
# If we put data in your system today what systems and processes are in place so that we can get it back 10 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
# What types of materials does your system handle? (documents, audio files, video file, stills, data sets, etc) And give examples of those types in practice&lt;br /&gt;
&lt;br /&gt;
==Questions for Member Institution Implementations of Large Scale Storage Architectures==&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, which service providers or tools did you consider and how did you make your choice? &lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc)&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another?&lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;br /&gt;
&lt;br /&gt;
===Responses to questions===&lt;br /&gt;
====[[NDSA:iRODS]] direct responses====&lt;br /&gt;
&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] The need for each storage target to support a specific set of operations, and consistently with other storage targets, seems like a risk that comes along with the elegant abstraction that iRODS provides. Clear specifications help mitigate this risk.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:DuraCloud]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Treatment of cloud provider is generally as a black box, without a strong sense of actual reliability of underlying storage systems. Cloud providers tend to promise checksum validation of contents, but recourse if validation fails was unknown (right?). Additional checksum validation has been augmented on top of cloud storage service by Duracloud.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:MetaArchive/GDDP]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Built on LOCKSS, so data integrity assurances are provided by robust networked software model augmented to commodity hardware and storage. Federated nature provides integrity assurance but also a lack of central control in that the accidental loss of multiple caches is unlikely but e.g. scheduled maintenance or upgrades could coincidentally collide.&lt;br /&gt;
&lt;br /&gt;
====Chronopolis====&lt;br /&gt;
# ...&lt;br /&gt;
====MicroSoft Azure====&lt;br /&gt;
# ...&lt;br /&gt;
====Amazon S3/EC2====&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==General Concerns==&lt;br /&gt;
# confidential data&lt;br /&gt;
# encrypted data&lt;br /&gt;
# auditing&lt;br /&gt;
# preservation risks&lt;br /&gt;
# legal compliance&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Solution Models and Environments==&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Name&lt;br /&gt;
!Offered as Service&lt;br /&gt;
!Deployed Locally&lt;br /&gt;
!Opensource&lt;br /&gt;
!Authentication Scheme&lt;br /&gt;
!Ingest Mechanism&lt;br /&gt;
!Export Mechanism&lt;br /&gt;
!Integrity/Validation Mechanism&lt;br /&gt;
!Replication Mechanism&lt;br /&gt;
!Administration Model (Federated, etc.)&lt;br /&gt;
!Tiering Support&lt;br /&gt;
|-&lt;br /&gt;
|iRODS&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|DuraCloud&lt;br /&gt;
|yes&lt;br /&gt;
|yes&lt;br /&gt;
|yes (Apache2)&lt;br /&gt;
|Basic Auth&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|Checksum verified on ingest. On-demand checksum verification service.&lt;br /&gt;
|Built-in support for cross-cloud replication.&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|MetaArchive/GDDP&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Chronopolis&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Microsoft Azure&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Amazon S3/EC2&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Lesliej</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:IRODS&amp;diff=2348</id>
		<title>NDSA:IRODS</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:IRODS&amp;diff=2348"/>
		<updated>2011-04-04T21:55:18Z</updated>

		<summary type="html">&lt;p&gt;Lesliej: Created page with &amp;#039;# What sort of use cases is your system designed to support? What doesn&amp;#039;t this support? ## Share Data ## Build Digital Libraries ## Build a Preservation Environment  ## Any group…&amp;#039;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
## Share Data&lt;br /&gt;
## Build Digital Libraries&lt;br /&gt;
## Build a Preservation Environment &lt;br /&gt;
## Any group that needs to manage distributed data or to migrate data should consider iRODS.&lt;br /&gt;
Question: Is this a system or a prototype?  Answer: It is definitely in production, although there is a separate prototype for NARA.&lt;br /&gt;
&lt;br /&gt;
Question: Who is using it for a preservation use case?  Answer:  CDR and the Taiwan National Archive&lt;br /&gt;
# What preservation strategies would your system support?&lt;br /&gt;
## The principle strategy is the instantiation of a standard set of enforceable policies in a preservation archive.&lt;br /&gt;
## 120 policies have been identified to date.  In identifying and reviewing the policies at a SAA workshop, there was a subset of 20 that at least 50% wanted.  there is a long tail of policies that at least 1 organization wanted.&lt;br /&gt;
Question:  What issues are there of mismatched semantics across system?  Answer:  An example is NCAR, with mass storage form the 1960s that understood tape get/put.  A disk cache had to be put in place on top of tape to interact with.  It&#039;s the same with Cloud services, which also deals in get/put, and need a cache on top.&lt;br /&gt;
&lt;br /&gt;
Question:  What is the base level of functionality to be a part of iRODS?  Answer:  This varies.  There are specific functions for each local environment.  What data processing needs are there? Where must they be run? etc.&lt;br /&gt;
&lt;br /&gt;
Question:  When managing large data collections, is  distributed data integrity checking built into the system?  Answer:  Yes, at the whichever locations where the data is stored.  You can create procedures for independent checking.&lt;br /&gt;
# What infrastructure do you rely on? AND What resources are required to support a solution implemented in your environment?&lt;br /&gt;
## Any operating system&lt;br /&gt;
## Up to 1 million files cam run in a standalone instance&lt;br /&gt;
## Over 1 million files, a distributed system is needed.&lt;br /&gt;
## The number of files is the primary gating factor for the database.  There is use with Postgres, MySQL, and Oracle, but most use Postgres.&lt;br /&gt;
Question: What is the largest current installation?  Answer:  NASA, with 700 TB, 65-85 million files.  A Particle data project in France has 2-3 PB.&lt;br /&gt;
Question:  Why is the catalog in one central location?  Answer: Efficiency.  There are three models in use in various installations.  NIH required a central catalog, which can have slaves.  NAO required multiple  chained into a catalog.  A project in the UK is using multiple Grids chained together.&lt;br /&gt;
&lt;br /&gt;
Question:  Are there any moving image projects?  Answer:  Yes, Cinegrid.  Also an ocean observation project with video observation files.&lt;br /&gt;
# How can the cloud environment impact digital preservation activities?&lt;br /&gt;
## There are no assertions about integrity or any properties.  The system has to independently record and assert.&lt;br /&gt;
## Once a project reaches a certain amount of data, it should really work locally, not in the cloud.  It can be more cost efficient/predictable if you expect to be running within local capacity.&lt;br /&gt;
# If we put data in your system today what systems and processes  are in place so that we can get it back 50 years from now? (Take for  granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
## In 50 years, NONE of our current infrastructure components will still be in place.&lt;br /&gt;
## We need infrastructure independence - we have to be able to migrate based on policies, not a specific infrastructure, to a new infrastructure.&lt;br /&gt;
## To do that, we must know all previous versions of policies, and which are applied to which objects.  That is potentially easiest with a policy-based system like iRODS.&lt;br /&gt;
Question:  What cloud services are supported so far?  Answer:  S3 and EC3.  It can also be run in a virtualized environment, such as the VCL project at NCSU.&lt;br /&gt;
&lt;br /&gt;
Question:  What about distributed checksums?  Answer: Can check in each and/or compare across multiple copies.  In a local environment, it checks against the central catalog.&lt;br /&gt;
&lt;br /&gt;
Question:  Are there any privacy use cases to be aware of?  Answer: They have worked with a group on IRB issues, and can implement policies against a local IRB catalog.  That data is NOT stored in the central catalog.&lt;br /&gt;
# Anything else?&lt;br /&gt;
## The next release is February 2011.&lt;/div&gt;</summary>
		<author><name>Lesliej</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2025</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2025"/>
		<updated>2011-04-04T21:17:32Z</updated>

		<summary type="html">&lt;p&gt;Lesliej: /* iRODS */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule and Slides==&lt;br /&gt;
# Feb 1, Tues, 1:00 EST call with iRods Reagan Moore ([[NDSA:Media:NIAID.ppt|presentation]])&lt;br /&gt;
# Feb 14, Monday, 11:00 EST call with Duracloud ([[NDSA:Media:DuracloudNDSA.ppt|presentation]])&lt;br /&gt;
# Feb 17, Thurs, 11:00 EST call with MetaArchive/GDDP Katherine Skinner, Matt Schultz and Martin Halbert MetaArchive NDSA ([[NDSA:Media:MetaArchive NDSA Infrastructure.ppt|presentation]])&lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation &lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Questions for Cloud Service Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
# What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
# What preservation standards would your system support? &lt;br /&gt;
# What resources are required to support a solution implemented in your environment? &lt;br /&gt;
# What infrastructure do you rely on?&lt;br /&gt;
# How can your system impact digital preservation activities?&lt;br /&gt;
# If we put data in your system today what systems and processes are in place so that we can get it back 10 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
# What types of materials does your system handle? (documents, audio files, video file, stills, data sets, etc) And give examples of those types in practice&lt;br /&gt;
&lt;br /&gt;
==Questions for Member Institution Implementations of Large Scale Storage Architectures==&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, which service providers or tools did you consider and how did you make your choice? &lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc)&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another?&lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;br /&gt;
&lt;br /&gt;
===Responses to questions===&lt;br /&gt;
====[[NDSA:iRODS]] direct responses====&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] The need for each storage target to support a specific set of operations, and consistently with other storage targets, seems like a risk that comes along with the elegant abstraction that iRODS provides. Clear specifications help mitigate this risk.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:DuraCloud]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Treatment of cloud provider is generally as a black box, without a strong sense of actual reliability of underlying storage systems. Cloud providers tend to promise checksum validation of contents, but recourse if validation fails was unknown (right?). Additional checksum validation has been augmented on top of cloud storage service by Duracloud.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:MetaArchive/GDDP]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Built on LOCKSS, so data integrity assurances are provided by robust networked software model augmented to commodity hardware and storage. Federated nature provides integrity assurance but also a lack of central control in that the accidental loss of multiple caches is unlikely but e.g. scheduled maintenance or upgrades could coincidentally collide.&lt;br /&gt;
&lt;br /&gt;
====Chronopolis====&lt;br /&gt;
# ...&lt;br /&gt;
====MicroSoft Azure====&lt;br /&gt;
# ...&lt;br /&gt;
====Amazon S3/EC2====&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==General Concerns==&lt;br /&gt;
# confidential data&lt;br /&gt;
# encrypted data&lt;br /&gt;
# auditing&lt;br /&gt;
# preservation risks&lt;br /&gt;
# legal compliance&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Solution Models and Environments==&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Name&lt;br /&gt;
!Offered as Service&lt;br /&gt;
!Deployed Locally&lt;br /&gt;
!Opensource&lt;br /&gt;
!Authentication Scheme&lt;br /&gt;
!Ingest Mechanism&lt;br /&gt;
!Export Mechanism&lt;br /&gt;
!Integrity/Validation Mechanism&lt;br /&gt;
!Replication Mechanism&lt;br /&gt;
!Administration Model (Federated, etc.)&lt;br /&gt;
!Tiering Support&lt;br /&gt;
|-&lt;br /&gt;
|iRODS&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|DuraCloud&lt;br /&gt;
|yes&lt;br /&gt;
|yes&lt;br /&gt;
|yes (Apache2)&lt;br /&gt;
|Basic Auth&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|Checksum verified on ingest. On-demand checksum verification service.&lt;br /&gt;
|Built-in support for cross-cloud replication.&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|MetaArchive/GDDP&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Chronopolis&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Microsoft Azure&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Amazon S3/EC2&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Lesliej</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=1987</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=1987"/>
		<updated>2011-02-01T19:09:04Z</updated>

		<summary type="html">&lt;p&gt;Lesliej: /* People/Projects to Contact */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule==&lt;br /&gt;
Once we start scheduling presenters we will keep a list of the talks here. &lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*DuraCloud/Duraspace (Leslie to contact)&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation http://www.metaarchive.org/GDDP (Martin will contact)&lt;br /&gt;
*Irods: Reagan Moore, 2/1/2011&lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Guiding Questions for Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
*What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
*What preservation strategies or standards would your system support? &lt;br /&gt;
*What resources are required to support a solution implemented in your environment &lt;br /&gt;
*How can the cloud environment impact digital preservation activities?&lt;br /&gt;
*If we put data in your system today what systems and processes are in place so that we can get it back 50 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
*What infrastructure do you rely on?&lt;/div&gt;</summary>
		<author><name>Lesliej</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=1986</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=1986"/>
		<updated>2011-02-01T19:08:49Z</updated>

		<summary type="html">&lt;p&gt;Lesliej: /* People/Projects to Contact */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule==&lt;br /&gt;
Once we start scheduling presenters we will keep a list of the talks here. &lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*DuraCloud/Duraspace (Leslie to contact)&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation http://www.metaarchive.org/GDDP (Martin will contact)&lt;br /&gt;
*Irods 2/1/2011&lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Guiding Questions for Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
*What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
*What preservation strategies or standards would your system support? &lt;br /&gt;
*What resources are required to support a solution implemented in your environment &lt;br /&gt;
*How can the cloud environment impact digital preservation activities?&lt;br /&gt;
*If we put data in your system today what systems and processes are in place so that we can get it back 50 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
*What infrastructure do you rely on?&lt;/div&gt;</summary>
		<author><name>Lesliej</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Infrastructure_Working_Group_Members&amp;diff=1487</id>
		<title>NDSA:Infrastructure Working Group Members</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Infrastructure_Working_Group_Members&amp;diff=1487"/>
		<updated>2011-01-20T22:09:54Z</updated>

		<summary type="html">&lt;p&gt;Lesliej: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;*Micah Altman&lt;br /&gt;
*Joseph Pawletko&lt;br /&gt;
*Elizabeth Perkes&lt;br /&gt;
*Bryan Beecher&lt;br /&gt;
*Karen Cariani&lt;br /&gt;
*Kris Carpenter&lt;br /&gt;
*Patricia Cruse&lt;br /&gt;
*Daphane DeLeon&lt;br /&gt;
*Blane Dessy&lt;br /&gt;
*Daniel Dodge&lt;br /&gt;
*Erin Engle&lt;br /&gt;
*Dean Farrell&lt;br /&gt;
*Eileen Fenton&lt;br /&gt;
*Michelle Gallinger&lt;br /&gt;
*Michael J. Giarlo&lt;br /&gt;
*U Andrea Goethals&lt;br /&gt;
*Abbie Grotke&lt;br /&gt;
*Matt Guzzi&lt;br /&gt;
*Martin Halbert&lt;br /&gt;
*Christine Marie Hopper&lt;br /&gt;
*Bob Horton&lt;br /&gt;
*Howard, Barrie&lt;br /&gt;
*Martin Jacobson&lt;br /&gt;
*Joseph JaJa&lt;br /&gt;
*Leslie Johnston&lt;br /&gt;
*Jimi Jones&lt;br /&gt;
*Butch Lazorchak&lt;br /&gt;
*Cal Lee&lt;br /&gt;
*Jane Mandelbaum&lt;br /&gt;
*Jonathan Marmor&lt;br /&gt;
*David Minor&lt;br /&gt;
*Eugene Mopsik&lt;br /&gt;
*Michael Nelson&lt;br /&gt;
*Trevor Owens&lt;br /&gt;
*Abbey Potter&lt;br /&gt;
*Curtis Pulford&lt;br /&gt;
*Patricia Smith-Mansfield&lt;br /&gt;
*Mike Smorul&lt;br /&gt;
*Cory Snavely&lt;br /&gt;
*Herbert Van de Sompel&lt;br /&gt;
*John Spencer&lt;br /&gt;
*Taylor Surface&lt;br /&gt;
*John Unsworth&lt;br /&gt;
*William Ying&lt;br /&gt;
*Andrew Woods&lt;/div&gt;</summary>
		<author><name>Lesliej</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Tuesday,_Jan_18,_2011&amp;diff=2078</id>
		<title>NDSA:Tuesday, Jan 18, 2011</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Tuesday,_Jan_18,_2011&amp;diff=2078"/>
		<updated>2011-01-20T22:02:23Z</updated>

		<summary type="html">&lt;p&gt;Lesliej: Created page with &amp;#039;NDSA Infrastructure call, 01/18/2011  On the call:  *Karen Cariani *Leslie Johnston *Elizabeth Perkes *Cal Lee *Dan Dodge *Mike Smorul *Mike Giarlo *Micah Altman *Joe Pawletko *M…&amp;#039;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;NDSA Infrastructure call, 01/18/2011&lt;br /&gt;
&lt;br /&gt;
On the call:&lt;br /&gt;
&lt;br /&gt;
*Karen Cariani&lt;br /&gt;
*Leslie Johnston&lt;br /&gt;
*Elizabeth Perkes&lt;br /&gt;
*Cal Lee&lt;br /&gt;
*Dan Dodge&lt;br /&gt;
*Mike Smorul&lt;br /&gt;
*Mike Giarlo&lt;br /&gt;
*Micah Altman&lt;br /&gt;
*Joe Pawletko&lt;br /&gt;
*Michelle Galliger&lt;br /&gt;
*Cory Snavely&lt;br /&gt;
*Curtis Pulford &lt;br /&gt;
*Trevor Owens&lt;br /&gt;
*Dean Farrell&lt;br /&gt;
*Taylor Surface&lt;br /&gt;
*Kris Carpenter&lt;br /&gt;
*Martin Halbert&lt;br /&gt;
*(Apologies to anyone we missed)&lt;br /&gt;
&lt;br /&gt;
==Agenda==&lt;br /&gt;
&lt;br /&gt;
*Determine if the third Tues of every month at 11:00 works best for the working group call&lt;br /&gt;
*Update of the NDSA meeting in December&lt;br /&gt;
*Wiki overview - discuss how we want to use it&lt;br /&gt;
*Discussion and planning to move forward with our two work issues&lt;br /&gt;
&lt;br /&gt;
==Call Schedule==&lt;br /&gt;
&lt;br /&gt;
It works for most people. Some have conflicts, but most can make it work.&lt;br /&gt;
&lt;br /&gt;
==December 2010 NDSA Organizing Workshop==&lt;br /&gt;
&lt;br /&gt;
Over 40 people were in the room, discussing how NDSA could and should be organized.  It was a fruitful conversation, with a wide spectrum of views in some areas, with others very close to consensus.  The [[NDSA:Organizing Workshop Notes]] have been circulated to the organizing group.  There is a timeline for document review and ratification by all NDSA members:&lt;br /&gt;
&lt;br /&gt;
*Send revised draft of organizing documents to all members: 2/04/11&lt;br /&gt;
*Review and comment period for documents: 2/18/11&lt;br /&gt;
*Incorporation of revisions from entire membership: 2/25/11&lt;br /&gt;
*Vote by entire membership for ratification: 3/01/11&lt;br /&gt;
*Voting period ends:  3/15/11&lt;br /&gt;
&lt;br /&gt;
Please send comments to Michelle or use the wiki.&lt;br /&gt;
&lt;br /&gt;
Kris:  One point – During the meeting there was a lot of discussion about tightening down the focus of each working group.  One specific topic was coordinating the participation of folks who are signed up for many groups. &lt;br /&gt;
&lt;br /&gt;
Leslie:  There will be some natural group membership fluctuations while people select the most appropriate group to be on.&lt;br /&gt;
&lt;br /&gt;
==Using the Wiki==&lt;br /&gt;
&lt;br /&gt;
Karen:  Do all agree that we should use the wiki as a collaborative work space?  &lt;br /&gt;
&lt;br /&gt;
All:  Yes, but we need to get notifications about new work by email.&lt;br /&gt;
&lt;br /&gt;
Everyone should have an account (Michelle set them up). Log in to generate your password.&lt;br /&gt;
Your User Name will be the front part of your email address (for example, lesliej or abgr)&lt;br /&gt;
Then click on &amp;quot;email new password&amp;quot; and a password should be sent to you.&lt;br /&gt;
&lt;br /&gt;
The working group charter we developed during our last call that will be guiding our work can be found on the wiki.  It can be found here: [[NDSA:Infrastructure_Working_Group]]&lt;br /&gt;
&lt;br /&gt;
Meeting notes are going up soon.&lt;br /&gt;
&lt;br /&gt;
==Work==&lt;br /&gt;
&lt;br /&gt;
The top topic from the Doodle poll the topic is:&lt;br /&gt;
&lt;br /&gt;
* Investigate, recognize, and document potential preservation emerging practices in the use of large-scale storage and cloud infrastructures.&lt;br /&gt;
&lt;br /&gt;
This one was a very close second so we should consider how we might address both:&lt;br /&gt;
&lt;br /&gt;
* Investigate, share, and recognize emerging practices for use and development and sharing of open source tools and other software that enable digital preservation.&lt;br /&gt;
&lt;br /&gt;
All:  Let’s start with cloud storage.  Let’s invite experts to speak with us.&lt;br /&gt;
&lt;br /&gt;
Cal: What do we want to learn?&lt;br /&gt;
&lt;br /&gt;
Martin: What are the issues in architecting systems?    It is compelling that they can potentially grow seamlessly from small to very large capacity with the same functional capabilities. Can they really? What are the issues?&lt;br /&gt;
&lt;br /&gt;
Kris: Are we asking about services offered by others, or about deploying our own cloud environments? It would be nice to hear about both.&lt;br /&gt;
&lt;br /&gt;
All:  Yes.&lt;br /&gt;
&lt;br /&gt;
Unidentified speaker:  We need to identify preservation issues for speakers to address.  As an example, how will we deal with confidential information, encrypted files, and auditing?  What are the preservation risks?&lt;br /&gt;
&lt;br /&gt;
Who will we invite?&lt;br /&gt;
&lt;br /&gt;
*DuraCloud (has worked with multiple commercial providers)&lt;br /&gt;
*Microsoft Azure&lt;br /&gt;
*Amazon (hosts a contest and shares best examples of use of the cloud)&lt;br /&gt;
*NSF (has been making awards around the use of Google and Azure clouds for computation)&lt;br /&gt;
*Chronopolis&lt;br /&gt;
*Irods&lt;br /&gt;
*LOCKSS&lt;br /&gt;
*Authors of the Guide to Distributed Digital Preservation&lt;br /&gt;
&lt;br /&gt;
Trevor:  Will we set up an Action Group?&lt;br /&gt;
&lt;br /&gt;
All:  No, not for this.  It’s better as a full group activity.  &lt;br /&gt;
&lt;br /&gt;
Mike Smorul:  Volunteers to talk about how they have worked with Chronopolis and Duraspace.  &lt;br /&gt;
&lt;br /&gt;
Martin:  As one of the authors of the Guide to Distributed Digital Preservation, they are working on a follow-up. The folks continuing the work that could give a presentation.  &lt;br /&gt;
&lt;br /&gt;
Martin – will talk to Katherine Skinner, and Matt Schultz. (Did Martin also say he would talk to Stanford and LOCKSS)&lt;br /&gt;
&lt;br /&gt;
Leslie:  Will invite DuraCloud as the first speaker.  Can also talk to Microsoft about Azure. DuraCloud team may be able to help us connect with commercial providers.&lt;br /&gt;
&lt;br /&gt;
Cal: Will talk to Irods team.&lt;br /&gt;
&lt;br /&gt;
Kris -  Let’s not forget about personal archiving in the cloud.  There are some Silicon Valley companies we might be able to connect with.  &lt;br /&gt;
&lt;br /&gt;
Cal:  What questions do we need to ask of them to compare each of the different solutions?&lt;br /&gt;
&lt;br /&gt;
Trevor:  Suggests that we develop a set of questions as a group to give to presenter s before hand so that all the presenters have common questions.  &lt;br /&gt;
&lt;br /&gt;
==Draft Questions==&lt;br /&gt;
&lt;br /&gt;
*What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
*What preservation strategies or standards would your system support? &lt;br /&gt;
*What resources are required to support a solution implemented in your environment &lt;br /&gt;
*How can the cloud environment impact digital preservation activities?&lt;br /&gt;
*If we put data in your system today what systems and processes are in place so that we can get it back 50 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
*What infrastructure do you rely on?&lt;br /&gt;
&lt;br /&gt;
Kris: There is also a lot of movement toward solutions for legal compliance. We should make sure to bring that up.&lt;br /&gt;
&lt;br /&gt;
Martin: Did anything come out of the LOC _Preservation Storage meeting about the Cloud?&lt;br /&gt;
&lt;br /&gt;
Leslie.  Not so much.  Will send link to meeting notes.&lt;br /&gt;
&lt;br /&gt;
Martin:  When we ask about standards, which do we mean?  Do we know?&lt;br /&gt;
&lt;br /&gt;
Cal:  We need to find out what standards they are prioritizing.&lt;br /&gt;
&lt;br /&gt;
Martin: If they don’t have an answer, that’s a red flag.&lt;br /&gt;
&lt;br /&gt;
Karen: the [[NDSA:Cloud Presentations]] questions will go on the wiki.&lt;/div&gt;</summary>
		<author><name>Lesliej</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=1985</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=1985"/>
		<updated>2011-01-20T21:44:37Z</updated>

		<summary type="html">&lt;p&gt;Lesliej: /* People/Projects to Contact */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule==&lt;br /&gt;
Once we start scheduling presenters we will keep a list of the talks here. &lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*DuraCloud/Duraspace (Leslie to contact)&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation http://www.metaarchive.org/GDDP (Martin will contact)&lt;br /&gt;
*Irods (Cal Lee will contact)&lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Guiding Questions for Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
*What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
*What preservation strategies or standards would your system support? &lt;br /&gt;
*What resources are required to support a solution implemented in your environment &lt;br /&gt;
*How can the cloud environment impact digital preservation activities?&lt;br /&gt;
*If we put data in your system today what systems and processes are in place so that we can get it back 50 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
*What infrastructure do you rely on?&lt;/div&gt;</summary>
		<author><name>Lesliej</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=1984</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=1984"/>
		<updated>2011-01-20T21:28:28Z</updated>

		<summary type="html">&lt;p&gt;Lesliej: /* General Guiding Questions for Presenters */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule==&lt;br /&gt;
Once we start scheduling presenters we will keep a list of the talks here. &lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*DuraCloud/Duraspace (Leslie to contact)&lt;br /&gt;
*Chronopolis (Who will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation http://www.metaarchive.org/GDDP (Who will contact?)&lt;br /&gt;
*Irods (Kell will contact)&lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Guiding Questions for Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
*What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
*What preservation strategies or standards would your system support? &lt;br /&gt;
*What resources are required to support a solution implemented in your environment &lt;br /&gt;
*How can the cloud environment impact digital preservation activities?&lt;br /&gt;
*If we put data in your system today what systems and processes are in place so that we can get it back 50 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
*What infrastructure do you rely on?&lt;/div&gt;</summary>
		<author><name>Lesliej</name></author>
	</entry>
</feed>