<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.diglib.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Rtc</id>
	<title>DLF Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.diglib.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Rtc"/>
	<link rel="alternate" type="text/html" href="https://wiki.diglib.org/Special:Contributions/Rtc"/>
	<updated>2026-05-07T15:39:52Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.44.0</generator>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2609</id>
		<title>NDSA:Columbia University</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2609"/>
		<updated>2011-06-08T20:27:26Z</updated>

		<summary type="html">&lt;p&gt;Rtc: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#* Design &amp;amp; implement coherent &amp;amp; comprehensive preservation program for ensuring survival &amp;amp; continued accessibility of Libraries’ digital content. Develop &amp;amp; budget for long-term digital archiving strategy for content created by the Libraries, whether “born-digital” or converted from analog formats.&lt;br /&gt;
#* Provide stable, secure storage for large-scale access &amp;amp; long- term preservation&lt;br /&gt;
#* Support efficient creation &amp;amp; management of administrative, descriptive, structural, preservation &amp;amp; rights metadata&lt;br /&gt;
#* Support object relationships, actions, behaviors, fine-grained access control policies&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#* Fedora version 3&lt;br /&gt;
#* SUN SAM-FS platform, four copies, two on disk, two on tape&lt;br /&gt;
#* 70TB effective storage with 9.6TB tier I disk cache&lt;br /&gt;
#* Offsite disk storage at NYSERNet Data Center, Syracuse, New York, dedicated 1Gb/s network link to Columbia&lt;br /&gt;
#* Risk averse - use &amp;quot;tried and true&amp;quot; technologies&lt;br /&gt;
#* Open to maximize sustainability and flexibility&lt;br /&gt;
#* Entrance and exit strategy&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#* text, images, data sets, audio, limited video&lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#* TBD&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#* System is an &amp;quot;accessible repository&amp;quot; with low latency access to data.  &lt;br /&gt;
#* Decision to build consolidated system based on current size of collection, desire to provide ready access to materials.&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#* Two copies of disk, two copies on tape, with one remote disk copy in Syracuse.&lt;br /&gt;
#* Two copies on disk support fixity checking.  &lt;br /&gt;
#* Tape copy supports offline and offsite backup.&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#* SAM automatically replicates data based on defined policies&lt;br /&gt;
#* SAM automatically brings data from SATA storage into higher performance fibre-channel storage&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#* System was a conservative choice, has commercial support, and has met our needs.&lt;br /&gt;
#* Oracle acquisition has created some uncertainty regarding hardware. &lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
#* We will migrate at the end of the equipment lifecycle (4-5 years).  We haven&#039;t decided if we will migrate off of SAM-FS.&lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2608</id>
		<title>NDSA:Columbia University</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2608"/>
		<updated>2011-06-08T20:24:01Z</updated>

		<summary type="html">&lt;p&gt;Rtc: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#* Design &amp;amp; implement coherent &amp;amp; comprehensive preservation program for ensuring survival &amp;amp; continued accessibility of Libraries’ digital content. Develop &amp;amp; budget for long-term digital archiving strategy for content created by the Libraries, whether “born-digital” or converted from analog formats.&lt;br /&gt;
#* Provide stable, secure storage for large-scale access &amp;amp; long- term preservation&lt;br /&gt;
#* Support efficient creation &amp;amp; management of administrative, descriptive, structural, preservation &amp;amp; rights metadata&lt;br /&gt;
#* Support object relationships, actions, behaviors, fine-grained access control policies&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#* Fedora version 3&lt;br /&gt;
#* SUN SAM-FS platform, four copies, two on disk, two on tape&lt;br /&gt;
#* 70TB effective storage with 9.6TB tier I disk cache&lt;br /&gt;
#* Offsite disk storage at NYSERNet Data Center, Syracuse, New York, dedicated 1Gb/s network link to Columbia&lt;br /&gt;
#* Risk averse - use &amp;quot;tried and true&amp;quot; technologies&lt;br /&gt;
#* Open to maximize sustainability and flexibility&lt;br /&gt;
#* Entrance and exit strategy&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#* text, images, data sets, audio, limited video&lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#* TBD&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#* System is an &amp;quot;accessible repository&amp;quot; with low latency access to data.  &lt;br /&gt;
#* Decision to build consolidated system based on current size of collection, desire to provide ready access to materials.&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#* Two copies of disk, two copies on tape, with one remote disk copy in Syracuse.&lt;br /&gt;
#* Two copies on disk support fixity checking.  &lt;br /&gt;
#* Tape copy supports offline and offsite backup.&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#* SAM automatically replicates data based on defined policies&lt;br /&gt;
#* SAM automatically brings data from SATA storage into higher performance fibre-channel storage&lt;br /&gt;
#* &lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#*  &lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
#* NA&lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2607</id>
		<title>NDSA:Columbia University</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2607"/>
		<updated>2011-06-08T20:23:15Z</updated>

		<summary type="html">&lt;p&gt;Rtc: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#* Design &amp;amp; implement coherent &amp;amp; comprehensive preservation program for ensuring survival &amp;amp; continued accessibility of Libraries’ digital content. Develop &amp;amp; budget for long-term digital archiving strategy for content created by the Libraries, whether “born-digital” or converted from analog formats.&lt;br /&gt;
#* Provide stable, secure storage for large-scale access &amp;amp; long- term preservation&lt;br /&gt;
#* Support efficient creation &amp;amp; management of administrative, descriptive, structural, preservation &amp;amp; rights metadata&lt;br /&gt;
#* Support object relationships, actions, behaviors, fine-grained access control policies&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#* Fedora version 3&lt;br /&gt;
#* SUN SAM-FS platform, four copies, two on disk, two on tape&lt;br /&gt;
#* 70TB effective storage with 9.6TB tier I disk cache&lt;br /&gt;
#* Offsite disk storage at NYSERNet Data Center, Syracuse, New York, dedicated 1Gb/s network link to Columbia&lt;br /&gt;
#* Risk averse - use &amp;quot;tried and true&amp;quot; technologies&lt;br /&gt;
#* Open to maximize sustainability and flexibility&lt;br /&gt;
#* Entrance and exit strategy&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#* text, images, data sets, audio, limited video&lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#* TBD&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#* System is an &amp;quot;accessible repository&amp;quot; with low latency access to data.  &lt;br /&gt;
#* Decision to build consolidated system based on current size of collection, desire to provide ready access to materials.&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#* Two copies of disk, two copies on tape, with one remote disk copy in Syracuse.&lt;br /&gt;
#* Two copies on disk support fixity checking.  &lt;br /&gt;
#* Tape copy supports offline and offsite backup.&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#* SAM automatically replicates data based on defined policies&lt;br /&gt;
#* SAM automatically brings data from SATA storage into higher performance fibre-channel storage&lt;br /&gt;
#* &lt;br /&gt;
&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#*  &lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2606</id>
		<title>NDSA:Columbia University</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2606"/>
		<updated>2011-06-08T20:17:29Z</updated>

		<summary type="html">&lt;p&gt;Rtc: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#* Design &amp;amp; implement coherent &amp;amp; comprehensive preservation program for ensuring survival &amp;amp; continued accessibility of Libraries’ digital content. Develop &amp;amp; budget for long-term digital archiving strategy for content created by the Libraries, whether “born-digital” or converted from analog formats.&lt;br /&gt;
#* Provide stable, secure storage for large-scale access &amp;amp; long- term preservation&lt;br /&gt;
#* Support efficient creation &amp;amp; management of administrative, descriptive, structural, preservation &amp;amp; rights metadata&lt;br /&gt;
#* Support object relationships, actions, behaviors, fine-grained access control policies&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#* SUN SAM-FS platform, four copies, two on disk, two on tape&lt;br /&gt;
#* 70TB effective storage with 9.6TB tier I disk cache&lt;br /&gt;
#* Offsite disk storage at NYSERNet Data Center, Syracuse, New York, dedicated 1Gb/s network link to Columbia&lt;br /&gt;
#* Risk averse - use &amp;quot;tried and true&amp;quot; technologies&lt;br /&gt;
#* Open to maximize sustainability and flexibility&lt;br /&gt;
#* Entrance and exit strategy&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#* text, images, data sets, audio, limited video&lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#*&lt;br /&gt;
&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2605</id>
		<title>NDSA:Columbia University</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2605"/>
		<updated>2011-06-08T18:56:39Z</updated>

		<summary type="html">&lt;p&gt;Rtc: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#* Design &amp;amp; implement coherent &amp;amp; comprehensive preservation program for ensuring survival &amp;amp; continued accessibility of Libraries’ digital content. Develop &amp;amp; budget for long-term digital archiving strategy for content created by the Libraries, whether “born-digital” or converted from analog formats.&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2604</id>
		<title>NDSA:Columbia University</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2604"/>
		<updated>2011-06-08T18:56:19Z</updated>

		<summary type="html">&lt;p&gt;Rtc: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
&lt;br /&gt;
#* Design &amp;amp; implement coherent &amp;amp; comprehensive preservation program for ensuring survival &amp;amp; continued accessibility of Libraries’ digital content. Develop &amp;amp; budget for long-term digital archiving strategy for content created by the Libraries, whether “born-digital” or converted from analog formats.&lt;br /&gt;
&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2603</id>
		<title>NDSA:Columbia University</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2603"/>
		<updated>2011-06-08T18:54:28Z</updated>

		<summary type="html">&lt;p&gt;Rtc: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
&lt;br /&gt;
Design &amp;amp; implement coherent &amp;amp; comprehensive preservation program for ensuring survival &amp;amp; continued accessibility of Libraries’ digital content. Develop &amp;amp; budget for long-term digital archiving strategy for content created by the Libraries, whether “born-digital” or converted from analog formats.&lt;br /&gt;
&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2602</id>
		<title>NDSA:Columbia University</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Columbia_University&amp;diff=2602"/>
		<updated>2011-06-08T18:47:30Z</updated>

		<summary type="html">&lt;p&gt;Rtc: Created page with &amp;#039; #What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.) #What large scale stora…&amp;#039;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2044</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2044"/>
		<updated>2011-06-08T18:47:00Z</updated>

		<summary type="html">&lt;p&gt;Rtc: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule and Slides==&lt;br /&gt;
# Feb 1, Tues, 1:00 EST call with iRods Reagan Moore ([[NDSA:Media:NIAID.ppt|presentation]])&lt;br /&gt;
# Feb 14, Monday, 11:00 EST call with Duracloud ([[NDSA:Media:DuracloudNDSA.ppt|presentation]])&lt;br /&gt;
# Feb 17, Thurs, 11:00 EST call with MetaArchive/GDDP Katherine Skinner, Matt Schultz and Martin Halbert MetaArchive NDSA ([[NDSA:Media:MetaArchive NDSA Infrastructure.ppt|presentation]])&lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation &lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Questions for Cloud Service Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
# What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
# What preservation standards would your system support? &lt;br /&gt;
# What resources are required to support a solution implemented in your environment? &lt;br /&gt;
# What infrastructure do you rely on?&lt;br /&gt;
# How can your system impact digital preservation activities?&lt;br /&gt;
# If we put data in your system today what systems and processes are in place so that we can get it back 10 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
# What types of materials does your system handle? (documents, audio files, video file, stills, data sets, etc) And give examples of those types in practice&lt;br /&gt;
&lt;br /&gt;
===Responses to questions===&lt;br /&gt;
====[[NDSA:iRODS]] direct responses====&lt;br /&gt;
&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] The need for each storage target to support a specific set of operations, and consistently with other storage targets, seems like a risk that comes along with the elegant abstraction that iRODS provides. Clear specifications help mitigate this risk.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:DuraCloud]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Treatment of cloud provider is generally as a black box, without a strong sense of actual reliability of underlying storage systems. Cloud providers tend to promise checksum validation of contents, but recourse if validation fails was unknown (right?). Additional checksum validation has been augmented on top of cloud storage service by Duracloud.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:MetaArchive/GDDP]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Built on LOCKSS, so data integrity assurances are provided by robust networked software model augmented to commodity hardware and storage. Federated nature provides integrity assurance but also a lack of central control in that the accidental loss of multiple caches is unlikely but e.g. scheduled maintenance or upgrades could coincidentally collide.&lt;br /&gt;
&lt;br /&gt;
====Chronopolis====&lt;br /&gt;
# ...&lt;br /&gt;
====MicroSoft Azure====&lt;br /&gt;
# ...&lt;br /&gt;
====Amazon S3/EC2====&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Questions for Member Institution Implementations of Large Scale Storage Architectures==&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;br /&gt;
 &lt;br /&gt;
===Responses to questions===&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Florida Center for Library Automation]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Harvard Library]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:HathiTrust]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:National Library of Medicine Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Penn State]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:WGBH Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:NYU Response]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Library of Congress]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Columbia University]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Your Institution Here]]====&lt;br /&gt;
&lt;br /&gt;
==General Concerns==&lt;br /&gt;
# confidential data&lt;br /&gt;
# encrypted data&lt;br /&gt;
# auditing&lt;br /&gt;
# preservation risks&lt;br /&gt;
# legal compliance&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Solution Models and Environments==&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Name&lt;br /&gt;
!Offered as Service&lt;br /&gt;
!Deployed Locally&lt;br /&gt;
!Opensource&lt;br /&gt;
!Authentication Scheme&lt;br /&gt;
!Ingest Mechanism&lt;br /&gt;
!Export Mechanism&lt;br /&gt;
!Integrity/Validation Mechanism&lt;br /&gt;
!Replication Mechanism&lt;br /&gt;
!Administration Model (Federated, etc.)&lt;br /&gt;
!Tiering Support&lt;br /&gt;
|-&lt;br /&gt;
|iRODS&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|DuraCloud&lt;br /&gt;
|yes&lt;br /&gt;
|yes&lt;br /&gt;
|yes (Apache2)&lt;br /&gt;
|Basic Auth&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|Checksum verified on ingest. On-demand checksum verification service.&lt;br /&gt;
|Built-in support for cross-cloud replication.&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|MetaArchive/GDDP&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Chronopolis&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Microsoft Azure&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Amazon S3/EC2&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2043</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2043"/>
		<updated>2011-06-08T18:46:12Z</updated>

		<summary type="html">&lt;p&gt;Rtc: Undo revision 1176 by Rtc (Talk)&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule and Slides==&lt;br /&gt;
# Feb 1, Tues, 1:00 EST call with iRods Reagan Moore ([[NDSA:Media:NIAID.ppt|presentation]])&lt;br /&gt;
# Feb 14, Monday, 11:00 EST call with Duracloud ([[NDSA:Media:DuracloudNDSA.ppt|presentation]])&lt;br /&gt;
# Feb 17, Thurs, 11:00 EST call with MetaArchive/GDDP Katherine Skinner, Matt Schultz and Martin Halbert MetaArchive NDSA ([[NDSA:Media:MetaArchive NDSA Infrastructure.ppt|presentation]])&lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation &lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Questions for Cloud Service Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
# What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
# What preservation standards would your system support? &lt;br /&gt;
# What resources are required to support a solution implemented in your environment? &lt;br /&gt;
# What infrastructure do you rely on?&lt;br /&gt;
# How can your system impact digital preservation activities?&lt;br /&gt;
# If we put data in your system today what systems and processes are in place so that we can get it back 10 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
# What types of materials does your system handle? (documents, audio files, video file, stills, data sets, etc) And give examples of those types in practice&lt;br /&gt;
&lt;br /&gt;
===Responses to questions===&lt;br /&gt;
====[[NDSA:iRODS]] direct responses====&lt;br /&gt;
&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] The need for each storage target to support a specific set of operations, and consistently with other storage targets, seems like a risk that comes along with the elegant abstraction that iRODS provides. Clear specifications help mitigate this risk.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:DuraCloud]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Treatment of cloud provider is generally as a black box, without a strong sense of actual reliability of underlying storage systems. Cloud providers tend to promise checksum validation of contents, but recourse if validation fails was unknown (right?). Additional checksum validation has been augmented on top of cloud storage service by Duracloud.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:MetaArchive/GDDP]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Built on LOCKSS, so data integrity assurances are provided by robust networked software model augmented to commodity hardware and storage. Federated nature provides integrity assurance but also a lack of central control in that the accidental loss of multiple caches is unlikely but e.g. scheduled maintenance or upgrades could coincidentally collide.&lt;br /&gt;
&lt;br /&gt;
====Chronopolis====&lt;br /&gt;
# ...&lt;br /&gt;
====MicroSoft Azure====&lt;br /&gt;
# ...&lt;br /&gt;
====Amazon S3/EC2====&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Questions for Member Institution Implementations of Large Scale Storage Architectures==&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;br /&gt;
 &lt;br /&gt;
===Responses to questions===&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Florida Center for Library Automation]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Harvard Library]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:HathiTrust]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:National Library of Medicine Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Penn State]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:WGBH Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:NYU Response]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Library of Congress]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Columbia University]]====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==General Concerns==&lt;br /&gt;
# confidential data&lt;br /&gt;
# encrypted data&lt;br /&gt;
# auditing&lt;br /&gt;
# preservation risks&lt;br /&gt;
# legal compliance&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Solution Models and Environments==&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Name&lt;br /&gt;
!Offered as Service&lt;br /&gt;
!Deployed Locally&lt;br /&gt;
!Opensource&lt;br /&gt;
!Authentication Scheme&lt;br /&gt;
!Ingest Mechanism&lt;br /&gt;
!Export Mechanism&lt;br /&gt;
!Integrity/Validation Mechanism&lt;br /&gt;
!Replication Mechanism&lt;br /&gt;
!Administration Model (Federated, etc.)&lt;br /&gt;
!Tiering Support&lt;br /&gt;
|-&lt;br /&gt;
|iRODS&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|DuraCloud&lt;br /&gt;
|yes&lt;br /&gt;
|yes&lt;br /&gt;
|yes (Apache2)&lt;br /&gt;
|Basic Auth&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|Checksum verified on ingest. On-demand checksum verification service.&lt;br /&gt;
|Built-in support for cross-cloud replication.&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|MetaArchive/GDDP&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Chronopolis&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Microsoft Azure&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Amazon S3/EC2&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2042</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2042"/>
		<updated>2011-06-08T18:45:39Z</updated>

		<summary type="html">&lt;p&gt;Rtc: Undo revision 1175 by Rtc (Talk)&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule and Slides==&lt;br /&gt;
# Feb 1, Tues, 1:00 EST call with iRods Reagan Moore ([[NDSA:Media:NIAID.ppt|presentation]])&lt;br /&gt;
# Feb 14, Monday, 11:00 EST call with Duracloud ([[NDSA:Media:DuracloudNDSA.ppt|presentation]])&lt;br /&gt;
# Feb 17, Thurs, 11:00 EST call with MetaArchive/GDDP Katherine Skinner, Matt Schultz and Martin Halbert MetaArchive NDSA ([[NDSA:Media:MetaArchive NDSA Infrastructure.ppt|presentation]])&lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation &lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Questions for Cloud Service Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
# What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
# What preservation standards would your system support? &lt;br /&gt;
# What resources are required to support a solution implemented in your environment? &lt;br /&gt;
# What infrastructure do you rely on?&lt;br /&gt;
# How can your system impact digital preservation activities?&lt;br /&gt;
# If we put data in your system today what systems and processes are in place so that we can get it back 10 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
# What types of materials does your system handle? (documents, audio files, video file, stills, data sets, etc) And give examples of those types in practice&lt;br /&gt;
&lt;br /&gt;
===Responses to questions===&lt;br /&gt;
====[[NDSA:iRODS]] direct responses====&lt;br /&gt;
&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] The need for each storage target to support a specific set of operations, and consistently with other storage targets, seems like a risk that comes along with the elegant abstraction that iRODS provides. Clear specifications help mitigate this risk.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:DuraCloud]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Treatment of cloud provider is generally as a black box, without a strong sense of actual reliability of underlying storage systems. Cloud providers tend to promise checksum validation of contents, but recourse if validation fails was unknown (right?). Additional checksum validation has been augmented on top of cloud storage service by Duracloud.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:MetaArchive/GDDP]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Built on LOCKSS, so data integrity assurances are provided by robust networked software model augmented to commodity hardware and storage. Federated nature provides integrity assurance but also a lack of central control in that the accidental loss of multiple caches is unlikely but e.g. scheduled maintenance or upgrades could coincidentally collide.&lt;br /&gt;
&lt;br /&gt;
====Chronopolis====&lt;br /&gt;
# ...&lt;br /&gt;
====MicroSoft Azure====&lt;br /&gt;
# ...&lt;br /&gt;
====Amazon S3/EC2====&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Questions for Member Institution Implementations of Large Scale Storage Architectures==&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;br /&gt;
 &lt;br /&gt;
===Responses to questions===&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Florida Center for Library Automation]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Harvard Library]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:HathiTrust]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:National Library of Medicine Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Penn State]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:WGBH Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:NYU Response]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Library of Congress]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Columbia University]]====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Questions for Member Institution Implementations of Large Scale Storage Architectures==&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Your Institution Here]]====&lt;br /&gt;
&lt;br /&gt;
==General Concerns==&lt;br /&gt;
# confidential data&lt;br /&gt;
# encrypted data&lt;br /&gt;
# auditing&lt;br /&gt;
# preservation risks&lt;br /&gt;
# legal compliance&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Solution Models and Environments==&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Name&lt;br /&gt;
!Offered as Service&lt;br /&gt;
!Deployed Locally&lt;br /&gt;
!Opensource&lt;br /&gt;
!Authentication Scheme&lt;br /&gt;
!Ingest Mechanism&lt;br /&gt;
!Export Mechanism&lt;br /&gt;
!Integrity/Validation Mechanism&lt;br /&gt;
!Replication Mechanism&lt;br /&gt;
!Administration Model (Federated, etc.)&lt;br /&gt;
!Tiering Support&lt;br /&gt;
|-&lt;br /&gt;
|iRODS&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|DuraCloud&lt;br /&gt;
|yes&lt;br /&gt;
|yes&lt;br /&gt;
|yes (Apache2)&lt;br /&gt;
|Basic Auth&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|Checksum verified on ingest. On-demand checksum verification service.&lt;br /&gt;
|Built-in support for cross-cloud replication.&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|MetaArchive/GDDP&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Chronopolis&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Microsoft Azure&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Amazon S3/EC2&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2041</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2041"/>
		<updated>2011-06-08T18:44:35Z</updated>

		<summary type="html">&lt;p&gt;Rtc: /* Questions for Member Institution Implementations of Large Scale Storage Architectures */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule and Slides==&lt;br /&gt;
# Feb 1, Tues, 1:00 EST call with iRods Reagan Moore ([[NDSA:Media:NIAID.ppt|presentation]])&lt;br /&gt;
# Feb 14, Monday, 11:00 EST call with Duracloud ([[NDSA:Media:DuracloudNDSA.ppt|presentation]])&lt;br /&gt;
# Feb 17, Thurs, 11:00 EST call with MetaArchive/GDDP Katherine Skinner, Matt Schultz and Martin Halbert MetaArchive NDSA ([[NDSA:Media:MetaArchive NDSA Infrastructure.ppt|presentation]])&lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation &lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Questions for Cloud Service Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
# What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
# What preservation standards would your system support? &lt;br /&gt;
# What resources are required to support a solution implemented in your environment? &lt;br /&gt;
# What infrastructure do you rely on?&lt;br /&gt;
# How can your system impact digital preservation activities?&lt;br /&gt;
# If we put data in your system today what systems and processes are in place so that we can get it back 10 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
# What types of materials does your system handle? (documents, audio files, video file, stills, data sets, etc) And give examples of those types in practice&lt;br /&gt;
&lt;br /&gt;
===Responses to questions===&lt;br /&gt;
====[[NDSA:iRODS]] direct responses====&lt;br /&gt;
&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] The need for each storage target to support a specific set of operations, and consistently with other storage targets, seems like a risk that comes along with the elegant abstraction that iRODS provides. Clear specifications help mitigate this risk.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:DuraCloud]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Treatment of cloud provider is generally as a black box, without a strong sense of actual reliability of underlying storage systems. Cloud providers tend to promise checksum validation of contents, but recourse if validation fails was unknown (right?). Additional checksum validation has been augmented on top of cloud storage service by Duracloud.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:MetaArchive/GDDP]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Built on LOCKSS, so data integrity assurances are provided by robust networked software model augmented to commodity hardware and storage. Federated nature provides integrity assurance but also a lack of central control in that the accidental loss of multiple caches is unlikely but e.g. scheduled maintenance or upgrades could coincidentally collide.&lt;br /&gt;
&lt;br /&gt;
====Chronopolis====&lt;br /&gt;
# ...&lt;br /&gt;
====MicroSoft Azure====&lt;br /&gt;
# ...&lt;br /&gt;
====Amazon S3/EC2====&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Questions for Member Institution Implementations of Large Scale Storage Architectures==&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;br /&gt;
 &lt;br /&gt;
===Responses to questions===&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Florida Center for Library Automation]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Harvard Library]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:HathiTrust]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:National Library of Medicine Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Penn State]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:WGBH Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:NYU Response]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Library of Congress]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Columbia University]]====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==General Concerns==&lt;br /&gt;
# confidential data&lt;br /&gt;
# encrypted data&lt;br /&gt;
# auditing&lt;br /&gt;
# preservation risks&lt;br /&gt;
# legal compliance&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Solution Models and Environments==&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Name&lt;br /&gt;
!Offered as Service&lt;br /&gt;
!Deployed Locally&lt;br /&gt;
!Opensource&lt;br /&gt;
!Authentication Scheme&lt;br /&gt;
!Ingest Mechanism&lt;br /&gt;
!Export Mechanism&lt;br /&gt;
!Integrity/Validation Mechanism&lt;br /&gt;
!Replication Mechanism&lt;br /&gt;
!Administration Model (Federated, etc.)&lt;br /&gt;
!Tiering Support&lt;br /&gt;
|-&lt;br /&gt;
|iRODS&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|DuraCloud&lt;br /&gt;
|yes&lt;br /&gt;
|yes&lt;br /&gt;
|yes (Apache2)&lt;br /&gt;
|Basic Auth&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|Checksum verified on ingest. On-demand checksum verification service.&lt;br /&gt;
|Built-in support for cross-cloud replication.&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|MetaArchive/GDDP&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Chronopolis&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Microsoft Azure&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Amazon S3/EC2&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2040</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2040"/>
		<updated>2011-06-08T18:43:55Z</updated>

		<summary type="html">&lt;p&gt;Rtc: /* Columbia University */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule and Slides==&lt;br /&gt;
# Feb 1, Tues, 1:00 EST call with iRods Reagan Moore ([[NDSA:Media:NIAID.ppt|presentation]])&lt;br /&gt;
# Feb 14, Monday, 11:00 EST call with Duracloud ([[NDSA:Media:DuracloudNDSA.ppt|presentation]])&lt;br /&gt;
# Feb 17, Thurs, 11:00 EST call with MetaArchive/GDDP Katherine Skinner, Matt Schultz and Martin Halbert MetaArchive NDSA ([[NDSA:Media:MetaArchive NDSA Infrastructure.ppt|presentation]])&lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation &lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Questions for Cloud Service Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
# What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
# What preservation standards would your system support? &lt;br /&gt;
# What resources are required to support a solution implemented in your environment? &lt;br /&gt;
# What infrastructure do you rely on?&lt;br /&gt;
# How can your system impact digital preservation activities?&lt;br /&gt;
# If we put data in your system today what systems and processes are in place so that we can get it back 10 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
# What types of materials does your system handle? (documents, audio files, video file, stills, data sets, etc) And give examples of those types in practice&lt;br /&gt;
&lt;br /&gt;
===Responses to questions===&lt;br /&gt;
====[[NDSA:iRODS]] direct responses====&lt;br /&gt;
&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] The need for each storage target to support a specific set of operations, and consistently with other storage targets, seems like a risk that comes along with the elegant abstraction that iRODS provides. Clear specifications help mitigate this risk.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:DuraCloud]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Treatment of cloud provider is generally as a black box, without a strong sense of actual reliability of underlying storage systems. Cloud providers tend to promise checksum validation of contents, but recourse if validation fails was unknown (right?). Additional checksum validation has been augmented on top of cloud storage service by Duracloud.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:MetaArchive/GDDP]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Built on LOCKSS, so data integrity assurances are provided by robust networked software model augmented to commodity hardware and storage. Federated nature provides integrity assurance but also a lack of central control in that the accidental loss of multiple caches is unlikely but e.g. scheduled maintenance or upgrades could coincidentally collide.&lt;br /&gt;
&lt;br /&gt;
====Chronopolis====&lt;br /&gt;
# ...&lt;br /&gt;
====MicroSoft Azure====&lt;br /&gt;
# ...&lt;br /&gt;
====Amazon S3/EC2====&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Questions for Member Institution Implementations of Large Scale Storage Architectures==&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;br /&gt;
 &lt;br /&gt;
===Responses to questions===&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Florida Center for Library Automation]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Harvard Library]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:HathiTrust]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:National Library of Medicine Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Penn State]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:WGBH Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:NYU Response]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Library of Congress]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Columbia University]]====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Questions for Member Institution Implementations of Large Scale Storage Architectures==&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Your Institution Here]]====&lt;br /&gt;
&lt;br /&gt;
==General Concerns==&lt;br /&gt;
# confidential data&lt;br /&gt;
# encrypted data&lt;br /&gt;
# auditing&lt;br /&gt;
# preservation risks&lt;br /&gt;
# legal compliance&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Solution Models and Environments==&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Name&lt;br /&gt;
!Offered as Service&lt;br /&gt;
!Deployed Locally&lt;br /&gt;
!Opensource&lt;br /&gt;
!Authentication Scheme&lt;br /&gt;
!Ingest Mechanism&lt;br /&gt;
!Export Mechanism&lt;br /&gt;
!Integrity/Validation Mechanism&lt;br /&gt;
!Replication Mechanism&lt;br /&gt;
!Administration Model (Federated, etc.)&lt;br /&gt;
!Tiering Support&lt;br /&gt;
|-&lt;br /&gt;
|iRODS&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|DuraCloud&lt;br /&gt;
|yes&lt;br /&gt;
|yes&lt;br /&gt;
|yes (Apache2)&lt;br /&gt;
|Basic Auth&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|Checksum verified on ingest. On-demand checksum verification service.&lt;br /&gt;
|Built-in support for cross-cloud replication.&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|MetaArchive/GDDP&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Chronopolis&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Microsoft Azure&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Amazon S3/EC2&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2039</id>
		<title>NDSA:Cloud Presentations</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Cloud_Presentations&amp;diff=2039"/>
		<updated>2011-06-08T18:43:34Z</updated>

		<summary type="html">&lt;p&gt;Rtc: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;In each case we would want to identify who would present, who will contact them. Then when they will present. &lt;br /&gt;
&lt;br /&gt;
From there we can include specific questions we would like them to respond to. &lt;br /&gt;
&lt;br /&gt;
==Presentation Schedule and Slides==&lt;br /&gt;
# Feb 1, Tues, 1:00 EST call with iRods Reagan Moore ([[NDSA:Media:NIAID.ppt|presentation]])&lt;br /&gt;
# Feb 14, Monday, 11:00 EST call with Duracloud ([[NDSA:Media:DuracloudNDSA.ppt|presentation]])&lt;br /&gt;
# Feb 17, Thurs, 11:00 EST call with MetaArchive/GDDP Katherine Skinner, Matt Schultz and Martin Halbert MetaArchive NDSA ([[NDSA:Media:MetaArchive NDSA Infrastructure.ppt|presentation]])&lt;br /&gt;
&lt;br /&gt;
==People/Projects to Contact==&lt;br /&gt;
*Chronopolis (Mike Smorul will contact)&lt;br /&gt;
*Open questions from the Educopia Guide to Distributed Digital Preservation &lt;br /&gt;
*Commercial providers? (Who specifically would we want here? Please add them.)&lt;br /&gt;
**Azure (Leslie to contact)&lt;br /&gt;
**Amazon (Who will contact?)&lt;br /&gt;
&lt;br /&gt;
==General Questions for Cloud Service Presenters==&lt;br /&gt;
Here we are working on a set of general questions for presenters to develop talks around. &lt;br /&gt;
&lt;br /&gt;
# What sort of use cases is your system designed to support? What doesn&#039;t this support?&lt;br /&gt;
# What preservation standards would your system support? &lt;br /&gt;
# What resources are required to support a solution implemented in your environment? &lt;br /&gt;
# What infrastructure do you rely on?&lt;br /&gt;
# How can your system impact digital preservation activities?&lt;br /&gt;
# If we put data in your system today what systems and processes are in place so that we can get it back 10 years from now? (Take for granted a sophisticated audience that knows about multiple copies etc.)&lt;br /&gt;
# What types of materials does your system handle? (documents, audio files, video file, stills, data sets, etc) And give examples of those types in practice&lt;br /&gt;
&lt;br /&gt;
===Responses to questions===&lt;br /&gt;
====[[NDSA:iRODS]] direct responses====&lt;br /&gt;
&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] The need for each storage target to support a specific set of operations, and consistently with other storage targets, seems like a risk that comes along with the elegant abstraction that iRODS provides. Clear specifications help mitigate this risk.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:DuraCloud]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Treatment of cloud provider is generally as a black box, without a strong sense of actual reliability of underlying storage systems. Cloud providers tend to promise checksum validation of contents, but recourse if validation fails was unknown (right?). Additional checksum validation has been augmented on top of cloud storage service by Duracloud.&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:MetaArchive/GDDP]] direct responses====&lt;br /&gt;
Other general notes:&lt;br /&gt;
&lt;br /&gt;
* [Snavely] Built on LOCKSS, so data integrity assurances are provided by robust networked software model augmented to commodity hardware and storage. Federated nature provides integrity assurance but also a lack of central control in that the accidental loss of multiple caches is unlikely but e.g. scheduled maintenance or upgrades could coincidentally collide.&lt;br /&gt;
&lt;br /&gt;
====Chronopolis====&lt;br /&gt;
# ...&lt;br /&gt;
====MicroSoft Azure====&lt;br /&gt;
# ...&lt;br /&gt;
====Amazon S3/EC2====&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Questions for Member Institution Implementations of Large Scale Storage Architectures==&lt;br /&gt;
#What is the particular preservation goal or challenge you need to accomplish? (for example, re-use, public access, internal access, legal mandate, etc.)&lt;br /&gt;
#What large scale storage or cloud technologies are you using to meet that challenge? Further, why did you choose these particular technologies?&lt;br /&gt;
#Specifically, what kind of materials are you preserving (text, data sets, images, moving images, web pages, etc.) &lt;br /&gt;
#How big is your collection? (In terms of number of objects and storage space required)&lt;br /&gt;
#What are your performance requirements? Further, why are these your particular requirements?&lt;br /&gt;
#What storage media have you elected to use? (Disk, Tape, etc) Further, why did you choose these particular media?&lt;br /&gt;
#What do you think the key advantages of the system you use?&lt;br /&gt;
#What do you think are the key problems or disadvantages your system present?&lt;br /&gt;
#What important principles informed your decision about the particular tool or service you chose to use? &lt;br /&gt;
#How frequently do you migrate from one system to another? Further, what is it that prompts you to make these migrations? &lt;br /&gt;
# What characteristics of the storage system(s) you use do you feel are particularly well-suited to long-term digital preservation? (High levels of redundancy/resiliency, internal checksumming capabilities, automated tape refresh, etc)&lt;br /&gt;
# What functionality or processes have you developed to augment your storage systems in order to meet preservation goals? (Periodic checksum validation, limited human access or novel use of permissions schemes)&lt;br /&gt;
# Are there tough requirements for digital preservation, e.g. TRAC certification, that you wish were more readily handled by your storage system?&lt;br /&gt;
 &lt;br /&gt;
===Responses to questions===&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Florida Center for Library Automation]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Harvard Library]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:HathiTrust]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:National Library of Medicine Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Penn State]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:WGBH Responses]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:NYU Response]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Library of Congress]]====&lt;br /&gt;
&lt;br /&gt;
====[[NDSA:Columbia University]]====&lt;br /&gt;
====[[NDSA:Your Institution Here]]====&lt;br /&gt;
&lt;br /&gt;
==General Concerns==&lt;br /&gt;
# confidential data&lt;br /&gt;
# encrypted data&lt;br /&gt;
# auditing&lt;br /&gt;
# preservation risks&lt;br /&gt;
# legal compliance&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
==Solution Models and Environments==&lt;br /&gt;
{| border=&amp;quot;1&amp;quot;&lt;br /&gt;
!Name&lt;br /&gt;
!Offered as Service&lt;br /&gt;
!Deployed Locally&lt;br /&gt;
!Opensource&lt;br /&gt;
!Authentication Scheme&lt;br /&gt;
!Ingest Mechanism&lt;br /&gt;
!Export Mechanism&lt;br /&gt;
!Integrity/Validation Mechanism&lt;br /&gt;
!Replication Mechanism&lt;br /&gt;
!Administration Model (Federated, etc.)&lt;br /&gt;
!Tiering Support&lt;br /&gt;
|-&lt;br /&gt;
|iRODS&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|DuraCloud&lt;br /&gt;
|yes&lt;br /&gt;
|yes&lt;br /&gt;
|yes (Apache2)&lt;br /&gt;
|Basic Auth&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|1:web-ui, 2:client-side utility, 3:REST-API&lt;br /&gt;
|Checksum verified on ingest. On-demand checksum verification service.&lt;br /&gt;
|Built-in support for cross-cloud replication.&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|MetaArchive/GDDP&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Chronopolis&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Microsoft Azure&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Amazon S3/EC2&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:Infrastructure_Working_Group_Members&amp;diff=1492</id>
		<title>NDSA:Infrastructure Working Group Members</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:Infrastructure_Working_Group_Members&amp;diff=1492"/>
		<updated>2011-06-08T18:40:57Z</updated>

		<summary type="html">&lt;p&gt;Rtc: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;*Micah Altman&lt;br /&gt;
*Elizabeth Perkes&lt;br /&gt;
*Bryan Beecher&lt;br /&gt;
*Priscilla Caplan&lt;br /&gt;
*Karen Cariani&lt;br /&gt;
*Kris Carpenter&lt;br /&gt;
*Robert Cartolano&lt;br /&gt;
*Patricia Cruse&lt;br /&gt;
*Daphane DeLeon&lt;br /&gt;
*Blane Dessy&lt;br /&gt;
*Daniel Dodge&lt;br /&gt;
*Erin Engle&lt;br /&gt;
*Dean Farrell&lt;br /&gt;
*Eileen Fenton&lt;br /&gt;
*Michelle Gallinger&lt;br /&gt;
*Michael J. Giarlo&lt;br /&gt;
*Andrea Goethals&lt;br /&gt;
*Abbie Grotke&lt;br /&gt;
*Matt Guzzi&lt;br /&gt;
*Martin Halbert&lt;br /&gt;
*Christine Marie Hopper&lt;br /&gt;
*Bob Horton&lt;br /&gt;
*Howard, Barrie&lt;br /&gt;
*Martin Jacobson&lt;br /&gt;
*Joseph JaJa&lt;br /&gt;
*Leslie Johnston&lt;br /&gt;
*Jimi Jones&lt;br /&gt;
*Butch Lazorchak&lt;br /&gt;
*Cal Lee&lt;br /&gt;
*Jane Mandelbaum&lt;br /&gt;
*Jonathan Marmor&lt;br /&gt;
*David Minor&lt;br /&gt;
*Eugene Mopsik&lt;br /&gt;
*Michael Nelson&lt;br /&gt;
*Trevor Owens&lt;br /&gt;
*Joseph Pawletko&lt;br /&gt;
*Abbey Potter&lt;br /&gt;
*Curtis Pulford&lt;br /&gt;
*Patricia Smith-Mansfield&lt;br /&gt;
*Mike Smorul&lt;br /&gt;
*Cory Snavely&lt;br /&gt;
*Herbert Van de Sompel&lt;br /&gt;
*John Spencer&lt;br /&gt;
*Taylor Surface&lt;br /&gt;
*John Unsworth&lt;br /&gt;
*William Ying&lt;br /&gt;
*Andrew Woods&lt;br /&gt;
*Gene Hurr&lt;/div&gt;</summary>
		<author><name>Rtc</name></author>
	</entry>
</feed>