<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.diglib.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Chris+Dietrich</id>
	<title>DLF Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.diglib.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Chris+Dietrich"/>
	<link rel="alternate" type="text/html" href="https://wiki.diglib.org/Special:Contributions/Chris_Dietrich"/>
	<updated>2026-05-10T20:13:32Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.44.0</generator>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5417</id>
		<title>NDSA:March 29, 2013 Call</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5417"/>
		<updated>2013-03-29T22:25:39Z</updated>

		<summary type="html">&lt;p&gt;Chris Dietrich: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Back to [[NDSA:Standards_and_Best_Practices_Working_Group | Standards Working Group Main Page]]&lt;br /&gt;
&lt;br /&gt;
Back to [[NDSA:PDF_Exploration | PDF Exploration Page]]&lt;br /&gt;
&lt;br /&gt;
=Agenda=&lt;br /&gt;
Discussion with Stephen Levenson, an IT Specialist for Policy and Planning at the Office of the US Courts and the chair of the PDF/A working group.&lt;br /&gt;
&lt;br /&gt;
=Participants=&lt;br /&gt;
&lt;br /&gt;
Don Chalfant, Kate Murray, Sheila Morrissey, Kevin DeVorsey, Chris Dietrich, Carl Fleischhauer, Stephen Levenson, Butch Lazorchak&lt;br /&gt;
&lt;br /&gt;
=Meeting Notes=&lt;br /&gt;
*A rough transcription of our conversation*&lt;br /&gt;
&lt;br /&gt;
Stephen Levenson discussing how the PDF/A family of standards are being implemented by the ISO community. &lt;br /&gt;
&lt;br /&gt;
Steve: PDF/A-3 doesn&#039;t necessarily replace A-1 or A-2. Should be able to use a PDF/A-1 file 30 years from now. The methodology used should not change for rendering these files in the future. &lt;br /&gt;
&lt;br /&gt;
PDF/A movement highly influenced by manufacturers, now PDF/A center, dominated by the Germans. Had many use cases for instances where creators wanted to include the original files wihtin a PDF/A document.  &lt;br /&gt;
&lt;br /&gt;
Brazilian government wanted to preserve their material as XML but XML wasn&#039;t trusted by users because of complexity. Wanted to make a more presentation-ready format but didn&#039;t want to throw away the XML.&lt;br /&gt;
&lt;br /&gt;
U.S. Courts, bankruptcy court, claims, when an individual goes into court, and the claims are laid out, in order for someone to assert the claim, we print a document for them. That&#039;s what they bring back to court to assert their claim. Ginny Mae started this and Mastercard is also movin gon this. We get the PDF but then have to reenter the data from this doucment in their case management systems. &lt;br /&gt;
&lt;br /&gt;
We&#039;re putting an XML output of what the claim represents inside the PDF document. Ginny Mae&#039;s automated processes work on this XML.&lt;br /&gt;
&lt;br /&gt;
Down with PDF/A-3 we have downstream functions that leverage the inner materials. Adobe&#039;s server product does not currently output A-3 files. &lt;br /&gt;
&lt;br /&gt;
Chris: How will hidden content be protected from certain readers of the document.&lt;br /&gt;
&lt;br /&gt;
Steve: For A-3 you can still include the information as &amp;quot;private data&amp;quot; that would make it hidden. Conforming reader would recognize it as A-3 and set up an additional dialog.&lt;br /&gt;
&lt;br /&gt;
Caroline: we&#039;re in the position of having to preserve files that somebody else created. We need tools to characterize files. Some PDF/A-3 files may not have any embedded content so they&#039;d actually behave like a PDF/A-2.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;d have to talk to the developers about that. There is a vendor that is looking to create an independent service to validate the writers to ensure that they are actually complying with the standards. The software would have to get certified that it works. Then we&#039;d actually have validators at ingestion. We have to get somebody interested in creating this software as a business and we think we finally have somebody who will do this.&lt;br /&gt;
&lt;br /&gt;
DOD standard 5015.2 that says if you&#039;re going to be a document management system you have to do certain things. Has to be sent to Fort Huachuca in AZ to a testing center to ensure that it conforms to DOD 5015.2. And this validator would do the same kind of thing.&lt;br /&gt;
&lt;br /&gt;
right now we have no validator that says a Word document is actually a word document. There are a lot of bad writers out there. &lt;br /&gt;
&lt;br /&gt;
Kevin: we&#039;re working on policy and guidance side. We ask people to keep temporary, permanent and non-record material separate from each other. Does PDF/A-3 run counter to that? Might encourage people to mix record and non-record material together in the same file?&lt;br /&gt;
&lt;br /&gt;
Steve: If there&#039;s a relationship, don&#039;t you need that for provenance information?&lt;br /&gt;
&lt;br /&gt;
Kevin: we need to educate our folks.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;ve been dealing with current technologies on these things and who knows what technology will afford us in the future. We&#039;ll be able to hedge our bets.&lt;br /&gt;
&lt;br /&gt;
Sheila: Question to me is what is the relationship between the PDF/A-3 container and the embedded XML, that is, in the Brazilian example, which one has the force of law? And how do you ensure that they say the same thing? In Germany they&#039;re planing on using this for commerce and the validation of invoices, but processing thing in the mass pragmatically means that you&#039;re going to look at one or the other. What warrant is there that they&#039;re going to stay the same way. The Germans said that the &amp;quot;embedded content&amp;quot; has no standing. Only the archival version has standing.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Steve: I can talk about legal. We would assume that the entire document was the legal evidence. In the case of dual content, according to the &amp;quot;best evidence&amp;quot; rule. If the company put a courtesy copy inside and that it&#039;s the main document that is their record. For example, if you requested an invoice and you received what you might see in a PDF versus XML data, then that&#039;s the evidence. &lt;br /&gt;
&lt;br /&gt;
Folks coming in to a reading room. We may have to set up rules at ingestion, they could either strip it out and put the file on a  diet and store the other content. We, in the committee, didn&#039;t want to dictate to preservationists how to do their job.&lt;br /&gt;
&lt;br /&gt;
If NARA said they didn&#039;t want a part of A-3 then the agencies shouldn&#039;t store in A-3. In our Pacer system you can pull down a PDF document but stored inside is an MP3 files that allows you to understand the provenance a little more. The PDF is the metadata around the MP3 file. these are temporary records, so it&#039;s not the same issues.&lt;br /&gt;
&lt;br /&gt;
Chris: So the PDF is acting as a manifest for the MP3 file. No validators, if someone if processing a bunch of files into PDF and embedding an XML version. Something could go wrong and you embed the wrong versions of the XML. There&#039;s not way to validate the right coordinated content. &lt;br /&gt;
&lt;br /&gt;
Steve: archivists are going to have to get more involved in advising creators on the types of files they create.&lt;br /&gt;
&lt;br /&gt;
PDF/E and A are essentially the same, they just couldn&#039;t find an open codec for rendering 3D documents. &lt;br /&gt;
&lt;br /&gt;
PDF/A is pretty much asleep now, making sure we keep up with the latest version of the PDF specification. &lt;br /&gt;
&lt;br /&gt;
Caroline:&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Chris Dietrich&#039;s Notes:&lt;br /&gt;
PDF/A-3 Use Cases 20130329&lt;br /&gt;
&lt;br /&gt;
NDSA PDF/A-3 work group discuss (positive) use cases with Stephen Levenson of the U.S. Courts&lt;br /&gt;
&lt;br /&gt;
PDF/A has 3 versions: No versions (A-1, A-2, or-A3) will be replaced by subsequent versions reducing the need for migration&lt;br /&gt;
&lt;br /&gt;
• PDF/A-1 = ???? [didn’t capture what makes a PDF/A-1 different from a PDF]&lt;br /&gt;
&lt;br /&gt;
• PDF/A-2 = PDF/A-1 + ability to embed a second PDF/A-2 copy of the same document(?) (recursive?)&lt;br /&gt;
&lt;br /&gt;
• PDF/A-3 = PDF/A-2 + any file embedded&lt;br /&gt;
&lt;br /&gt;
• Genesis for A-3 was to include other types like XML along with the PDF (not just PDF/A-2)&lt;br /&gt;
&lt;br /&gt;
• Private Data section in A-2:&lt;br /&gt;
&lt;br /&gt;
• Use Case: Brazil uses to embed other data like XML version of the file (legal docs)&lt;br /&gt;
&lt;br /&gt;
• Is/can be protected from view&lt;br /&gt;
&lt;br /&gt;
• Was the genesis for creating A-3 to store other data not protected from view (?)&lt;br /&gt;
&lt;br /&gt;
• Allows for human- and machine-readable versions of a document in the same container&lt;br /&gt;
&lt;br /&gt;
• Viewing software (i.e. Adobe Reader et al.) will not render additional content but will notify consumer that additional content       exists &lt;br /&gt;
&lt;br /&gt;
All three versions are PDFs, no need to distinguish between the three versions except by the presentation interface which should be designed to detect and display embedded additional content&lt;br /&gt;
&lt;br /&gt;
Currently no validators to ensure that any particular file (PDF, .doc, etc.) is actually what the extension indicates or is high-quality (e.g. some PDF generators are better than others)&lt;br /&gt;
&lt;br /&gt;
• There is a vendor looking to create an ISO-compliant PDF validator which could be used by repositories to validate incoming PDF files (still years away)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Ability to embed XML allows both human- and machine-readable versions of a document in the same container&lt;br /&gt;
&lt;br /&gt;
• Useful for automated ingest and presentation by repositories&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Question arises which of the two versions (PDF or XML) is the authoritative version&lt;br /&gt;
&lt;br /&gt;
• Use-case: PACER system for courtroom audio stored as MP3 and packaged in PDF/A-3&lt;br /&gt;
&lt;br /&gt;
• PDF/A-3 acts as the metadata/manifest for the embedded recording&lt;br /&gt;
&lt;br /&gt;
• Recordings are access copies, not high-quality originals for long-term preservation/archiving&lt;br /&gt;
&lt;br /&gt;
• These are temp records, disposed of after 5 years&lt;/div&gt;</summary>
		<author><name>Chris Dietrich</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5416</id>
		<title>NDSA:March 29, 2013 Call</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5416"/>
		<updated>2013-03-29T22:25:18Z</updated>

		<summary type="html">&lt;p&gt;Chris Dietrich: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Back to [[NDSA:Standards_and_Best_Practices_Working_Group | Standards Working Group Main Page]]&lt;br /&gt;
&lt;br /&gt;
Back to [[NDSA:PDF_Exploration | PDF Exploration Page]]&lt;br /&gt;
&lt;br /&gt;
=Agenda=&lt;br /&gt;
Discussion with Stephen Levenson, an IT Specialist for Policy and Planning at the Office of the US Courts and the chair of the PDF/A working group.&lt;br /&gt;
&lt;br /&gt;
=Participants=&lt;br /&gt;
&lt;br /&gt;
Don Chalfant, Kate Murray, Sheila Morrissey, Kevin DeVorsey, Chris Dietrich, Carl Fleischhauer, Stephen Levenson, Butch Lazorchak&lt;br /&gt;
&lt;br /&gt;
=Meeting Notes=&lt;br /&gt;
*A rough transcription of our conversation*&lt;br /&gt;
&lt;br /&gt;
Stephen Levenson discussing how the PDF/A family of standards are being implemented by the ISO community. &lt;br /&gt;
&lt;br /&gt;
Steve: PDF/A-3 doesn&#039;t necessarily replace A-1 or A-2. Should be able to use a PDF/A-1 file 30 years from now. The methodology used should not change for rendering these files in the future. &lt;br /&gt;
&lt;br /&gt;
PDF/A movement highly influenced by manufacturers, now PDF/A center, dominated by the Germans. Had many use cases for instances where creators wanted to include the original files wihtin a PDF/A document.  &lt;br /&gt;
&lt;br /&gt;
Brazilian government wanted to preserve their material as XML but XML wasn&#039;t trusted by users because of complexity. Wanted to make a more presentation-ready format but didn&#039;t want to throw away the XML.&lt;br /&gt;
&lt;br /&gt;
U.S. Courts, bankruptcy court, claims, when an individual goes into court, and the claims are laid out, in order for someone to assert the claim, we print a document for them. That&#039;s what they bring back to court to assert their claim. Ginny Mae started this and Mastercard is also movin gon this. We get the PDF but then have to reenter the data from this doucment in their case management systems. &lt;br /&gt;
&lt;br /&gt;
We&#039;re putting an XML output of what the claim represents inside the PDF document. Ginny Mae&#039;s automated processes work on this XML.&lt;br /&gt;
&lt;br /&gt;
Down with PDF/A-3 we have downstream functions that leverage the inner materials. Adobe&#039;s server product does not currently output A-3 files. &lt;br /&gt;
&lt;br /&gt;
Chris: How will hidden content be protected from certain readers of the document.&lt;br /&gt;
&lt;br /&gt;
Steve: For A-3 you can still include the information as &amp;quot;private data&amp;quot; that would make it hidden. Conforming reader would recognize it as A-3 and set up an additional dialog.&lt;br /&gt;
&lt;br /&gt;
Caroline: we&#039;re in the position of having to preserve files that somebody else created. We need tools to characterize files. Some PDF/A-3 files may not have any embedded content so they&#039;d actually behave like a PDF/A-2.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;d have to talk to the developers about that. There is a vendor that is looking to create an independent service to validate the writers to ensure that they are actually complying with the standards. The software would have to get certified that it works. Then we&#039;d actually have validators at ingestion. We have to get somebody interested in creating this software as a business and we think we finally have somebody who will do this.&lt;br /&gt;
&lt;br /&gt;
DOD standard 5015.2 that says if you&#039;re going to be a document management system you have to do certain things. Has to be sent to Fort Huachuca in AZ to a testing center to ensure that it conforms to DOD 5015.2. And this validator would do the same kind of thing.&lt;br /&gt;
&lt;br /&gt;
right now we have no validator that says a Word document is actually a word document. There are a lot of bad writers out there. &lt;br /&gt;
&lt;br /&gt;
Kevin: we&#039;re working on policy and guidance side. We ask people to keep temporary, permanent and non-record material separate from each other. Does PDF/A-3 run counter to that? Might encourage people to mix record and non-record material together in the same file?&lt;br /&gt;
&lt;br /&gt;
Steve: If there&#039;s a relationship, don&#039;t you need that for provenance information?&lt;br /&gt;
&lt;br /&gt;
Kevin: we need to educate our folks.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;ve been dealing with current technologies on these things and who knows what technology will afford us in the future. We&#039;ll be able to hedge our bets.&lt;br /&gt;
&lt;br /&gt;
Sheila: Question to me is what is the relationship between the PDF/A-3 container and the embedded XML, that is, in the Brazilian example, which one has the force of law? And how do you ensure that they say the same thing? In Germany they&#039;re planing on using this for commerce and the validation of invoices, but processing thing in the mass pragmatically means that you&#039;re going to look at one or the other. What warrant is there that they&#039;re going to stay the same way. The Germans said that the &amp;quot;embedded content&amp;quot; has no standing. Only the archival version has standing.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Steve: I can talk about legal. We would assume that the entire document was the legal evidence. In the case of dual content, according to the &amp;quot;best evidence&amp;quot; rule. If the company put a courtesy copy inside and that it&#039;s the main document that is their record. For example, if you requested an invoice and you received what you might see in a PDF versus XML data, then that&#039;s the evidence. &lt;br /&gt;
&lt;br /&gt;
Folks coming in to a reading room. We may have to set up rules at ingestion, they could either strip it out and put the file on a  diet and store the other content. We, in the committee, didn&#039;t want to dictate to preservationists how to do their job.&lt;br /&gt;
&lt;br /&gt;
If NARA said they didn&#039;t want a part of A-3 then the agencies shouldn&#039;t store in A-3. In our Pacer system you can pull down a PDF document but stored inside is an MP3 files that allows you to understand the provenance a little more. The PDF is the metadata around the MP3 file. these are temporary records, so it&#039;s not the same issues.&lt;br /&gt;
&lt;br /&gt;
Chris: So the PDF is acting as a manifest for the MP3 file. No validators, if someone if processing a bunch of files into PDF and embedding an XML version. Something could go wrong and you embed the wrong versions of the XML. There&#039;s not way to validate the right coordinated content. &lt;br /&gt;
&lt;br /&gt;
Steve: archivists are going to have to get more involved in advising creators on the types of files they create.&lt;br /&gt;
&lt;br /&gt;
PDF/E and A are essentially the same, they just couldn&#039;t find an open codec for rendering 3D documents. &lt;br /&gt;
&lt;br /&gt;
PDF/A is pretty much asleep now, making sure we keep up with the latest version of the PDF specification. &lt;br /&gt;
&lt;br /&gt;
Caroline:&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Chris Dietrich&#039;s Notes:&lt;br /&gt;
PDF/A-3 Use Cases 20130329&lt;br /&gt;
&lt;br /&gt;
NDSA PDF/A-3 work group discuss (positive) use cases with Stephen Levenson of the U.S. Courts&lt;br /&gt;
&lt;br /&gt;
PDF/A has 3 versions: No versions (A-1, A-2, or-A3) will be replaced by subsequent versions reducing the need for migration&lt;br /&gt;
&lt;br /&gt;
• PDF/A-1 = ???? [didn’t capture what makes a PDF/A-1 different from a PDF]&lt;br /&gt;
&lt;br /&gt;
• PDF/A-2 = PDF/A-1 + ability to embed a second PDF/A-2 copy of the same document(?) (recursive?)&lt;br /&gt;
&lt;br /&gt;
• PDF/A-3 = PDF/A-2 + any file embedded&lt;br /&gt;
&lt;br /&gt;
• Genesis for A-3 was to include other types like XML along with the PDF (not just PDF/A-2)&lt;br /&gt;
&lt;br /&gt;
• Private Data section in A-2:&lt;br /&gt;
&lt;br /&gt;
• Use Case: Brazil uses to embed other data like XML version of the file (legal docs)&lt;br /&gt;
&lt;br /&gt;
• Is/can be protected from view&lt;br /&gt;
&lt;br /&gt;
• Was the genesis for creating A-3 to store other data not protected from view (?)&lt;br /&gt;
&lt;br /&gt;
• Allows for human- and machine-readable versions of a document in the same container&lt;br /&gt;
&lt;br /&gt;
• Viewing software (i.e. Adobe Reader et al.) will not render additional content but will notify consumer that additional content       exists &lt;br /&gt;
&lt;br /&gt;
All three versions are PDFs, no need to distinguish between the three versions except by the presentation interface which should be designed to detect and display embedded additional content&lt;br /&gt;
&lt;br /&gt;
Currently no validators to ensure that any particular file (PDF, .doc, etc.) is actually what the extension indicates or is high-quality (e.g. some PDF generators are better than others)&lt;br /&gt;
&lt;br /&gt;
• There is a vendor looking to create an ISO-compliant PDF validator which could be used by repositories to validate incoming PDF files (still years away)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Ability to embed XML allows both human- and machine-readable versions of a document in the same container&lt;br /&gt;
&lt;br /&gt;
• Useful for automated ingest and presentation by repositories&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Question arises which of the two versions (PDF or XML) is the authoritative version&lt;br /&gt;
&lt;br /&gt;
• Use-case: PACER system for courtroom audio stored as MP3 and packaged in PDF/A-3&lt;br /&gt;
&lt;br /&gt;
• PDF/A-3 acts as the metadata/manifest for the embedded recording&lt;br /&gt;
&lt;br /&gt;
• Recordings are access copies, not high-quality originals for long-term preservation/archiving&lt;br /&gt;
&lt;br /&gt;
• These are temp records, disposed of after 5 years&lt;br /&gt;
&lt;br /&gt;
I think I missed another use case here…&lt;/div&gt;</summary>
		<author><name>Chris Dietrich</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5415</id>
		<title>NDSA:March 29, 2013 Call</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5415"/>
		<updated>2013-03-29T22:24:38Z</updated>

		<summary type="html">&lt;p&gt;Chris Dietrich: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Back to [[NDSA:Standards_and_Best_Practices_Working_Group | Standards Working Group Main Page]]&lt;br /&gt;
&lt;br /&gt;
Back to [[NDSA:PDF_Exploration | PDF Exploration Page]]&lt;br /&gt;
&lt;br /&gt;
=Agenda=&lt;br /&gt;
Discussion with Stephen Levenson, an IT Specialist for Policy and Planning at the Office of the US Courts and the chair of the PDF/A working group.&lt;br /&gt;
&lt;br /&gt;
=Participants=&lt;br /&gt;
&lt;br /&gt;
Don Chalfant, Kate Murray, Sheila Morrissey, Kevin DeVorsey, Chris Dietrich, Carl Fleischhauer, Stephen Levenson, Butch Lazorchak&lt;br /&gt;
&lt;br /&gt;
=Meeting Notes=&lt;br /&gt;
*A rough transcription of our conversation*&lt;br /&gt;
&lt;br /&gt;
Stephen Levenson discussing how the PDF/A family of standards are being implemented by the ISO community. &lt;br /&gt;
&lt;br /&gt;
Steve: PDF/A-3 doesn&#039;t necessarily replace A-1 or A-2. Should be able to use a PDF/A-1 file 30 years from now. The methodology used should not change for rendering these files in the future. &lt;br /&gt;
&lt;br /&gt;
PDF/A movement highly influenced by manufacturers, now PDF/A center, dominated by the Germans. Had many use cases for instances where creators wanted to include the original files wihtin a PDF/A document.  &lt;br /&gt;
&lt;br /&gt;
Brazilian government wanted to preserve their material as XML but XML wasn&#039;t trusted by users because of complexity. Wanted to make a more presentation-ready format but didn&#039;t want to throw away the XML.&lt;br /&gt;
&lt;br /&gt;
U.S. Courts, bankruptcy court, claims, when an individual goes into court, and the claims are laid out, in order for someone to assert the claim, we print a document for them. That&#039;s what they bring back to court to assert their claim. Ginny Mae started this and Mastercard is also movin gon this. We get the PDF but then have to reenter the data from this doucment in their case management systems. &lt;br /&gt;
&lt;br /&gt;
We&#039;re putting an XML output of what the claim represents inside the PDF document. Ginny Mae&#039;s automated processes work on this XML.&lt;br /&gt;
&lt;br /&gt;
Down with PDF/A-3 we have downstream functions that leverage the inner materials. Adobe&#039;s server product does not currently output A-3 files. &lt;br /&gt;
&lt;br /&gt;
Chris: How will hidden content be protected from certain readers of the document.&lt;br /&gt;
&lt;br /&gt;
Steve: For A-3 you can still include the information as &amp;quot;private data&amp;quot; that would make it hidden. Conforming reader would recognize it as A-3 and set up an additional dialog.&lt;br /&gt;
&lt;br /&gt;
Caroline: we&#039;re in the position of having to preserve files that somebody else created. We need tools to characterize files. Some PDF/A-3 files may not have any embedded content so they&#039;d actually behave like a PDF/A-2.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;d have to talk to the developers about that. There is a vendor that is looking to create an independent service to validate the writers to ensure that they are actually complying with the standards. The software would have to get certified that it works. Then we&#039;d actually have validators at ingestion. We have to get somebody interested in creating this software as a business and we think we finally have somebody who will do this.&lt;br /&gt;
&lt;br /&gt;
DOD standard 5015.2 that says if you&#039;re going to be a document management system you have to do certain things. Has to be sent to Fort Huachuca in AZ to a testing center to ensure that it conforms to DOD 5015.2. And this validator would do the same kind of thing.&lt;br /&gt;
&lt;br /&gt;
right now we have no validator that says a Word document is actually a word document. There are a lot of bad writers out there. &lt;br /&gt;
&lt;br /&gt;
Kevin: we&#039;re working on policy and guidance side. We ask people to keep temporary, permanent and non-record material separate from each other. Does PDF/A-3 run counter to that? Might encourage people to mix record and non-record material together in the same file?&lt;br /&gt;
&lt;br /&gt;
Steve: If there&#039;s a relationship, don&#039;t you need that for provenance information?&lt;br /&gt;
&lt;br /&gt;
Kevin: we need to educate our folks.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;ve been dealing with current technologies on these things and who knows what technology will afford us in the future. We&#039;ll be able to hedge our bets.&lt;br /&gt;
&lt;br /&gt;
Sheila: Question to me is what is the relationship between the PDF/A-3 container and the embedded XML, that is, in the Brazilian example, which one has the force of law? And how do you ensure that they say the same thing? In Germany they&#039;re planing on using this for commerce and the validation of invoices, but processing thing in the mass pragmatically means that you&#039;re going to look at one or the other. What warrant is there that they&#039;re going to stay the same way. The Germans said that the &amp;quot;embedded content&amp;quot; has no standing. Only the archival version has standing.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Steve: I can talk about legal. We would assume that the entire document was the legal evidence. In the case of dual content, according to the &amp;quot;best evidence&amp;quot; rule. If the company put a courtesy copy inside and that it&#039;s the main document that is their record. For example, if you requested an invoice and you received what you might see in a PDF versus XML data, then that&#039;s the evidence. &lt;br /&gt;
&lt;br /&gt;
Folks coming in to a reading room. We may have to set up rules at ingestion, they could either strip it out and put the file on a  diet and store the other content. We, in the committee, didn&#039;t want to dictate to preservationists how to do their job.&lt;br /&gt;
&lt;br /&gt;
If NARA said they didn&#039;t want a part of A-3 then the agencies shouldn&#039;t store in A-3. In our Pacer system you can pull down a PDF document but stored inside is an MP3 files that allows you to understand the provenance a little more. The PDF is the metadata around the MP3 file. these are temporary records, so it&#039;s not the same issues.&lt;br /&gt;
&lt;br /&gt;
Chris: So the PDF is acting as a manifest for the MP3 file. No validators, if someone if processing a bunch of files into PDF and embedding an XML version. Something could go wrong and you embed the wrong versions of the XML. There&#039;s not way to validate the right coordinated content. &lt;br /&gt;
&lt;br /&gt;
Steve: archivists are going to have to get more involved in advising creators on the types of files they create.&lt;br /&gt;
&lt;br /&gt;
PDF/E and A are essentially the same, they just couldn&#039;t find an open codec for rendering 3D documents. &lt;br /&gt;
&lt;br /&gt;
PDF/A is pretty much asleep now, making sure we keep up with the latest version of the PDF specification. &lt;br /&gt;
&lt;br /&gt;
Caroline:&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Chris Dietrich&#039;s Notes:&lt;br /&gt;
PDF/A-3 Use Cases 20130329&lt;br /&gt;
&lt;br /&gt;
NDSA PDF/A-3 work group discuss (positive) use cases with Stephen Levenson of the U.S. Courts&lt;br /&gt;
&lt;br /&gt;
PDF/A has 3 versions: No versions (A-1, A-2, or-A3) will be replaced by subsequent versions reducing the need for migration&lt;br /&gt;
&lt;br /&gt;
• PDF/A-1 = ???? [didn’t capture what makes a PDF/A-1 different from a PDF]&lt;br /&gt;
&lt;br /&gt;
• PDF/A-2 = PDF/A-1 + ability to embed a second PDF/A-2 copy of the same document(?) (recursive?)&lt;br /&gt;
&lt;br /&gt;
• PDF/A-3 = PDF/A-2 + any file embedded&lt;br /&gt;
&lt;br /&gt;
• Genesis for A-3 was to include other types like XML along with the PDF (not just PDF/A-2)&lt;br /&gt;
&lt;br /&gt;
• Private Data section in A-2:&lt;br /&gt;
&lt;br /&gt;
• Use Case: Brazil uses to embed other data like XML version of the file (legal docs)&lt;br /&gt;
&lt;br /&gt;
• Is/can be protected from view&lt;br /&gt;
&lt;br /&gt;
• Was the genesis for creating A-3 to store other data not protected from view (?)&lt;br /&gt;
&lt;br /&gt;
• Allows for human- and machine-readable versions of a document in the same container&lt;br /&gt;
&lt;br /&gt;
• Viewing software (i.e. Adobe Reader et al.) will not render additional content but will notify consumer that additional content       exists &lt;br /&gt;
&lt;br /&gt;
All three versions are PDFs, no need to distinguish between the three versions except by the presentation interface which should be designed to detect and display embedded additional content&lt;br /&gt;
&lt;br /&gt;
Currently no validators to ensure that any particular file (PDF, .doc, etc.) is actually what the extension indicates or is high-quality (e.g. some PDF generators are better than others)&lt;br /&gt;
&lt;br /&gt;
There is a vendor looking to create an ISO-compliant PDF validator which could be used by repositories to validate incoming PDF files (still years away)&lt;br /&gt;
&lt;br /&gt;
Ability to embed XML allows both human- and machine-readable versions of a document in the same container&lt;br /&gt;
&lt;br /&gt;
• Useful for automated ingest and presentation by repositories&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Question arises which of the two versions (PDF or XML) is the authoritative version&lt;br /&gt;
&lt;br /&gt;
• Use-case: PACER system for courtroom audio stored as MP3 and packaged in PDF/A-3&lt;br /&gt;
&lt;br /&gt;
• PDF/A-3 acts as the metadata/manifest for the embedded recording&lt;br /&gt;
&lt;br /&gt;
• Recordings are access copies, not high-quality originals for long-term preservation/archiving&lt;br /&gt;
&lt;br /&gt;
• These are temp records, disposed of after 5 years&lt;br /&gt;
&lt;br /&gt;
I think I missed another use case here…&lt;/div&gt;</summary>
		<author><name>Chris Dietrich</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5414</id>
		<title>NDSA:March 29, 2013 Call</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5414"/>
		<updated>2013-03-29T22:23:54Z</updated>

		<summary type="html">&lt;p&gt;Chris Dietrich: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Back to [[NDSA:Standards_and_Best_Practices_Working_Group | Standards Working Group Main Page]]&lt;br /&gt;
&lt;br /&gt;
Back to [[NDSA:PDF_Exploration | PDF Exploration Page]]&lt;br /&gt;
&lt;br /&gt;
=Agenda=&lt;br /&gt;
Discussion with Stephen Levenson, an IT Specialist for Policy and Planning at the Office of the US Courts and the chair of the PDF/A working group.&lt;br /&gt;
&lt;br /&gt;
=Participants=&lt;br /&gt;
&lt;br /&gt;
Don Chalfant, Kate Murray, Sheila Morrissey, Kevin DeVorsey, Chris Dietrich, Carl Fleischhauer, Stephen Levenson, Butch Lazorchak&lt;br /&gt;
&lt;br /&gt;
=Meeting Notes=&lt;br /&gt;
*A rough transcription of our conversation*&lt;br /&gt;
&lt;br /&gt;
Stephen Levenson discussing how the PDF/A family of standards are being implemented by the ISO community. &lt;br /&gt;
&lt;br /&gt;
Steve: PDF/A-3 doesn&#039;t necessarily replace A-1 or A-2. Should be able to use a PDF/A-1 file 30 years from now. The methodology used should not change for rendering these files in the future. &lt;br /&gt;
&lt;br /&gt;
PDF/A movement highly influenced by manufacturers, now PDF/A center, dominated by the Germans. Had many use cases for instances where creators wanted to include the original files wihtin a PDF/A document.  &lt;br /&gt;
&lt;br /&gt;
Brazilian government wanted to preserve their material as XML but XML wasn&#039;t trusted by users because of complexity. Wanted to make a more presentation-ready format but didn&#039;t want to throw away the XML.&lt;br /&gt;
&lt;br /&gt;
U.S. Courts, bankruptcy court, claims, when an individual goes into court, and the claims are laid out, in order for someone to assert the claim, we print a document for them. That&#039;s what they bring back to court to assert their claim. Ginny Mae started this and Mastercard is also movin gon this. We get the PDF but then have to reenter the data from this doucment in their case management systems. &lt;br /&gt;
&lt;br /&gt;
We&#039;re putting an XML output of what the claim represents inside the PDF document. Ginny Mae&#039;s automated processes work on this XML.&lt;br /&gt;
&lt;br /&gt;
Down with PDF/A-3 we have downstream functions that leverage the inner materials. Adobe&#039;s server product does not currently output A-3 files. &lt;br /&gt;
&lt;br /&gt;
Chris: How will hidden content be protected from certain readers of the document.&lt;br /&gt;
&lt;br /&gt;
Steve: For A-3 you can still include the information as &amp;quot;private data&amp;quot; that would make it hidden. Conforming reader would recognize it as A-3 and set up an additional dialog.&lt;br /&gt;
&lt;br /&gt;
Caroline: we&#039;re in the position of having to preserve files that somebody else created. We need tools to characterize files. Some PDF/A-3 files may not have any embedded content so they&#039;d actually behave like a PDF/A-2.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;d have to talk to the developers about that. There is a vendor that is looking to create an independent service to validate the writers to ensure that they are actually complying with the standards. The software would have to get certified that it works. Then we&#039;d actually have validators at ingestion. We have to get somebody interested in creating this software as a business and we think we finally have somebody who will do this.&lt;br /&gt;
&lt;br /&gt;
DOD standard 5015.2 that says if you&#039;re going to be a document management system you have to do certain things. Has to be sent to Fort Huachuca in AZ to a testing center to ensure that it conforms to DOD 5015.2. And this validator would do the same kind of thing.&lt;br /&gt;
&lt;br /&gt;
right now we have no validator that says a Word document is actually a word document. There are a lot of bad writers out there. &lt;br /&gt;
&lt;br /&gt;
Kevin: we&#039;re working on policy and guidance side. We ask people to keep temporary, permanent and non-record material separate from each other. Does PDF/A-3 run counter to that? Might encourage people to mix record and non-record material together in the same file?&lt;br /&gt;
&lt;br /&gt;
Steve: If there&#039;s a relationship, don&#039;t you need that for provenance information?&lt;br /&gt;
&lt;br /&gt;
Kevin: we need to educate our folks.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;ve been dealing with current technologies on these things and who knows what technology will afford us in the future. We&#039;ll be able to hedge our bets.&lt;br /&gt;
&lt;br /&gt;
Sheila: Question to me is what is the relationship between the PDF/A-3 container and the embedded XML, that is, in the Brazilian example, which one has the force of law? And how do you ensure that they say the same thing? In Germany they&#039;re planing on using this for commerce and the validation of invoices, but processing thing in the mass pragmatically means that you&#039;re going to look at one or the other. What warrant is there that they&#039;re going to stay the same way. The Germans said that the &amp;quot;embedded content&amp;quot; has no standing. Only the archival version has standing.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Steve: I can talk about legal. We would assume that the entire document was the legal evidence. In the case of dual content, according to the &amp;quot;best evidence&amp;quot; rule. If the company put a courtesy copy inside and that it&#039;s the main document that is their record. For example, if you requested an invoice and you received what you might see in a PDF versus XML data, then that&#039;s the evidence. &lt;br /&gt;
&lt;br /&gt;
Folks coming in to a reading room. We may have to set up rules at ingestion, they could either strip it out and put the file on a  diet and store the other content. We, in the committee, didn&#039;t want to dictate to preservationists how to do their job.&lt;br /&gt;
&lt;br /&gt;
If NARA said they didn&#039;t want a part of A-3 then the agencies shouldn&#039;t store in A-3. In our Pacer system you can pull down a PDF document but stored inside is an MP3 files that allows you to understand the provenance a little more. The PDF is the metadata around the MP3 file. these are temporary records, so it&#039;s not the same issues.&lt;br /&gt;
&lt;br /&gt;
Chris: So the PDF is acting as a manifest for the MP3 file. No validators, if someone if processing a bunch of files into PDF and embedding an XML version. Something could go wrong and you embed the wrong versions of the XML. There&#039;s not way to validate the right coordinated content. &lt;br /&gt;
&lt;br /&gt;
Steve: archivists are going to have to get more involved in advising creators on the types of files they create.&lt;br /&gt;
&lt;br /&gt;
PDF/E and A are essentially the same, they just couldn&#039;t find an open codec for rendering 3D documents. &lt;br /&gt;
&lt;br /&gt;
PDF/A is pretty much asleep now, making sure we keep up with the latest version of the PDF specification. &lt;br /&gt;
&lt;br /&gt;
Caroline:&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Chris Dietrich&#039;s Notes:&lt;br /&gt;
PDF/A-3 Use Cases 20130329&lt;br /&gt;
&lt;br /&gt;
NDSA PDF/A-3 work group discuss (positive) use cases with Stephen Levenson of the U.S. Courts&lt;br /&gt;
&lt;br /&gt;
PDF/A has 3 versions: No versions (A-1, A-2, or-A3) will be replaced by subsequent versions reducing the need for migration&lt;br /&gt;
&lt;br /&gt;
• PDF/A-1 = ???? [didn’t capture what makes a PDF/A-1 different from a PDF]&lt;br /&gt;
&lt;br /&gt;
• PDF/A-2 = PDF/A-1 + ability to embed a second PDF/A-2 copy of the same document(?) (recursive?)&lt;br /&gt;
&lt;br /&gt;
• PDF/A-3 = PDF/A-2 + any file embedded&lt;br /&gt;
&lt;br /&gt;
• Genesis for A-3 was to include other types like XML along with the PDF (not just PDF/A-2)&lt;br /&gt;
&lt;br /&gt;
• Private Data section in A-2:&lt;br /&gt;
&lt;br /&gt;
• Use Case: Brazil uses to embed other data like XML version of the file (legal docs)&lt;br /&gt;
&lt;br /&gt;
• Is/can be protected from view&lt;br /&gt;
&lt;br /&gt;
• Was the genesis for creating A-3 to store other data not protected from view (?)&lt;br /&gt;
&lt;br /&gt;
• Allows for human- and machine-readable versions of a document in the same container&lt;br /&gt;
&lt;br /&gt;
• Viewing software (i.e. Adobe Reader et al.) will not render additional content but will notify consumer that additional content       exists &lt;br /&gt;
&lt;br /&gt;
All three versions are PDFs, no need to distinguish between the three versions except by the presentation interface which should be designed to detect and display embedded additional content&lt;br /&gt;
&lt;br /&gt;
Currently no validators to ensure that any particular file (PDF, .doc, etc.) is actually what the extension indicates or is high-quality (e.g. some PDF generators are better than others)&lt;br /&gt;
&lt;br /&gt;
There is a vendor looking to create an ISO-compliant PDF validator which could be used by repositories to validate incoming PDF files (still years away)&lt;br /&gt;
&lt;br /&gt;
Ability to embed XML allows both human- and machine-readable versions of a document in the same container&lt;br /&gt;
• Useful for automated ingest and presentation by repositories&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Question arises which of the two versions (PDF or XML) is the authoritative version&lt;br /&gt;
&lt;br /&gt;
• Use-case: PACER system for courtroom audio stored as MP3 and packaged in PDF/A-3&lt;br /&gt;
&lt;br /&gt;
• PDF/A-3 acts as the metadata/manifest for the embedded recording&lt;br /&gt;
&lt;br /&gt;
• Recordings are access copies, not high-quality originals for long-term preservation/archiving&lt;br /&gt;
&lt;br /&gt;
• These are temp records, disposed of after 5 years&lt;br /&gt;
&lt;br /&gt;
I think I missed another use case here…&lt;/div&gt;</summary>
		<author><name>Chris Dietrich</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5413</id>
		<title>NDSA:March 29, 2013 Call</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5413"/>
		<updated>2013-03-29T22:23:12Z</updated>

		<summary type="html">&lt;p&gt;Chris Dietrich: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Back to [[NDSA:Standards_and_Best_Practices_Working_Group | Standards Working Group Main Page]]&lt;br /&gt;
&lt;br /&gt;
Back to [[NDSA:PDF_Exploration | PDF Exploration Page]]&lt;br /&gt;
&lt;br /&gt;
=Agenda=&lt;br /&gt;
Discussion with Stephen Levenson, an IT Specialist for Policy and Planning at the Office of the US Courts and the chair of the PDF/A working group.&lt;br /&gt;
&lt;br /&gt;
=Participants=&lt;br /&gt;
&lt;br /&gt;
Don Chalfant, Kate Murray, Sheila Morrissey, Kevin DeVorsey, Chris Dietrich, Carl Fleischhauer, Stephen Levenson, Butch Lazorchak&lt;br /&gt;
&lt;br /&gt;
=Meeting Notes=&lt;br /&gt;
*A rough transcription of our conversation*&lt;br /&gt;
&lt;br /&gt;
Stephen Levenson discussing how the PDF/A family of standards are being implemented by the ISO community. &lt;br /&gt;
&lt;br /&gt;
Steve: PDF/A-3 doesn&#039;t necessarily replace A-1 or A-2. Should be able to use a PDF/A-1 file 30 years from now. The methodology used should not change for rendering these files in the future. &lt;br /&gt;
&lt;br /&gt;
PDF/A movement highly influenced by manufacturers, now PDF/A center, dominated by the Germans. Had many use cases for instances where creators wanted to include the original files wihtin a PDF/A document.  &lt;br /&gt;
&lt;br /&gt;
Brazilian government wanted to preserve their material as XML but XML wasn&#039;t trusted by users because of complexity. Wanted to make a more presentation-ready format but didn&#039;t want to throw away the XML.&lt;br /&gt;
&lt;br /&gt;
U.S. Courts, bankruptcy court, claims, when an individual goes into court, and the claims are laid out, in order for someone to assert the claim, we print a document for them. That&#039;s what they bring back to court to assert their claim. Ginny Mae started this and Mastercard is also movin gon this. We get the PDF but then have to reenter the data from this doucment in their case management systems. &lt;br /&gt;
&lt;br /&gt;
We&#039;re putting an XML output of what the claim represents inside the PDF document. Ginny Mae&#039;s automated processes work on this XML.&lt;br /&gt;
&lt;br /&gt;
Down with PDF/A-3 we have downstream functions that leverage the inner materials. Adobe&#039;s server product does not currently output A-3 files. &lt;br /&gt;
&lt;br /&gt;
Chris: How will hidden content be protected from certain readers of the document.&lt;br /&gt;
&lt;br /&gt;
Steve: For A-3 you can still include the information as &amp;quot;private data&amp;quot; that would make it hidden. Conforming reader would recognize it as A-3 and set up an additional dialog.&lt;br /&gt;
&lt;br /&gt;
Caroline: we&#039;re in the position of having to preserve files that somebody else created. We need tools to characterize files. Some PDF/A-3 files may not have any embedded content so they&#039;d actually behave like a PDF/A-2.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;d have to talk to the developers about that. There is a vendor that is looking to create an independent service to validate the writers to ensure that they are actually complying with the standards. The software would have to get certified that it works. Then we&#039;d actually have validators at ingestion. We have to get somebody interested in creating this software as a business and we think we finally have somebody who will do this.&lt;br /&gt;
&lt;br /&gt;
DOD standard 5015.2 that says if you&#039;re going to be a document management system you have to do certain things. Has to be sent to Fort Huachuca in AZ to a testing center to ensure that it conforms to DOD 5015.2. And this validator would do the same kind of thing.&lt;br /&gt;
&lt;br /&gt;
right now we have no validator that says a Word document is actually a word document. There are a lot of bad writers out there. &lt;br /&gt;
&lt;br /&gt;
Kevin: we&#039;re working on policy and guidance side. We ask people to keep temporary, permanent and non-record material separate from each other. Does PDF/A-3 run counter to that? Might encourage people to mix record and non-record material together in the same file?&lt;br /&gt;
&lt;br /&gt;
Steve: If there&#039;s a relationship, don&#039;t you need that for provenance information?&lt;br /&gt;
&lt;br /&gt;
Kevin: we need to educate our folks.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;ve been dealing with current technologies on these things and who knows what technology will afford us in the future. We&#039;ll be able to hedge our bets.&lt;br /&gt;
&lt;br /&gt;
Sheila: Question to me is what is the relationship between the PDF/A-3 container and the embedded XML, that is, in the Brazilian example, which one has the force of law? And how do you ensure that they say the same thing? In Germany they&#039;re planing on using this for commerce and the validation of invoices, but processing thing in the mass pragmatically means that you&#039;re going to look at one or the other. What warrant is there that they&#039;re going to stay the same way. The Germans said that the &amp;quot;embedded content&amp;quot; has no standing. Only the archival version has standing.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Steve: I can talk about legal. We would assume that the entire document was the legal evidence. In the case of dual content, according to the &amp;quot;best evidence&amp;quot; rule. If the company put a courtesy copy inside and that it&#039;s the main document that is their record. For example, if you requested an invoice and you received what you might see in a PDF versus XML data, then that&#039;s the evidence. &lt;br /&gt;
&lt;br /&gt;
Folks coming in to a reading room. We may have to set up rules at ingestion, they could either strip it out and put the file on a  diet and store the other content. We, in the committee, didn&#039;t want to dictate to preservationists how to do their job.&lt;br /&gt;
&lt;br /&gt;
If NARA said they didn&#039;t want a part of A-3 then the agencies shouldn&#039;t store in A-3. In our Pacer system you can pull down a PDF document but stored inside is an MP3 files that allows you to understand the provenance a little more. The PDF is the metadata around the MP3 file. these are temporary records, so it&#039;s not the same issues.&lt;br /&gt;
&lt;br /&gt;
Chris: So the PDF is acting as a manifest for the MP3 file. No validators, if someone if processing a bunch of files into PDF and embedding an XML version. Something could go wrong and you embed the wrong versions of the XML. There&#039;s not way to validate the right coordinated content. &lt;br /&gt;
&lt;br /&gt;
Steve: archivists are going to have to get more involved in advising creators on the types of files they create.&lt;br /&gt;
&lt;br /&gt;
PDF/E and A are essentially the same, they just couldn&#039;t find an open codec for rendering 3D documents. &lt;br /&gt;
&lt;br /&gt;
PDF/A is pretty much asleep now, making sure we keep up with the latest version of the PDF specification. &lt;br /&gt;
&lt;br /&gt;
Caroline:&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Chris Dietrich&#039;s Notes:&lt;br /&gt;
PDF/A-3 Use Cases 20130329&lt;br /&gt;
&lt;br /&gt;
NDSA PDF/A-3 work group discuss (positive) use cases with Stephen Levenson of the U.S. Courts&lt;br /&gt;
&lt;br /&gt;
PDF/A has 3 versions: No versions (A-1, A-2, or-A3) will be replaced by subsequent versions reducing the need for migration&lt;br /&gt;
&lt;br /&gt;
• PDF/A-1 = ???? [didn’t capture what makes a PDF/A-1 different from a PDF]&lt;br /&gt;
&lt;br /&gt;
• PDF/A-2 = PDF/A-1 + ability to embed a second PDF/A-2 copy of the same document(?) (recursive?)&lt;br /&gt;
&lt;br /&gt;
• PDF/A-3 = PDF/A-2 + any file embedded&lt;br /&gt;
&lt;br /&gt;
• Genesis for A-3 was to include other types like XML along with the PDF (not just PDF/A-2)&lt;br /&gt;
&lt;br /&gt;
• Private Data section in A-2:&lt;br /&gt;
&lt;br /&gt;
• Use Case: Brazil uses to embed other data like XML version of the file (legal docs)&lt;br /&gt;
&lt;br /&gt;
• Is/can be protected from view&lt;br /&gt;
&lt;br /&gt;
• Was the genesis for creating A-3 to store other data not protected from view (?)&lt;br /&gt;
&lt;br /&gt;
• Allows for human- and machine-readable versions of a document in the same container&lt;br /&gt;
&lt;br /&gt;
• Viewing software (i.e. Adobe Reader et al.) will not render additional content but will notify consumer that additional content       exists &lt;br /&gt;
&lt;br /&gt;
All three versions are PDFs, no need to distinguish between the three versions except by the presentation interface which should be designed to detect and display embedded additional content&lt;br /&gt;
&lt;br /&gt;
Currently no validators to ensure that any particular file (PDF, .doc, etc.) is actually what the extension indicates or is high-quality (e.g. some PDF generators are better than others)&lt;br /&gt;
&lt;br /&gt;
There is a vendor looking to create an ISO-compliant PDF validator which could be used by repositories to validate incoming PDF files (still years away)&lt;br /&gt;
&lt;br /&gt;
Ability to embed XML allows both human- and machine-readable versions of a document in the same container&lt;br /&gt;
Useful for automated ingest and presentation by repositories&lt;br /&gt;
&lt;br /&gt;
Question arises which of the two versions (PDF or XML) is the authoritative version&lt;br /&gt;
-Use-case: PACER system for courtroom audio stored as MP3 and packaged in PDF/A-3&lt;br /&gt;
--PDF/A-3 acts as the metadata/manifest for the embedded recording&lt;br /&gt;
--Recordings are access copies, not high-quality originals for long-term preservation/archiving&lt;br /&gt;
--These are temp records, disposed of after 5 years&lt;br /&gt;
&lt;br /&gt;
I think I missed another use case here…&lt;/div&gt;</summary>
		<author><name>Chris Dietrich</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5412</id>
		<title>NDSA:March 29, 2013 Call</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5412"/>
		<updated>2013-03-29T22:22:41Z</updated>

		<summary type="html">&lt;p&gt;Chris Dietrich: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Back to [[NDSA:Standards_and_Best_Practices_Working_Group | Standards Working Group Main Page]]&lt;br /&gt;
&lt;br /&gt;
Back to [[NDSA:PDF_Exploration | PDF Exploration Page]]&lt;br /&gt;
&lt;br /&gt;
=Agenda=&lt;br /&gt;
Discussion with Stephen Levenson, an IT Specialist for Policy and Planning at the Office of the US Courts and the chair of the PDF/A working group.&lt;br /&gt;
&lt;br /&gt;
=Participants=&lt;br /&gt;
&lt;br /&gt;
Don Chalfant, Kate Murray, Sheila Morrissey, Kevin DeVorsey, Chris Dietrich, Carl Fleischhauer, Stephen Levenson, Butch Lazorchak&lt;br /&gt;
&lt;br /&gt;
=Meeting Notes=&lt;br /&gt;
*A rough transcription of our conversation*&lt;br /&gt;
&lt;br /&gt;
Stephen Levenson discussing how the PDF/A family of standards are being implemented by the ISO community. &lt;br /&gt;
&lt;br /&gt;
Steve: PDF/A-3 doesn&#039;t necessarily replace A-1 or A-2. Should be able to use a PDF/A-1 file 30 years from now. The methodology used should not change for rendering these files in the future. &lt;br /&gt;
&lt;br /&gt;
PDF/A movement highly influenced by manufacturers, now PDF/A center, dominated by the Germans. Had many use cases for instances where creators wanted to include the original files wihtin a PDF/A document.  &lt;br /&gt;
&lt;br /&gt;
Brazilian government wanted to preserve their material as XML but XML wasn&#039;t trusted by users because of complexity. Wanted to make a more presentation-ready format but didn&#039;t want to throw away the XML.&lt;br /&gt;
&lt;br /&gt;
U.S. Courts, bankruptcy court, claims, when an individual goes into court, and the claims are laid out, in order for someone to assert the claim, we print a document for them. That&#039;s what they bring back to court to assert their claim. Ginny Mae started this and Mastercard is also movin gon this. We get the PDF but then have to reenter the data from this doucment in their case management systems. &lt;br /&gt;
&lt;br /&gt;
We&#039;re putting an XML output of what the claim represents inside the PDF document. Ginny Mae&#039;s automated processes work on this XML.&lt;br /&gt;
&lt;br /&gt;
Down with PDF/A-3 we have downstream functions that leverage the inner materials. Adobe&#039;s server product does not currently output A-3 files. &lt;br /&gt;
&lt;br /&gt;
Chris: How will hidden content be protected from certain readers of the document.&lt;br /&gt;
&lt;br /&gt;
Steve: For A-3 you can still include the information as &amp;quot;private data&amp;quot; that would make it hidden. Conforming reader would recognize it as A-3 and set up an additional dialog.&lt;br /&gt;
&lt;br /&gt;
Caroline: we&#039;re in the position of having to preserve files that somebody else created. We need tools to characterize files. Some PDF/A-3 files may not have any embedded content so they&#039;d actually behave like a PDF/A-2.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;d have to talk to the developers about that. There is a vendor that is looking to create an independent service to validate the writers to ensure that they are actually complying with the standards. The software would have to get certified that it works. Then we&#039;d actually have validators at ingestion. We have to get somebody interested in creating this software as a business and we think we finally have somebody who will do this.&lt;br /&gt;
&lt;br /&gt;
DOD standard 5015.2 that says if you&#039;re going to be a document management system you have to do certain things. Has to be sent to Fort Huachuca in AZ to a testing center to ensure that it conforms to DOD 5015.2. And this validator would do the same kind of thing.&lt;br /&gt;
&lt;br /&gt;
right now we have no validator that says a Word document is actually a word document. There are a lot of bad writers out there. &lt;br /&gt;
&lt;br /&gt;
Kevin: we&#039;re working on policy and guidance side. We ask people to keep temporary, permanent and non-record material separate from each other. Does PDF/A-3 run counter to that? Might encourage people to mix record and non-record material together in the same file?&lt;br /&gt;
&lt;br /&gt;
Steve: If there&#039;s a relationship, don&#039;t you need that for provenance information?&lt;br /&gt;
&lt;br /&gt;
Kevin: we need to educate our folks.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;ve been dealing with current technologies on these things and who knows what technology will afford us in the future. We&#039;ll be able to hedge our bets.&lt;br /&gt;
&lt;br /&gt;
Sheila: Question to me is what is the relationship between the PDF/A-3 container and the embedded XML, that is, in the Brazilian example, which one has the force of law? And how do you ensure that they say the same thing? In Germany they&#039;re planing on using this for commerce and the validation of invoices, but processing thing in the mass pragmatically means that you&#039;re going to look at one or the other. What warrant is there that they&#039;re going to stay the same way. The Germans said that the &amp;quot;embedded content&amp;quot; has no standing. Only the archival version has standing.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Steve: I can talk about legal. We would assume that the entire document was the legal evidence. In the case of dual content, according to the &amp;quot;best evidence&amp;quot; rule. If the company put a courtesy copy inside and that it&#039;s the main document that is their record. For example, if you requested an invoice and you received what you might see in a PDF versus XML data, then that&#039;s the evidence. &lt;br /&gt;
&lt;br /&gt;
Folks coming in to a reading room. We may have to set up rules at ingestion, they could either strip it out and put the file on a  diet and store the other content. We, in the committee, didn&#039;t want to dictate to preservationists how to do their job.&lt;br /&gt;
&lt;br /&gt;
If NARA said they didn&#039;t want a part of A-3 then the agencies shouldn&#039;t store in A-3. In our Pacer system you can pull down a PDF document but stored inside is an MP3 files that allows you to understand the provenance a little more. The PDF is the metadata around the MP3 file. these are temporary records, so it&#039;s not the same issues.&lt;br /&gt;
&lt;br /&gt;
Chris: So the PDF is acting as a manifest for the MP3 file. No validators, if someone if processing a bunch of files into PDF and embedding an XML version. Something could go wrong and you embed the wrong versions of the XML. There&#039;s not way to validate the right coordinated content. &lt;br /&gt;
&lt;br /&gt;
Steve: archivists are going to have to get more involved in advising creators on the types of files they create.&lt;br /&gt;
&lt;br /&gt;
PDF/E and A are essentially the same, they just couldn&#039;t find an open codec for rendering 3D documents. &lt;br /&gt;
&lt;br /&gt;
PDF/A is pretty much asleep now, making sure we keep up with the latest version of the PDF specification. &lt;br /&gt;
&lt;br /&gt;
Caroline:&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Chris Dietrich&#039;s Notes:&lt;br /&gt;
PDF/A-3 Use Cases 20130329&lt;br /&gt;
&lt;br /&gt;
NDSA PDF/A-3 work group discuss (positive) use cases with Stephen Levenson of the U.S. Courts&lt;br /&gt;
&lt;br /&gt;
PDF/A has 3 versions: No versions (A-1, A-2, or-A3) will be replaced by subsequent versions reducing the need for migration&lt;br /&gt;
• PDF/A-1 = ???? [didn’t capture what makes a PDF/A-1 different from a PDF]&lt;br /&gt;
• PDF/A-2 = PDF/A-1 + ability to embed a second PDF/A-2 copy of the same document(?) (recursive?)&lt;br /&gt;
• PDF/A-3 = PDF/A-2 + any file embedded&lt;br /&gt;
• Genesis for A-3 was to include other types like XML along with the PDF (not just PDF/A-2)&lt;br /&gt;
• Private Data section in A-2:&lt;br /&gt;
• Use Case: Brazil uses to embed other data like XML version of the file (legal docs)&lt;br /&gt;
• Is/can be protected from view&lt;br /&gt;
• Was the genesis for creating A-3 to store other data not protected from view (?)&lt;br /&gt;
• Allows for human- and machine-readable versions of a document in the same container&lt;br /&gt;
• Viewing software (i.e. Adobe Reader et al.) will not render additional content but will notify consumer that additional content       exists &lt;br /&gt;
&lt;br /&gt;
All three versions are PDFs, no need to distinguish between the three versions except by the presentation interface which should be designed to detect and display embedded additional content&lt;br /&gt;
&lt;br /&gt;
Currently no validators to ensure that any particular file (PDF, .doc, etc.) is actually what the extension indicates or is high-quality (e.g. some PDF generators are better than others)&lt;br /&gt;
&lt;br /&gt;
There is a vendor looking to create an ISO-compliant PDF validator which could be used by repositories to validate incoming PDF files (still years away)&lt;br /&gt;
&lt;br /&gt;
Ability to embed XML allows both human- and machine-readable versions of a document in the same container&lt;br /&gt;
Useful for automated ingest and presentation by repositories&lt;br /&gt;
&lt;br /&gt;
Question arises which of the two versions (PDF or XML) is the authoritative version&lt;br /&gt;
-Use-case: PACER system for courtroom audio stored as MP3 and packaged in PDF/A-3&lt;br /&gt;
--PDF/A-3 acts as the metadata/manifest for the embedded recording&lt;br /&gt;
--Recordings are access copies, not high-quality originals for long-term preservation/archiving&lt;br /&gt;
--These are temp records, disposed of after 5 years&lt;br /&gt;
&lt;br /&gt;
I think I missed another use case here…&lt;/div&gt;</summary>
		<author><name>Chris Dietrich</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5411</id>
		<title>NDSA:March 29, 2013 Call</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:March_29,_2013_Call&amp;diff=5411"/>
		<updated>2013-03-29T22:19:12Z</updated>

		<summary type="html">&lt;p&gt;Chris Dietrich: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Back to [[NDSA:Standards_and_Best_Practices_Working_Group | Standards Working Group Main Page]]&lt;br /&gt;
&lt;br /&gt;
Back to [[NDSA:PDF_Exploration | PDF Exploration Page]]&lt;br /&gt;
&lt;br /&gt;
=Agenda=&lt;br /&gt;
Discussion with Stephen Levenson, an IT Specialist for Policy and Planning at the Office of the US Courts and the chair of the PDF/A working group.&lt;br /&gt;
&lt;br /&gt;
=Participants=&lt;br /&gt;
&lt;br /&gt;
Don Chalfant, Kate Murray, Sheila Morrissey, Kevin DeVorsey, Chris Dietrich, Carl Fleischhauer, Stephen Levenson, Butch Lazorchak&lt;br /&gt;
&lt;br /&gt;
=Meeting Notes=&lt;br /&gt;
*A rough transcription of our conversation*&lt;br /&gt;
&lt;br /&gt;
Stephen Levenson discussing how the PDF/A family of standards are being implemented by the ISO community. &lt;br /&gt;
&lt;br /&gt;
Steve: PDF/A-3 doesn&#039;t necessarily replace A-1 or A-2. Should be able to use a PDF/A-1 file 30 years from now. The methodology used should not change for rendering these files in the future. &lt;br /&gt;
&lt;br /&gt;
PDF/A movement highly influenced by manufacturers, now PDF/A center, dominated by the Germans. Had many use cases for instances where creators wanted to include the original files wihtin a PDF/A document.  &lt;br /&gt;
&lt;br /&gt;
Brazilian government wanted to preserve their material as XML but XML wasn&#039;t trusted by users because of complexity. Wanted to make a more presentation-ready format but didn&#039;t want to throw away the XML.&lt;br /&gt;
&lt;br /&gt;
U.S. Courts, bankruptcy court, claims, when an individual goes into court, and the claims are laid out, in order for someone to assert the claim, we print a document for them. That&#039;s what they bring back to court to assert their claim. Ginny Mae started this and Mastercard is also movin gon this. We get the PDF but then have to reenter the data from this doucment in their case management systems. &lt;br /&gt;
&lt;br /&gt;
We&#039;re putting an XML output of what the claim represents inside the PDF document. Ginny Mae&#039;s automated processes work on this XML.&lt;br /&gt;
&lt;br /&gt;
Down with PDF/A-3 we have downstream functions that leverage the inner materials. Adobe&#039;s server product does not currently output A-3 files. &lt;br /&gt;
&lt;br /&gt;
Chris: How will hidden content be protected from certain readers of the document.&lt;br /&gt;
&lt;br /&gt;
Steve: For A-3 you can still include the information as &amp;quot;private data&amp;quot; that would make it hidden. Conforming reader would recognize it as A-3 and set up an additional dialog.&lt;br /&gt;
&lt;br /&gt;
Caroline: we&#039;re in the position of having to preserve files that somebody else created. We need tools to characterize files. Some PDF/A-3 files may not have any embedded content so they&#039;d actually behave like a PDF/A-2.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;d have to talk to the developers about that. There is a vendor that is looking to create an independent service to validate the writers to ensure that they are actually complying with the standards. The software would have to get certified that it works. Then we&#039;d actually have validators at ingestion. We have to get somebody interested in creating this software as a business and we think we finally have somebody who will do this.&lt;br /&gt;
&lt;br /&gt;
DOD standard 5015.2 that says if you&#039;re going to be a document management system you have to do certain things. Has to be sent to Fort Huachuca in AZ to a testing center to ensure that it conforms to DOD 5015.2. And this validator would do the same kind of thing.&lt;br /&gt;
&lt;br /&gt;
right now we have no validator that says a Word document is actually a word document. There are a lot of bad writers out there. &lt;br /&gt;
&lt;br /&gt;
Kevin: we&#039;re working on policy and guidance side. We ask people to keep temporary, permanent and non-record material separate from each other. Does PDF/A-3 run counter to that? Might encourage people to mix record and non-record material together in the same file?&lt;br /&gt;
&lt;br /&gt;
Steve: If there&#039;s a relationship, don&#039;t you need that for provenance information?&lt;br /&gt;
&lt;br /&gt;
Kevin: we need to educate our folks.&lt;br /&gt;
&lt;br /&gt;
Steve: We&#039;ve been dealing with current technologies on these things and who knows what technology will afford us in the future. We&#039;ll be able to hedge our bets.&lt;br /&gt;
&lt;br /&gt;
Sheila: Question to me is what is the relationship between the PDF/A-3 container and the embedded XML, that is, in the Brazilian example, which one has the force of law? And how do you ensure that they say the same thing? In Germany they&#039;re planing on using this for commerce and the validation of invoices, but processing thing in the mass pragmatically means that you&#039;re going to look at one or the other. What warrant is there that they&#039;re going to stay the same way. The Germans said that the &amp;quot;embedded content&amp;quot; has no standing. Only the archival version has standing.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Steve: I can talk about legal. We would assume that the entire document was the legal evidence. In the case of dual content, according to the &amp;quot;best evidence&amp;quot; rule. If the company put a courtesy copy inside and that it&#039;s the main document that is their record. For example, if you requested an invoice and you received what you might see in a PDF versus XML data, then that&#039;s the evidence. &lt;br /&gt;
&lt;br /&gt;
Folks coming in to a reading room. We may have to set up rules at ingestion, they could either strip it out and put the file on a  diet and store the other content. We, in the committee, didn&#039;t want to dictate to preservationists how to do their job.&lt;br /&gt;
&lt;br /&gt;
If NARA said they didn&#039;t want a part of A-3 then the agencies shouldn&#039;t store in A-3. In our Pacer system you can pull down a PDF document but stored inside is an MP3 files that allows you to understand the provenance a little more. The PDF is the metadata around the MP3 file. these are temporary records, so it&#039;s not the same issues.&lt;br /&gt;
&lt;br /&gt;
Chris: So the PDF is acting as a manifest for the MP3 file. No validators, if someone if processing a bunch of files into PDF and embedding an XML version. Something could go wrong and you embed the wrong versions of the XML. There&#039;s not way to validate the right coordinated content. &lt;br /&gt;
&lt;br /&gt;
Steve: archivists are going to have to get more involved in advising creators on the types of files they create.&lt;br /&gt;
&lt;br /&gt;
PDF/E and A are essentially the same, they just couldn&#039;t find an open codec for rendering 3D documents. &lt;br /&gt;
&lt;br /&gt;
PDF/A is pretty much asleep now, making sure we keep up with the latest version of the PDF specification. &lt;br /&gt;
&lt;br /&gt;
Caroline:&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
Chris Dietrich&#039;s Notes:&lt;br /&gt;
PDF/A-3 Use Cases 20130329&lt;br /&gt;
&lt;br /&gt;
NDSA PDF/A-3 work group discuss (positive) use cases with Stephen Levenson of the U.S. Courts&lt;br /&gt;
&lt;br /&gt;
PDF/A has 3 versions: No versions (A-1, A-2, or-A3) will be replaced by subsequent versions reducing the need for migration&lt;br /&gt;
-PDF/A-1 = ???? [didn’t capture what makes a PDF/A-1 different from a PDF]&lt;br /&gt;
-PDF/A-2 = PDF/A-1 + ability to embed a second PDF/A-2 copy of the same document(?) (recursive?)&lt;br /&gt;
-PDF/A-3 = PDF/A-2 + any file embedded&lt;br /&gt;
--Genesis for A-3 was to include other types like XML along with the PDF (not just PDF/A-2)&lt;br /&gt;
--Private Data section in A-2:&lt;br /&gt;
--Use Case: Brazil uses to embed other data like XML version of the file (legal docs)&lt;br /&gt;
--Is/can be protected from view&lt;br /&gt;
--Was the genesis for creating A-3 to store other data not protected from view (?)&lt;br /&gt;
--Allows for human- and machine-readable versions of a document in the same container&lt;br /&gt;
--Viewing software (i.e. Adobe Reader et al.) will not render additional content but will notify consumer that additional content       exists &lt;br /&gt;
&lt;br /&gt;
All three versions are PDFs, no need to distinguish between the three versions except by the presentation interface which should be designed to detect and display embedded additional content&lt;br /&gt;
&lt;br /&gt;
Currently no validators to ensure that any particular file (PDF, .doc, etc.) is actually what the extension indicates or is high-quality (e.g. some PDF generators are better than others)&lt;br /&gt;
&lt;br /&gt;
There is a vendor looking to create an ISO-compliant PDF validator which could be used by repositories to validate incoming PDF files (still years away)&lt;br /&gt;
&lt;br /&gt;
Ability to embed XML allows both human- and machine-readable versions of a document in the same container&lt;br /&gt;
   Useful for automated ingest and presentation by repositories&lt;br /&gt;
&lt;br /&gt;
Question arises which of the two versions (PDF or XML) is the authoritative version&lt;br /&gt;
-Use-case: PACER system for courtroom audio stored as MP3 and packaged in PDF/A-3&lt;br /&gt;
--PDF/A-3 acts as the metadata/manifest for the embedded recording&lt;br /&gt;
--Recordings are access copies, not high-quality originals for long-term preservation/archiving&lt;br /&gt;
--These are temp records, disposed of after 5 years&lt;br /&gt;
&lt;br /&gt;
I think I missed another use case here…&lt;/div&gt;</summary>
		<author><name>Chris Dietrich</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:PDF_Exploration&amp;diff=4971</id>
		<title>NDSA:PDF Exploration</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:PDF_Exploration&amp;diff=4971"/>
		<updated>2013-01-07T21:44:01Z</updated>

		<summary type="html">&lt;p&gt;Chris Dietrich: /* PDF/A-3 Use Case Scenarios */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Back to [[NDSA:Standards_and_Best_Practices_Working_Group | Standards Working Group Main Page]]&lt;br /&gt;
&lt;br /&gt;
==Title of Activity or Project==&lt;br /&gt;
NDSA PDF/A-3 Scoping Project&lt;br /&gt;
&lt;br /&gt;
==One Sentence Description:==&lt;br /&gt;
NDSA PDF/A-3 Scoping Project working group members will research the pros and cons of using the PDF/A-3 standard as an all-purpose wrapper for various digital asset/media types including: textual, audio, video, photo, and GIS data.&lt;br /&gt;
&lt;br /&gt;
==Statement of the Problem and Goals for Addressing the Problem:==&lt;br /&gt;
It is unclear whether PDF/A-3, which was designed to accommodate supplementary media files for  text documents, is appropriate as a de facto normalization wrapper format for all media types. The goal is to develop guidelines for the appropriate use of PDF/A-3 with respect to different media types that includes both detailed technical information and a practical quick reference guide for end-users.&lt;br /&gt;
&lt;br /&gt;
==Strategic Value of Activity:==&lt;br /&gt;
* Improve understanding of best practices for using PDF/A-3 in digital preservation activities&lt;br /&gt;
* Enhance consistency and improve long-term viability of digitally preserved content &lt;br /&gt;
* Provide guidance to those considering PDF/A-3 as a long-term archiving format&lt;br /&gt;
&lt;br /&gt;
==Required Resources:==&lt;br /&gt;
* Time of working group members&lt;br /&gt;
* Publishing venue(s)&lt;br /&gt;
* Communication channels&lt;br /&gt;
&lt;br /&gt;
==Roadmap:==&lt;br /&gt;
# Hold regular working group conference calls (monthly, between NDSA Standards WG calls) &lt;br /&gt;
# Draft document and review&lt;br /&gt;
# Invite broader NDSA member feedback&lt;br /&gt;
# Publish document (digitalpreservation.gov, others?)&lt;br /&gt;
&lt;br /&gt;
==Dissemination of Knowledge:==&lt;br /&gt;
* Publish report on digitalpreservation.gov&lt;br /&gt;
* Write a blog post&lt;br /&gt;
* Announce on NDSA member organization communication channels&lt;br /&gt;
* Present at conferences that members (and non-members?) are attending&lt;br /&gt;
&lt;br /&gt;
==Signifiers of Success and Outcomes:==&lt;br /&gt;
* Completed guidelines document published on digitalpreservation.gov&lt;br /&gt;
* Guidelines document referenced on related Wikipedia pages&lt;br /&gt;
* Guidelines in use or recommended by NDSA participating organizations or others&lt;br /&gt;
* Publication at other conferences/other journals&lt;br /&gt;
&lt;br /&gt;
==Questions to Ask and Answer==&lt;br /&gt;
*Talk about background (what is pdf/a-3 and how is it different from earlier versions of PDF/A)&lt;br /&gt;
*Iterate categories of materials/use cases/concrete examples where it makes sense to use A-3 and other categories where it doesn&#039;t make sense. Example: if you&#039;re sending a video file don&#039;t put it in a PDF! If you had a certain kind of a journal article that had a static version of the spreadsheet in the doc but a malleable version embedded perhaps that argues for it. &lt;br /&gt;
*Risks to the format (scenarios in why this might be bad and why)&lt;br /&gt;
*Possibilities of the format (scenarios in why this might be good and why)&lt;br /&gt;
*Have list of defined terms in our document. How do these relate to the terms in the ISO spec. Leverage NDSA Levels of Preservation glossary. Link to glossary.&lt;br /&gt;
&lt;br /&gt;
==PDF/A-3 Use Case Scenarios==&lt;br /&gt;
Add them here! We can create a separate page as necessary. &lt;br /&gt;
----&lt;br /&gt;
Example:  Federal agency with a document management system puts an MPEG video file (and nothing else) into a PDF/A-3 file to store and then, later, to submit as an SIP (Submission Information Package) to NARA for long-term management.&lt;br /&gt;
&lt;br /&gt;
Example: Publisher has a text-only article and puts it into a PDF/A-3 file, even though, in the past, the publisher used PDF/A-2.  The article is then sent to library where it will be preserved for the long term.&lt;br /&gt;
&lt;br /&gt;
Example: Publisher has an article that includes a complicated table, &amp;quot;frozen&amp;quot; in place, and puts it into a PDF/A-3 file, along with the Excel file from which the table was generated, in order to make it easier for a future researcher to have a malleable version of the table for use when writing another article on the same subject.&lt;br /&gt;
&lt;br /&gt;
Example: Data creator has a digital map, a report, a database, digital photos, and detailed metadata that comprise a whole and wants to archive these together for the long-term.&lt;br /&gt;
&lt;br /&gt;
==Members==&lt;br /&gt;
*Caroline Arms, Library of Congress (caar@loc.gov)&lt;br /&gt;
*Don Chalfant, NARA (Donald.Chalfant@nara.gov)&lt;br /&gt;
*Kevin DeVorsey, NARA (Kevin.DeVorsey@nara.gov)&lt;br /&gt;
*Chris Dietrich, National Park Service (chris_dietrich@nps.gov)&lt;br /&gt;
*Carl Fleischauer, Library of Congress (cfle@loc.gov)&lt;br /&gt;
*Butch Lazorchak, Library of Congress (wlaz@loc.gov)&lt;br /&gt;
*Sheila Morrissey, Ithaka (Sheila.Morrissey@ithaka.org)&lt;br /&gt;
*Kate Murrary, NARA (Kate.Murray1@nara.gov)&lt;br /&gt;
&lt;br /&gt;
==Calls and Notes==&lt;br /&gt;
&lt;br /&gt;
Call information:&lt;br /&gt;
&lt;br /&gt;
*Call-in toll-free number (US/Canada):    866-469-3239 &lt;br /&gt;
*Participant access code:          	21408589 &lt;br /&gt;
&lt;br /&gt;
Next call: Tuesday Jan. 22, 2013, 2:00 P.M.&lt;br /&gt;
&lt;br /&gt;
==Background Materials==&lt;br /&gt;
&lt;br /&gt;
*[http://www.digitalpreservation.gov:8081/formats/fdd/fdd000360.shtml Library of Congress Sustainability of Digital Formats DRAFT PDF/A-3 site]&lt;br /&gt;
*[http://blogs.loc.gov/digitalpreservation/2012/11/all-in-embedded-files-in-pdfa/ Blog Post on PDF/A-3 on the Signal]&lt;br /&gt;
*[[NDSA:Media: TheNetworkIsTheFormat.pdf | Sheila M. Morrissey, The Network is the Format: PDF and the Long-term Use of Digital Content, Archiving 2012, pg. 200-203 (2012)]]&lt;br /&gt;
*[[NDSA:Media: CommentsOnISO19005-3_smorrissey.pdf | Ithaka comments on ISO 19005-3 draft]]&lt;br /&gt;
*Caroline&#039;s document &lt;br /&gt;
*In future set up calls with Steve Levinson (U.S. Courts) and Leonard Rosenthal (Adobe)&lt;/div&gt;</summary>
		<author><name>Chris Dietrich</name></author>
	</entry>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:PDF_Exploration&amp;diff=4970</id>
		<title>NDSA:PDF Exploration</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:PDF_Exploration&amp;diff=4970"/>
		<updated>2013-01-07T21:39:10Z</updated>

		<summary type="html">&lt;p&gt;Chris Dietrich: /* PDF/A-3 Use Case Scenarios */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Back to [[NDSA:Standards_and_Best_Practices_Working_Group | Standards Working Group Main Page]]&lt;br /&gt;
&lt;br /&gt;
==Title of Activity or Project==&lt;br /&gt;
NDSA PDF/A-3 Scoping Project&lt;br /&gt;
&lt;br /&gt;
==One Sentence Description:==&lt;br /&gt;
NDSA PDF/A-3 Scoping Project working group members will research the pros and cons of using the PDF/A-3 standard as an all-purpose wrapper for various digital asset/media types including: textual, audio, video, photo, and GIS data.&lt;br /&gt;
&lt;br /&gt;
==Statement of the Problem and Goals for Addressing the Problem:==&lt;br /&gt;
It is unclear whether PDF/A-3, which was designed to accommodate supplementary media files for  text documents, is appropriate as a de facto normalization wrapper format for all media types. The goal is to develop guidelines for the appropriate use of PDF/A-3 with respect to different media types that includes both detailed technical information and a practical quick reference guide for end-users.&lt;br /&gt;
&lt;br /&gt;
==Strategic Value of Activity:==&lt;br /&gt;
* Improve understanding of best practices for using PDF/A-3 in digital preservation activities&lt;br /&gt;
* Enhance consistency and improve long-term viability of digitally preserved content &lt;br /&gt;
* Provide guidance to those considering PDF/A-3 as a long-term archiving format&lt;br /&gt;
&lt;br /&gt;
==Required Resources:==&lt;br /&gt;
* Time of working group members&lt;br /&gt;
* Publishing venue(s)&lt;br /&gt;
* Communication channels&lt;br /&gt;
&lt;br /&gt;
==Roadmap:==&lt;br /&gt;
# Hold regular working group conference calls (monthly, between NDSA Standards WG calls) &lt;br /&gt;
# Draft document and review&lt;br /&gt;
# Invite broader NDSA member feedback&lt;br /&gt;
# Publish document (digitalpreservation.gov, others?)&lt;br /&gt;
&lt;br /&gt;
==Dissemination of Knowledge:==&lt;br /&gt;
* Publish report on digitalpreservation.gov&lt;br /&gt;
* Write a blog post&lt;br /&gt;
* Announce on NDSA member organization communication channels&lt;br /&gt;
* Present at conferences that members (and non-members?) are attending&lt;br /&gt;
&lt;br /&gt;
==Signifiers of Success and Outcomes:==&lt;br /&gt;
* Completed guidelines document published on digitalpreservation.gov&lt;br /&gt;
* Guidelines document referenced on related Wikipedia pages&lt;br /&gt;
* Guidelines in use or recommended by NDSA participating organizations or others&lt;br /&gt;
* Publication at other conferences/other journals&lt;br /&gt;
&lt;br /&gt;
==Questions to Ask and Answer==&lt;br /&gt;
*Talk about background (what is pdf/a-3 and how is it different from earlier versions of PDF/A)&lt;br /&gt;
*Iterate categories of materials/use cases/concrete examples where it makes sense to use A-3 and other categories where it doesn&#039;t make sense. Example: if you&#039;re sending a video file don&#039;t put it in a PDF! If you had a certain kind of a journal article that had a static version of the spreadsheet in the doc but a malleable version embedded perhaps that argues for it. &lt;br /&gt;
*Risks to the format (scenarios in why this might be bad and why)&lt;br /&gt;
*Possibilities of the format (scenarios in why this might be good and why)&lt;br /&gt;
*Have list of defined terms in our document. How do these relate to the terms in the ISO spec. Leverage NDSA Levels of Preservation glossary. Link to glossary.&lt;br /&gt;
&lt;br /&gt;
==PDF/A-3 Use Case Scenarios==&lt;br /&gt;
Add them here! We can create a separate page as necessary. &lt;br /&gt;
----&lt;br /&gt;
Example:  Federal agency with a document management system puts an MPEG video file (and nothing else) into a PDF/A-3 file to store and then, later, to submit as an SIP (Submission Information Package) to NARA for long-term management.&lt;br /&gt;
&lt;br /&gt;
Example: Publisher has a text-only article and puts it into a PDF/A-3 file, even though, in the past, the publisher used PDF/A-2.  The article is then sent to library where it will be preserved for the long term.&lt;br /&gt;
&lt;br /&gt;
Example: Publisher has an article that includes a complicated table, &amp;quot;frozen&amp;quot; in place, and puts it into a PDF/A-3 file, along with the Excel file from which the table was generated, in order to make it easier for a future researcher to have a malleable version of the table for use when writing another article on the same subject.&lt;br /&gt;
&lt;br /&gt;
==Members==&lt;br /&gt;
*Caroline Arms, Library of Congress (caar@loc.gov)&lt;br /&gt;
*Don Chalfant, NARA (Donald.Chalfant@nara.gov)&lt;br /&gt;
*Kevin DeVorsey, NARA (Kevin.DeVorsey@nara.gov)&lt;br /&gt;
*Chris Dietrich, National Park Service (chris_dietrich@nps.gov)&lt;br /&gt;
*Carl Fleischauer, Library of Congress (cfle@loc.gov)&lt;br /&gt;
*Butch Lazorchak, Library of Congress (wlaz@loc.gov)&lt;br /&gt;
*Sheila Morrissey, Ithaka (Sheila.Morrissey@ithaka.org)&lt;br /&gt;
*Kate Murrary, NARA (Kate.Murray1@nara.gov)&lt;br /&gt;
&lt;br /&gt;
==Calls and Notes==&lt;br /&gt;
&lt;br /&gt;
Call information:&lt;br /&gt;
&lt;br /&gt;
*Call-in toll-free number (US/Canada):    866-469-3239 &lt;br /&gt;
*Participant access code:          	21408589 &lt;br /&gt;
&lt;br /&gt;
Next call: Tuesday Jan. 22, 2013, 2:00 P.M.&lt;br /&gt;
&lt;br /&gt;
==Background Materials==&lt;br /&gt;
&lt;br /&gt;
*[http://www.digitalpreservation.gov:8081/formats/fdd/fdd000360.shtml Library of Congress Sustainability of Digital Formats DRAFT PDF/A-3 site]&lt;br /&gt;
*[http://blogs.loc.gov/digitalpreservation/2012/11/all-in-embedded-files-in-pdfa/ Blog Post on PDF/A-3 on the Signal]&lt;br /&gt;
*[[NDSA:Media: TheNetworkIsTheFormat.pdf | Sheila M. Morrissey, The Network is the Format: PDF and the Long-term Use of Digital Content, Archiving 2012, pg. 200-203 (2012)]]&lt;br /&gt;
*[[NDSA:Media: CommentsOnISO19005-3_smorrissey.pdf | Ithaka comments on ISO 19005-3 draft]]&lt;br /&gt;
*Caroline&#039;s document &lt;br /&gt;
*In future set up calls with Steve Levinson (U.S. Courts) and Leonard Rosenthal (Adobe)&lt;/div&gt;</summary>
		<author><name>Chris Dietrich</name></author>
	</entry>
</feed>