<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.diglib.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=SheilaM</id>
	<title>DLF Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.diglib.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=SheilaM"/>
	<link rel="alternate" type="text/html" href="https://wiki.diglib.org/Special:Contributions/SheilaM"/>
	<updated>2026-05-10T18:55:13Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.44.0</generator>
	<entry>
		<id>https://wiki.diglib.org/index.php?title=NDSA:PDF_Exploration&amp;diff=5005</id>
		<title>NDSA:PDF Exploration</title>
		<link rel="alternate" type="text/html" href="https://wiki.diglib.org/index.php?title=NDSA:PDF_Exploration&amp;diff=5005"/>
		<updated>2013-03-24T21:39:36Z</updated>

		<summary type="html">&lt;p&gt;SheilaM: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Back to [[NDSA:Standards_and_Best_Practices_Working_Group | Standards Working Group Main Page]]&lt;br /&gt;
&lt;br /&gt;
==Title of Activity or Project==&lt;br /&gt;
NDSA PDF/A-3 Scoping Project&lt;br /&gt;
&lt;br /&gt;
==One Sentence Description:==&lt;br /&gt;
NDSA PDF/A-3 Scoping Project working group members will research the pros and cons of using the PDF/A-3 standard as an all-purpose wrapper in different preservation scenarios, including use as an extension to PDF/A-1 and PDF/A-2 in circumstances for which those formats have been adopted or recommended and use as a wrapper for various digital asset/media types, such as textual, audio, video, photo, and GIS data.&lt;br /&gt;
&lt;br /&gt;
==Statement of the Problem and Goals for Addressing the Problem:==&lt;br /&gt;
The single extension to PDF/A-2 in PDF/A-3 is the ability to embed files of any type within a PDF/A document.  &lt;br /&gt;
PDF/A-3 was designed to accommodate supplementary media files for text documents. Issues raised by this extension include:&lt;br /&gt;
&lt;br /&gt;
* Is PDF/A-3 appropriate as a de facto normalization wrapper format for some or all media types or in particular circumstances?&lt;br /&gt;
* For circumstances where PDF/A-2 has already been deemed an appropriate preservation format (primarily for textual documents), what are the risks and opportunities offered by the ability to embed content in non-PDF formats?&lt;br /&gt;
&lt;br /&gt;
The goal is to develop guidelines for the appropriate use of PDF/A-3 with respect to different scenarios that include both detailed technical information and a practical quick reference guide for end-users.&lt;br /&gt;
&lt;br /&gt;
==Strategic Value of Activity:==&lt;br /&gt;
* Improve understanding of best practices for using PDF/A-3 in digital preservation activities&lt;br /&gt;
* Enhance consistency and improve long-term viability of digitally preserved content &lt;br /&gt;
* Provide guidance to those considering PDF/A-3 as a long-term archiving format&lt;br /&gt;
&lt;br /&gt;
==Required Resources:==&lt;br /&gt;
* Time of working group members&lt;br /&gt;
* Publishing venue(s)&lt;br /&gt;
* Communication channels&lt;br /&gt;
&lt;br /&gt;
==Roadmap:==&lt;br /&gt;
# Hold regular working group conference calls (monthly, between NDSA Standards WG calls) &lt;br /&gt;
# Draft document and review&lt;br /&gt;
# Invite broader NDSA member feedback&lt;br /&gt;
# Publish document (digitalpreservation.gov, others?)&lt;br /&gt;
&lt;br /&gt;
==Dissemination of Knowledge:==&lt;br /&gt;
* Publish report on digitalpreservation.gov&lt;br /&gt;
* Write a blog post&lt;br /&gt;
* Announce on NDSA member organization communication channels&lt;br /&gt;
* Present at conferences that members (and non-members?) are attending&lt;br /&gt;
&lt;br /&gt;
==Signifiers of Success and Outcomes:==&lt;br /&gt;
* Completed guidelines document published on digitalpreservation.gov&lt;br /&gt;
* Guidelines document referenced on related Wikipedia pages&lt;br /&gt;
* Guidelines referenced in FDD (format description document) for PDF/A-3 [http://www.digitalpreservation.gov/formats/fdd/fdd000360.shtml]&lt;br /&gt;
* Guidelines in use or recommended by NDSA participating organizations or others&lt;br /&gt;
* Publication at other conferences/other journals&lt;br /&gt;
&lt;br /&gt;
==Questions to Ask and Answer==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
A [https://docs.google.com/document/d/1cZ1x2jzaoVzclqV0nkdHoX5LtQJVr3o2ni-fwi86LHM/edit Google doc] has been set up to provide an environment for shared work. All group members should be able to edit the document, but if you have trouble drop a note to Butch. March 22 has been set as the deadline for the first draft of each person&#039;s section.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Talk about background (what is pdf/a-3 and how is it different from earlier versions of PDF/A)(Butch, plus Caroline&#039;s 2-pager)&lt;br /&gt;
*Iterate categories of materials/use cases/concrete examples where it makes sense to use A-3 and other categories where it doesn&#039;t make sense. Example: if you&#039;re sending a video file don&#039;t put it in a PDF! If you had a certain kind of a journal article that had a static version of the spreadsheet in the doc but a malleable version embedded perhaps that argues for it. (Don, Kevin, Kate)&lt;br /&gt;
*Risks to the format (scenarios in why this might be bad and why) (Sheila)&lt;br /&gt;
*Possibilities of the format (scenarios in why this might be good and why) (Chris)&lt;br /&gt;
*Have list of defined terms in our document. How do these relate to the terms in the ISO spec. Leverage NDSA Levels of Preservation glossary. Link to glossary.&lt;br /&gt;
&lt;br /&gt;
==PDF/A-3 Use Case Scenarios==&lt;br /&gt;
&lt;br /&gt;
A Template might include:&lt;br /&gt;
*Actors&lt;br /&gt;
*Actions&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Example:  Federal agency with a document management system puts an MPEG video file (and nothing else) into a PDF/A-3 file to store and then, later, to submit as an SIP (Submission Information Package) to NARA for long-term management.&lt;br /&gt;
&lt;br /&gt;
*Example: Publisher has a text-only article and puts it into a PDF/A-3 file, even though, in the past, the publisher used PDF/A-2.  The article is then sent to library where it will be preserved for the long term.&lt;br /&gt;
&lt;br /&gt;
*Example: Publisher has an article that includes a complicated table, &amp;quot;frozen&amp;quot; in place, and puts it into a PDF/A-3 file, along with the Excel file from which the table was generated, in order to make it easier for a future researcher to have a malleable version of the table for use when writing another article on the same subject.&lt;br /&gt;
&lt;br /&gt;
*Example: Data creator has a digital map, a report, a database, digital photos, and detailed metadata that comprise a whole and wants to archive these together for the long-term.&lt;br /&gt;
&lt;br /&gt;
*Example from Luratech Webinar used to show primary intent of PDF/A-3:  PDF/A document with diagram based on data, with embedded spreadsheet associated with diagram, metadata associated with subsection of document, source word-processing file, and audio rendering of the document (perhaps for accessibility).&lt;br /&gt;
&lt;br /&gt;
*See case #1 from Luratech Webinar:  Scanned documents, with the scanned image as the main PDF/A content, with native metadata in XML embedded.  &lt;br /&gt;
&lt;br /&gt;
*Use case #2 from Luratech Webinar:  &amp;quot;Hybrid archiving&amp;quot; used when document in its active life cycle, further versions might be created.  Create PDF/A-3 for archive-ready rendition and embed the document in its native (e.g., word-processor) format.  Built in to a standard workflow, this would leave documents &amp;quot;archive ready&amp;quot; at all times.&lt;br /&gt;
&lt;br /&gt;
*Use case #3 from Luratech Webinar: Human-readable invoice with embedded data marked up in CEN Core Invoice Standard (XML).&lt;br /&gt;
&lt;br /&gt;
==Members==&lt;br /&gt;
*Caroline Arms, Library of Congress (caar@loc.gov)&lt;br /&gt;
*Don Chalfant, NARA (Donald.Chalfant@nara.gov)&lt;br /&gt;
*Kevin DeVorsey, NARA (Kevin.DeVorsey@nara.gov)&lt;br /&gt;
*Chris Dietrich, National Park Service (chris_dietrich@nps.gov)&lt;br /&gt;
*Carl Fleischhauer, Library of Congress (cfle@loc.gov)&lt;br /&gt;
*Butch Lazorchak, Library of Congress (wlaz@loc.gov)&lt;br /&gt;
*Sheila Morrissey, Ithaka (Sheila.Morrissey@ithaka.org)&lt;br /&gt;
*Kate Murray, NARA (Kate.Murray1@nara.gov)&lt;br /&gt;
&lt;br /&gt;
==Calls and Notes==&lt;br /&gt;
&lt;br /&gt;
Call information:&lt;br /&gt;
&lt;br /&gt;
*Call-in toll-free number (US/Canada):    866-469-3239 &lt;br /&gt;
*Participant access code:          	21408589 &lt;br /&gt;
&lt;br /&gt;
Next call: March 25, 2013 at 11:00 a.m. ET&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
&lt;br /&gt;
[[NDSA:February 19, 2013 Call]]&lt;br /&gt;
&lt;br /&gt;
[[NDSA:January 22, 2013 Call]]&lt;br /&gt;
&lt;br /&gt;
==Background Materials==&lt;br /&gt;
&lt;br /&gt;
*[http://www.digitalpreservation.gov:8081/formats/fdd/fdd000360.shtml Library of Congress Sustainability of Digital Formats DRAFT PDF/A-3 format description document (FDD)]  COMMENTS PLEASE to caar@loc.gov and cfle@loc.gov&lt;br /&gt;
*[http://blogs.loc.gov/digitalpreservation/2012/11/all-in-embedded-files-in-pdfa/ Blog Post on PDF/A-3 on the Signal]&lt;br /&gt;
*[http://www.portico.org/digital-preservation/wp-content/uploads/2012/12/Archiving2012TheNetworkIsTheFormat.pdf Sheila M. Morrissey, The Network is the Format: PDF and the Long-term Use of Digital Content, Archiving 2012, pg. 200-203 (2012)]&lt;br /&gt;
*[[NDSA:Media: CommentsOnISO19005-3_smorrissey.pdf | Ithaka comments on ISO 19005-3 draft]]&lt;br /&gt;
*[[NDSA:Media: PDFA3-crathoughts_20121126.doc | Caroline&#039;s thoughts on PDF/A-3 circulated in late November, 2012]]&lt;br /&gt;
*[http://www.youtube.com/watch?v=g-tJRSsZHyc Video of Webinar by Luratech on PDF/A-3]   Nov 8, 2012.  Includes uses cases and demos.&lt;br /&gt;
*[[NDSA:Media: Luratech-PDFA3-Webinar-ENG.pdf | Slides used for Luratech Webinar]]   Nov 8, 2012.  Includes uses cases and demos.  Do not distribute.&lt;br /&gt;
*[http://www.dpconline.org/events/details/55-DPC_PDFA3_briefing?xref=58 Digital Preservation Coalition (DPC) 2013-03-13 Workshop on PDF/A-3] Includes links to pressenters slides and to William Kilbride&#039;s comments&lt;br /&gt;
&lt;br /&gt;
*Unofficial XMP notes from 2011 explorations by Caroline Arms -- Please do not distribute&lt;br /&gt;
**[[NDSA:Media: XMPbackground_20111130_cra.pdf‎ | Notes on XMP and tools available to LC]]&lt;br /&gt;
**[[NDSA:Media: XMPexplore_20111209_cra.pdf‎ | Summary of exploration of XMP use external to LC]]&lt;br /&gt;
&lt;br /&gt;
==Possible future actions==&lt;br /&gt;
*Once charter is reviewed by main NDSA Standards Group, extend participation call to &lt;br /&gt;
*Set up calls with Steve Levinson (U.S. Courts) and Leonard Rosenthal (Adobe)&lt;br /&gt;
*Extend invitation to join beyond active NDSA participants, e.g. to LC staff involved in Best Edition choices.&lt;/div&gt;</summary>
		<author><name>SheilaM</name></author>
	</entry>
</feed>