« 10 delicious links | Main | Fax signatures - how secure are they? »

May 05, 2008

Comments

Feed You can follow this conversation by subscribing to the comment feed for this post.

Bryan Sims

Why would anyone convert a document to PDF simply to print it and then scan it in again.

Scanning a paper document would likely eliminate the metadata concerns that the person has. A metadata removal program will do the same. Also, it will do it much more easily.

Converting to PDF (without scanning) will likely remove most of the metadata concerns (depending on what your concerns are). Keep in mind, however, that, depending on your settings, you can convert things such as comments in a Word document into the final PDF that you create.

Curtis Carmack

One thing you can do to appease her is to have her use a metadata stripping tool first before converting the doc to PDF. Try Doc Scrubber for this (http://www.javacoolsoftware.com/docscrubber/index.html). It is quite effective.

Leonard Rosenthol

Actually, the whether metadata is included in the conversion process of Word->PDF depends on what software you use and its settings.

For example, the PDFMakers of Adobe Acrobat (the toolbar buttons that we add to Office) will, by default, COPY the metadata from the Word file to the PDF - because that's what the average user expects - the highest fidelity conversion. However, you can certainly turn that option off in its settings.

In addition, Adobe Acrobat includes an Examine Document feature that enables you to check a PDF for any metadata AND other potentially "problematic" things (hidden text, scripts, etc.) and remove them before sending the document out of house.

Leonard Rosenthol
Adobe Systems

yclipse

I did some analysis of this issue some time back. The idea that "metadata" can be transmitted from Word to PDF when the PDF is created has some basis in truth, but the danger (as described by some authors) is overstated. It boils down to what is meant by the word.

Essentially, the standard identification metadata that appears in MS documents created in Word and Excel (and perhaps in other apps) can appear in the PDF file after it is created. This is the information that appears when you choose File | Properties, and includes such things as Title, Author, Subject, Keywords. These data can even be passed along when the PDF is created by a third-party program like pdfFactory. (It apparently only happens with MS apps, by the way.) This "metadata" is relatively innocuous, and can be changed at any rate if there is something unwanted before the PDF is generated.

The problem is that the word "metadata" is also used to refer to such undesirable stuff as deleted text, file comments, etc. There is no evidence that this type of data is passed along to the PDF when it is created.

Where I think some authors have gone wrong is in failing to make the distinction between these different concepts of "metadata" when discussing this issue. As a result they have unnecessarily injected a lot of fear into this area.

Johnette Hassell

There is a tool named Metadata Assistant that will remove metadata from any Office document, that is the Office metadata. It does not remove the system metadata such as the access, creation, and modification times-but neither does creating a .pdf version.

The idea of creating a .pdf document, printing it, and scanning in the printed document is not a good one. First of all it wastes resources. Secondly, if this is in a document production and the second .pdf document is not searchable, it will be in violation of the e-Discovery amendments to the FRCP.

Finally, again in document production, the courts are beginning to hold that the metadata is discoverable under the e-Discovery amendments.

jason

just instead of printing and scanning it again, just print it to a new pdf from adobe, saves a step and no metadata ........... someone please verify

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.

Twitter Updates

    follow me on Twitter