c# pdf library itextsharp : Add metadata to pdf programmatically control SDK system web page wpf azure console Webinar_PDFA_for_Scanned_Documents_English_US_version0-part558

Mark McKinney
President, LuraTech, Inc.
www.pdfa.org
PDF/A for Scanned Documents
Paper Becomes Digital
Webinar
Mark McKinney, LuraTech, Inc., President
Armin Ortmann, LuraTech, CTO
© 2009  PDF/A Competence Center, www.pdfa.org
Add metadata to pdf programmatically - add, remove, update PDF metadata in C#.net, ASP.NET, MVC, Ajax, WinForms, WPF
Allow C# Developers to Read, Add, Edit, Update and Delete PDF Metadata
remove metadata from pdf; pdf xmp metadata
Add metadata to pdf programmatically - VB.NET PDF metadata library: add, remove, update PDF metadata in vb.net, ASP.NET, MVC, Ajax, WinForms, WPF
Enable VB.NET Users to Read, Write, Edit, Delete and Update PDF Document Metadata
batch pdf metadata; add metadata to pdf file
Mark McKinney
President, LuraTech, Inc.
www.pdfa.org
2
Existing Solutions for Scanned Documents
Black & White: TIFF G4
Color: Mostly JPEG, but sometimes PNG, BMP and 
other raster graphics formats
Often special version formats like “JPEG in TIFF”
Disadvantages:
Several formats already for scanned documents
Even more formats for born digital documents
Loss of information, e.g. with TIFF G4
Bad image quality and huge file size, e.g. with JPEG
No standardized metadata spread over all formats
Not full text searchable (OCR) inside of files
Black/White: 
-TIFF FAX G4
Color:
-TIFF
-TIFF LZW
-JPEG
-PDF
VB.NET PowerPoint: Read, Edit and Process PPTX File
How to convert PowerPoint to PDF, render PowerPoint to SVG and effective VB.NET solution to add desired watermark VB.NET PowerPoint: Read & Edit PPTX Metadata,
read pdf metadata online; pdf metadata online
C# Create PDF Library SDK to convert PDF from other file formats
Besides, using this PDF document metadata adding control, you can add some additional information to generated PDF file.
rename pdf files from metadata; remove pdf metadata online
Mark McKinney
President, LuraTech, Inc.
www.pdfa.org
3
Existing Solutions for Scanned Documents
Bad image quality vs. file size
TIFF/BMP
23.8 MB
JPEG
TIFF G4
60 kB
180 kB
C# PDF File Compress Library: Compress reduce PDF size in C#.net
Document and metadata. All object data. Flatten visible layers. C#.NET DLLs: Compress PDF Document. Add necessary references: RasterEdge.Imaging.Basic.dll.
embed metadata in pdf; adding metadata to pdf files
C# PDF Text Search Library: search text inside PDF file in C#.net
Ability to search and replace PDF text in ASP.NET programmatically. C#.NET PDF DLLs for Finding Text in PDF Document. Add necessary references:
bulk edit pdf metadata; modify pdf metadata
Mark McKinney
President, LuraTech, Inc.
www.pdfa.org
4
Alternative Solution: PDF
PDF is already widely used to:
Unify file formats
Image à PDF
“Office” Documents àPDF
Other sources àPDF
Create full-text searchable files
Apply modern compression technology
(e.g. the JPEG2000 file formats family)
Harmonize metadata
Conclusion:
PDF avoids the disadvantages of the legacy formats
“So if you are already using PDF as archival 
format, why not use PDF/A with its many 
advantages?”
.NET PDF Document Viewing, Annotation, Conversion & Processing
Extract images from PDF file. Read PDF metadata. Search text content inside PDF. Edit, remove images from PDF. Add, edit, delete links. Form Process.
pdf metadata; pdf metadata editor
VB.NET PDF - How to Add Barcode on PDF Page
text in PDF, C#.NET edit PDF bookmark, C#.NET edit PDF metadata, C#.NET VB.NET PDF barcode creator add-on, which combines the PDF reading add-on with
pdf metadata editor online; batch pdf metadata editor
Mark McKinney
President, LuraTech, Inc.
www.pdfa.org
PDF/A
What is PDF/A?
• ISO 19005-1, Document Management
• Electronic document file format for 
long-term preservation
Goals of PDF/A:
• Maintain static visual representation 
of documents
• Consistent handing of Metadata
• Option to maintain structure and 
semantic meaning of content
• Transparency to guarantee access
• Limit the number of restrictions
VB.NET PDF - Convert CSV to PDF
pages, C#.NET search text in PDF, C#.NET edit PDF bookmark, C#.NET edit PDF metadata, C#.NET VB.NET Demo Code for Converting RTF to PDF. Add necessary references
view pdf metadata in explorer; extract pdf metadata
VB.NET PDF Text Search Library: search text inside PDF file in vb.
Ability to search and replace PDF text programmatically in VB.NET. Our VB.NET PDF Document Add-On enables you to search for text in target PDF document
analyze pdf metadata; edit multiple pdf metadata
Mark McKinney
President, LuraTech, Inc.
www.pdfa.org
6
PDF/A – Full-Text Searchability (OCR)
Benefit: Searchable at the File Level
Digital Library -“after book download”
Large Manuals / Multi-Page Construction Documents
Downloaded Documents from Archive Databases
Documents sent to customers, suppliers, lawyers, 
etc. as email attachments
VB.NET Create PDF Library SDK to convert PDF from other file
Load PDF from stream programmatically in VB.NET. VB.NET: DLLs for Creating PDF. Add necessary references: RasterEdge.Imaging.Basic.dll.
pdf metadata viewer; metadata in pdf documents
C# PDF: PDF Document Viewer & Reader SDK for Windows Forms
without any dependency on Adobe products; Add PDF viewing and Support navigating, zooming, annotating and saving PDF in C# WinForms project programmatically;
change pdf metadata creation date; adding metadata to pdf
Mark McKinney
President, LuraTech, Inc.
www.pdfa.org
7
PDF/A – Enhanced Compression
For Black & White Documents
JBIG2 -ISO/IEC 14492
Used as alternative to TIFF G4
Full and visual lossless mode
Embedded in PDF/A, available in Acrobat Reader
FAX G4
60 kB
JBIG2/lossless
46 kB
JBIG2/lossy
29 kB
Mark McKinney
President, LuraTech, Inc.
www.pdfa.org
8
PDF/A – Enhanced Compression
For Color Documents
MRC Compression, also known as JPEG2000 (JPM)
Splits documents in three layers to be compressed 
independently and stored in PDF/A
Mark McKinney
President, LuraTech, Inc.
www.pdfa.org
9
PDF/A – Enhanced Compression
For Color Documents
Extreme compression, fully legible
Saves the color and the visual quality
TIFF
65 kB
180 kB
60 kB
TIFF G4
PDF/A
JPEG
23,8 MB
Mark McKinney
President, LuraTech, Inc.
www.pdfa.org
PDF Compressor Basics: How it works
Convert Scanned documents
Batch conversion
“unattended”
Fully automated
Paper
Conversion and Optimization Process
Scanner
Storage / ECM
LuraDocument
PDF Compressor
TIFF
JPEG
PDF
Network / 
Workflow
Documents you may be interested
Documents you may be interested