44
PDF Compresor Enterprise – Manual
www.luratech.com
info@luratech.com
Setting up Job Entries: 61
c. high quality switches foreground sub sampling off. Additionally a higher foreground
image compression ratio is used to limit the increase in file size. Color reproduction
for text and lines is further improved. This option may be used to work around
problems with PDF Viewer applications that do not render the standard sub sampled
foreground in full quality.
d. max quality results in maximum text and line color quality at significantly increased
output file size. (No foreground sub sampling and standard compression ratio)
3. There are two JBIG2 options: The first one selects from two different compression schemes:
a. High compression efficiency uses a sophisticated symbol matching algorithm that
obtains very high compression results, especially if the input image contains a lot of
similar patterns (like characters of a font). This option should normally be used.
b. Fast encoding disables the symbol matching, resulting in a slightly faster
compression, but also in larger file sizes.
The second option is only available for high compression efficiency:
c. Lossy encoding gives the smallest file sizes.
d. Lossless encoding gives larger file sizes, but does not change the text mask. Use this
option for 100% quality and to prevent symbol mismatches in case of low resolution
or low symbol quality.
4. Image Coder Options determine which image codec is used for color images embedded into
the layers of the created PDF documents (e.g. text color and background image layer when
page segmentation is applied):
a. JPEG 2000 selects the newer JPEG 2000 compression.
b. JPEG selects the older DCT-based JPEG compression.
c. Automatic determines the selection of JPEG 2000 or JPEG compression based on the
size (in pixels) of the encoded image. Using Adobe™ Reader™, the display of
document pages with large JPEG 2000 images can become very slow or even faulty,
depending on the user interaction such as zooming or panning. To overcome this
problem, the Automatic mode uses JPEG instead of JPEG 2000 compression
whenever the width or height of an image is greater than 5120 pixels.
Note: This option is not available for Reader™ 5 or PDF/A-1 compatibility. These
compatibility modes must use JPEG compression by definition.
Note: This option is not available for Reader™ 6, Reader™ 7, PDF/A-2, or PDF/A-3
compatibility when the Document Class is set to Photo with Quality 11 (lossless). This special
setting features a lossless compression for the page image. Since lossless compression is not
possible with JPEG, this mode always uses lossless JPEG 2000 compression.
5. Page Segmentation Options let you configure advanced options for the LuraTech page
segmentation. These options are only available when page segmentation is actually part of
the processing, i.e. for document classes MRC Standard or MRC Check.
Enable Use layout analysis to detect images on a page to improve the visual quality of a
page. Detected images are excluded from the segmentation process and stored in a single
37
PDF Compresor Enterprise – Manual
www.luratech.com
info@luratech.com
Setting up Job Entries: 62
layer. This gives images a better appearance without any disturbances by segmentation
artifacts. The resolution of the detected images can be controlled as follows:
a. Regular image resolution sets the image resolution identical to the resolution used
for the background image. This obtains the highest compression rates.
b. Increased image resolution sets the image resolution slightly higher than the regular
background resolution. Images become sharper, but file sizes slightly increase.
c. High image resolution sets the image resolution to the maximum possible value to
obtain the best image quality. Compared to increased image resolution the file sizes
are even larger, but still smaller compared to the document class Photo, where no
segmentation is used.
This option is only available in either of the MRC compression modes. The detection of
images on a page does not increase the processing time if OCR is enabled. If the compression
is done without OCR, the detection of images adds a certain amount to the overall
processing time. Enable activate only when using OCR to only use this feature when OCR is
enabled. (So there is no increase in processing time, but advanced image detection will not
be used for job list entries without OCR.)
Note: The PDF layers option present in older versions of PDF Compressor has been moved to
the Advanced Output Options dialog described under Configuring Advanced Output Options,
p. 30.
Note: The PDF input rasterization option present in older versions of PDF Compressor has
been moved to the Advanced Input Options dialog described under Configuring Advanced
PDF Input Options, p. 19.
Setting up the Default Properties
Each job list entry has its own properties settings. There is one additional set of properties that
is not related to any specific job list entry. This set is called Default Properties. They are used as a
template whenever you create a new entry in the list. You can open the Default Properties
dialog by any of the following actions:
Use the toolďar͛s default properties ďuttoŶ .
Use the menu Entry Edit Default Properties.
WheŶ aŶ eŶtrLJ͛s properties dialog is already opened: Unselect the currently selected
entry by clicking outside any entries region within the list window.
The Default Properties dialog is similar to an entries properties dialog. Only the few differences
are described here.
24
PDF Compresor Enterprise – Manual
www.luratech.com
info@luratech.com
Setting up Job Entries: 63
Default Properties has no name that can be edited.
The Start and Stop buttons are missing (there is no related job to run). Instead you can
reset all parameters to the factory default values.
Template String Syntax Description
Template strings are used for PDF metadata (see PDF Metadata, p. 33), headers and footers
(see Configuring Header and Footer, p. 54), input file renaming (see also Configuring Post-
Processing, p. 38), output file naming (see Configuring Data Output, p. 25), thumbnail file
naming (see Thumbnails, p. 34), data embedding (Configuring Data Embedding, p. 49), and file
embedding (Configuring File Embedding, p. 53).
Restrictions
Used in
Restriction
PDF Metadata
Must not contain %Q or its versions with modifiers.
Header and Footer
Must not contain %L, %N, %C, %V or its versions with modifiers.
Input file renaming
Must contain %F or its version with modifiers, or regular expression
substitution for F.
Must not contain %P, %Q, %L, %N, %C, %V or its versions with
66
PDF Compresor Enterprise – Manual
www.luratech.com
info@luratech.com
Setting up Job Entries: 64
modifiers.
Must not contain any characters disallowed for file names.
XMP metadata,
PDF bookmarks,
XML OCR results,
file attachments:
Must not contain %Q or its versions with modifiers. Must not contain
any characters disallowed for file names. May be constant strings
without any % escape sequences.
Output file naming
Must at least contain one of %P, %L, %C or its versions with modifiers
when page based output splitting is enabled.
Must at least contain one of %P, %L, %C, %V or its versions with
modifiers when barcode triggered output splitting is enabled.
Must not contain %Q or its versions with modifiers.
Must not contain any characters disallowed for file names.
Thumbnail
file
naming
Must not contain %L, %N or its versions with modifiers.
Must not contain any characters disallowed for file names.
Escape Sequences
Escape
Substitution
%%
Percent sign
%F
Input file (or directory) name.
File name for single file input processing, directory name when merging files
from a directory into a single PDF output file. The file name is printed without file
extension for PDF
%P
First page number that is output to the PDF file (metadata or output file name)
Current page number in output document (header, footer, or thumbnail output)
%Q
Current page number in output chunk
%L
Last page number that is output to the PDF file
%N
Number of pages output to the PDF file
%C
Number of PDF file when using output splitting
%V
Value of the last detected barcode
%a
Abbreviated weekday name
%A
Full weekday name
%b
Abbreviated month name
%B
Full month name
%c
Date and time representation appropriate for locale
%d
Day of month as decimal number (01 – 31)
%H
Hour in 24-hour format (00 – 23)
%I
Hour in 12-hour format (01 – 12)
%j
Day of year as decimal number (001 – 366)
71
PDF Compresor Enterprise – Manual
www.luratech.com
info@luratech.com
Setting up Job Entries: 65
%m
Month as decimal number (01 – 12)
%M
Minute as decimal number (00 – 59)
%p
urreŶt loĐale͛s A͘D͘/P͘D͘ iŶdiĐator for ϭϮ-hour clock
%S
Second as decimal number (00 – 59)
%U
Week of year as decimal number, with Sunday as first day of week (00 – 53)
%w
Weekday as decimal number (0 – 6; Sunday is 0)
%W
Week of year as decimal number, with Monday as first day of week (00 – 53)
%x
Date representation for current locale
%X
Time representation for current locale
%y
Year without century, as decimal number (00 – 99)
%Y
Year with century, as decimal number
%z, %Z
Time-zone name or abbreviation; no characters if time zone is unknown
%#c
Long date and time representation, appropriate for current locale. For example:
͞TuesdaLJ, DarĐh ϭϰ, ϭϵϵϱ, ϭϮ͗ϰϭ͗Ϯϵ͘͟
%#x
Long date representation, appropriate to current locale. For example: ͞TuesdaLJ,
DarĐh ϭϰ, ϭϵϵϱ͘͟
%[, %],
%?, %:
Used for conditional and regular expression substitutions. See
Conditional Substitution and Regular Expression Substitution, p. 66.
Moreover addiŶg the ĐharaĐter ͚#͛ to shortĐuts that output Ŷuŵďers, ǁill reŵoǀe leadiŶg zeros
in most cases.
Modifiers
Escape
Modifier
Function
%P
%Q
%L
%N
%C
%0<number>P
%0<number>Q
%0<number>L
%0<number>N
%0<number>C
Minimal number of output characters.
This modifier defines the number of preceding zeros
used to format a page number. The modifier starts
ǁith the digit ͞Ϭ͟ aŶd is folloǁed ďLJ oŶe or ŵore
digits, representing the minimal number of output
characters as a decimal number.
%F
%<number>-
<number>F
Defines a substring of the given file name
This modifier is composed by two integer numbers
separated ďLJ a ŵiŶus sigŶ ͚-͚͘ If the ŵodifier is preseŶt,
only a substring of the filename will be substituted.
The numbers represent the first and the last character
index (counting from 0) of the substring. Any number
greater or equal to the string length will be adjusted to
represent the last character of the string.
How to C#: Special Effects Erase. Set the image to current background color, the background color can be set by:ImageProcess.BackgroundColor = Color.Red. Encipher.
delete text pdf; delete text pdf document
62
PDF Compresor Enterprise – Manual
www.luratech.com
info@luratech.com
Setting up Job Entries: 66
Modifier Examples
Escape
Page Number
Substitution
%02P
1
01
%02P
9
09
%02P
10
10
%02P
100
100
Escape
File name
Substitution
%3-10F
01_myfile01_xyz.tif
myfile01.pdf
%3-99F
01_myfile01_xyz.tif
myfile01_xyz.pdf
%50-60
01_myfile01_xyz.tif
z.pdf
%10-3F
considered as an error
Examples
WheŶ a ϭϬ pages doĐuŵeŶt ͞DLJ&ile͘tif͟ is output iŶto ĐhuŶks of ϰ pages, the output file
template
͞%&, Part % , Pages %ϬϮP - %ϬϮL͟
will output the files
͞DLJ&ile, Part ϭ, Pages Ϭϭ - Ϭϰ͘pdf͞
͞DLJ&ile, Part Ϯ, Pages Ϭϱ - Ϭϴ͘pdf͞
͞DLJ&ile, Part ϯ, Pages 09 - ϭϬ͘pdf͞
Conditional Substitution
The following syntax can be used to make a template string substitution dependent from a
condition:
Condition Syntax
Substitution
%[<condition>%?<string1>%:<string2>%]
<string1> if <condition> is true, <string2>
otherwise
Currently the following conditions are defined:
<condition>
Description
C
True if the number of output chunks is greater than 1 or unknown. False
otherwise.
V
True if the value of the last detected barcode (%V) is not empty (a barcode
was detected).
Examples
A) Output chunks: The template
"%F, %[C%?Part %C, %:%]Pages %02P - %02L"
41
PDF Compresor Enterprise – Manual
www.luratech.com
info@luratech.com
Setting up Job Entries: 67
outputs the 10 pages document "MyFile.tif" with a chunk size of 5 pages to
"MyFile, Part 1, Pages 01 - 05.pdf"
"MyFile, Part 2, Pages 06 - 10.pdf"
and the 5 pages document "MyShortFile.tif" with the same chunk size to
"MyShortFile, Pages 01 - 05.pdf"
B) Barcode renaming: The template
"%[V%?%V%:%F%]"
names the output file according to the value of the barcode found in the document (%V). If
barcode detection fails for a document however, the output is named like the input file (%F) as a
fallback.
C) Barcode splitting: The template
"%F - %[V%?%V%:PREFIX%] (%02P - %02L)"
outputs for a doĐuŵeŶt ͞DLJ&ile͘tif͟ ǁith ďarĐodes Η hapter ϭΗ oŶ page ϯ aŶd Η hapter ϮΗ oŶ
page 7 the following chunks:
"MyFile - PREFIX (01 - 02).pdf"
"MyFile - Chapter 1 (03 - 06).pdf"
"MyFile - Chapter 2 (07 - 10).pdf"
Regular Expression Substitution
The regular expression substitution syntax is defined by:
%[<input>%?<regexp>%:<replacement>%]
<input> identifies the input string the regular expression substitution is applied to:
<input>
Description
F
Input file (or directory) name.
File name for single file input processing, directory name when merging files
from a directory into a single PDF output file. The file name is printed
without file extension for PDF metadata, input file renaming, and output file
renaming.
<regexp> is the regular expression. It implements the commonly used syntax for regular
expressions defined as follows:
A regular expression is zero or more branches, separated by '|'. It matches anything that
matches one of the branches.
A branch is zero or more pieces, concatenated. It matches a match for the first, followed by a
match for the second, etc.
A piece is an atom possibly followed by '*', '+', or '?'. An atom followed by '*' matches a
sequence of 0 or more matches of the atom. An atom followed by '+' matches a sequence of 1
36
PDF Compresor Enterprise – Manual
www.luratech.com
info@luratech.com
Setting up Job Entries: 68
or more matches of the atom. An atom followed by '?' matches a match of the atom, or the null
string.
An atom is a regular expression in parentheses (matching a match for the regular expression), a
range (see below), '.' (matching any single character), '^' (matching the null string at the
beginning of the input string), '$' (matching the null string at the end of the input string), a '\'
followed by a single character (matching that character), or a single character with no other
significance (matching that character).
A range is a sequence of characters enclosed in '[]'. It normally matches any single character
from the sequence. If the sequence begins with '^', it matches any single character not from the
rest of the sequence. If two characters in the sequence are separated by '-', this is shorthand for
the full list of ASCII characters between them (e.g. '[0-9]' matches any decimal digit). To include
a literal ']' in the sequence, make it the first character (following a possible '^'). To include a
literal '-', make it the first or last character.
<replacement> is the replacement string to be used to build up the output string if the regular
expression found a successful match. (If the regular expression has no match, the output string
will be a copy of the input string.) In case of a successful match, each character in the
replacement string will be copied to the output string except for the following special
characters:
Sequence
Output
&
The complete matched string
\1
Matched sub-string 1
aŶd so oŶ uŶtil …
\9
Matched sub-string 9
Example: The template
"%[F%?(.*)_From_[0-9]+_To_[0-9]+$%:\1%]_%P_To_%L"
outputs the ϭϬ pages doĐuŵeŶt ͞DLJ&ile_&roŵ_ϭ_To_ϭϬ͘tif͟ ǁith a ĐhuŶk size of ϱ pages to
"MyFile_ 1_To_5.pdf"
"MyFile_ 6_To_10.pdf"
34
PDF Compresor Enterprise – Manual
www.luratech.com
info@luratech.com
Managing the Job List: 69
6. Managing the Job List
This section describes in detail all operations that can be performed on the job list.
Adding, Deleting and Copying Entries
New entries can be added by any of the following actions:
Using the context menu within an area of the list that is outside any existing entry.
Use the toolďar͛s Ŷeǁ ďuttoŶ .
Use the menu Entry
Add New Entry.
Drag & drop files and/or folders from the Windows™ Explorer to the list window.
When a new entry is created, all settings from the Default Properties are copied to the new
eŶtrLJ͘ The Ŷaŵe is set to ͞EŶtrLJ фŶuŵх͟, ǁhere фŶuŵх is the first aǀailaďle Ŷuŵďer͘ WheŶ the
entry is created by drag & drop the input path is set accordingly. The output path defaults to the
input path in this case.
Entries can be deleted by any of the following actions:
Use the context menu on top of the entry to be deleted. Choose Delete.
SeleĐt aŶ eŶtrLJ aŶd use the toolďar͛s delete ďuttoŶ .
Select an entry and use the menu Entry
Delete.
All entries can be deleted by use of the menu File
Delete All Job Entries.
Sometimes it is useful to create a copy of an existing entry (e.g. you might only want to change
the compression quality and output folder for comparison reasons). An entry can be copied by
any of the following actions:
Use the context menu on top of the entry to be deleted. Choose Insert Copy.
Select an entry and use the menu Entry
Insert Copy.
48
PDF Compresor Enterprise – Manual
www.luratech.com
info@luratech.com
Managing the Job List: 70
Changing the Sequence of Job Entries
To change the order of the list, you can move entries up and down:
Select an entry and use the toolďar͛s up and down buttons.
Select an entry and use the menu Entry
Move Up or Entry
Move Down.
See Priority Processing, p. 81, for details on the processing order of the job list entries.
Starting and Stopping Job Entries
Individual Entries
A selected entry can be started and stopped (aborted) by any of the following actions:
Use the context menu of the entry to be started or stopped. Choose Start or Stop.
Select an entry and use the toolďar͛s start and stop buttons.
Select an entry and use the menu Entry
Start or Entry
Stop.
Use the Start and Stop buttons of the entries Properties dialog.
All Entries
All entries can be started or stopped simultaneously by any of the following actions:
Use the toolbar buttons or .
Use the menu File
Start All Job Entries or File
Stop All Job Entries.
Use the ĐoŶtedžt ŵeŶu of the PD& oŵpressor͛s taskďar iĐoŶ ;right ŵouse ĐliĐkͿ aŶd Đhoose Start
All or Stop All.
Monitoring Jobs
The status of all jobs is listed within the job list. It can be one of the following:
Status
Description
no service
The underlying PDF Compressor service is stopped or has not
been installed. See Managing the PDF Compressor Service, p. 85,
for details.
stopped
The job is stopped. No file conversion will take place.
starting
A signal has been sent from the GUI to the PDF Compressor
service to start the job. The status will soon change to started or
working.
Documents you may be interested
Documents you may be interested