46
3-Heights™ Document Converter, Version 4.6
Page 72 of 90
January 25, 2016
© PDF Tools AG - Premium PDF Technology
The hardware resources available impose certain limits. Office applications remain
loaded in memory; they can consume considerable amounts of memory while
processing large documents.
The computer on which a worker session executes can be individually configured. It
could be a virtual computer hosted in Microsoft Virtual PC or Virtual Server.
If your installation needs to process Outlook .MSG files, follow the following rules:
If the .MSG files constitute a small fraction of processed documents, a single worker
session will be sufficient. This is the default unless you set the SingleOutlook setting to
false in the O2PSRV.exe.config file.
If .MSG files are frequent, you should configure distinct user accounts for worker
sessions running on the same computer. Also make sure that each session will be using
its own pool of PDF Producer printer drivers (no issue with a standard installation).
If either Outlook or VISIO documents must be processed, make sure to configure at
least two PDF Producer printer instances for each worker session.
5.11.1 TIFF Output Format
When TIFF is selected as the output format, office applications will print to the TIFF
Printer driver instead of the PDF Printer driver. Built-in formats like PDF or image
formats will be converted directly.
The TIFF format actually offers many options. The choice between color and gray level
output is supported by the “PRINTCOLOR” option, but many other parameters are left to
be tuned and can be inspected or changed in the Printing Defaults page of the
corresponding TIFF Printers, e.g.
The resolution for color TIFF output is controlled by the DPI setting on the ~T1A and
~T1B etc. printer entries
The resolution for monochrome TIFF output is controlled by the settings on the ~T1N
and ~T1O etc. printer entries
Similarly, the image compression type can be configured via these settings
Note: if TIFF (and PDF) printer settings are changed, this should be done consistently
for all entries. Values are propagated from the system defaults to the user printing
defaults; changing the system defaults immediately after installation will have the effect
that all worker sessions will inherit these values. However, once a worker session has
used a printer entry, it will continue to use the copy of the printer settings of the user
account under which the worker session is running.
The Document Converter selects a TIFF Printer based on the Color setting only. To
change the settings for black and white conversion e.g. from ‘bi-tonal’ to ‘gray scale’ or
change the DPI setting, you set the printing defaults of all non-color TIFF Printer entries
accordingly. These would be the entries carrying “T1N”, “T1O”, “T2N”, etc. in their
name.
As built-in conversions do not make use of the TIFF Printers, they use their own options,
which can be set at the document or job level:
TIFF.COMPR: the compression type (“raw”, “JPEG”, “flate”, “LZW”, “Group3”,
“Group3_2D”, “Group4”, “TIFFJPEG”)
43
3-Heights™ Document Converter, Version 4.6
Page 73 of 90
January 25, 2016
© PDF Tools AG - Premium PDF Technology
Compression types can be specified selectively also for bi-tonal, indexed and continuous
color images using the following settings:
TIFF.COMPR.BITONAL (default Fax Group 4)
TIFF.COMPR.INDEXED (default JPEG)
TIFF.COMPR.CONTINOUS (default LZW)
The TIFF.COMPR.PDF applies for conversions of PDF files to TIFF, and also for the
conversion of images to PDF. The default is JPEG for color output, and Fax Group 4
otherwise.
CMPRQUAL: this setting can be used to specify the compression quality for JPEG
compressed pages. The default is 75 (on a scale from 1 to 100).
TIFF.BPI: the number of bits per pixel to be used by the specified compression type.
Defaults are: 1 for Group3, Group3_2D and Group4 compression (bi-level), 8 for gray
level, and 24 for color.
TIFF.XDPI, TIFF.YDPI: the horizontal and vertical resolution in dots per inch. Default is
150. To set the same resolution for both, you can use the option name TIFF.DPI
TIFF.RENDERINGMODE: the rendering mode to be applied for PDF to TIFF conversion.
Default: 1 (accurate) for color conversions, 0 (fast) for black and white (Group3,
Group4, JBIG2 compression.
TIFF.DITHERINGMODE: the dither mode to be applied during PDF to TIFF conversion
with bi-level compression. Default: 1 (Floyd Steinberg) for JBIG2, 4 (Group3 optimized)
for Group3, 5 (Group4 optimized) for Group4 compression.
TIFF.ROTATE controls how pages from PDF will be rotated when converted to TIFF;
possible values are: NONE, PORTRAIT, LANDSCAPE, AUTO (according to the page
rotation attribute in PDF, which is default)
TIFF.UNPACK controls the conversion of indexed images having less than 8 bits per
sample. When this option is set to true, samples will be expanded to 8 bits. This permits
re-compression using JPEG.
The built-in defaults depend on the color (PRINTCOLOR option) setting. Color output has
the defaults 24 bit JPEG, black and white Fax Group 4 (bi-level).
The Document Converter produces multipage TIFF files. Applications which need single
page TIFF files use a TIFF split/extraction tool.
5.11.2 Conversion of Internet Mail Messages
MIME compliant mail messages are converted as follows:
The HTML body part is extracted, as well as any images that are included in the
message and referenced from the HTML page.
If no HTML body is present, the text part is extracted
From the mail header, the Subject and From fields are extracted and inserted into the
top of the extracted message
All headers are optionally appended to the end of the message
44
3-Heights™ Document Converter, Version 4.6
Page 74 of 90
January 25, 2016
© PDF Tools AG - Premium PDF Technology
All above parts are passed to the application configured for “.HTM” documents for
rendering to PDF. The default application is Microsoft Word; however, it is possible to
configure Internet Explorer alternatively.
Attachments of the mail message are extracted and also converted to PDF
The following options are recognized during mail conversion:
MAILPARTS=AB/B/A
Select attachments and/or body of the mail message for conversion. Default: body +
attachments (AB)
TEXTBODY =true
Values “true” or “false”; if “true”, use text representation of mail body for conversion.
Default: “false” (use HTML, if present)
MAILHEADER=true
Values “true” or “false”; if “true”, add full mail headers to converted message.
The layout and styles used to produce the intermediate HTML representation of the mail
message can be configured via a style sheet file and a header template file (see 4.1,
Eml section).
Encrypted mails can be converted, if appropriate certificates and private keys are
installed on the server.
Signatures of signed mails are verified, and the certificate status found during
verification is added to the “From” information. The Windows certificate store is used to
locate trusted issuer certificates.
All store types are searched that are accessible to the converter process performing the
conversion. Therefore, you may want to store relevant certificates either in the
“Computer” store, or in each worker user store.
5.11.3 HTML Conversion
HTML documents can be converted either via MS Word or via Internet Explorer. Either
way has its advantages and disadvantages:
MS Word can preserve links when using the ExportAsFixedFormat feature; Internet
Explorer is used with the printing function, and link functionality is lost
Page breaks are handled differently, and MS Word will probably give better results
with images
MS Word has very limited support for styles handling. HTML documents using such
features need to be converted via Internet Explorer for acceptable results
To configure Internet Explorer for HTML and HTM documents, you need to overwrite the
Extensions entry in both the MSWord and the HTML section of the O2PWSC.ini file (this
can be performed also via the “Edit Configuration Settings” tool), e.g. as this:
[HTML]
Extensions = “URL:MHT:HTMZIP:HTML:HTM”
[MSWord]
Extensions = “DOC:DOCX:RTF:WPD:WPC:WS:”
34
3-Heights™ Document Converter, Version 4.6
Page 75 of 90
January 25, 2016
© PDF Tools AG - Premium PDF Technology
5.12 Platform Issues
When choosing the Windows platform for hosting the Document Converter, the following
issues are relevant:
The platform should be suited for hosting the relevant office applications (Microsoft
Office, OpenOffice.org, etc.). These applications are usually 32-bit applications. Please
refer to the appropriate manufacturer’s publications to verify if a particular platform is
supported.
Printing from 32-bit applications on 64-bit platforms involves an overhead due to
architectural reasons.
PDF and TIFF processing including supporting functions like OCR is generally not
sensitive to platform issues.
Most components of the Document Converter run in 32-bit mode. Hence, you should not
expect a significant advantage when installing on a 64-bit platform.
Newer Windows Server platforms require more care with configuring for use with
terminal services. This is relevant because the Document Converter makes use of
terminal server sessions for controlling office applications. With Windows Server 2008,
starting a specific program is no longer supported in administration mode – a feature
that is crucial for the Converter Service.
It is therefore recommended that you enable application mode for terminal services
(requires additional licenses).
As a work-around, the converter service will create a batch script in the worker’s startup
folder, named O2PWSC-Start.bat. This will be executed after a normal login. Note that
the script will not be created as soon as there is a file (shortcut or script) having a name
that starts with “O2P”.
When operating in application mode, make sure the security settings are configured
appropriately to
• permit connections appropriately (e. g. do not require client certificates)
• permit starting the work session control program O2PWSC.exe
Configurations can be made using the Terminal Server RemoteApp Console (RDP
Settings, Terminal Server tab). See figure below.
7
3-Heights™ Document Converter, Version 4.6
Page 76 of 90
January 25, 2016
© PDF Tools AG - Premium PDF Technology
5.13 Configuration of Terminal Services on Windows 2008 Server
Go through the following configuration dialogues to adjust the settings. Start with
“Terminal Services Configuration” (Start->Administrative Tools->Terminal Services):
6
3-Heights™ Document Converter, Version 4.6
Page 77 of 90
January 25, 2016
© PDF Tools AG - Premium PDF Technology
With a right click on the “RDP-Tcp” connection line (selected above), open the properties
dialogue and go through the tabs as shown on the subsequent screen shots:
4
3-Heights™ Document Converter, Version 4.6
Page 78 of 90
January 25, 2016
© PDF Tools AG - Premium PDF Technology
6
3-Heights™ Document Converter, Version 4.6
Page 79 of 90
January 25, 2016
© PDF Tools AG - Premium PDF Technology
The Color Depth setting is only relevant if you intend to use the conversion of web
pages via Internet Explorer. Otherwise, you can leave the default “16 bits per pixel”.
10
3-Heights™ Document Converter, Version 4.6
Page 80 of 90
January 25, 2016
© PDF Tools AG - Premium PDF Technology
You need to create and configure one or several user accounts for use with the
Document Converter. Make sure these accounts have sufficient rights to log on via
Remote Desktop.
When Remote Desktop Services are configured in application mode, start “RemoteApp”
manager and enter O2PWSC.exe as program that is allowed to be started with any
parameter value (see figure below).
Documents you may be interested
Documents you may be interested