42
ABBYY FineReader 12 User‘s Guide
2
Information in this document is subject to change without notice and does not bear any commitment on the part of
ABBYY.
The software described in this document is supplied under a license agreement. The software may only be used or
copied in strict accordance with the terms of the agreement. It is a breach of the "On legal protection of software and
databases" law of the Russian Federation and of international law to copy the software onto any medium unless
specifically allowed in the license agreement or nondisclosure agreements.
No part of this document may be reproduced or transmitted in any from or by any means, electronic or other, for any
purpose, without the express written permission of ABBYY.
© 2013 ABBYY Production LLC. All rights reserved.
ABBYY, ABBYY FineReader, ADRT are either registered trademarks or trademarks of ABBYY Software Ltd.
© 1984-2008 Adobe Systems Incorporated and its licensors. All rights reserved.
Protected by U.S. Patents 5,929,866; 5,943,063; 6,289,364; 6,563,502; 6,185,684; 6,205,549; 6,639,593; 7,213,269; 7,246,748;
7,272,628; 7,278,168; 7,343,551; 7,395,503; 7,389,200; 7,406,599; 6,754,382 Patents Pending.
Adobe® PDF Library is licensed from Adobe Systems Incorporated.
Adobe, Acrobat®, the Adobe logo, the Acrobat logo, the Adobe PDF logo and Adobe PDF Library are either registered trademarks or
trademarks of Adobe Systems Incorporated in the United States and/or other countries.
Portions of this computer program are copyright © 2008 Celartem, Inc. All rights reserved.
Portions of this computer program are copyright © 2011 Caminova, Inc. All rights reserved.
DjVu is protected by U.S. Patent № 6,058,214. Foreign Patents Pending.
Powered by AT&T Labs Technology.
Portions of this computer program are copyright © 2013 University of New South Wales. All rights reserved.
© 2002-2008 Intel Corporation.
© 2010 Microsoft Corporation. All rights reserved.
Microsoft, Outlook, Excel, PowerPoint, SharePoint, SkyDrive, Windows Server, Office 365, Windows Vista, Windows are either registered
trademarks or trademarks of Microsoft Corporation in the United States and/or other countries.
© 1991-2013 Unicode, Inc. All rights reserved.
JasPer License Version 2.0:
© 2001-2006 Michael David Adams
© 1999-2000 Image Power, Inc.
© 1999-2000 The University of British Columbia
This product includes software developed by the OpenSSL Project for use in the OpenSSL Toolkit. (http://www.openssl.org/). This
product includes cryptographic software written by Eric Young (eay@cryptsoft.com).
© 1998-2011 The OpenSSL Project. All rights reserved.
©1995-1998 Eric Young (eay@cryptsoft.com) All rights reserved.
This product includes software written by Tim Hudson (tjh@cryptsoft.com).
Portions of this software are copyright © 2009 The FreeType Project (www.freetype.org). All rights reserved.
Apache, the Apache feather logo, and OpenOffice are trademarks of The Apache Software Foundation. OpenOffice.org and the seagull
logo are registered trademarks of The Apache Software Foundation.
EPUB®, is a registered trademark of the IDPF (International Digital Publishing Forum)
All other trademarks are the sole property of their respective owners.
57
ABBYY FineReader 12 User‘s Guide
3
Contents
Introducing ABBYY FineReader 12
.......................................................................................... 6
What's New in ABBYY FineReader 12
..................................................................................... 8
Quick Start
...................................................................................................................................... 10
Microsoft Word Tasks
............................................................................................................................. 13
Microsoft Excel Tasks
............................................................................................................................. 14
Adobe PDF Tasks
..................................................................................................................................... 14
Tasks for Other Formats
........................................................................................................................ 15
Adding Images Without Processing
................................................................................................... 16
Creating Custom Automated Tasks
.................................................................................................... 16
Integration with Other Applications
.................................................................................................. 18
Scanning Paper Documents
................................................................................................................. 20
Photographing Documents
................................................................................................................... 22
Opening an Image or PDF Document
............................................................................................... 25
Scanning and Opening Options
........................................................................................................... 26
Image Preprocessing
............................................................................................................................. 28
Recognizing Documents
............................................................................................................. 31
What Is a FineReader Document?
...................................................................................................... 31
Document Features to Consider Prior to OCR
................................................................................ 35
OCR Options
............................................................................................................................................. 37
Working with Complex–Script Languages
........................................................................................ 38
Tips for Improving OCR Quality
.............................................................................................. 42
If the Complex Structure of a Paper Document Is Not Reproduced
........................................ 42
If Areas Are Detected Incorrectly
...................................................................................................... 42
If You Are Processing a Large Number of Documents with Identical Layouts
...................... 45
If a Table Is Not Detected
.................................................................................................................... 46
If a Picture Is Not Detected
................................................................................................................. 47
If a Barcode Is Not Detected
............................................................................................................... 47
56
ABBYY FineReader 12 User‘s Guide
4
Adjusting Area Properties
..................................................................................................................... 48
Incorrect Font Is Used or Some Characters Are Replaced with "?" or "□"
............................. 49
If Your Printed Document Contains Non–Standard Fonts
........................................................... 49
If Your Text Contains Too Many Specialized or Rare Terms
........................................................ 52
If the Program Fails to Recognize Some of the Characters
........................................................ 52
If Vertical or Inverted Text Is Not Recognized
............................................................................... 54
Checking and Editing Texts
...................................................................................................... 55
Checking Texts in the Text Window
................................................................................................... 55
Using Styles
.............................................................................................................................................. 57
Editing Hyperlinks
................................................................................................................................... 58
Editing Tables
........................................................................................................................................... 59
Removing Confidential Information
................................................................................................... 59
Copying Content from Documents
......................................................................................... 61
Saving OCR Results
..................................................................................................................... 62
Saving an Image of a Page
.................................................................................................................. 75
E–mailing OCR Results
.......................................................................................................................... 76
Working with Online Storage Services and Microsoft SharePoint
.............................. 78
Working with Online Storage Services
.............................................................................................. 78
Saving Results to Microsoft SharePoint
............................................................................................ 79
Group Work in a Local Area Network
.................................................................................... 80
Automating and Scheduling OCR
............................................................................................ 82
Automated Tasks
..................................................................................................................................... 82
ABBYY Hot Folder
.................................................................................................................................... 83
Customizing ABBYY FineReader
.............................................................................................. 87
Main Window
............................................................................................................................................ 87
Toolbars
..................................................................................................................................................... 89
Customizing the Workspace
................................................................................................................. 90
34
ABBYY FineReader 12 User‘s Guide
5
Options Dialog Box
................................................................................................................................. 91
Changing the User Interface Language
............................................................................................ 92
Installing, Activating, and Registering ABBYY FineReader
........................................... 93
Installing and Starting ABBYY FineReader
....................................................................................... 93
Activating ABBYY FineReader
.............................................................................................................. 95
Registering ABBYY FineReader
............................................................................................................ 96
Privacy Policy
........................................................................................................................................... 96
ABBYY Screenshot Reader
........................................................................................................ 98
Appendix
....................................................................................................................................... 102
Glossary
................................................................................................................................................... 102
Shortcut Keys
......................................................................................................................................... 106
Supported Image Formats
.................................................................................................................. 110
Supported Saving Formats
................................................................................................................. 112
Required Fonts
....................................................................................................................................... 112
Regular Expressions
............................................................................................................................. 114
Technical Support
...................................................................................................................... 116
How to C#: Special Effects Erase. Set the image to current background color, the background color can be set by:ImageProcess.BackgroundColor = Color.Red. Encipher.
remove text from pdf; how to delete text from a pdf document
52
ABBYY FineReader 12 User‘s Guide
6
Introducing ABBYY FineReader 12
ABBYY FineReader is an optical character recognition (OCR) system that converts
scanned documents, PDF documents, and image files (including digital photos) into
editable formats.
ABBYY FineReader 12 advantages
Fast and accurate recognition
The OCR technology used in ABBYY FineReader quickly and accurately recognizes and
retains the original formatting of any document.
Thanks to ABBYY's Adaptive Document Recognition Technology (ADRT®), ABBYY
FineReader can analyze and process a document in its entirety, rather than one page at a
time. This approach retains the source document's structure, including formatting,
hyperlinks, e–mail addresses, headers and footers, image and table captions, page
numbers, and footnotes.
ABBYY FineReader is largely immune to printing defects and can recognize texts printed in
virtually any font.
ABBYY FineReader can recognize text photos obtained with a regular camera or a mobile
phone. Additional image preprocessing can greatly improve the quality of your photos,
resulting in more accurate OCR.
For faster processing, ABBYY FineReader makes efficient use of multi–core processors and
offers a special black–and–white processing mode for documents where colors need not be
preserved.
Supports most of the world's languages*
ABBYY FineReader can recognize texts written in any of the 190 languages that it supports,
or in a combination of those languages. Among the supported languages are Arabic,
Vietnamese, Korean, Chinese, Japanese, Thai, and Hebrew. ABBYY FineReader can
automatically detect the language of a document.
Ability to check OCR results
ABBYY FineReader has a built–in text editor which allows you to compare recognized texts
against their original images and make any necessary changes.
If you are not satisfied with the results of automatic processing, you can manually specify
image areas to capture and train the program to recognize less common or unusual fonts.
Intuitive user interface
The program comes with a number of preconfigured automated tasks that cover the most
common OCR scenarios and enable you to convert scans, PDFs, and image files into
editable documents with a click of a button. Integration with Microsoft Office and Windows
Explorer means that you can recognize documents directly from within Microsoft Outlook,
Microsoft Word, Microsoft Excel or simply by right–clicking a file on your computer.
The program supports the usual Windows shortcut keys and touchscreen swipes, e.g. to
scroll or zoom in and out of images.
Quick quoting
37
ABBYY FineReader 12 User‘s Guide
7
You can easily copy and paste recognized fragments into other applications. Page images
will open instantly, and will be available for viewing, selection, and copying before the
entire document has been recognized.
Recognition of digital photos
You can take a picture of a document with your digital camera, and ABBYY FineReader 12
will recognize the text just as if it was an ordinary scan.
PDF archiving
ABBYY FineReader can convert your paper documents or scanned PDFs into searchable
PDF and PDF/A documents.
MRC compression can be applied to reduce the size of PDF files without impairing their
visual quality.
Supports multiple saving formats and cloud storage services
ABBYY FineReader 12 can save recognized texts in Microsoft Office formats (Word, Excel,
and PowerPoint), in searchable PDF/A and PDF for long–term storage, and in popular e–
book formats.
You can save results either locally or in cloud storage services (Google Drive, Dropbox, and
SkyDrive) and access them from anywhere in the world. ABBYY FineReader 12 can also
export documents directly to Microsoft SharePoint Online and Microsoft Office.
Includes two bonus applications — ABBYY Business Card Reader and ABBYY
Screenshot Reader
ABBYY Business Card Reader (available only with ABBYY FineReader 12 Corporate) is a
handy utility that captures data from business cards and saves them directly to Microsoft®
Outlook®, Salesforce, and other contact management software.
ABBYY Screenshot Reader is an easy–to–use program that can take screenshots of whole
windows or selected areas and recognize the text inside.
Free technical support for registered users
* The set of supported languages may vary in different editions of the product.
46
ABBYY FineReader 12 User‘s Guide
8
What's New in ABBYY FineReader 12
Below follows a brief overview of the major new features and improvements that have been
introduced in ABBYY FineReader 12.
Improved recognition accuracy
The new version of ABBYY FineReader delivers more accurate OCR and better recreates the
original formatting of your documents thanks to improvements in ABBYY's proprietary
Adaptive Document Recognition Technology (ADRT). The program now better detects
document styles, headings, and tables, so that you don't have to fix the formatting of your
documents once they are recognized.
Recognition languages
ABBYY FineReader 12 can now recognize Russian texts with stress marks. OCR quality has
been improved for Chinese, Japanese, Korean, Arabic, and Hebrew.
Faster and friendlier user interface
Background processing
It may take quite some time to recognize very large documents. In the new version, time–
consuming processes run in the background, allowing you to continue working on those
parts of the document which have already been recognized. Now you don't have to wait for
the OCR process to complete before you can adjust image areas, view non–recognized
pages, force–start the OCR of a particular page or image area, add pages from other
sources, or change the order of pages in the document.
Faster image loading
Page images will appear in the program as soon as you scan the paper originals, so that
you can immediately see the scanning results and select pages and image areas to
recognize.
Easier quoting
Any image area containing text, pictures or tables can be easily recognized and copied to
the Clipboard with a click of the mouse.
All the basic operations, including scrolling and zooming, are now also supported on
touchscreens.
Image preprocessing and camera OCR
The improved image preprocessing algorithms ensure better recognition of photographed
texts and produce text photos that look as good as scans. The new photo correction
capabilities include automatic cropping, correction of geometrical distortions, and evening
out of brightness and background colors.
ABBYY FineReader 12 allows you to select the preprocessing options you wish to apply to
any newly added image, so that you won't need to correct each image separately.
Better visual quality for archived documents
ABBYY FineReader 12 includes new PreciseScan technology, which smoothes characters to
improve the visual quality of scanned documents. As a result, characters do not look
pixelated even when you zoom in on the page.
22
ABBYY FineReader 12 User‘s Guide
9
New tools for manual editing of recognition output
Verification and correction capabilities have been expanded in the new version. In ABBYY
FineReader 12, you can format recognized texts in the verification window, which now also
includes a tool for inserting special symbols not available on standard keyboards. You can
also use keyboard shortcuts for the most frequent verification and correction commands.
In ABBYY FineReader 12, you can disable recreation of such structural elements as
headers, footers, footnotes, tables of contents, and numbered lists. This may be necessary
if you want these elements to appear as normal text for better compatibility with other
products, e.g. translation software and e–book authoring software.
New saving options
When saving OCR results to XLSX, you can now save pictures, remove text formatting, and
save each page on a separate Excel worksheet.
ABBYY FineReader 12 can create ePub files compliant with the EPUB 2.0.1 and EPUB 3.0
standards.
Improved integration with third–party services and applications
Now you can export your recognized documents directly to SharePoint Online and Microsoft
Office 365, and the new opening and saving dialog boxes provide easy access to cloud
storage services, such as Google Drive, Dropbox, and SkyDrive.
22
ABBYY FineReader 12 User‘s Guide
10
Quick Start
ABBYY FineReader converts scanned documents, PDF documents, and image files (including
digital photos) into editable formats.
To process a document with ABBYY FineReader, you need to complete the following four
steps:
Acquire an image of the document
Recognize the document
Verify the results
Save the results in a format of your choice
If you need to repeat the same steps over and over again, you can use an automated task,
which will execute the required actions with just one click of a button. To process
documents with complex layouts, you can customize and run each step separately.
Built–in automated tasks
When you start ABBYY FineReader, the Task window is displayed, listing the automated
tasks for the most common processing scenarios. If you can't see the Task window, click
the Task button on the main toolbar.
Documents you may be interested
Documents you may be interested