54
The Administration Module
64
Kofax Capture Administrator's Guide
Zone Type
Description
Toolbar Icon
Form
identification zone
A form identification zone can be used to assist the Kofax
Capture page-level form identification feature. It is typically
used to help the form identification engine distinguish
among forms that are very similar. For example, a form
identification zone could be useful for identifying different
revisions of the same form.
You can specify more than one form identification zone for
a form type. For multipage form types, the zone must be on
the first page.
Group zone
Many documents consist of related items. For example,
you might find fields for first and last name. With group
zones, you can take advantage of the logical relationship
among items by creating a group with one or more index
zones in it. This capability eliminates the need to use a
special script. With groups, you can combine data from
separate fields, control how many fields can be filled in for
the group, or find conflicts when more than one choice has
(inappropriately) been made.
You create index zones on your sample pages. To display a sample page, do one of the
following:
▪ From the Document class tree view tab, double-click the document class associated with
the form type. Select a form type, and select the sample page. The sample page appears.
▪ From the Batch class tree view tab, double-click the document class associated with the
form type. Select a form type, and select the sample page. The sample page appears.
When creating zones on your sample page, use the following guidelines.
In General
▪ If all index data is separated from other information on your sample page by at least .25",
draw your index zones to include a moderate amount of white space (about .25") around
the index data. The white space around the data allows for horizontal or vertical offsets
that might be introduced by a scanner.
▪ If at least some of the index data is close together (that is, not separated from other
information on your sample page by at least .25"), draw your index zones with only a small
amount of white space. This prevents unwanted data from appearing in a zone, which
could reduce the accuracy of character recognition. Because a small amount of white space
does not allow for horizontal or vertical offsets that might be introduced by a scanner, a
registration zone may be required.
For OCR Zones
▪ In general, characters and numbers must reside completely within the zone boundaries to
be read.
▪ Characters or numbers that do not reside completely within the boundaries are not
reflected in the captured data.
For shape zones, Kofax Capture examines the content of the zone and attempts to find a
corresponding shape. If a shape is found, its ID number is placed in the associated field. The
results are available in the Validation and Verification modules. They can also be used in your
custom recognition scripts or custom modules. If you assign a shape recognition profile to
32
The Administration Module
Kofax Capture Administrator's Guide
65
an index zone, you cannot specify a specific search shape. All shapes are examined and the
closest match is used. Your ability to use this capability is potentially unlimited.
For Bar Code Recognition
▪ Include at least a 0.5” boundary around the bar code when drawing your zone. If this is not
possible, ensure that a minimum 0.5” margin exists on the left side of the bar code.
▪ Make sure you include the entire bar code within the zone.
For OCR Registration Zones
▪ If possible, include at least .25" of white space around the registration data to allow for
horizontal or vertical offsets that might be introduced by a scanner. The registration
process measures and corrects any scan offset by shifting the index zones into place.
▪ Make sure the border of the zone does not touch other text or elements on the image.
▪ When manually resetting a registration point, select it carefully. Position the cursor at
the lower-left corner of an imaginary box around the first character of the text in the
OCR registration zone. If the first character has a descender (for example, a lowercase
"y"), use the baseline of the character, not the bottom of the character, when selecting the
registration point.
▪ Test whether a registration point is properly defined: Create a test batch by importing the
sample image file used to define your OCR index zones. If the registration point is properly
defined, your OCR index zones contain the expected data. If the registration point is not
properly defined, your OCR index zones are shifted and contain unexpected data.
▪ For best results, keep the search text as short as possible.
For Shape Registration Zones
Include only the portion of the shape you are trying to match. If your zone includes excess
components, the recognition engine will probably return ID number 100 (general shape),
thereby leading to incorrect registration.
The following illustration displays the portions of a shape to include in the zone. When
searching for a particular shape, enclose only that portion of the graphic within the zone
boundary.
Figure 2-18. Line Intersections that Kofax Capture Recognizes
C# Excel - Excel Conversion & Rendering in C#.NET Excel documents in .NET class applications independently, without using other external third-party dependencies like Adobe Acrobat. Excel to PDF Conversion.
delete text pdf acrobat; how to delete text in pdf file online
44
The Administration Module
66
Kofax Capture Administrator's Guide
For OMR Zones
Each OMR object (for example, a check box) must be zoned separately. You cannot draw one
OMR zone around multiple OMR items. (You can draw another type of zone around multiple
OMR zones. Then, you can use scripts or SmartGroups to process the results from the zones.)
All OMR zones should be consistent in size. If the OMR shapes are different sizes, create
different groups of OMR zones. Make the zones for each group consistent within that group.
Use a different recognition profile for each group of OMR zones.
To allow for potential registration offsets, the zones should include the OMR shape and as
much white space around the shape as prudent to ensure that all of the zone is included.
Do not create zones whose boundaries touch or get near any other text or unrelated OMR
shapes on the page.
For square or rectangular shapes, you can use an aggressive line removal scheme to remove
the shapes without destroying the inner marks. Once the shapes are removed, you can use the
despeckle feature to clean up extraneous noise. Then, set your pixel threshold to a low value
(for example, 0 or 1). During processing, “empty” zones return 0s (the shape is not filled in) or
1s (the shape is filled in).
The confidence level is a number returned by a Kofax Capture recognition engine to
indicate the level of certainty in the reported results. The exact method used to calculate the
confidence varies with the type of index zone and the options you set for it. In general, the
higher the number, the greater the confidence.
The confidence returned from an OMR zone is really the percentage of black pixels found in
the zone. Therefore, you can use the confidence returned from a zone to determine the most
appropriate threshold.
Double Zoning
You can draw multiple zones for the same field. The zones can be in different locations, or
even one on top of the other. Then, you can capture the data from each zone with a different
set of recognition settings. For example, you can use Kofax Advanced OCR with aggressive
image cleanup for one zone and Kofax Advanced OCR with no image cleanup for another
zone. The zone data captured with the highest confidence level is returned for the field. Your
validation and verification operators can see the zone that returned the highest confidence.
Note Using multiple zones could adversely affect performance.
Tips for Drawing Zones
When defining zones on your sample page, follow these guidelines:
▪ If all index data is separated from other information on your sample page by at least .25",
draw your index zones to include a moderate amount of white space (about .25") all
around the index data. The white space around the data allows for horizontal or vertical
offsets that might be introduced by a scanner.
▪ If at least some index data is close together (that is, not separated from other information
on your sample page by at least .25"), draw your index zones with only a small amount of
white space. This prevents unwanted data from displaying in a zone, which could reduce
the accuracy of character recognition. But, because a small amount of white space does
35
The Administration Module
Kofax Capture Administrator's Guide
67
not allow for horizontal or vertical offsets that might be introduced by a scanner, an OCR
registration zone may be required.
For OCR Zones
▪ In general, characters and numbers must reside completely within the zone boundaries to
be read.
▪ Characters or numbers that do not reside completely within the boundaries are not
reflected in the captured data.
For Bar Code Recognition
▪ Include at least a .5" boundary around the bar code when drawing your zone.
▪ Make sure you include the entire bar code within the zone.
Make sure you include the entire bar code within the zone
▪ If possible, include at least .25" of white space around the registration data to allow for
horizontal or vertical offsets that might be introduced by a scanner. The registration
process measures and corrects any scan offset by shifting the index zones into place.
▪ Make sure the border of the zone does not touch other text or elements on the image.
▪ When manually resetting a registration point, select it carefully. Position the cursor at
the lower-left corner of an imaginary box around the first character of the text in the
OCR registration zone. If the first character has a descender (for example, a lowercase
“y”), use the baseline of the character, not the bottom of the character, when selecting the
registration point.
▪ Test whether a registration point is properly defined: Create a test batch by importing the
sample image file used to define your OCR index zones. If the registration point is properly
defined, your OCR index zones contain the expected data. If the registration point is not
properly defined, your OCR index zones are shifted and contain unexpected data.
▪ For best results, keep the search text as short as possible.
For Shape Registration Zones
▪ Include only that portion of the shape that you are trying to match. If your zone includes
excess components, the recognition engine will probably return code 100 (general
shape). For example, if you are searching for a cross shape, center the zone only on the
cross portion of the graphic.
▪ For best results, keep the shape limitations
in mind.
47
The Administration Module
68
Kofax Capture Administrator's Guide
For OMR Zones
▪ Each OMR shape must be zoned separately. You cannot draw one OMR zone around
multiple OMR shapes. (Note that you can draw another type of zone around multiple
OMR zones. Then you can use scripts or SmartGroups
to process the results from the
zones.)
▪ All your OMR zones should be consistent in size. If the OMR shapes are different sizes,
create different groups of OMR zones. Make the zones for each group consistent within
that group. Use a different recognition profile for each group of OMR zones.
▪ To allow for potential registration offsets, the zones should include the OMR shape and as
much white space around the shape as prudent.
▪ Do not create zones whose boundaries touch or get near any other text or unrelated OMR
shapes on the page.
▪ For square or rectangular shapes, you can use an aggressive line removal scheme to
remove the shapes without destroying the inner marks. Once the shapes are removed, use
the despeckle feature to clean up extraneous noise. Then, set your pixel threshold to a low
value (for example, 0 or 1). During processing, “empty” zones return 0s (the shape was not
filled in) or 1s (the shape was filled in).
▪ The confidence returned from an OMR zone is really the percentage of black pixels found
in the zone. Therefore, you can use the confidence returned from a zone to determine the
most appropriate threshold.
Double Zoning
You can draw multiple zones for the same field. The zones can be in different locations, or
even one on top of the other. Then, you can capture the data from each zone with a different
set of recognition settings. For example, you can use Kofax Advanced OCR with aggressive
image cleanup for one zone and Kofax Advanced OCR without image cleanup for another
zone. The zone data captured with the highest confidence level is returned for the field. Your
validation and verification operators can see the zone that returned the highest confidence.
Note Using multiple zones could adversely affect performance.
Creating an Index Zone
In addition to the standard index zones described here, Kofax Capture also supports the
following special types of index zones:
▪ Group Zones (SmartGroups)
▪ OCR Registration Zones
▪ Shape Registration Zones
▪ Separator Zones
▪ Form ID Zones
Create index zones on the sample pages you scan for form types. To display a sample page,
do one of the following:
▪ From the Document class tree view tab, double-click the document class associated with
the form. Select a form type, and select the sample page. The sample page appears.
▪ From the Batch class tree view tab, double-click on the document class with which the form
type is associated. Select a form type, and select the sample page. The sample page appears.
1 Select a sample page.
55
The Administration Module
Kofax Capture Administrator's Guide
69
2 On the Insert tab, in the Zone group, click the appropriate button for the type of zone you
want.
Your cursor changes to a cross hair.
3 Draw a rectangle around the area on your sample page that you want to use as the zone.
A window appears.
4
Set options as appropriate for your zone.
5
Click Test to apply your settings to this zone and check the results.
The Test button is not available if you display the sample page from the Document class
tree view tab. To use QuickZones to verify your recognition and cleanup settings, you
must display the sample page from the Batch class tree view tab.
6
Select OK.
Tip Once your zone is drawn on the sample page, you can resize the zone.
Copying an Index Zone
Use Copy to duplicate an existing index zone. When copying and pasting an index zone, a
new index field is also created. The copy method is useful for adding an index zone that is
similar to an existing one. You can modify the copy of the index zone as necessary to make it
unique.
When copying an index zone, certain items associated with the index zone are copied, and
others are linked. The following table shows the items that are copied and linked.
Copied Items
Linked Items
Index zone
None
Index field
None
OMR settings
None
Table index field
None
Custom storage string
None
If you paste a table index field on the same sample page, a copy of the index field is created in
the same table. Otherwise, the index field is pasted as a new index field that is not in a table.
1
Select the index zone you to copy. You select an index zone from the Batch class tree view
tab, the Document class tree view tab, or the sample page.
2
On the Home tab, in the Clipboard group, click Copy.
3
To paste the copied index zone, do one of the following:
▪ From the Batch class tree view tab or the Document class tree view tab, select a sample
page node. On the Home tab, in the Clipboard group, click the arrow under Paste, and
then click Paste.
When you paste from the sample page node, the copied index zone is placed on the
sample page offset from the original index zone's top left location. It is also added to
the list of index zones in the index zone tree. If you paste a zone multiple times, the
copied zone placement uses the same offset as the last zone pasted.
▪ From the Batch class tree view tab or the Document class tree view tab, right-click the
sample page image. On the context menu, click Paste.
44
The Administration Module
70
Kofax Capture Administrator's Guide
If you perform a copy and paste on a previously copied index zone such as Name2, the
new index zone is created with a unique name such as Name21, Name22, and Name23.
The Export Setup window does not appear when an index zone is created using copy and
paste. Instead, the Export Setup window appears upon publishing the associated batch
class.
A copy of the index zone and corresponding index field is created with a unique name.
4 Edit the copied index zone from the Index Zone Properties window.
Editing an Index Zone
You edit index zones with the Index Zone Properties window.
1 Click an index zone on a sample page.
2 On the Home tab, in the Editing group, click Properties.
3 From the Index Zone Properties window, edit the properties.
4 Click Apply to save the changes without closing the window.
5 Click Test to apply your settings to this zone and check the results.
The Test button is unavailable if you display the sample page from the Document class
tree view tab. You must display the sample page from the Batch class tree view tab to use
QuickZones to verify recognition and cleanup settings.
6 Click OK to save the changes and close the window.
▪ Once you select the zone, you can resize it.
▪ You can select a zone to edit on the sample page or from the Definitions tree view. Just
expand the tree under the sample page and select a zone.
Creating a Shape Registration Zone
Use this procedure if you want the page registration zone to use shape recognition.
1 Select a sample page to display it.
2 On the Insert tab, in the Zones group, click the arrow under Registration, and then click
Create Shape Registration Zone.
Your cursor changes from a pointer to a cross hair.
3
Draw a rectangle around the area of your sample page to use as a shape registration
zone.
The Create Registration Zone window appears.
4 Select a shape recognition profile from the list. If none of the options in the list is
appropriate, click Edit and define a custom shape recognition profile.
5 The appropriate shape is automatically selected in the Search shape list.
6 Click Test to apply your settings to this zone and check the results.
The Test button is unavailable if you display the sample page from the Document class
tree view tab.
7 Click OK.
The new shape registration zone appears on the sample page. The registration point for
the zone appears as a bold cross hair, with the intersection of the cross hair denoting the
registration point location, which varies according to the pixel density of the shape.
Documents you may be interested
Documents you may be interested