Visioneer PROOCR100 User manual

Untitled Document
Pro OCR User’s Guide
file:///C|/VisioneerDoc/Main.html [1/20/2003 4:21:09 PM]

Pro OCR User’s Guide
Contents
Chapter 1:
Introducing
Visioneer Pro
OCR 100
Chapter 2:
Learning Pro
OCR Basics
Chapter 3:
Getting
Documents
Chapter 4:
Locating Text
and Graphics
Chapter 5:
Setting
Recognize
Options and
Proofing a
Pro OCR User’s
Guide
Chapter 1
Introducing Visioneer Pro
OCR 100
This chapter introduces you to the Pro OCR
application and to the concept of optical character
recognition (OCR).
Why Pro OCR
Pro OCR is an Optical Character Recognition (OCR)
application. An OCR application converts images of
text, such as those obtained from scanning a
document or receiving a fax through your fax-
modem, into editable text. For example, when a
scanner scans a page of text, it sees black and white
areas on the page. The scanner converts what it sees
into an image and stores the image on the computer.
To transform a scanned text image into something a
word processing or spreadsheet application can
recognize as characters, you need an OCR (optical
file:///C|/VisioneerDoc/html/ug_main.htm (1 of 3) [1/20/2003 4:21:10 PM]

Pro OCR User’s Guide
Recognized
Document
Chapter 6:
Saving and
Printing
Documents
Chapter 7:
Creating and
Processing
Deferred and
Batch Jobs
Chapter 8: Tips
for Getting the
Best Results
Glossary
character recognition) application, such as Pro OCR.
Every day you may spend a lot of time retyping
printed text or numbers from hard copy documents.
By using Pro OCR and a scanner as an input device,
you can eliminate much of this retyping.
Features and Highlights of Pro OCR
Many of the existing OCR products are typically
capable of recognizing 200–300 plain, nonstylized
typefaces. Using recognition technology, Pro OCR
can recognize over 2,000 typefaces.
Most basic OCR applications inspect the scanned
page image, attempt to recognize the dots on the page
as characters, and transform the image into a plain
text file. Pro OCR does all of these basic tasks, but it
can also get the entire page into your word processor
or spreadsheet as is—retaining the shape, form, type,
and spacing, as well as the content, of the input page.
Pro OCR provides:
■The ability to read one or more pages of
text including graphics. Pro OCR reads
pages directly from your scanner, or it reads
TIFF, PCX, and DCX files. Pro OCR can
automatically locate pictures and embed them
in your document. You can also export
pictures separately in a number of file formats.
■Speed and accuracy of recognition. With
most documents, Pro OCR is faster than, and
as accurate as a good typist.
■Numeric regions. You can specify that a
given region on a page can contain only
numbers. Numeric regions help Pro OCR
make sure that numbers are always recognized
as numbers and never mistakenly identified as
file:///C|/VisioneerDoc/html/ug_main.htm (2 of 3) [1/20/2003 4:21:10 PM]

Pro OCR User’s Guide
letters.
■Recognition and retention of fonts,
characters, styles, and page formatting. Pro
OCR recognizes and retains the differences
between serif and sans-serif fonts, styles such
as bold, underline, and subscript, and
formatting such as columns, tables, and
indents.
■Deferred and batch processing. You can
perform procedures that need your attention or
interaction (for example, locating), and then
do the time consuming steps that don’t need
interaction (for example, recognizing) at
another time.
■Internet readiness. supports HTML export
format. You can convert an image file directly
to an HTML page and upload it to the Web
site.
■Proofing options. Pro OCR has a number of
proofing options. You can also send
recognized text directly to your word
processor.
■Save features. With Pro OCR you can save
recognized text in a wide variety of word
processor and spreadsheet file formats. Pro
OCR works with imperfect input pages that
may have skewed lines of text, touching or
broken characters, and fuzzy characters.
© Copyright 1998 Visioneer, Inc. Reach us at
www.visioneer.com.
file:///C|/VisioneerDoc/html/ug_main.htm (3 of 3) [1/20/2003 4:21:10 PM]

Introducing Visioneer Pro OCR 100
Pro OCR User’s Guide
Chapter 1
Introducing Visioneer Pro OCR 100
This chapter introduces you to the Pro OCR application and to the concept of
optical character recognition (OCR).
Why Pro OCR
Pro OCR is an Optical Character Recognition (OCR) application. An OCR
application converts images of text, such as those obtained from scanning a
document or receiving a fax through your fax-modem, into editable text. For
example, when a scanner scans a page of text, it sees black and white areas on the
page. The scanner converts what it sees into an image and stores the image on the
computer. To transform a scanned text image into something a word processing or
spreadsheet application can recognize as characters, you need an OCR (optical
character recognition) application, such as Pro OCR.
Every day you may spend a lot of time retyping printed text or numbers from hard
copy documents. By using Pro OCR and a scanner as an input device, you can
eliminate much of this retyping.
Features and Highlights of Pro OCR
Many of the existing OCR products are typically capable of recognizing 200–300
plain, nonstylized typefaces. Using recognition technology, Pro OCR can recognize
over 2,000 typefaces.
file:///C|/VisioneerDoc/html/01intro.htm (1 of 2) [1/20/2003 4:21:10 PM]

Introducing Visioneer Pro OCR 100
Most basic OCR applications inspect the scanned page image, attempt to recognize
the dots on the page as characters, and transform the image into a plain text file. Pro
OCR does all of these basic tasks, but it can also get the entire page into your word
processor or spreadsheet as is—retaining the shape, form, type, and spacing, as well
as the content, of the input page. Pro OCR provides:
■The ability to read one or more pages of text including graphics. Pro
OCR reads pages directly from your scanner, or it reads TIFF, PCX, and
DCX files. Pro OCR can automatically locate pictures and embed them in
your document. You can also export pictures separately in a number of file
formats.
■Speed and accuracy of recognition. With most documents, Pro OCR is
faster than, and as accurate as a good typist.
■Numeric regions. You can specify that a given region on a page can contain
only numbers. Numeric regions help Pro OCR make sure that numbers are
always recognized as numbers and never mistakenly identified as letters.
■Recognition and retention of fonts, characters, styles, and page
formatting. Pro OCR recognizes and retains the differences between serif
and sans-serif fonts, styles such as bold, underline, and subscript, and
formatting such as columns, tables, and indents.
■Deferred and batch processing. You can perform procedures that need
your attention or interaction (for example, locating), and then do the time
consuming steps that don’t need interaction (for example, recognizing) at
another time.
■Internet readiness. supports HTML export format. You can convert an
image file directly to an HTML page and upload it to the Web site.
■Proofing options. Pro OCR has a number of proofing options. You can also
send recognized text directly to your word processor.
■Save features. With Pro OCR you can save recognized text in a wide
variety of word processor and spreadsheet file formats. Pro OCR works with
imperfect input pages that may have skewed lines of text, touching or broken
characters, and fuzzy characters.
© Copyright 1998 Visioneer, Inc. Reach us at www.visioneer.com.
file:///C|/VisioneerDoc/html/01intro.htm (2 of 2) [1/20/2003 4:21:10 PM]

file:///C|/VisioneerDoc/html/copyrt.htm
Copyright Information
Pro OCR User’s Guide for Windows. Copyright ©1998 Visioneer, Inc. All rights
reserved.
Reproduction, adaptation, or translation without prior written permission is
prohibited, except as allowed under the copyright laws.
AnyPort, AutoFix, AutoLaunch, FormTyper, MicroChrome, PaperEnable,
PaperLaunch, PaperPort, PaperPort Deluxe, PaperPort ix, PaperPort Links,
PaperPort mx, PaperPort PowerBar, PaperPort 3000, PaperPort 6000, PaperPort vx,
PaperPortation, PaperPort Strobe, Pro OCR, ScanDirect, SimpleSearch, SharpPage,
and Visioneer are trademarks of Visioneer, Inc. PaperPort, Paper-driven, and the
Visioneer logo are registered trademarks of Visioneer, Inc.
Microsoft is a U.S. registered trademark of Microsoft Corporation. Windows is a
trademark of Microsoft Corporation. TextBridge is a registered trademark of Xerox
Corporation. ZyINDEX is a registered trademark of ZyLAB International, Inc.
ZyINDEX toolkit portions, Copyright © 1990–1996, ZyLAB International, Inc. All
Rights Reserved. All other products mentioned herein may be trademarks of their
respective companies.
Information is subject to change without notice and does not represent a
commitment on the part of Visioneer, Inc. The software described is furnished
under a licensing agreement. The software may be used or copied only in
accordance with the terms of such an agreement. It is against the law to copy the
software on any medium except as specifically allowed in the licensing agreement.
No part of this document may be reproduced or transmitted in any form or by any
means, electronic or mechanical, including photocopying, recording, or information
storage and retrieval systems, or translated to another language, for any purpose
other than the licensee’s personal use and as specifically allowed in the licensing
agreement, without the express written permission of Visioneer, Inc.
Part Number: 05-0340-000
Restricted Rights Legend
Use, duplication, or disclosure is subject to restrictions as set forth in contract
subdivision (c)(1)(ii) of the Rights in Technical Data and Computer Software
Clause 52.227-FAR14. Material scanned by this product may be protected by
governmental laws and other regulations, such as copyright laws. The customer is
solely responsible for complying with all such laws and regulations.
file:///C|/VisioneerDoc/html/copyrt.htm (1 of 3) [1/20/2003 4:21:10 PM]

file:///C|/VisioneerDoc/html/copyrt.htm
Visioneer’s Limited Product Warranty
If you find physical defects in the materials or the workmanship used in making the
product described in this document, Visioneer will repair, or at its option, replace,
the product at no charge to you, provided you return it (postage prepaid, with proof
of your purchase from the original reseller) during the 12-month period after the
date of your original purchase of the product.
THIS IS VISIONEER’S ONLY WARRANTY AND YOUR EXCLUSIVE
REMEDY CONCERNING THE PRODUCT, ALL OTHER
REPRESENTATIONS, WARRANTIES OR CONDITIONS, EXPRESS OR
IMPLIED, WRITTEN OR ORAL, INCLUDING ANY WARRANTY OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE OR NON-
INFRINGEMENT, ARE EXPRESSLY EXCLUDED. AS A RESULT, EXCEPT
AS SET OUT ABOVE, THE PRODUCT IS SOLD “AS IS” AND YOU ARE
ASSUMING THE ENTIRE RISK AS TO THE PRODUCT’S SUITABILITY TO
YOUR NEEDS, ITS QUALITY AND ITS PERFORMANCE,
IN NO EVENT WILL VISIONEER BE LIABLE FOR DIRECT, INDIRECT,
SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES RESULTING
FROM ANY DEFECT IN THE PRODUCT OR FROM ITS USE, EVEN IF
ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.
All exclusions and limitations in this warranty are made only to the extent permitted
by applicable law and shall be of no effect to the extent in conflict with the express
requirements of applicable law.
FCC Radio Frequency Interference Statement
This equipment has been tested and found to comply with the limits for the class B
digital device, pursuant to part 15 of the FCC Rules. These limits are designed to
provide reasonable protection against interference in a residential installation. This
equipment generates, uses and can radiate radio frequency energy and if not
installed, and used in accordance with the instructions, may cause harmful
interference to radio communications. However, there is no guarantee that
interference will not occur in a particular installation. If this equipment does cause
harmful interference to radio or television reception, which can be determined by
turning the equirpment off and on, the user is encouraged to try and correct the
interference by one or more of the following measures:
■Reorient or relocate the recemng antenna.
file:///C|/VisioneerDoc/html/copyrt.htm (2 of 3) [1/20/2003 4:21:10 PM]

file:///C|/VisioneerDoc/html/copyrt.htm
■Increase the separation between the equipment and receiver.
■Connect the equipment into an outlet on a circuit different from that to
which the receiver is connected.
■Consult the dealer or an experienced radio/TV technician for help.
This equipment has been certified to comply with the limits for a class B computing
device, pursuant to FCC Rules. In order to maintain compliance with FCC
regulations, shielded cables must be used with this equipment. Operation with non-
approved equipment or unshielded cables is likely to result in interference to radio
and TV reception. The user is cautioned that changes and modifications made to the
equipment without the approval of manufacturer could void the user's authority to
operate this equipment.
This device complies with part 15 of the FCC Rules. Operation is subject to the
following two conditions: (1) This device may not cause harmful interference, and
(2) this device must accept any interference received, including interference that
may cause undesired operation.
Back to Pro OCR User’s Guide.
file:///C|/VisioneerDoc/html/copyrt.htm (3 of 3) [1/20/2003 4:21:10 PM]

Table of Contents
Contents
Chapter 1: Introducing Visioneer Pro OCR 100
Chapter 2: Learning Pro OCR Basics
Chapter 3: Getting Documents
Chapter 4: Locating Text and Graphics
Chapter 5: Setting Recognize Options and Proofing a Recognized
Document
Chapter 6: Saving and Printing Documents
Chapter 7: Creating and Processing Deferred and Batch Jobs
Chapter 8: Tips for Getting the Best Results
Glossary
file:///C|/VisioneerDoc/html/toc.htm [1/20/2003 4:21:11 PM]

Table of Contents
Contents
Chapter 1: Introducing Visioneer Pro OCR 100
Why Pro OCR
Features and Highlights of Pro OCR
Glossary
file:///C|/VisioneerDoc/html/toc1.htm [1/20/2003 4:21:11 PM]

Glossary
Glossary
A4 Letter page size
accelerator key
ADF
alphanumeric word
ASCII
As Single Column locating method
Auto OCR
Auto brightness
automatic document feeder (ADF)
automatic processing
background noise
backup
backwards compatible
bit image
bitmap
bitmapped character
bold text
brightness
broken character
file:///C|/VisioneerDoc/html/glos.htm (1 of 9) [1/20/2003 4:21:11 PM]

Glossary
built-in dictionary
CCITT
character
character format
character identification error
character image
character recognition
character style
clipboard
column information
compression
confidence
consistent document
copyrighted document
deferred job
deferred processing
degraded image
dialog box
desktop
document area
dots per inch (dpi)
file:///C|/VisioneerDoc/html/glos.htm (2 of 9) [1/20/2003 4:21:11 PM]

Glossary
dpi
draft quality text
driver
exporting
export format
file extension
file formats
file type
fine resolution
flatbed scanner
font
font family
font mapping
format retention
Gallery
Get Page
grayscale image
hard page breaks
heavy character
I-beam pointer
file:///C|/VisioneerDoc/html/glos.htm (3 of 9) [1/20/2003 4:21:11 PM]

Glossary
icon
illegible character
illegible character symbol
image view
input file formats
insertion point
italic text
justification
kerning
landscape orientation
layout
layout analysis error
Legal page size
Lenient suspect threshold
letter quality text
line break
Locate
locate region
locating
locating method
menu
file:///C|/VisioneerDoc/html/glos.htm (4 of 9) [1/20/2003 4:21:11 PM]

Glossary
menu bar
multi-column text
monospaced font
monospaced font mapping
newspaper style columns
Normal locating method
Normal suspect threshold
numeric region
OCR
On-Screen Verifier™
Optical Character Recognition (OCR)
order of text regions
orientation
output file formats
page controls
page format
page image
page number box
page orientation
page size
file:///C|/VisioneerDoc/html/glos.htm (5 of 9) [1/20/2003 4:21:11 PM]

Glossary
page source
PCX
picture element
picture region
pixel
pixel-for-pixel
plain text
portrait orientation
printer font
Pro OCR Deferred format
Pro OCR format
Pro OCR process
Pro OCR window
Proof
proportionally spaced font
recognition accuracy
Recognize
recognized text
recognizing
region style
resolution
file:///C|/VisioneerDoc/html/glos.htm (6 of 9) [1/20/2003 4:21:11 PM]

Glossary
Rich Text Format (RTF)
RTF
sans serif
sans serif font mapping
scanner
scanner driver
scanning
screen font
scroll bars
serif
serif font mapping
settings file
sheetfed scanner
side-by-side columns
single-bit image
single-step processing
skewed text
spell checking
standard resolution
status bar
file:///C|/VisioneerDoc/html/glos.htm (7 of 9) [1/20/2003 4:21:11 PM]

Glossary
status display area
Stringent suspect threshold
stroke weight
Style ribbon
stylized font
subscript text
superscript text
supplementary dictionaries
suspect character
suspect threshold
Tag Image File Format
template
template matching
Template locating method
text quality
text region
text style
text view
throughput
TIFF
touching characters
file:///C|/VisioneerDoc/html/glos.htm (8 of 9) [1/20/2003 4:21:11 PM]

Glossary
typeface
type quality
type size
type style
underline text
User Defined page size
user dictionary
view selector
window
Windows
word wrap
zoom controls
file:///C|/VisioneerDoc/html/glos.htm (9 of 9) [1/20/2003 4:21:11 PM]
This manual suits for next models
1
Table of contents
Other Visioneer Software manuals

Visioneer
Visioneer VISUAL EXPLORER - GETTING STARTED GUIDE FOR... User manual

Visioneer
Visioneer PaperPort Strobe 500 User manual

Visioneer
Visioneer VISUAL EXPLORER - GETTING STARTED GUIDE FOR... User manual

Visioneer
Visioneer WEB PUBLISHERS KIT User manual

Visioneer
Visioneer SCANSOFT PAPERPORT DELUXE 6.0 User manual

Visioneer
Visioneer PAPERPORT 5.0 SOFTWARE FOR MACINTOSH User manual