Omnipage Pro 14 - How do I save OCR data in TIFF so I can search document with Home XP Documents sea

R

RJ

I want to scan documents, save the image for legal reasons but be able to
search the documents with the Start/Search/Documents utility.

I have Omnipage Pro 14 on a Home XP system. Omnipage OCR is much more
accurate than the MS Document Imaging so I'd like to use it.

However, when I scan a document, OCR it and save it to TIFF format the file
does not show up as being OCR'd by XP and documents scanned with Omnipage
don't show up in Start/Search/Documents search. Documents scanned and OCR'd
with Office Document Imaging are found correctly by the
Start/Search/Documents.

How do I save my OCR'd Omnipage documents in a TIFF so the data is available
to MS Document search?

I realize I could save as TIFF and searchable Word document but I'd rather
have a single source picture/OCR data file. I could save as PDF but I
believe PDF but XP cannot search a PDF file. I'd appreciate any advice.

thanks
rj
 
J

joust in jest

ScanSoft OmniPage Pro 14 is neither a part of MS Office, nor is it a
Microsoft product. If you bothered to read ScanSoft OmniPage Pro 14 's
excellent Help files, you would know that....

TIFF is a GRAPHICS file format -- when you save your OCR as a TIFF, you are
not saving it as editable / searchable text -- you are saving it as a
PICTURE of text. Since there is, by definition, no text in a graphic file,
neither MS Windows, nor any other program, can find text within a graphic
file.
 
T

T Shadow

RJ said:
I want to scan documents, save the image for legal reasons but be able to
search the documents with the Start/Search/Documents utility.

I have Omnipage Pro 14 on a Home XP system. Omnipage OCR is much more
accurate than the MS Document Imaging so I'd like to use it.

However, when I scan a document, OCR it and save it to TIFF format the file
does not show up as being OCR'd by XP and documents scanned with Omnipage
don't show up in Start/Search/Documents search. Documents scanned and OCR'd
with Office Document Imaging are found correctly by the
Start/Search/Documents.

How do I save my OCR'd Omnipage documents in a TIFF so the data is available
to MS Document search?

I realize I could save as TIFF and searchable Word document but I'd rather
have a single source picture/OCR data file. I could save as PDF but I
believe PDF but XP cannot search a PDF file. I'd appreciate any advice.

thanks
rj
TIFF is an image(picture) format. It is not searchable, its just dots. I
think you mean .RTF this is a text format that allows different fonts and is
searchable. You could open them in Word and save them as a .DOC. It would
be easier to search for the text using *.rtf from the start button. To have
the original image(picture) and to be able to search it requires 2 files.
The tex/doc file and the image file.
HTH
 
A

Anders Thulin

RJ said:
How do I save my OCR'd Omnipage documents in a TIFF so the data is available
to MS Document search?

You can't. OCR takes you from an image file (like TIFF) to a text file or
word processor file. That kind of file can be word-indexed for searches. Going back
to an image file loses that benefit: you convert words to pixels, and lose the words.
And you need to OCR it over again to get back at the words.

If you want to keep the original image, but with the text for searches,
PDF (as you note) or perhaps DjVu would be the right choices.
I could save as PDF but I
believe PDF but XP cannot search a PDF file.

Adobe Reader can -- and you're going to need for accessing the PDF file anyway.
If you have very many PDF files, talk to Adobe -- I'm almost sure they have
a solution. It may not fit your budget, though.

I believe DjVu now is capable of something similar, but I don't think
many OCR programs can save to that format directly.
 
G

Graham Jones

Anders Thulin said:
Adobe Reader can -- and you're going to need for accessing the PDF file
anyway.
If you have very many PDF files, talk to Adobe -- I'm almost sure they have
a solution. It may not fit your budget, though.

For a cheap way to search PDF files, you could try SearchWithin from
http://www.software995.com/
 
C

CSM1

RJ said:
I want to scan documents, save the image for legal reasons but be able to
search the documents with the Start/Search/Documents utility.

I have Omnipage Pro 14 on a Home XP system. Omnipage OCR is much more
accurate than the MS Document Imaging so I'd like to use it.

However, when I scan a document, OCR it and save it to TIFF format the file
does not show up as being OCR'd by XP and documents scanned with Omnipage
don't show up in Start/Search/Documents search. Documents scanned and OCR'd
with Office Document Imaging are found correctly by the
Start/Search/Documents.

How do I save my OCR'd Omnipage documents in a TIFF so the data is available
to MS Document search?

I realize I could save as TIFF and searchable Word document but I'd rather
have a single source picture/OCR data file. I could save as PDF but I
believe PDF but XP cannot search a PDF file. I'd appreciate any advice.

thanks
rj
Well in the first place TIFF is a image only format. (A picture).

To have a searchable document, It must be OCR'ed to a Text format such as
Microsoft Word or a plain TXT file. You can also save a searchable PDF from
Omnipage Pro 14.

To search a PDF you need Adobe Acrobat Reader.
http://www.adobe.com/products/acrobat/readstep2.html
 
F

friend®

*I want to scan documents, save the image for legal reasons but be able to
*search the documents with the Start/Search/Documents utility.

images or rtf, doc documents are inherently unsafe, they can be easily
manipulated afterwards. PDF files can be manipulated but all versions
are saved, the document/file can be ammended not deleted and replaced,
all versions can be checked for changes. The best solution is to use
Adobe Acrobat 6, it scanes the document, and can OCR withing the
program, giving you several options to save, as an image, as image
over text or text. For legal reason saving image over test is the
best. Acrobat 6 allows to search multiple files in a subdirectory for
a string of characters without creating indices. OCR is not as good as
FineReader 7, but it all depends on type size and face. Get a demo
version of Acrobat 6 and try, if it is work related, claim from taxes.
Images saved by Acrobat 6 can be jpeg, jpeg2000 and probably other
compression methods as well. Jpeg is not good for text, but jpeg2000
is OK, smaller file size for equal quality.
As a matter of fact, winxp is piece of shit, get 2000 and use 3rd
party real programs, not cut down bloatware from µshit.
*
*I have Omnipage Pro 14 on a Home XP system. Omnipage OCR is much more
*accurate than the MS Document Imaging so I'd like to use it.

Omnipage is extremely expensive.
*
*However, when I scan a document, OCR it and save it to TIFF format the file
*does not show up as being OCR'd by XP and documents scanned with Omnipage
*don't show up in Start/Search/Documents search. Documents scanned and OCR'd
*with Office Document Imaging are found correctly by the
*Start/Search/Documents.
*
*How do I save my OCR'd Omnipage documents in a TIFF so the data is available
*to MS Document search?
*
*I realize I could save as TIFF and searchable Word document but I'd rather
*have a single source picture/OCR data file. I could save as PDF but I
*believe PDF but XP cannot search a PDF file. I'd appreciate any advice.
*
*thanks
*rj
*
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top