Free download PDF to TXT OCR Converter

PDF to TXT OCR Converter

2.0
VeryPDF PDF to TXT OCR Converter is a Command Line application uses OCR technology to OCR PDF documents to editable TXT
Free Download
User rating
0
0 votes
License
Shareware
OS
Windows
Developer
Version
2.0
Language
English
Release date
5 September 2010

Editor's review

This is a command line application that lets you convert scanned PDF documents into editable text through OCR.

VeryPDF PDF to TXT OCR Converter is a Command Line application. It uses Optical Character Recognition technology to convert PDF documents to editable TXT files. There is no need for Adobe Acrobat software. A range of image formats including TIFF, BMP, PNG, JPG, PCX, TGA, etc. are supported. It is possible to specify a single page, a range of pages or even the complete document. The tool can also handle several other languages besides English. These include German, French, Spanish, Italian and others. You can handle encrypted and password protected PDF files also quite easily. The original layout available in the source document is maintained after conversion. The quality of the OCR conversion process depends largely on the quality of the scanned image and the clarity of the characters of that image.

Thus some amount of image preprocessing is essential before submitting to the recognition process. De-speckling and de-skewing are essential processes that need to be done. General enhancement of contrast and brightness goes a long way to improve the recognition rate. This is significant as even at 5% failure the amount of editing that`ll be required builds up substantially when the document is large in volume. Some filters also may be effective, particularly the edge enhancement types. These additional processing will call for a suitable editor and you need to keep that in mind when planning your workflow. This is a handy tool if you need to carry out large amounts of character recognition often.

Publisher's description

VeryPDF PDF to TXT OCR Converter is a Command Line application uses Optical Character Recognition technology to OCR PDF documents to editable TXT files, PDF to TXT OCR Converter needn't Adobe Acrobat software.

PDF to TXT OCR Converter Command Line has following features:
1. PDF to TXT OCR Converter converts scanned PDF files to editable text files;
2. PDF to TXT OCR Converter converts scanned image files (TIFF, BMP, PNG, JPG, PCX, TGA, etc.) to editable text files;
3. PDF to TXT OCR Converter has a fast OCR engine, 92% faster than other OCR software;
4. PDF to TXT OCR Converter supports page selection, OCR single, range or all pages at a time;
5. PDF to TXT OCR Converter supports over 10 Languages, Besides English, PDF to TXT OCR Converter also supports German, French, Spanish, Italian and many Languages else;
6. PDF to TXT OCR Converter converts text based PDF documents to text format, Fast, Accurate, Free Trial;
7. PDF to TXT OCR Converter supports command line operation (for manual use or inclusion in scripts);
8. PDF to TXT OCR Converter does NOT need Adobe Acrobat or free Acrobat Reader software;
9. PDF to TXT OCR Converter supports all Windows platforms;
10. PDF to TXT OCR Converter supports extract text from encrypted PDF files and password protected documents;
11. PDF to TXT OCR Converter able to convert PDF file to text file and maintain original physical layout;
12. PDF to TXT OCR Converter able to convert PDF file to text file with reading order layout;
PDF to TXT OCR Converter
PDF to TXT OCR Converter
Version 2.0
Free Download

User comments

Rate this program