VeryPDF PDF to Text OCR SDK for .NET converts scanned PDF files to text files.

Press release: VeryPDF PDF to Text OCR SDK for NET


Publisher: VeryPDF.com Inc

VeryPDF PDF to Text OCR SDK for .NET can recognize text from scanned documents with Optical Character Recognition technology. It can extract text from scanned PDF and even images. As a command line tool, users can implement batch process with batch scripts. Features of VeryPDF PDF to Text OCR SDK for .NET * Support command line operation which is useful for batch process. * Convert scanned PDF to editable textual files. * Recognize characters from images, such as TIFF, BMP, PNG, JPG, PCX, and TGA. * Convert specified pages of source files. * No need for a third-party PDF reader application. * Support more than ten languages (download language packages here). * Convert textual PDF to plain text file. * Extract text from encrypted PDF. * Able to retain original layouts of PDF source files (Physical Layout). * Able to convert PDF to text with reading order layout (Reading Layout). * Able to insert or remove page break characters (0x0C) between pages in text files. * Able to add additional information, such as page number, to the end of each text page. * Convert scanned PDF and image files (TIFF, BMP, PNG, JPG, PCX, TGA, etc.) to editable text files. * Able to convert scanned PDF and image files to searchable PDF files. * Create searchable PDF with original color retained, insert a hidden text layer into resultant PDF file. * Create searchable black-and-white PDF without image, contain pure text layer in PDF file. * Create searchable black-and-white PDF with image, insert a hidden text layer into resultant PDF file. * Create searchable PDF with specific color depth of image layer, e.g., Ture Color Image Layer, Grayscale Image Layer, or Black and White Image Layer. * Create TXT file containing the coordination information of text in original PDF, [X, Y, Width, Height].

Source: http://www.verypdf.com/app/pdf-to-text-ocr-converter/sdk-for-net.html