VeryPDF PDF to Text OCR SDK for .NET can recognize text from scanned documents with Optical Character Recognition technology. It can extract text from scanned PDF and even images. As a command line tool, users can implement batch process with batch scripts. Features of VeryPDF PDF to Text OCR SDK for .NET * Support command line operation which is useful for batch process. * Convert scanned PDF to editable textual files. * Recognize characters from images, such as TIFF, BMP, PNG, JPG, PCX, and TGA. * Convert specified pages of source files. * No need for a third-party PDF reader application. * Support more than ten languages (download language packages here). * Convert textual PDF to plain text file. * Extract text from encrypted PDF. * Able to retain original layouts of PDF source files (Physical Layout). * Able to convert PDF to text with reading order layout (Reading Layout). * Able to insert or remove page break characters (0x0C) between pages in text files. * Able to add additional information, such as page number, to the end of each text page. * Convert scanned PDF and image files (TIFF, BMP, PNG, JPG, PCX, TGA, etc.) to editable text files. * Able to convert scanned PDF and image files to searchable PDF files. * Create searchable PDF with original color retained, insert a hidden text layer into resultant PDF file. * Create searchable black-and-white PDF without image, contain pure text layer in PDF file. * Create searchable black-and-white PDF with image, insert a hidden text layer into resultant PDF file. * Create searchable PDF with specific color depth of image layer, e.g., Ture Color Image Layer, Grayscale Image Layer, or Black and White Image Layer. * Create TXT file containing the coordination information of text in original PDF, [X, Y, Width, Height].
Source: http://www.verypdf.com/app/pdf-to-text-ocr-converter/sdk-for-net.html