Scanned PDF to text for .NET
.NET OCR plug-in that extracts text from scanned PDFs or converts them into searchable documents from which you can select and copy text. Advanced machine learning algorithms accurately identify the structure of text and tables. The original images are placed in the background so you do not lose the content.
Buy for $99
Share
Features and benefits
- Supports European, Cyrillic and Asian languages, including Chinese and Hindi.
- Superior recognition speed and accuracy.
- Can be used to build desktop, cloud and web applications for a number of platforms.
- Detects and recognizes all popular typefaces and font styles.
- Processes rotated, distorted and noisy images.
- Reads all image formats you can get from a scanner or camera.
- Bulk recognition of multiple images in a folder or archive.
- Recognizes the whole image or selected areas only.
- Detects and automatically corrects spelling errors.
How to use the plug-in
|
|
Converting a scanned PDF to text
Digitize scanned documents or convert scans into indexable and searchable PDFs with less than 10 lines of code. Forget about manual retyping, save precious time and focus on the task at hand.
How to use the plug-in
- Install Aspose.OCR package from Nuget or a locally downloaded file.
- Load a scanned image into `OcrInput` object.
- Create an instance of Aspose.OCR recognition engine.
- Extract text from an scanned PDF.
- Output the recognized text or save it to a file.
|
|
System requirements
To use Scanned PDF to text for .NET plug-in, make sure your system meets the following requirements