As we know, the Text in PDF or photos for user's eyes to read and not for editing or indexing. However, this situation is changed and now it is possible to convert text in PDF file to plain content using free tools. This tutorial explains how to convert words written on PDF document and image to plain text. I use free OCR tools to perform this conversion. Full form of OCR is Optical Character Recognition and this technology is used to identify image text and convert it to machine-encoded text. There are many OCR services available and some handpicked services are introduced in this guide.
Here I take the screenshot of "about me" page of CoreNetworkZ.com and try to take words in the screenshot to real words automatically. I have saved the screenshot in different formats like JPEG, GIF, BMP, TIFF, and PNG. The sample document is uploaded to Free OCR services.
I have the same document saved in different formats and each one is shown exactly as after the conversion.
The tool I have used for the above experiment is: http://www.free-ocr.com/
Now let us check how to copy the sentences displayed in a PDF file to notepad. Here I am using http://www.ocrconvert.com/ to perform this task. Just like the previous tool, this one too a free online service. Steps to extract content from PDF version to notepad are given below.
though these tools are helpful, some data entry job centers prefer not to use them because of the high percentage of error while copying letters from PDF files. So until a high accuracy program is developed, they need a workforce to complete data entry works.
Convert Image to Text
Here I take the screenshot of "about me" page of CoreNetworkZ.com and try to take words in the screenshot to real words automatically. I have saved the screenshot in different formats like JPEG, GIF, BMP, TIFF, and PNG. The sample document is uploaded to Free OCR services.
I have the same document saved in different formats and each one is shown exactly as after the conversion.
- Result: Convert bmp image to plain text
Check the result and I must say the result would be better if I used a picture with higher quality.
Though this tool failed to convert with 100 percent accuracy, it has satisfactory output.
- Convert JPEG to real words
This time I have uploaded a JPEG file to this free OCR service. Look at the output.
- Converting GIF to Plain Content
For some unknown reasons, I have failed to get a converted file. I tried 3 times with this gif version but it didn't work.
- Convert PNG File
After running PNG file on this free OCR tool, I received following output.
- Convert TIF file
This time, I have run the file in TIF format using this service. See the notepad version of the output.
The tool I have used for the above experiment is: http://www.free-ocr.com/
Convert PDF Text to Notepad Content
Now let us check how to copy the sentences displayed in a PDF file to notepad. Here I am using http://www.ocrconvert.com/ to perform this task. Just like the previous tool, this one too a free online service. Steps to extract content from PDF version to notepad are given below.
- Visit http://www.ocrconvert.com/
- Click on the browse button to upload PDF file
- Conversion starts once we click Process button
though these tools are helpful, some data entry job centers prefer not to use them because of the high percentage of error while copying letters from PDF files. So until a high accuracy program is developed, they need a workforce to complete data entry works.
Recent Topics
- Tor Proxy Review
- Hide specific computer Hard Disk Drive
- How to Turn off IE default browser setting message
its great pretty useful i guess
ReplyDeletethanks for sharing...
Thanks Admin, for your valuable addition. I read your blog and it is full of posts regarding text conversions using OCR services.
ReplyDelete