Convert PDF & Image Text to Notepad Format

Published by: Alex George on August 30, 2010
As we know, the Text in PDF or photos for user's eyes to read and not for editing or indexing. However, this situation is changed and now it is possible to convert text in PDF file to plain content using free tools. This tutorial explains how to convert words written on PDF document and image to plain text. I use free OCR tools to perform this conversion. Full form of OCR is Optical Character Recognition and this technology is used to identify image text and convert it to machine-encoded text. There are many OCR services available and some handpicked services are introduced in this guide.


Convert Image to Text

Here I take the screenshot of "about me" page of CoreNetworkZ.com and try to take words in the screenshot to real words automatically. I have saved the screenshot in different formats like JPEG, GIF, BMP, TIFF, and PNG. The sample document is uploaded to Free OCR services.
CoreNetworkZ

I have the same document saved in different formats and each one is shown exactly as after the conversion.

  1. Result: Convert bmp image to plain text

    Check the result and I must say the result would be better if I used a picture with higher quality.
    bmp

    Though this tool failed to convert with 100 percent accuracy, it has satisfactory output.

  2. Convert JPEG to real words

    This time I have uploaded a JPEG file to this free OCR service. Look at the output.
    jpeg


  3. Converting GIF to Plain Content
    For some unknown reasons, I have failed to get a converted file. I tried 3 times with this gif version but it didn't work.
    changed


  4. Convert PNG File

    After running PNG file on this free OCR tool, I received following output.
    PDF to Notepad


  5. Convert TIF file


    This time, I have run the file in TIF format using this service. See the notepad version of the output.


The tool I have used for the above experiment is: http://www.free-ocr.com/


Convert PDF Text to Notepad Content

Now let us check how to copy the sentences displayed in a PDF file to notepad. Here I am using  http://www.ocrconvert.com/ to perform this task. Just like the previous tool, this one too a free online service. Steps to extract content from PDF version to notepad are given below.
  1. Visit http://www.ocrconvert.com/

  2. Click on the browse button to upload PDF file
    Changing

  3. Conversion starts once we click Process button
    test

though these tools are helpful, some data entry job centers prefer not to use them because of the high percentage of error while copying letters from PDF files. So until a high accuracy program is developed, they need a workforce to complete data entry works.

Recent Topics
  1. Tor Proxy Review

  2. Hide specific computer Hard Disk Drive

  3. How to Turn off IE default browser setting message

No: Recent Posts
This Device is Not Configured Correctly. (Code 1)
Getting Device Manager Error Code 10
Getting 169.254.X.X (APIPA) Windows Automatic Private IP Address
How to Setup MTNL Broadband ADSL Modem
Setup Idea 4G on Android Phone
Bypass Windows Admin Account Password Of Vista, XP, Windows 8
Error 1747: The Authentication Service is Unknown
Reasons for Unexpected Automatic Reboot of Your Computer
Use Flash Memory (USB) as Virtual RAM in Windows
Setup Log Files for IP Messenger
Adobe Flash Player has Stopped a Potentially Unsafe Operation
FTP Error 503 Login Authentication Failed
Best LAN Messengers
How to Reset BSNL WiFi Modem
How to Check Your Tata Photon Plus Internet Usage
Destination Net Unreachable

2 comments:

  1. its great pretty useful i guess

    thanks for sharing...

    ReplyDelete
  2. Thanks Admin, for your valuable addition. I read your blog and it is full of posts regarding text conversions using OCR services.

    ReplyDelete

Newer Post Older Post Home