Tesseract OCR (Image to Text) using Python

Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and “read” the text embedded in images.

Python-tesseract is a wrapper for Google’s Tesseract-OCR Engine. It is also useful as a stand-alone invocation script to tesseract, as it can read all image types supported by the Pillow and Leptonica imaging libraries, including jpeg, png, gif, bmp, tiff, and others. Additionally, if used as a script, Python-tesseract will print the recognized text instead of writing it to a file.

import pytesseract
from pytesseract import Output
import cv2
import numpy as np
import pandas as pd
import io
from io import BytesIO
import sys
import os
import tempfile
import csv
import json
from csv import writer

def main(filename):

    #for windows user
    pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract'
    img = cv2.imread(filename)
    d = pytesseract.image_to_data(img, output_type=Output.DICT)  
    json_format=json.dumps(d)    
    print(json_format)
    #with open("sample.json", "w") as outfile: 
    #json.dump(d, outfile)


if __name__ == "__main__":
    #id=2
    image_path = 'image.png'
    main(image_path )

1 Comment

Jaclyn

October 7, 2021 at 2:30 pm

Hi! You need to post even more pictures right into
your short articles. That would certainly be better.

Tesseract OCR (Image to Text) using Python

1 Comment

Leave a Reply Cancel reply

About

Satish Sharma

Recent Posts

Tesseract OCR (Image to Text) using Python

Share this post:

Related Posts:

You might also like

Building a Chat with PDF App Using LLMs

Unlocking the Power of DeepSeek: A Python Guide to Enhanced Chat Applications

Revolutionizing Business with DeepSeek: AI-Driven Solutions for the Future

Simulating 3D Point Clouds from 2D KITTI LiDAR Dataset using Open3D

Join our newsletter

1 Comment

Leave a Reply Cancel reply

About

Satish Sharma

Subscribe and Follow

Recent Posts