Wednesday, November 13, 2024

More results...

Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages

Smart Glass That Can Read Books

efy tested sani theoWe have seen many smart glasses to detect the distance between objects or help visually challenged persons to recognise people’s faces. Here we extend it to the next level using optical character recognition (OCR). Expanding the DIY project to its next version, we are now giving details to make a smart glass that can read text using OCR and computer vision. A camera mounted on the front of the glass for OCR and computer vision will thus allow visually challenged people also to read. In this version, you will also need a pair of goggles.

Prerequisites

First, we need to install some of the required modules and libraries for the project. As the smart glass is based on real-time video processing, by using OCR we can extract the text from the pages that may include images. Here, a module is needed that can capture images from a camera. By using a speech synthesis module, the text in those captured images can be converted into speech.

First, we need to install the following libraries:

- Advertisement -
  • OpenCV
  • PyTesseract
  • eSpeak

To install them, use the commands below:
sudo apt-get install espeak
sudo apt-get install espeak python-espeak
sudo pip3 install opencv2

Coding

Import the required libraries to the code and then set the path where the video frames are to be saved for text extraction.

- Advertisement -
Code snippet importing Python modules
Fig. 1: Code snippet importing Python modules
Code snippet for capturing video and processing it for OCR
Fig. 2: Code snippet for capturing video and processing it for OCR
Testing code
Fig. 3: Testing code

Create a while loop in the code, which will capture real-time video from the camera. Using cv2, convert the image into BGR and save it to the path previously set. Then call PyTesseract that will open the saved video frame for processing the image and extracting text from it. By using eSpeak, the speech engine will convert all that text into audio and read it.

Author’s device attached with the camera on eyeglass
Fig. 4: Author’s device attached with the camera on eyeglass
Author testing the device
Fig. 5: Author testing the device

Note

For a clear voice output that does not sound robotic you can use paid or other text-to-speech services like gtts, watson speech, etc.
Testing

Fix the camera onto the eyeglass and run the code. On putting a book in front of the camera and waiting for a few minutes without any movement, it will automatically start reading the book. To hear it, connect your earphones to the Raspberry Pi headphone TRRS jack or any speaker with amplifier. You can also connect any Bluetooth earphone.

Download Source Code


Ashwini Kumar Sinha is an electronics hobbyist and tech journalist at EFYi

Ashwini Sinha
Ashwini Sinha
A tech journalist at EFY, with hands-on expertise in electronics DIY. He has an extraordinary passion for AI, IoT, and electronics. Holder of two design records and two times winner of US-China Makers Award.

SHARE YOUR THOUGHTS & COMMENTS

EFY Prime

Unique DIY Projects

Electronics News

Truly Innovative Electronics

Latest DIY Videos

Electronics Components

Electronics Jobs

Calculators For Electronics