Transliteration of English text in the image to kannada
Task here is to transliteration of english words present in images.
It consists of three sub-task:-
Step 1
The first step is to identify the location of English text in the image. Here I have used detectron for text detection. It should not recognize any other language text like Hindi etc. I have trained a model with a dataset of two languages (English and Hindi) with nearly 200 images. The model accurately identifies between English and Hindi. Model tries to put identified text into these classes if text in any other language is detected.
The structure of the project is lil messed up, initially, I was training using my laptop, but for object detection, I had to move to collab, since detectron is not supported for windows. I didn’t want to face any trouble later.
Step 2
This model’s architecture is taken from https://arxiv.org/pdf/1507.05717.pdf.
Once the english text is detected, the model will extract the features from the image, Either pretrained model or custom feature architecture can be used. Details are available in the paper.
Step 3
Sequence-to-Sequence model with Attention Mechanism are used for Transliteration. Two layers of GRU are used in this model. Thanks to padhAI online course for providing the starter code.
Once all the models have been trained, it can be used for Transliteration.
Limitations
Models - https://drive.google.com/drive/folders/19q-89TjeyaM2d8KbWdjWOjJV1Ow1ARi2?usp=sharing
Datasets link -
Credits.