NLTM OCR

Publications

Home
Publications

Handwritten

2024

Towards Digitizing Filled Indic Handwritten Forms

Shaon Bhattacharyya, Ajoy Mondal, and C. V. Jawahar

CVIP, 2024. [ PDF ]

Advancing Question Answering on Handwritten Documents

Aniket Pal, Ajoy Mondal, and C. V. Jawahar

CVIP, 2024. [ PDF ]

LineTR: Unified Text Line Segmentation for Challenging Palm Leaf Manuscripts

Vaibhav Agrawal, Niharika Vadlamudi, Muhammad Waseem, Amal Joseph, Sreenya Chitluri, and Ravi Kiran Sarvadevabhatla

International Conference on Pattern Recognition (ICPR 2024). [ PDF ]

Unconstrained Camera Captured Indic Offline Handwritten Dataset

Ajoy Mondal and C. V. Jawahar

International Conference on Pattern Recognition (ICPR), 2024. [ PDF ]

Competition on Recogntion and VQA on Handwritten Documents

Ajoy Mondal, Vijay Mahadevan, R. Manmatha, and C. V. Jawahar

International Conference on Document Analysis and Recognition (ICDAR) , 2024. [ PDF ]

Bridging the Gap in Resource for Offline English Handwritten Text Recognition

Ajoy Mondal, Krishna Tulsyan, and C V Jawahar,

International Conference on Document Analysis and Recognition (ICDAR) , 2024. [ PDF ]

Enhancing Accuracy in Indic Handwritten Text Recognition

Evani Lalitha, Ajoy Mondal, and C. V. Jawahar,

International Conference on Computer Vision & Image Processing ( CVIP ), 2024 [ PDF ]

Semantic Labels-Aware Transformer Model for Searching over a Large Collection of Lecture-Slides

K. V. Jobin, Anand Mishra, and C. V. Jawahar

IEEE CVF Winter Conference on Applications of Computer Vision ( WACV 2024 ) [ PDF ]

2023

Robust OCR Pipeline for Automated Digitization of Mother and Child Protection Cards in India

Devesh Pant, Dibyendu Talukder, Aaditeshwar Seth, Dinesh Pant, Rohit Singh, Brejesh Dua, Rachit Pandey, Srirama Maruthi, Mira Johri, and Chetan Arora

ACM Journal on Computing and Sustainable Societies (JCSS) [ PDF ]

ICDAR 2023 Competition on Indic Handwriting Text Recognition

Ajoy Mondal, and C. V. Jawahar

17th International Conference on Document Analysis and Recognition (ICDAR), Springer, 2023. [ PDF ]

SeamFormer: High Precision Text Line Segmentation for Handwritten Documents

Niharika Vadlamudi, Rahul Krishna, Ravi Kiran Sarvadevabhatla

17th International Conference on Document Analysis and Recognition (ICDAR), Springer, 2023. [ PDF ]

2022

Enhancing Indic Handwritten Text Recognition using Global Semantic Information

Ajoy Mondal and C. V. Jawahar,

18th International Conference on Frontiers in Handwriting Recognition (ICFHR) 2022. [ PDF ]

Use of Metric Learning for the Recognition of Handwritten Digits, and its Application to Increase the Outreach of Voice-based Communication Platforms

Devesh Pant, Dibyendu Talukder, Deepak Kumar, Rachit Pandey, Aaditeshwar Seth, and Chetan Arora

In ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies, 2022, (COMPASS) (COMPASS '22). Association for Computing Machinery, New York, NY, USA, 364–374 [ PDF ]

Automatic Annotation of Handwritten Document Images at Word Level,

Ajoy Mondal, Krishna Tulsyan, and C. V. Jawahar

13th Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP) 2022. [ PDF ]

Towards Robust Handwritten Text Recognition with On-the-fly User Participation

Ajoy Mondal, Rohit Saluja, and C. V. Jawahar

13th Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP) 2022. [ PDF ]

Printed

2025

IndicDLP : A Foundational Dataset for Multi-Lingual and Multi-Domain Document Layout Parsing

Oikantik Nath, Sahithi Kukkala, Mitesh Khapra, and Ravi Kiran Sarvadevabhatla

ICDAR, 2025. [ PDF ]

TexTAR - Textual Attribute Recognition in Multi-domain and Multi-lingual Document Images

Rohan Kumar , Jyothi Swaroopa Jinka ,and Ravi Kiran Sarvadevabhatla

ICDAR, 2025. [ PDF ]

Adapting Vision-Language Models for Hindi OCR

Shaon Bhattacharyya, Souvik Ghosh, Prantik Deb, Ajoy Mondal, and C. V. Jawahar

ICDAR, 2025. [ PDF ]

SynSlideGen: AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval

Suyash Maniyar, Vishvesh Trivedi, Ajoy Mondal, Anand Mishra, C.V. Jawahar

ICDAR, 2025. [ PDF ]

UniLayDet: Simple Multi-dataset Document Layout Analysis

Prasidh Srikumar, Ajoy Mondal, C. V. Jawahar

ICDAR, 2025. [ PDF ]

EviFiVQA: A Benchmark for Evidence-Grounded Multi-Hop Reasoning in Financial VQA

Sachin Raja, Ajoy Mondal, C. V. Jawahar

ICDAR, 2025. [ PDF ]

Attend to What I Say: Highlighting Relevant Content on Slides

Megha Mariam KM , C.V.Jawahar

ICDAR, 2025. [ PDF ]

Treading Towards Privacy-Preserving Table Structure Recognition

Sachin Raja, Ajoy Mondal, and C. V. Jawahar

WACV, 2025. [ PDF ]

2024

Enhancing Sindhi (Devanagari) OCR Performance Through MLM-BERT-Based Error Correction Model

Arvind Kaur and G. S. Lehal

Nanotechnology Perceptions 20(6), pp. 4441-4459 (2024). [ PDF ]

Layout Analysis of Punjabi Newspapers Using Contour Detection and Deep Learning-Based Model

A. Kumar and G.S. Lehal

Advances in Networks, Intelligence and Computing, pp. 121-134 (2024).

Faster CNN-Based Layout Analysis of Punjabi Newspapers Using the Custom Dataset

A. Kumar and G.S. Lehal

Smart Innovation, Systems and Technologies, 376, pp. 123–137 (2024).

CHART-Info 2024: A dataset for Chart Analysis and Recognition

Kenny Davila, Rupak Lazarus, Fei Xu , Nicole Rodriguez Alcántara, Srirangaraj Setlur, Venu Govindaraju, Ajoy Mondal, and C. V. Jawahar

ICPR, 2024. [ PDF ]

Competition on Reading Documents Through Aria Glasses

Soumya Shamarao Jahagirdar, Ajoy Mondal, Yuheng Ren, Omkar M. Parkhi, and C. V. Jawahar

International Conference on Document Analysis and Recognition (ICDAR), 2024 [ PDF ]

Towards Deployable OCR Models for Indic Languages

Minesh Mathew, Ajoy Mondal, and C. V. Jawahar

International Conference on Pattern Recognition (ICPR), 2024 [ PDF ]

SPRINT: Script-agnostic Structure Recognition in Tables

Dhruv Kudale, Badri Vishal Kasuba, Venkatapathy Subramanian, Parag Chaudhuri and Ganesh Ramakrishnan

International Conference on Document Analysis and Recognition (ICDAR), 2024

Printed OCR for Extremely Low-resource Indic Languages

Alik Sarkar, Ajoy Mondal, Gurpreet Singh Lehal, and C. V. Jawahar

International Conference on Computer Vision & Image Processing ( CVIP ), 2024 [ PDF ]

A Pipeline for Recognizing Printed Documents for Indian Languages

Krishna Tulsyan, Tessy Flemin, Ajoy Mondal, and C V Jawahar

CODS-COMAD 2024 [ PDF ]

Eigen: Expert-Informed Joint Learning Aggregation for High-Fidelity Information Extraction from Document Images

Subramanian and Venkatapathy

Machine Learning for Health (ML4H) , 2024 [ PDF ]

2023

UDAAN-Machine Learning based Post-Editing tool for Document Translation

Maheshwari Ayush, Ajay Ravindran, Venkatapathy Subramanian and Ganesh Ramakrishnan

Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD). 2023. [ PDF ]

A Benchmark and Dataset for Post-OCR text correction in Sanskrit

Ayush Maheshwari, Nikhil Singh, Amrith Krishna, Ganesh Ramakrishnan

EMNLP 2023 [ PDF ]

Towards Making Flowchart Images Machine Interpretable

Shreya Shukla, Prajwal Gatti, Yogesh Kumar, Vikash Yadav, Anand Mishra

ICDAR 2023 [ PDF ]

TACTFUL A Framework for Targeted Active Learning for Document Analysis

Venkatapathy Subramanian, Sagar Poudel, Parag Chaudhuri & Ganesh Ramakrishnan

ICDAR 2023 [ PDF ]

UTRNet: High-Resolution Urdu Text Recognition In Printed Documents

Abdur Rahman, Arjun Ghosh, and Chetan Arora

17th International Conference on Document Analysis and Recognition (ICDAR), Springer, 2023. [ PDF ]

Scene Text

2024

Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant

Abhirama Subramanyam Penamakuri, and Anand Mishra

In proceedings of 19th Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2024 [ PDF ]

Competition on Word Image Recognition from Indic Scene Images

Harsh Lunia, Ajoy Mondal, and C. V. Jawhar

International Conference on Pattern Recognition (ICPR), 2024 [ PDF ]

Indic Scene Text on the Roadside

Ajoy Mondal, Krishna Tulsyan, and C. V. Jawahar

International Conference on Document Analysis and Recognition (ICDAR), 2024 [ PDF ]

Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation

Shreyas Vaidya, Arvind Kumar Sharma, Prajwal Gatti and Anand Mishra

International Conference on Pattern Recognition ( ICPR) , 2024 [ PDF ]

2023

IndicSTR12: A Dataset for Indic Scene Text Recognition

Harsh Lunia, Ajoy Mondal, and C. V. Jawahar

10th International Workshop on Camera-Based Document Analysis and Recognition (CBDAR), ICDAR, Springer, 2023. [ PDF ]