Handwritten
2026
Learning Beyond Labels: Self-Supervised Handwritten Text Recognition
Shree Mitra, Ajoy Mondal, and C V Jawahar
In Winter Conference on Applications of Computer Vision (WACV), 2026. [ PDF ]
2025
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding
Aniket Pal, Sanket Biswas, Alloy Das, Ayush Lodh, Priyanka Banerjee, Soumitri Chattopadhyay, Dimosthenis Karatzas, Josep Llados, and C V Jawahar
In International Conference on Document Analysis and Recognition (ICDAR), 2025. [ PDF ]
HW-MLVQA: A Novel Handwritten Multilingual Dataset for Visual Question Answering and Evaluation
Aniket Pal, Ajoy Mondal, and C. V. Jawahar
International Journal of Document Analysis and Recognition (IJDAR), Springer, 2025. [ PDF ]
SemiHastakshar: Generalizable Indic Handwritten OCR through Semi-Supervised Learning
Lalitha Evani, Ajoy Mondal, and C V Jawahar
In Indian Conference on Vision Graphics and Image Processing (ICVGIP), 2025. [ PDF ]
Unveiling Text in Challenging Stone Inscriptions: A Character-Context-Aware Patching Strategy for Binarization
Pratyush Jena, Amal Joseph, Arnav Sharma, and Ravi Kiran Sarvadevabhatla
In Indian Conference on Vision Graphics and Image Processing (ICVGIP), 2025. [ PDF ]
FormLens: From Ink to Insight with Adapting Vision-Language Models for Handwritten Form Digitization
Shaon Bhattacharya, Ajoy Mondal, and C V Jawahar
In Indian Conference on Vision Graphics and Image Processing (ICVGIP), 2025. [ PDF ]
Handwritten Notes Understanding Challenge
Aniket Pal, Sanket Biswas, Alloy Das, Ayush Lodh, Priyanka Banerjee, Ajoy Mondal, Dimosthenis Karatzas, Josep Lladós, and C. V. Jawahar
ICDAR 2025. [ PDF ]
2024
Towards Digitizing Filled Indic Handwritten Forms
Shaon Bhattacharyya, Ajoy Mondal, and C. V. Jawahar
CVIP, 2024. [ PDF ]
Advancing Question Answering on Handwritten Documents
Aniket Pal, Ajoy Mondal, and C. V. Jawahar
CVIP, 2024. [ PDF ]
LineTR: Unified Text Line Segmentation for Challenging Palm Leaf Manuscripts
Vaibhav Agrawal, Niharika Vadlamudi, Muhammad Waseem, Amal Joseph, Sreenya Chitluri, and Ravi Kiran Sarvadevabhatla
International Conference on Pattern Recognition (ICPR 2024). [ PDF ]
Unconstrained Camera Captured Indic Offline Handwritten Dataset
Ajoy Mondal and C. V. Jawahar
International Conference on Pattern Recognition (ICPR), 2024. [ PDF ]
Competition on Recogntion and VQA on Handwritten Documents
Ajoy Mondal, Vijay Mahadevan, R. Manmatha, and C. V. Jawahar
International Conference on Document Analysis and Recognition (ICDAR) , 2024. [ PDF ]
Bridging the Gap in Resource for Offline English Handwritten Text Recognition
Ajoy Mondal, Krishna Tulsyan, and C V Jawahar
International Conference on Document Analysis and Recognition (ICDAR) , 2024. [ PDF ]
Enhancing Accuracy in Indic Handwritten Text Recognition
Evani Lalitha, Ajoy Mondal, and C. V. Jawahar,
International Conference on Computer Vision & Image Processing ( CVIP ), 2024 [ PDF ]
Semantic Labels-Aware Transformer Model for Searching over a Large Collection of Lecture-Slides
K. V. Jobin, Anand Mishra, and C. V. Jawahar
IEEE CVF Winter Conference on Applications of Computer Vision ( WACV 2024 ) [ PDF ]
2023
Robust OCR Pipeline for Automated Digitization of Mother and Child Protection Cards in India
Devesh Pant, Dibyendu Talukder, Aaditeshwar Seth, Dinesh Pant, Rohit Singh, Brejesh Dua, Rachit Pandey, Srirama Maruthi, Mira Johri, and Chetan Arora
ACM Journal on Computing and Sustainable Societies (JCSS) [ PDF ]
ICDAR 2023 Competition on Indic Handwriting Text Recognition
Ajoy Mondal, and C. V. Jawahar
17th International Conference on Document Analysis and Recognition (ICDAR), Springer, 2023. [ PDF ]
SeamFormer: High Precision Text Line Segmentation for Handwritten Documents
Niharika Vadlamudi, Rahul Krishna, and Ravi Kiran Sarvadevabhatla
17th International Conference on Document Analysis and Recognition (ICDAR), Springer, 2023. [ PDF ]
2022
Enhancing Indic Handwritten Text Recognition using Global Semantic Information
Ajoy Mondal and C. V. Jawahar
18th International Conference on Frontiers in Handwriting Recognition (ICFHR) 2022. [ PDF ]
Use of Metric Learning for the Recognition of Handwritten Digits, and its Application to Increase the Outreach of Voice-based Communication Platforms
Devesh Pant, Dibyendu Talukder, Deepak Kumar, Rachit Pandey, Aaditeshwar Seth, and Chetan Arora
In ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies, 2022, (COMPASS) (COMPASS '22). Association for Computing Machinery, New York, NY, USA, 364–374 [ PDF ]
Automatic Annotation of Handwritten Document Images at Word Level,
Ajoy Mondal, Krishna Tulsyan, and C. V. Jawahar
13th Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP) 2022. [ PDF ]
Towards Robust Handwritten Text Recognition with On-the-fly User Participation
Ajoy Mondal, Rohit Saluja, and C. V. Jawahar
13th Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP) 2022. [ PDF ]
Printed
2026
UniTabBank: A Large Scale Multi-Lingual, Multi-Layout, Multi-Type, Multi-Format Dataset for Table Detection
Ajoy Mondal, Saumya Mundra, Avijit Dasgupta, and C V Jawahar
In Winter Conference on Applications of Computer Vision (WACV), 2026. [ PDF ]
Printed
2025
Treading Towards Privacy-Preserving Table Structure Recognition
Sachin Raja, Ajoy Mondal, and C V Jawahar
In Winter Conference on Applications of Computer Vision (WACV), 2025. [ PDF ]
EviFiVQA: A Benchmark for Evidence-Grounded Multi-hop Reasoning in Financial VQA
Sachin Raja, Ajoy Mondal, and C V Jawahar
International Conference on Document Analysis and Recognition (ICDAR), Springer, Wuhan, 2025. [ PDF ]
Label-Free Adaptation of Indic Printed OCR
Akash Manna, Radha Krishna Deshpande, Ajoy Mondal, and C V Jawahar
Indian Conference on Computer Vision, Graphics, and Image Processing (ICVGIP) , 2025. [ PDF ]
Are We There Yet? Assessing the Capabilities of MLLMs in Assistive AI Applications
Shayon Dasgupta, Avijit Dasgupta, and C V Jawahar
Indian Conference on Vision Graphics and Image Processing (ICVGIP), 2025. [ PDF ]
From Words to Paragraphs: Hierarchical Dense Text Detection with SAM-Adaptive Backbone
Shashank Krishna Vempatti , Gaurav Talebailkar , Sai Prabhath Bogam Bhaskar Arun , Kayalvizhi Ganesan, and Chetan Arora
Indian Conference on Vision Graphics and Image Processing (ICVGIP), 2025. [ PDF ]
When Big Models Train Small Ones: Label-Free Model Parity Alignment for Efficient Visual Question Answering using Small VLMs
Abhirama Subramanyam Penamakuri*, Navlika Singh*, Piyush Arora*, and Anand Mishra (*: Equal contribution)
Conference on Empirical Methods in Natural Language Processing (EMNLP 2025). [ PDF ]
From Pixels to Tables: Reconstructing Complex Tables from Document Images
Sachin Raja, Ajoy Mondal, and C. V. Jawahar
IJDAR, 2025. [ PDF ]
IndicDLP : A Foundational Dataset for Multi-Lingual and Multi-Domain Document Layout Parsing
Oikantik Nath, Sahithi Kukkala, Mitesh Khapra, and Ravi Kiran Sarvadevabhatla
ICDAR, 2025. [ PDF ]
TexTAR - Textual Attribute Recognition in Multi-domain and Multi-lingual Document Images
Rohan Kumar , Jyothi Swaroopa Jinka, and Ravi Kiran Sarvadevabhatla
ICDAR, 2025. [ PDF ]
Adapting Vision-Language Models for Hindi OCR
Shaon Bhattacharyya, Souvik Ghosh, Prantik Deb, Ajoy Mondal, and C. V. Jawahar
ICDAR, 2025. [ PDF ]
SynSlideGen: AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval
Suyash Maniyar, Vishvesh Trivedi, Ajoy Mondal, Anand Mishra, and C.V. Jawahar
ICDAR, 2025. [ PDF ]
UniLayDet: Simple Multi-dataset Document Layout Analysis
Prasidh Srikumar, Ajoy Mondal, and C. V. Jawahar
ICDAR, 2025. [ PDF ]
EviFiVQA: A Benchmark for Evidence-Grounded Multi-Hop Reasoning in Financial VQA
Sachin Raja, Ajoy Mondal, and C. V. Jawahar
ICDAR, 2025. [ PDF ]
Attend to What I Say: Highlighting Relevant Content on Slides
Megha Mariam KM ,and C.V.Jawahar
ICDAR, 2025. [ PDF ]
Treading Towards Privacy-Preserving Table Structure Recognition
Sachin Raja, Ajoy Mondal, and C. V. Jawahar
WACV, 2025. [ PDF ]
2024
Enhancing Sindhi (Devanagari) OCR Performance Through MLM-BERT-Based Error Correction Model
Arvind Kaur and G. S. Lehal
Nanotechnology Perceptions 20(6), pp. 4441-4459 (2024). [ PDF ]
Layout Analysis of Punjabi Newspapers Using Contour Detection and Deep Learning-Based Model
A. Kumar and G.S. Lehal
Advances in Networks, Intelligence and Computing, pp. 121-134 (2024).
Faster CNN-Based Layout Analysis of Punjabi Newspapers Using the Custom Dataset
A. Kumar and G.S. Lehal
Smart Innovation, Systems and Technologies, 376, pp. 123–137 (2024).
CHART-Info 2024: A dataset for Chart Analysis and Recognition
Kenny Davila, Rupak Lazarus, Fei Xu , Nicole Rodriguez Alcántara, Srirangaraj Setlur, Venu Govindaraju, Ajoy Mondal, and C. V. Jawahar
ICPR, 2024. [ PDF ]
Competition on Reading Documents Through Aria Glasses
Soumya Shamarao Jahagirdar, Ajoy Mondal, Yuheng Ren, Omkar M. Parkhi, and C. V. Jawahar
International Conference on Document Analysis and Recognition (ICDAR), 2024 [ PDF ]
Towards Deployable OCR Models for Indic Languages
Minesh Mathew, Ajoy Mondal, and C. V. Jawahar
International Conference on Pattern Recognition (ICPR), 2024 [ PDF ]
SPRINT: Script-agnostic Structure Recognition in Tables
Dhruv Kudale, Badri Vishal Kasuba, Venkatapathy Subramanian, Parag Chaudhuri and Ganesh Ramakrishnan
International Conference on Document Analysis and Recognition (ICDAR), 2024
Printed OCR for Extremely Low-resource Indic Languages
Alik Sarkar, Ajoy Mondal, Gurpreet Singh Lehal, and C. V. Jawahar
International Conference on Computer Vision & Image Processing ( CVIP ), 2024 [ PDF ]
A Pipeline for Recognizing Printed Documents for Indian Languages
Krishna Tulsyan, Tessy Flemin, Ajoy Mondal, and C V Jawahar
CODS-COMAD 2024 [ PDF ]
Eigen: Expert-Informed Joint Learning Aggregation for High-Fidelity Information Extraction from Document Images
Subramanian and Venkatapathy
Machine Learning for Health (ML4H) , 2024 [ PDF ]
2023
UDAAN-Machine Learning based Post-Editing tool for Document Translation
Maheshwari Ayush, Ajay Ravindran, Venkatapathy Subramanian and Ganesh Ramakrishnan
Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD). 2023. [ PDF ]
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
Ayush Maheshwari, Nikhil Singh, Amrith Krishna, and Ganesh Ramakrishnan
EMNLP 2023 [ PDF ]
Towards Making Flowchart Images Machine Interpretable
Shreya Shukla, Prajwal Gatti, Yogesh Kumar, Vikash Yadav, and Anand Mishra
ICDAR 2023 [ PDF ]
TACTFUL A Framework for Targeted Active Learning for Document Analysis
Venkatapathy Subramanian, Sagar Poudel, Parag Chaudhuri and Ganesh Ramakrishnan
ICDAR 2023 [ PDF ]
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
Abdur Rahman, Arjun Ghosh, and Chetan Arora
17th International Conference on Document Analysis and Recognition (ICDAR), Springer, 2023. [ PDF ]
Scene Text
2026
MIST Multilingual Incidental Dataset for Scene Text Detection
Saumya Mundra, Ajoy Mondal and C. V. Jawhar
In Winter Conference on Applications of Computer Vision (WACV), 2026 [ PDF ]
2024
Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant
Abhirama Subramanyam Penamakuri, and Anand Mishra
In proceedings of 19th Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2024 [ PDF ]
Competition on Word Image Recognition from Indic Scene Images
Harsh Lunia, Ajoy Mondal, and C. V. Jawhar
International Conference on Pattern Recognition (ICPR), 2024 [ PDF ]
Indic Scene Text on the Roadside
Ajoy Mondal, Krishna Tulsyan, and C. V. Jawahar
International Conference on Document Analysis and Recognition (ICDAR), 2024 [ PDF ]
Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation
Shreyas Vaidya, Arvind Kumar Sharma, Prajwal Gatti and Anand Mishra
International Conference on Pattern Recognition ( ICPR) , 2024 [ PDF ]
2023
IndicSTR12: A Dataset for Indic Scene Text Recognition
Harsh Lunia, Ajoy Mondal, and C. V. Jawahar
10th International Workshop on Camera-Based Document Analysis and Recognition (CBDAR), ICDAR, Springer, 2023. [ PDF ]