Uddipan Basu Bir: From Pixels to Structure: Analysis of Lightweight Vision-Language Models for Document OCR and Structured Output Generation [MT Intro]
Uddipan Basu Bir: From Pixels to Structure: Analysis of Lightweight Vision-Language Models for Document OCR and Structured Output Generation [MT Intro]