Article "Finetuning olmOCR to be a faithful OCR-Engine"

We recently created a fine-tune of an Optical Character Recognition (OCR) AI model based on olmOCR to help us automate our internal document processing workflows. In our new Hugging Face blog post “Finetuning olmOCR to be a faithful OCR-Engine”, we discuss how we trained the Vision Language Model (VLM) to reliably transcribe headers and footers of invoices. The final model weights are available on Hugging Face as well.