Gradio Demo for Donut, an instance of VisionEncoderDecoderModel fine-tuned on CORD (document parsing). To use it, simply upload your image and click 'submit', or click one of the examples to load them. Read more at the links below.
VisionEncoderDecoderModel
Donut: OCR-free Document Understanding Transformer | Github Repo