- Classified documents esp. invoices into types using OCR. Extended Chargrid model to Japanese language and additional field detection.
- Reduced model’s training, inference time and increased accuracy of character detection.
- Experimented on using named entity recognition for extracting single-word fields in place of annotated approach.
- Automated sales data entry from documents and cross-validation of sales and purchase related documents.
Document Classification, Detail Extraction and Processing
This post is licensed under CC BY 4.0 by the author.