Abstract
Font family and font size recognition became an essential step for document analysis. Font recognition helps to identify the proper segmentation method to be used before feeding the document to the Optical character Recognition (OCR). In this paper, some of the previous techniques used for font family and font size recognition will be discussed then we will discuss the proposed method that is based on deep learning. Two methods have been presented in this paper 1) a method for font family recognition (font size invariant) and 2) a method for font size recognition. Both methods use Deep Convolutional Neural Networks (D-CNN). We evaluated the proposed method on Arabic Printed Text Image Database (APTI) [7] and on a document generated using APTI database word images and scanned with the scanner.
Original language | English |
---|---|
Number of pages | 6 |
Journal | International Journal of Computer Applications |
Volume | 176 |
Issue number | 4 |
DOIs | |
Publication status | Published - 31 Oct 2017 |
Keywords
- Font family recognition
- Font size recognition
- Optical character recognition (OCR),
- Document layout analysis (DLA)
- Deep learning
- Deep convolutional neural network (D-CNN)