Deep Arabic document layout analysis

Ibrahim M. Amer, Salma Hamdy, Mostafa G. M. Mostafa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)
505 Downloads (Pure)


Document layout analysis (DLA) is an essential step for Optical Character Recognition Systems (OCR). The text of the document fed to the OCR must be extracted first and isolated from images if exist. The DLA task is difficult as there is no fixed layout for all documents, but instead, there are several layouts. There are various approaches for DLA for various different languages. In this paper, some of the previous techniques used in this field will be listed and then we will discuss the proposed method that depends on deep learning for documents’ text localization. We used Arabic Printed Text Image database (APTI [19]), ImageNet [18] and a dataset collected from different Arabic newspapers for training and evaluation.
Original languageEnglish
Title of host publicationProceedings of the 2017 IEEE Eighth International Conference on Intelligent Computing and Information Systems (ICICIS 2017)
Place of PublicationCairo, Egypt
Number of pages8
ISBN (Electronic)9781538608210
ISBN (Print)9781538608227, 9772371723
Publication statusPublished - 5 Dec 2017
Externally publishedYes
Event8th International Conference on Intelligent Computing and Information Systems - Ain Shams University, Cairo, Egypt
Duration: 5 Dec 20177 Dec 2017
Conference number: 8

Publication series

ISSN (Print)1687-1103


Conference8th International Conference on Intelligent Computing and Information Systems
Abbreviated titleICICIS 2017
Internet address


  • Optical character recognition
  • Document layout analysis
  • Font type recognition
  • Font size recognition
  • Deep learning
  • Deep Convolutional Neural Networks (DCNN)
  • Transfer learning (TL)


Dive into the research topics of 'Deep Arabic document layout analysis'. Together they form a unique fingerprint.

Cite this