The Special Issue "Document Image Processing" in the Journal of Imaging aims at presenting approaches which contribute to access the content of document images.These approaches are related to low level tasks such as image preprocessing, skew/slant corrections, binarization and document segmentation, as well as high level tasks such as OCR, handwriting recognition, word spotting or script identification.This special issue brings together 12 papers that discuss such approaches.The first three articles deal with historical document preprocessing.The work by Hanif et al. [1] aims at removing bleed-through using a non-linear model, and at reconstructing the background by an inpainting approach based on non-local patch similarity.The paper by Almeida et al. [2] proposes a new binarization approach that includes a decision-based process for finding the best threshold for each RGB channel.In the paper by Kavallieratou et al. [3], a segmentation-free approach based on the Wigner-Ville distribution is used to detect the slant of a document and correct it.Once a document image is preprocessed, a next step described in the paper by Ghosh et al.[4] consists in separating text components from non-text ones, using a classifier based on LBP features.Following steps may consist in recognizing text components or searching from word queries.In the paper by Nashwan et al.[5] a holistic-based approach for the recognition of printed Arabic words is proposed, coupled with an efficient dictionary reduction.In the work by Nagendar et al. [6]itis shown that using a query specific fast Dynamic Time Warping distance, improves the Direct Query Classifier (DQC) word spotting system.Deep neural network-based approaches are now widely used in the domain of document image processing, especially for the recognition of textual elements.The following papers also follow this trend.In the work by Jangid and Srivastava [7], deep convolutional networks trained layer-wise, are applied to the recognition of Devanagari characters.The paper by Kesiman et al. [8] is dedicated to southeast Asian scripts written on palm leafs.Character and word images are recognized by CNNs (Convolutional Neural Networks) and RNNs (Recurrent Neural Networks), respectively.Several binarization and text-line segmentation approaches are also benchmarked on these specific documents.The work by Granell et al. [9] describes an efficient text-line recognition system, based on CNN and stacks of RNNs, that has been developed for the recognition of historical Spanish documents.These documents include out-of-vocabulary ancient words which are handled by a language model based on sub-lexical units.Annotated datasets are necessary to train systems or to evaluate the various tasks related to document image processing.In several papers published in this special issue, new datasets are released as well as open-source tools that are able to generate synthetic images.A dataset of indic scripts is released in the paper by Mukhopadhyay et al. [10] and first results are provided with this