Abstract:This paper presents a fast and efficient way to extract the target text areas. First, the edge detection and mathematical morphology are used to realize coarse location of text and graphic areas. Then several window features are extracted to distinguish the text areas from graphic areas and thereafter certain rules are set to extract the target text areas. The algorithm has high location accuracy and low time complexity due to weakening the interference of background areas. At the same time, OSTU algorithm is applied to enhance the adaptability of images with complex background during binarization. Test results on several types of samples indicate that the algorithm has strong popularity. The high accuracy of the method has also proved the validity of the algorithm.