• Article
  • | |
  • Metrics
  • |
  • Reference [15]
  • |
  • Related [20]
  • |
  • Cited by [7]
  • | |
  • Comments
    Abstract:

    This paper gives a novel approach to recognize Chinese fonts based on Empirical Mode Decomposition (EMD). By analyzing and comparing a great number of Chinese characters, 8 basic strokes are selected to characterize the structural attributes of Chinese fonts. Based on them, stroke feature sequences of each text block are calculated. Once decomposed by EMD, their first two intrinsic mode functions (IMFs), which are of the highest frequencies, are used to calculate the stroke energy of all the 8 basic strokes, forming the average of the energy of the two IMFs over the length of the sequence. To distinguish bold fonts from their regular fonts, average of the pixel's gray levels of the text is calculated and appended to the feature vector to form a 9 dimensional feature. Finally, the minimum distance classifier is used to recognize the fonts. Experiments show encouraging recognition rates.

    Reference
    [1]Khoubyari S, Hull JJ. Font and function word identification in document recognition. Computer Vision and Image Understanding,1996,63(1):66-74.
    [2]Shi H, Pavlidis T. Font recognition and contextual processing for more accuratetext recognition. In: Proc. of the ICDAR'97. ULm:IEEE Computer Society Press, 1997.39-44.
    [3]Zramdini A, Ingold R. Optical font recognition using typographical features. IEEE Trans on Pattern Analysis and Machine Intelligence, 1998,220(8):877-882.
    [4]Jung MC, Shin YC, Srihari SN. Multifont classification using typographical attributes. In: Proc. of the ICDAR'99. Bangalore: IEEE Computer Socety Press, 1999. 353-356.
    [5]Zhu Y, Tan TN, Wang YH. Font recognition based on global texture analysis. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2001,23 (10): 1192-1200.
    [6]Zeng L, Tang YY, Chen TH. Multi-Scale wavelet texture-based script identification method. Chinese Journal of Computers,2000,23(7):699-704 (in Chinese with English abstract).
    [7]Chen L, Ding XQ. Font recognition of single Chinese character based on wavelet feature. Acta Electronica Sinica,2004,32(2): 177-180 (in Chinese with English abstract).
    [8]Huang NE, Shen Z, Long SR. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. of the Royal Society of London, 1998,A(454):903-995.
    [9]Flandrin P, Rilling G, Goncalves P. Empirical mode decomposition as a filter bank. IEEE Signal Processing Letters, 2004,11(2): 112-114.
    [10]Deng YJ, Wang W, Qian CC, Dai DJ. Boundary-Processing technique in EMD method and Hilbert transform. Chinese Science Bulletin, 2001,46(3):954-961 (in Chinese with English abstract).
    [11]Yang ZH, Huang D, Yang LH. A novel pitch period detection algorithm based on Hilbert-Huang transform. LNCS 3338, 2004.586-593.
    [12]Yang ZH, Qi DX, Yang LH. Signal period analysis based on Hilbert-Huang transform and its application to texture analysis. In:Proc. of the 3rd Int'l Conf. on Image and Graphics. Hong Kong: IEEE Computer Society Press, 2004. 430-433.
    [6]曾理,唐远炎,陈廷槐.基于多尺度小波纹理分析的文字种类自动识别.计算机学报,2000,23(7):699-704.
    [7]陈力,丁晓青.基于小波特征的单字符汉字字体识别.电子学报,2004,32(2):177-180.
    [10]邓拥军,王伟,钱成春.EMD方法及Hilbert变换中边界点问题的处理.科学通报,2001,46(3):257-263.
    Comments
    Comments
    分享到微博
    Submit
Get Citation

杨志华,齐东旭,杨力华,吴立军.基于经验模式分解的汉字字体识别方法.软件学报,2005,16(8):1438-1444

Copy
Share
Article Metrics
  • Abstract:4238
  • PDF: 6506
  • HTML: 0
  • Cited by: 0
History
  • Received:September 07,2004
  • Revised:March 11,2005
You are the first2038652Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063