Chinese Book Cover Text Location and Chinese Book Retrieval
Author:
Affiliation:

  • Article
  • | |
  • Metrics
  • |
  • Reference [11]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    As an important part of book covers, characters contain rich semantic information. By extracting accurate information from complex color images, and combining it with content-based image retrieval technology, it is possible to further improve the accuracy of book retrieval. According to the characteristics of text information in Chinese book covers, this paper proposes connected components methods to locate the text regions. At first, the grayscale image is decomposed to a series of binary images and merged to connect components in each image, according to the structures of Chinese characters, generating candidate text regions. Additionally, text verification is used to rule out non-text regions. The result regions are regarded as the prominent regions of book cover, further this paper use Hu moment invariant to extract features for image matching. Experiments show the results of this method are fairly good, proving the importance of text information to book retrieval.

    Reference
    [1] Hasan YMY, Karam LJ. Morphological text extraction from images. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2000,9(11):1978-1983.
    [2] Dinh VC, Chun SS, Cha S, Ryu H, Sull S. An efficient method for text detection in video based on stroke width similarity. In: Proc. of the ACCV Part I, LNCS 4843, 2007. 200-209.
    [3] Mao WG, Chung FL, Lam KKM, Siu WC. Hybrid Chinese/English text detection in images and video frames. In: Proc. of the ICPR, Vol.3. 2002. 1015-1018.
    [4] Epshtein B, Ofek E, Wexler Y. Detecting text in natural scenes with stroke width transform. In: Proc. of the CVPR. 2010. 2963-2970.
    [5] Chen C, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B. Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: Proc. of the Conf. on Image Processing. 2011.
    [6] Li XJ, Wang WQ, Jiang SQ, Huang QM, Gao W. Fast and effective text detection. In: Proc. of the IEEE Int'l Conf. on Image Processing. 2008.969-972.
    [7] Chen XR, Yuille AL. Detecting and reading text in natural scenes. In: Proc. of the Int'l Conf. on Computer Vision and Pattern Recognition, Vol.2. 2004. 366-373.
    [8] Liu J, Zhang SW, Li HP, Liang W. A Chinese character localization method based on intergrating structure and CC-clustering for advertising images, In: Proc. of the Conf. on Document Analysis and Recognition (ICDAR 11). 2011.
    [9] Lowe DG. Distinctive image features from scale-invariant keypoints. Int'l Journal of Computer Vision, 2004,60(2):91-110.
    [10] Blei DM, Ng AY, Jordan MI. Latent Dirichlet allocation. Journal of Machine Learning Research, 2003,3:993-1022.
    [11] Lu Y, Tan CL, Shi PF, Zhang KH. Segmentation of handwritten Chinese characters from destination addresses of mail pieces. Int'l Journal of Pattern Recognition and Artificial Intelligence, 2002,16(1):85-96.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

刘玉杰,李峰,李宗民,李华,林茂.中文图书封面文本定位及中文图书检索.软件学报,2012,23(zk2):77-84

Copy
Share
Article Metrics
  • Abstract:2512
  • PDF: 5344
  • HTML: 0
  • Cited by: 0
History
  • Received:May 30,2012
  • Revised:September 29,2012
  • Online: December 29,2012
You are the first2038312Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063