A Web Bibliographies Retrieval Structure Based on the Longest Sequential Frequent Phrases

微信服务号

微信订阅号

2025-6-4- 1

Home > Archive>Volume 17, Issue 10, 2006 >2096-2105

A Web Bibliographies Retrieval Structure Based on the Longest Sequential Frequent Phrases
DOI:
                        
                    
Author:
                        WANG Da-LingWANG Da-Ling

Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
YU GeYU Ge

Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
BAO Yu-BinBAO Yu-Bin

Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Most Web bibliographies cannot meet the retrieval requirements of the researchers with different academic levels. The reason resulting in the problem is analyzed, and the idea of constructing an auxiliary Web bibliography retrieval structure for the users to obtain more proper bibliographies is proposed. Based on the idea, an algorithm of mining the longest sequential frequent phrases for extracting features of the bibliographies is designed, and an extended feature hierarchical tree describing the relationship among the features, among the bibliographies, and among the features, the bibliographies and its construction is presented. The experiments show that the new method outperforms the current popular TFIDF method in extraction features. The theoretical analysis explains that the extended feature hierarchical tree has constringent structure, reveals the relationship between phrases and bibliographies, and provides better assistant retrievals.

Key words:longest sequential frequent phrases;extended feature hierarchical tree;feature extraction;text mining;information retrieval

Get Citation

王大玲,于戈,鲍玉斌.基于最长顺序频繁词组的Web文献检索结构.软件学报,2006,17(10):2096-2105

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:May 05,2005
Revised:December 13,2005
Adopted:
Online:
Published:

You are the first2050505Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History