Using Classifiers to Find Domain-Specific Online Databases Automatically

微信服务号

微信订阅号

2025-5-18- 2

Home > Archive>Volume 19, Issue 2, 2008 >246-256

Using Classifiers to Find Domain-Specific Online Databases Automatically
DOI:
                        
                    
Author:
                        WANG HuiWANG Hui

Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LIU Yan-WeiLIU Yan-Wei

Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZUO Wan-LiZUO Wan-Li

Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

In hidden Web domain, general-purpose search engines (i.e., Google and Yahoo) have their shortcomings. They cover less than one-third of the data stored in document databases. Unlike the surface Web, if combined, they cover roughly the same data. Hidden Web is a highly important information source since the content provided by many hidden Web sites is often of very high quality. This paper proposes a three-step framework to automatically identify domain-specific hidden Web entries. With those obtained query interfaces, they can be integrated to obtain a unified interface which is given to users to query. Eight large-scale experiments demonstrate that the technique can find domain-specific hidden Web entries accurately and efficiently.

Key words:deep Web; hidden Web; surface Web; hidden Web entry; searchable form

Get Citation

王辉,刘艳威,左万利.使用分类器自动发现特定领域的深度网入口.软件学报,2008,19(2):246-256

Copy

Article Metrics

Abstract:8276
PDF: 8421
HTML: 0
Cited by: 0

History

Received:August 02,2007
Revised:November 06,2007
Adopted:
Online:
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History