nii.ac.jp

Discovering Relationships Among Catalogs

Authors: 
Ichise, R.; Hamasaki, M.; Takeda, H.
Year: 
2004
Venue: 
DS, 2004

When we have a large amount of information, we usually use categories with a hierarchy, in which all information is assigned. The Yahoo! Internet directory is one such example. This paper proposes a new method of integrating two catalogs with hierarchical categories. The proposed method uses not only the contents of information but also the structures of both hierarchical categories. In order to evaluate the proposed method, we conducted experiments using two actual Internet directories, Yahoo! and Google. The results show improved performance compared with the previous approaches.

Integrating Multiple Internet Directories by Instance-based Learning

Authors: 
Ichise, R.; Takeda, H.; Honiden, S.
Year: 
2003
Venue: 
IJCAI, 2003

Finding desired information on the Internet is becoming increasingly difficult. Internet directories such as Yahoo!, which organize web pages into hierarchical categories, provide one solution to this problem; however, such directories are of limited use because some bias is applied both in the collection and categorization of pages. We propose a method for integrating multiple Internet directories by instance-based learning. Our method provides the mapping of categories in order to transfer documents from one directory to another, instead of simply merging two directories into one.

Syndicate content