:: Snapshots ::
Click here to view more photos
| Demos | Papers | Conferences | Seminars |

 Research Goal  
While small-scale search engines in specific domains and languages are increasingly desired by Web users, most existing search engine development tools do not support the development of search engines in languages other than English, cannot be integrated with other applications, or rely on proprietary software. A tool that supports search engine creation in multiple languages is thus highly desired. To study the research issues involved, we designed and implemented a toolkit, called SpidersRUs, for multilingual search engine creation. The toolkit consists of a Spider module, an Indexer module, a Search module, a Graphical User Interface module, and an Index Structure. This study demonstrates that the proposed architecture is feasible in effectively and efficiently developing search engines in different language such as Chinese, Spanish, Japanese, and Arabic.

 Funding  
This project has been supported in part by the following grants:

IIS-9817473 April 1999 – March 2002
NSF Digital Library Initiative-2  
High-performance Digital Library Systems: From Information Retrieval to Knowledge Management
DUE-0121741 September 2001 – August 2003
NSF National SMETE Digital Library  
Intelligent Collection Services for and about Educators and Students: Logging, Spidering, Analysis and Visualization

 Acknowledgements  
We would like to thank Chia-Jung Hsu for his contribution to this project. We would also like to thank other members of the Artificial Intelligence Lab at the University of Arizona who have tested the toolkit and shared with us their ideas and comments.

 Approach & Methodology  
In this study, we reviewed related literature and suggested the criteria for an ideal search tool. We proposed an architecture for a multilingual search engine building tool and implemented it in Java programming language. The design and implementation of the tool consists of a Spider module, an Indexer module, a Search module, a Graphical User Interface module, and an Index Structure. We also conducted a case study on using the tool to develop a medical search engine in Chinese and demonstrated the effectiveness and efficiency of the toolkit.

 Team Members  
Developers 
Michael Chau
mchau@business.hku.hk
Chunju Tseng
chunju@u.arizona.edu
Jialun Qin
qin@email.arizona.edu
Yilu Zhou
yilu@email.arizona.edu
Chia-Jung Hsu hsuc@email.arizona.edu
   
Advisor
Hsinchun Chen hchen@eller.arizona.edu

 Publications  

Currently no publications.

top 

 

| Contact Us | Site Map | Search |
Copyright © 2004 Eller College of Management. All Rights Reserved.
All trademarks mentioned herein belong to their respective owners