|
| Research
Goal |
|
 |
While small-scale search engines in specific domains and languages
are increasingly desired by Web users, most existing search
engine development tools do not support the development of search
engines in languages other than English, cannot be integrated
with other applications, or rely on proprietary software. A
tool that supports search engine creation in multiple languages
is thus highly desired. To study the research issues involved,
we designed and implemented a toolkit, called SpidersRUs, for
multilingual search engine creation. The toolkit consists of
a Spider module, an Indexer module, a Search module, a Graphical
User Interface module, and an Index Structure. This study demonstrates
that the proposed architecture is feasible in effectively and
efficiently developing search engines in different language
such as Chinese, Spanish, Japanese, and Arabic. |
|
| Funding |
|
| Acknowledgements |
|
 |
We would like to thank Chia-Jung Hsu for his contribution to
this project. We would also like to thank other members of the
Artificial Intelligence Lab at the University of Arizona who
have tested the toolkit and shared with us their ideas and comments. |
|
| Approach
& Methodology |
|
 |
In this study, we reviewed related literature and suggested
the criteria for an ideal search tool. We proposed an architecture
for a multilingual search engine building tool and implemented
it in Java programming language. The design and implementation
of the tool consists of a Spider module, an Indexer module,
a Search module, a Graphical User Interface module, and an Index
Structure. We also conducted a case study on using the tool
to develop a medical search engine in Chinese and demonstrated
the effectiveness and efficiency of the toolkit. |
|
| Team
Members |
|
| Publications |
|
 |
Currently no publications. |
|
|