The Interspace Project
http://ai.bpa.arizona.edu/Interspace

Project Goal
To develop innovative methods for information analysis and visualization.
Grant Support
Department of Defense, Advanced Research Projects Agency (ARPA), "The Interspace Prototype: An Analysis Environment based on Scalable Semantics."
Subcontract of University of Illinois Digital Library Initiative, NSF/ARPA/NASA, "Building the Interspace: Digital Library Infrastructure for a University Engineering Community."
Researchers/Collaborators
Hsinchun Chen, Associate Professor, Management Information Systems, University of Arizona, Tucson, AZ 85721, (520) 621-4153, hchen@bpa.arizona.edu
Bruce R. Schatz, Director of the Digital Library Research Program, University of Illinois at Urbana-Champaign, Urbana, IL 61801, (217) 244-0651, schatz@csl.ncsa.uiuc.edu
Research Assistants:
- Chienting Lin
- Michael McQuaid
- Kristin Tolle
- Dmitri Roussinov
- Marshall Ramsey
- Harry Li
- Thian-Huat Ong
- Elvina Hendrata

Key Technical Summary
CSQuest is a collection of 280,000 computer engineering terms and 10M links. The associated concept space allows users to identify other relevant search terms for searching Internet computer engineering servers or homepages such as the CS Technical Report Sites.
The GS-Map analyzes 200 raw Group Systems (GS) electronic meeting comments, identifies key terms and organizes them in 1-D, 2-D, and 3-D (VRML) displays.
The Arizona Noun Phraser required 17 serial hours to generate noun phrases and 2 hours and 40 minutes to create the concept space.
Techniques:
- Arizona Noun Phraser
- Automatic Indexing
- Concept Space Generation: Batch, Incremental, Real-time
- Kohonen Self-Organization Map (SOM) Algorithms:
* 1-D, 2-D, 3-D (VRML) displays
- Multi-dimensional Scaling: 1-D, 2-D, 3-D
- Ward's Hierarchical Clustering
- Visualization: Fisheye view, Fractal view
- Intelligent Agents: Java-based genetic algorithm spiders
Key Results Summary
The following studies are currently underway:
MDS-categorization of GS comments is being compared to human categorization.
GS-Map-user studies are being conducted on the 1-D, 2-D, and 3-D Kohonen-based maps.
SOM vs. Ward's comparison study
Genetic-algorithm based spider
Publications
H. Chen, Y. Chung, M. Ramsey, and C. Yang. A Smart Itsy Bitsy Spider for the Web, Journal of the American Society for Information Science, 1998, forthcoming.
H. Chen, A. Houston, R. Sewell, and B. Schatz. Internet Browsing and Searching: User Evaluations of Category Map and Concept Space Techniques, Journal of the American Society for Information Science, 1998, forthcoming.
D. Roussinov and H. Chen. A Scalable Self-Organizing Map Algorithm for Textual Classification: A Neural Network Approach to Thesaurus Generation, accepted for CCAI Communication Cognition and Artificial Intelligence, Spring 1998.
Demo Sites/Information
CSQuest is an automatically generated thesaurus of 280,000 computer engineering terms and 10M links. (http://ai.bpa.arizona.edu/html/mcsquest/)
GS-Map is a concept-based categorization and visual tool for raw comments from a GroupSystem (GS) electronic meeting session. (http://ai.bpa.arizona.edu/gsmap/)
Itsy Bitsy Spider is a dynamic internet agent. (http://ai.bpa.arizona.edu/~mramsey/SPIDER/itsy.html)
ET-Map is a set of concept-based search tools for searching entertainment related web pages (110,000+ URLs). (http://ai.bpa.arizona.edu/ent)
Arizona Noun Phraser is able to isolate valid noun phrases from text to use in document retrieval.
- HTML, http://ai.bpa.arizona.edu/cgi-bin/ktolle/interface/nlpi
- Java, http://ai.bpa.arizona.edu/~elvina/npigraph.html


Copyright © 1990-1997, Artificial Intelligence Group, The University of Arizona.