Ongoing Projects


Project: "Building the National Cancer Information Infrastructure: Phase II Project"
Support:
National Cancer Institute, 1997-1998 ($300,000)
Collaborators: Bruce R. Schatz and Susan M. Hubbard

Description:
Continuing to build upon the progress we made in Phase I, "Information Analysis and Visualization for Cancer Literature," we will test the Second Prototype concept space which utilizes 500,000 CANCERLIT abstracts and includes noun parsing, automatic indexing, and UMLS metathesaurus components. We will also explore the integration of a Biosys based concept space, integration of category map generation into the CancerLit concept space, and vocabulary switching across the CancerLit space, UMLS meta-thesaurus, and the Biosys concept space.


Project: "Interspace Analysis Environment on Scalable Semantics" (a joint project with NCSA)
Support: Department of Defense, Advanced Research Projects Agency
(DARPA), 1997-2000 ($1.1 Million)
Collaborators: Bruce R. Schatz

Description:
The University of Arizona Artificial Intelligence Group will be involved with the development of algorithms, system prototyping and user evaluation of various multimedia analysis, advanced visualization, and information management techniques for digital libraries, multimedia databases, and the Internet.


Project: "COPLINK: Database Integration and Access for a Law Enforcement Intranet" (a joint project with the Tucson Police Department)
Support:
National Institute of Justice, 1997-1999 ($1,200,000)
Collaborators: Tucson Police Chief Douglas F. Smith

Description:
The integration and access of criminal information databases is a problem faced by many law enforcement agencies. The University of Arizona's Artificial Intelligence Lab will work with the Tucson Police Department to find a solution to this problem by developing an integrated database solution based on secured Intranet technologies. This will include an Intranet-based universal graphical interface for multimedia justice information, information containing both text and images.


Project: "Supplement to Alexandria DLI Project: A Semantic Interoperability Experiment for Spatially-Oriented Multimedia Data"
Funding: NSF, CISE, 1996-1998 ($100K)
Collaborators: T. Smith, B. Manjunath, L. Carver, M. Larsgaard, L. Hill, M. Ramsey, K. Tolle

Description:
This research aims to examine semantic interoperability issues related to spatially-oriented, multimedia geographic information access. Based on the concept space approach developed by the Illinois Digital Library Initiative (DLI) project and the Alexandria (University of California at Santa Barbara) geo-referenced collections, this research proposes to develop knowledge representations and structures to capture concepts of relevance to spatial and multimedia information (natural language phrases and geo-related textures). Selected machine learning techniques and general Artificial Intelligence (AI) graph traversal algorithms will also be adopted to assist in semantic, concept-based spreading activation in integrated knowledge networks.


Project: "Semantic Retrieval for Toxicology and Environmental Health Databases"
Support: National Library of Medicine,
Specialized Information Services, 1996-1997. ($50k)
Collaborators: T. Doszkocs, K. Tolle, and M. Ramsey.

Description:
This project aims to develop concept spaces (networks of vocabularies) to assist in toxicology and environmental health information access. Based on selected automatic indexing, linguistic parsing, and cluster analysis techniques, the system-generated concept spaces will be integrated with the NLM's fine-grained UMLS (Unified Medical Language System) knowledge sources.


Project: "Information Analysis and Visualization for Cancer Literature"
Support:
National Cancer Institute, 1996-1997. ($150K)
Collaborators: B. Schatz, S. Hubbard, A. Houston, T. Ng, R. Sewell, and M. Ramsey.

Description:
This project aims to develop concept spaces (networks of vocabularies) and category maps for cancer-related literature and documents. Based on selected automatic indexing, linguistic parsing, cluster analysis, neural networks, and information visualization techniques, the system-generated concept spaces and category maps will be used to enhance cancer-related information retrieval and knowledge sharing.


Project: "Concept-based Categorization and Search on Internet: A Machine Learning, Parallel Computing Approach"
Funding: NSF, CISE, IRIS, 1995-1998 ($200K).
Collaborators: C. Lin,T. Ng, J. Martinez, C. Schuffels, Y. Chung, C. Yang.

Description:
Our proposed approach, which is grounded on automatic textual analysis of Internet documents (homepages), attempts to address the Internet search problem by first categorizing the content of Internet documents and subsequently providing semantic search capabilities based on a concept space and a genetic algorithm spider (agent). The categorization, concept space, and agent components would be developed sequentially over three years. As Internet information space continues to grow at the present (exponential) pace, we believe this research would shed light on potentially robust and scalable solutions to the increasingly more complex and urgentinformation access and sharing problems.

Publications:


Project: "Semantic Retrieval and User Customization for the Illinois DLI Project"
Funding: NSF/ARPA/NASA, IRI 9411318, 1994-1998 ($600K).
Collaborators: B. Schatz, J. Hardin, A. Bishop, W. Mischo, L. Star, T. Ng, J. Martinez, C. Schuffels.

Description :
The project goal is to develop a large-scale testbed for building the next-generation digital libraries for the National Information Infrastructure (NII). The testbed collections will be in the engineering domains, to be contributed by major engineering societies and publishers e.g., IEEE, John Wiely & Sons, etc. Project researchers mainly are information science and computer science faculty from the University of Illinois at Urbana-Champaign and research scientists from Illinois's National Center for Supercomputer Applications. The initial platform will be based on the NCSA Mosaic. The AI Group is responsible for developing semantic retrieval, customized IR, and content-based information routing capabilities for digital libraries.

Publications:


Project:"Information Analysis and Knowledge discovery: A Parallel Computing Approach"
Support: National Center for Supercomputing Applications, 1994-1998
Collaborators: B. Schatz, T. Ng, C. Schuffels, Y. Yang.

Description:
The project aims to develop and experiment with use of various information analysis and machine learning algorithms for various knowledge discovery (data mining) and digital library applications. Thinking Machine's CM-5 (512 nodes), SGI's Power Challenge Array (48 nodes), Cray's CS6400 (8 nodes), and Convex's Exemplar (64 nodes) supercomputers provided by NCSA have been used in this project.

Publications:


Project: "Building a Concept Space for an Electronic Community System"
Funding: NSF, CISE/IRIS, IRI-9211418, 1992-1995.
Collaborators:B. Schatz, B. Schatz, S. Ward, T. Yim, D. Fye, J. Martinez, T. Ng, K. Powell, E. Grossman, T. Friedman, J. Calley

Description:
The project goal is to design a concept-based information retrieval and information sharing software for molecular biologists whose work is related to the Human Genome Initiative. A (nematode) worm concept space which can assist in concept exploration and term suggestion during IR has been created and is in use by worm biologists. We are in the process of creating a fly (Drosophila) concept space and a human concept space for assisting automatic, cross-domain, concept-based IR.

Publications:


Project: "An Artificial Intelligence Approach to Creating an Intelligent GroupSystems Environment"
Funding: AT&T Foundation Special Purpose Grants in Science and Engineering, 1994-1996 and Army Corps of Engineering contracts, 1993-1996
Collaborators: J. Nunamaker, D. Vogel, R. Orwig, P. Hsu, L. Hoopes, J. Kim, L. She, G. Shankar, A. Iyer

Description:
Artificial intelligence and information science techniques for assisting concept classification and content-based information retrieval that have been tested include: neural network classification, genetic algorithms, inductive learning algorithms, vector space modeling, and parallel search and clustering algorithms. We have successfully developed an AI tool(agent) to automatically extract and classify topics (concepts) from the electronic brainstorming comments generated by meeting participants (using GroupSystems). We are in the process of using similar techniques to develop software which could automatically classify email and assist in email routing and filtering. We are also designing an "intelligent" (concept-based) meeting memory system.

Publications:


Project: "Creating an Intelligent, Multilingual Intelligence Analysis Environment"
Funding: International Program Development Funds, University of Arizona, 1992-1993 and Small Grants Program, University of Arizona, 1994.
Collaborators: S. Goodman, W. McHenry, P. Wolcott, K. Lynch, A. Himler, R. Orwig, T. Ng, K. Basu H. Saldana, C. Lin, K. Leung, P. Rodriguez

Description:
The project develops an intelligence analysis and retrieval system, which supports intelligence analysts who study information technology policy, manufacturing, and proliferation in the (former) USSR countries. Extensive user requirements studies were conducted. A content-based intelligence analysis and retrieval system was developed in ANSI C and runs on VAX/VMS and DECStation (UNIX based). It includes a system-generated Russian computing "concept space" and a system-assisted relevance feedback and query refinement process. The system is currently in use by the intelligence analysts.

Publications:


Project: "A Neural Network Approach to the Estimation of the Aqueous Activity Coefficients (Solubility)"
Funding: Biomedical Research Support Grant Program, Division of Research Resources, National Institutes of Health (NIH), 1992-1994
Collaborators:S. Yakowski, P. Myral, H. Chow, T. Ng, P. Buntin, L. She, S. Sutjahjo, C. Sommer, and D. Neely

Description:
This NIH-funded project investigates a neural network approach to pharmaceutical applications. Sample applications include drug solubility prediction and non-linear pharmacokinetics functions approximation. 1992-present.

Publications:


[ Ongoing Project List | AI Group ]