File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/97/a97-1049_intro.xml

Size: 2,172 bytes

Last Modified: 2025-10-06 14:06:17

<?xml version="1.0" standalone="yes"?>
<Paper uid="A97-1049">
  <Title>An Intelligent Multilingual Information Browsing and Retrieval System Using Information Extraction</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> More and more multilingual information is available on-line every day. The World Wide Web (WWW), for example, is becoming a vast depository of multilingual information. However, monolingual users can currently access information only in their native language. For example, it is not easy for a monolingual English speaker to locate necessary information written in Japanese. The users would not know the query terms in Japanese even if the search engine accepts Japanese queries. In addition, even when the users locate a possibly relevant text in Japanese, they will have little idea about what is in the text. Outputs of off-the-shelf machine translation (MT) systems are often of low-quality, and even &amp;quot;high-end&amp;quot; MT systems have problems particularly in translating proper names and specialized domain terms, which often contain the most critical information to the users.</Paragraph>
    <Paragraph position="1"> In this paper, we describe our multilingual (or cross-linguistic) information browsing and retrieval system, which is aimed at monolingual users who are interested in information from multiple language sources. The system takes advantage of information extraction (IE) technology in novel ways to improve the accuracy of cross-linguistic retrieval and to provide innovative methods for browsing and exploring multilingual document collections. The system indexes texts in different languages (e.g., English and Japanese) and allows the users to retrieve relevant texts in their native language (e.g., English). The retrieved text is then presented to the users with proper names and specialized domain terms translated and hyperlinked. The system also allows the user in their native language to browse and discover information buried in the database derived from the entire document collection.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML