File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/96/x96-1018_metho.xml

Size: 5,808 bytes

Last Modified: 2025-10-06 14:14:27

<?xml version="1.0" standalone="yes"?>
<Paper uid="X96-1018">
  <Title>Management of Free Text for NDIC: An Overview of the FTM Project</Title>
  <Section position="1" start_page="0" end_page="77" type="metho">
    <SectionTitle>
SUMMARY: The FTM Project is a joint
</SectionTitle>
    <Paragraph position="0"> project to build a prototype which retrieves and extracts against free (unstructured) text and to evaluate the prototype's usefulness for the National Drug Intelligence Center (NDIC).</Paragraph>
    <Paragraph position="1"> NDIC has strong needs for retrieval and extraction from large multi-source textual databases.</Paragraph>
    <Paragraph position="2"> SPONSORS: FFM is jointly sponsored by the Intelligence Systems Support Office (ISSO) of OASD/C3I, the National Drug Intelligence Center (NDIC) and the Office of Research and Development (ORD). The prototype is being built and evaluated at the Federal Intelligent</Paragraph>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
Document Understanding Laboratory (FIDUL)
</SectionTitle>
      <Paragraph position="0"> in Tyson's Comers, VA, in an unclassified environment which replicates the hardware and software environment at NDIC.</Paragraph>
      <Paragraph position="1"> PARTICIPANTS: As of April 15, contractors include BETAC, PRC, IDI, HNC and Lockheed Martin. BETAC and IDI are providing the technical leadership, the integration and the GUI. PRC is providing TIPSTER expertise (from the TIPSTER SE/CM), and the testing and evaluation leadership (through FIDUL). HNC and InQuery (via Sovereign Software) are the first detection tools being integrated. Lockheed Martin is supplying the TIPSTER compliant document manager, document viewer and extraction system used on other TIPSTER demonstration projects. Information on lessons learned is shared among all the participants and sponsors. OBJECTIVES: The project objectives (from the Memorandum of Agreement) are to: Prototype and demonstrate new technology which might be incorporated into production systems at NDIC. The NDIC OASIS/IDEF model and the current NDIC free text management architecture development will guide project activity.</Paragraph>
      <Paragraph position="2"> Results will be reported in monthly updates as well as in a final report.</Paragraph>
      <Paragraph position="3"> Assemble a prototype which includes at least two TIPSTER detection tools working on source material which represents NDIC data (unclassified).</Paragraph>
      <Paragraph position="4"> Expand the prototype to include at least two TIPSTER extraction tools, and use the extraction tools to load a database. The database will serve as the basis for analytical tools.</Paragraph>
      <Paragraph position="5"> Expand the prototype to include conversion of hard copy documents (or the simulation of them), including degraded OCR data.</Paragraph>
      <Paragraph position="6"> Develop (or acquire) a minimal user interface and other interfaces which are necessary to demonstrate integrated use of detection and extraction tools, and any other analysis tools to be demonstrated.</Paragraph>
      <Paragraph position="7"> Develop test measures appropriate for testing these tools in an end-to-end system; test and evaluate the tools.</Paragraph>
      <Paragraph position="8"> Use the TIPSTER Architecture in the development, and work closely with the</Paragraph>
    </Section>
    <Section position="2" start_page="0" end_page="77" type="sub_section">
      <SectionTitle>
TIPSTER Architecture Working Group
</SectionTitle>
      <Paragraph position="0"> (AWG) to improve the Architecture.</Paragraph>
      <Paragraph position="1"> Explore the implications of the prototype for an end-to-end architecture, i.e., expanding the TIPSTER architecture to include end-to-end text processing (e.g., OCR, machine translation and database loading).</Paragraph>
    </Section>
  </Section>
  <Section position="2" start_page="77" end_page="77" type="metho">
    <SectionTitle>
CONCEPT OF OPERATIONS: The
</SectionTitle>
    <Paragraph position="0"> prototype will allow users to choose among detection tools, do sophisticated searches on particular topic s , maintain chosen references to documents in project files, view those documents either within the detection tool or through.a common viewer, &amp;quot;copy and paste&amp;quot; from full text information into word processing documents while retaining the source information; and extract particular fields and relationships from project files or any other TIPSTER collection.</Paragraph>
  </Section>
  <Section position="3" start_page="77" end_page="77" type="metho">
    <SectionTitle>
THE HARDWARE AND
SOFTWARE ENVIRONMENT: The
</SectionTitle>
    <Paragraph position="0"> hardware environment is a DEC ALPHA 2100 server and two Celefis 5100DP (dual Pentium) workstations. The software environment includes the Digital OSF Operating Environment on the server, Windows NT on the workstations, eXceed (an x-windows emulator for Windows NT), Oracle, and MS Word. The GUI is being developed in Visual Basic.</Paragraph>
    <Paragraph position="1"> SCHEDULE: The prototype will be completed by September 30. Usability testing, some modifications, and testing against degraded OCR data will occur by January, 1997.</Paragraph>
  </Section>
  <Section position="4" start_page="77" end_page="78" type="metho">
    <SectionTitle>
POSSIBLE FUTURE ADDITIONS:
</SectionTitle>
    <Paragraph position="0"> Once the prototype is complete, other detection and extraction tools, GUI's, or alternative TIPSTER-compliant document managers could easily be added to the FTM prototype, providing an excellent unclassified environment for continuing to evaluate and demonstrate TIPSTER technology. Other possible modifications include fine-tuning the GUI to the NDIC users, and integrating the system into the NDIC environment.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML