File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/97/a97-2001_metho.xml
Size: 3,662 bytes
Last Modified: 2025-10-06 14:14:34
<?xml version="1.0" standalone="yes"?> <Paper uid="A97-2001"> <Title>Syntactic Structures of Sentences from Large Corpora</Title> <Section position="1" start_page="0" end_page="0" type="metho"> <SectionTitle> 1 The Demonstration </SectionTitle> <Paragraph position="0"> The demonstration will consist in displaying the syntactic structure of sentences from novels, scientific texts, newspapers, ... The syntactic structures are computed by our syntactic parser ad the output shows in a human-friendly graphic interface (1) word features (as computed by POS tagger) (2) non-recursive phrases (as computed by shallow parser) and (3) their relations (the functional structure). It will be a closed demo. The outputs will be split into sets in order to focus on the resolution of different kinds of problems (i.e. coordination, long distance subject-verb relation, preposition/determiner resolution). A large scale corpus will be available to prove that the results are not hand-made and open demos will be possible during informal demo sessions. null</Paragraph> </Section> <Section position="2" start_page="0" end_page="0" type="metho"> <SectionTitle> 2 Demo Sessions </SectionTitle> <Paragraph position="0"> Schedule of the intended demo sessions: Other informal sessions can be scheduled in order to allow people to parse their own sentences or texts.</Paragraph> </Section> <Section position="3" start_page="0" end_page="0" type="metho"> <SectionTitle> 3 The Graphic Output </SectionTitle> <Paragraph position="0"> Our viewer, developped with Java, allows to display in a graphical way (see Fig. 1) the dependency tree between non-recursive phrases and the tags of words. Figure 1: &quot;A GRAPHIC OUTPUT</Paragraph> </Section> <Section position="4" start_page="0" end_page="0" type="metho"> <SectionTitle> 4 The Control Panel </SectionTitle> <Paragraph position="0"> The control panel (see Fig. 2) still in progress allows the choice of the corpus, the sentence to be displayed. Filters can be applied in order to show or to hide specified dependency and coordination relations. The tags of the words can also be shown or hidden. This principle allows to concentrate on word-level or on dependency-level and allows to check precisely the behavior of our parser on specific relation types. A trace mode enables the step by step display of the construction of the analysis.</Paragraph> <Paragraph position="1"> In a near future, a graphic comparison between two outputs will be available to control the modification of the rule base and to point out easily the differences. This comparison between two outputs can also be used to compare an output and an expected output.</Paragraph> </Section> <Section position="5" start_page="0" end_page="2" type="metho"> <SectionTitle> 5 Web Site Opening </SectionTitle> <Paragraph position="0"> After this conference, the same demonstration will be available on Internet at the following URL.</Paragraph> <Paragraph position="1"> http://www.info.unicaen.fr/~iguet This website will required a browser running Java and is platform independent (It has been tested with Netscape on Linux, Solaris2.5, MacOS, Windows NT and Windows 95).</Paragraph> </Section> <Section position="6" start_page="2" end_page="2" type="metho"> <SectionTitle> 6 Getting Annotated Corpora </SectionTitle> <Paragraph position="0"> People who are interested in getting annotated corpora are invited to contact the authors.</Paragraph> </Section> <Section position="7" start_page="2" end_page="2" type="metho"> <SectionTitle> 7 References </SectionTitle> <Paragraph position="0"> No references are available yet.</Paragraph> </Section> class="xml-element"></Paper>