XML Viewer - c96-1024

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/96/c96-1024_metho.xml
Size: 18,298 bytes
Last Modified: 2025-10-06 14:14:06
<?xml version="1.0" standalone="yes"?>
<Paper uid="C96-1024">
  <Title>Compositional Semantics in Verbmobil</Title>
  <Section position="3" start_page="131" end_page="131" type="metho">
    <SectionTitle>
2 The Verbmobil Project
</SectionTitle>
    <Paragraph position="0"> The project Verbmobil funded by the German Federal Ministry of Research and Technology (BMBF) combines speech technology with machine translation techniques in order to develop a system for translation in face-to-face dialogues. The overall project is described in (Wahlster, 1993); in this section we will give a short overview of the key aspects.</Paragraph>
    <Paragraph position="1"> The ambitious overall objective of the Verbmobil project is to produce a device which will provide English translations of dialogues between German and Japanese businessmen who only have a restricted active, but larger passive knowledge of English. The domain is the scheduling of business appointments. The major requirement is to provide translations as and when users need them, and do so robustly and in (near) real-time.</Paragraph>
    <Paragraph position="2"> In order to achieve this, the system is composed of time-limited processing components which on the source language (German or Japanese) side perform speech recognition, syntactic, semantic and pragmatic analysis, as well as dialogue management; transfer on a semantic level; and on the target language (English) side generation and speech synthesis. When the users speak English, only keyword spotting for the dialogue management is undertaken.</Paragraph>
    <Paragraph position="3"> At any moment in the dialogue, a user may activate the Verbmobil device and start speaking his/her native language. The speech recognition component then processes the input and produces a word lattice representing the speech hypotheses and their corresponding prosodic information. The parsing component processes the lattice and assigns each well-formed path through it one or several syntactic and (compositional) semantic representations. Ambiguities introduced by thesc may be resolved by a resolution component. The representations produced are then assigned dialogue acts and used to update the model of the discourse, which in turn may be used by the speech recognizer to choose the current language model. The transfer component takes the (possibly resolved) semantic analysis of the input and builds a target language representation. The generator then constructs the corresponding English expression. For robustness, this deep-level processing strategy is complemented with a shallow analysis-and-transfer component.</Paragraph>
  </Section>
  <Section position="4" start_page="131" end_page="132" type="metho">
    <SectionTitle>
3 Underspecified Representations
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="131" end_page="131" type="sub_section">
      <SectionTitle>
3.1 Theoretical Background
</SectionTitle>
      <Paragraph position="0"> Since the Verbmobil domain is related to discourse rather than isolated sentences, a variant of Kamp's Discourse Representation Theory, DRT (Kamp and Reyle, 1993) has been chosen as the model theoretic semantics. Itowcver, to allow for underspecification of several linguistic phenomena, we have chosen a formalism that is suited to represent underspecified structures: LUD, a description language for underspecified discourse representations (Bos, 1995). The basic idea is the one given in Section 1, namely that natural language expressions are not directly translated into Discourse Representation Structures (DRSs), but into a representation that describes several DRSs.</Paragraph>
      <Paragraph position="1"> Representations in LUD have the following distinct features. Firstly, all elementary semantic &amp;quot;bits&amp;quot; (conditions, entities, and events) are uniquely labeled. This makes them easy to refer to and results in a very powerful description language. Secondly, meta variables over DRSs (which we call holes) allow for the assignment of under-specified scope to a semantic operator. Thirdly, a subordination relation on the set of holes and labels constrains the number of interpretations of the LUD-representation in the object language: DRSs.</Paragraph>
    </Section>
    <Section position="2" start_page="131" end_page="132" type="sub_section">
      <SectionTitle>
3.2 LUD-Representations
</SectionTitle>
      <Paragraph position="0"> A LUD-representation U is a triple &lt; Hu,Lu,Cu &gt; where Hu is a set of holes (variables over labels), Lu is a set of labeled (LUD) conditions, and Cu is a set of constraints. A plugging is a bijective function from holes to labels. For each plugging there is a corresponding DRS. The syntax of LUD-conditions is formally defined as follows:  1. If x is a discourse marker (i.e., entity or event), then din(x) is a LUDcondition; null 2. If R is a symbol for an n-place relation, xl,..., xn are discourse markers, then pred(R, xl,...,x,~) is a LUD-condition; 3. If I is a label or hole for a LUD-condition, then -~l is a LUDcondition; null 4. If 11 and 12 are labels (or holes) for LUD-conditions, then 11 --+ 12, 11AI2 and 11 V 12 are LUD-conditions; 5. Nothing else is a LUD-condition.</Paragraph>
      <Paragraph position="1">  There are three types of constraints in LUDrepresentations. There is subordination (&lt;_), strict subordination (&lt;), and fimdly presupposition (c~). These constraints are syntactically defined as: If/i, l.~ are labels, h is a hole, then It &lt; h, 11 &lt; 12 and l~ ~ 12 are LUD-constraints. The interpretation of a LUD-representation is the interpretation of top, the label or hole of a LUi)-representation tbr which tt,ere exists no label thai; subordinates it. ~ The interpretation fnnction I is a function from a labeled condition to a DRS. This hmction is defined with respect to a plugging P. We represent a I)RS ~ a box ~DI~ , where D is the set of discourse markers and C is the set of conditions. The mappings between LUD-conditions and I)RSs are then detiued in (2)-(9) where l is a label or hole and ~b is a labeled condition.</Paragraph>
      <Paragraph position="3"> DRSs K1 and K= and returns a I)RS which domain is the nnion of the set of the domains of K1 and K2, and which conditions form the union of the set of the conditions of K1 and K2.</Paragraph>
      <Paragraph position="4"> 2q.'he reade.r interested in a more detailed discus-SiGn of the iul;erl)retation of underspccified semanti(: representations is referred to (Bos, 19{)5).</Paragraph>
    </Section>
    <Section position="3" start_page="132" end_page="132" type="sub_section">
      <SectionTitle>
3.3 Lexical Entries and Composition
</SectionTitle>
      <Paragraph position="0"> For building LUD-reprcsentations we use a lambda-operator and functional application in order to compositionally combine simple LUD-representations to complex ones. In addition, we have two functions that help us to keep track of the right labels. These are top, as described above, and main, the label of the semantic head of a LU1)-reprcsentation. Further, we have an operation that combines two LUD-representations into one: q) (merge for LUl)-representations). Some sample lexical entries for German as well as a sample derivation, are shown in Figure 1.</Paragraph>
    </Section>
  </Section>
  <Section position="5" start_page="132" end_page="132" type="metho">
    <SectionTitle>
4 Related Work
</SectionTitle>
    <Paragraph position="0"> The LUD representation is quite closely related to UDI{Ss, underspecified l)t{Ss (Reyle, 1993). The main difference is that the I,UI) description language in principle is independent of the object langnage, thus not only DI{T, but also or(tinary predicate logic, as well as a Dynamic Predicate Logic (Groenendijk and Stokhof, 1991) can be used as the object language of LUI), as shown in (Bos, 1995). Compared to UDRS, LUD also has a stronger descriptive power: Not DRSs, but the smallest possible semantic components are uniquely labeled.</Paragraph>
    <Paragraph position="1"> The Verbmobil system is a translation system built by some 30 different groups in three countries. The semantic \[brmalism used on the English generation side has been developed by CSLI, Stanford and is called MRS, Minimal Recursion Semantics (Copest;ake eL al., 1995). The deep-level syntactic and semalttic German processing of Verbmobil is also done along two parallel paths.</Paragraph>
    <Paragraph position="2"> The other path is developed by IBM, lleidelberg and uses a variant of MRS, Underspecified Minimal Recnrsion Semantics (UMRS) (Egg and Lebeth, 1995). All the three formalisms LUD, MRS, and UMRS have in common that they use a fiat, nco-Davidsoniau representation and allow for the nndcrspecification o\[&amp;quot; functor-argmnent relations. In MRS, this is done by unification of the relations with unresolved dependencies. This, however, results in structures which cannot be fltrther resolved. In UMRS this is modified by expressing the scoping possibilities directly as disiunctions.</Paragraph>
    <Paragraph position="3"> The main difference between both types of MRSs and LUI) is that the interpretation of LUI) in an object language other than ordinary predicate logic is well delined, as described in Section 3.2.</Paragraph>
    <Paragraph position="4"> The translation task of the SICS-SRI l:/ilin-gnal Conversation Interpreter, BCI (Alshawi et al., 1991) is quite similar to that of Verbmobil.</Paragraph>
    <Paragraph position="5"> The BCI does translation at the level of Quasi-</Paragraph>
    <Paragraph position="7"> Logical Form, QLF which also is a monotonic representation language for compositional semantics as discussed in (Alshawi and Crouch, 1992).</Paragraph>
    <Paragraph position="8"> The QLF formalism incorporates a Davidsonian approach to semantics, containing underspecified quantifiers and operators, as well as 'anaphoric terms' which stand for entities and relations to be determined by reference resolution. In these respects, the basic ideas of the QLF formalism are quite similar to LUD.</Paragraph>
  </Section>
  <Section position="6" start_page="132" end_page="135" type="metho">
    <SectionTitle>
5 Syntax-Semantics Interface and
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="132" end_page="134" type="sub_section">
      <SectionTitle>
Implementation
5.1 Grammar
</SectionTitle>
      <Paragraph position="0"> The LUD semantic construction component has been implemented in the grammar formalism TUG, Trace and Unification Grammar (Block and Schachtl, 1992), in a system called TrUG (in cooperation with Siemens AG, Munich, who provided the German syntax and the TrUG system). TUG is a formalism that combines ideas from Government and Binding theory, namely the use of traces, with unification in order to account for, for example, the free word order phenomena found in German.</Paragraph>
      <Paragraph position="1">  A TUG grammar basically consists of PATR-II style context free rules with feature annotations. Each syntactic rule gets annotated with a semantic counterpart. In this way, syntactic derivation and semantic construction are fully interleaved and semantics can further constrain the possible readings of the input.</Paragraph>
      <Paragraph position="2"> In order to make our formalisation executable, we employ the TrUG system, which compiles our rules into an efficient Tomita-style parser. In addition TrUG incorporates sortal information, which is used to rank parsing results.</Paragraph>
      <Paragraph position="3"> Consider a simplified example of a syntactic rule annotated with a semantic functor-argument application. null</Paragraph>
      <Paragraph position="5"> In this example, a sentence s consists of an np and a vp. The first feature equation annotated to this rule says that the value of the feature agr (for agreement) of the np equals that of the respective feature value of the vp.</Paragraph>
      <Paragraph position="6">  A category symbol like np in the rule above also stands for the entry node of its associated feature structure. This property is used for the semantic counterpart of the rule: lud..fun_.arg is a call to a semantic rule, a macro in the TUG notation, which defines functor-argument application. Since the macro gets the entry nodes of the fea4 ture structures as arguments, all the information present in the feature structures can be accessed within the macro which is defined as</Paragraph>
      <Paragraph position="8"> lud_ cont ext_equal (Fun, Result ), context (Fun, FunContext), context (hrg, ArgCont ext ), subcat (Result, ResultSc), subcat (Fun, \[hrgContext \] ResultSc\] ).  The functor-argument application is based on the notion of the context of a LUD-representation. The context of a LUD-representation is a three-place structure consisting of the LUDrepresentation's main label and top hole (as described in Section 3.3) and its main instance, which is a discourse marker or a lambda-bound variable. A LUD-representation also has a semantic subcategorization list under the feature subcat which performs the same function as a A-prefix. This list consists of the contexts of the arguments a category is looking for.</Paragraph>
      <Paragraph position="9"> The functor-argumcnt application macro thus says the following. The context of the result is the context of the functor. The functor is looking for the argument as the first element on its subcat list, while the result's subcat list is that of the functor minus the argument (which has been bound in the rule). The binding of variables between functor and argument takes place via the subcat list, through which a functor can access the main instance and thc main label of its arguments and state relations between them.</Paragraph>
      <Paragraph position="10"> Note that the only relevant piece of information contained in a LUD-representation for the purpose of composition is its context. Its content in terms of semantic prcdicates is handled differently. The predicates of a LUD-representation are stored in a special slot provided for each category by the TrUG systcm. The contents of this slot is handed up the tree from the daughters to the mother completely monotonically. So the predicates introduced by some lexical entry percolate up to the topmost node automatically.</Paragraph>
      <Paragraph position="11"> These two restrictions, the use of only a LUDrepresentation's context in composition and the monotonic percolation of semantic predicates up the tree, make the system completely compositional in the sense defined in Section 1.</Paragraph>
      <Paragraph position="12">  To see how the composition interacts with the lexicon, consider the following lexical macro defining the semantics of a transitive verb  subcat(Cat,\[lud(Argl .... ), lud(Arg2 .... )\] ).</Paragraph>
      <Paragraph position="13">  The macro states that a transitive verb introduces a basic predicate of a certain relation with an instance and a label. The instance is related to its two arguments by argument roles. The arguments' instances are accessed via the verb's subcat list (and get bound during functor-argument application, cf. above). The labels introduced are grouped together; the group label is the main label of the LUD-representation, the instance its main instance. Another property of the verb's semantics is that it introduces the top hole of the sentence.</Paragraph>
    </Section>
    <Section position="2" start_page="134" end_page="135" type="sub_section">
      <SectionTitle>
5.2 Interfaces to Other Components
</SectionTitle>
      <Paragraph position="0"> As sketched in Section 2, our semantic construction component delivers output to the components for semantic evaluation and transfer. The paragraphs that follow describe the common interface to these two components.</Paragraph>
      <Paragraph position="1">  Generating a scopally resolved LUD-representation from an underspecified one is the process which we referred to as plugging in Section 3.2. It aims at making the possibly ambiguous semantics captured by a LUD unique. Obviously, purely mathematical approaches for transforming the partial ordering encoded in the leq constraints into a total ordering may yield many results.</Paragraph>
      <Paragraph position="2"> Fortunately, linguistic constraints allow us to reduce the effort that has to be put into the computation of pluggings. An example is the linguistic observation that a predicate that encodes sentence mood in many cases modifies all of the remainder of the proposition for a sentence. Thus, pluggings where the predicate for sentence mood is subject to a leq constraint should not be considered. They would result in a resolved structure expressing that the mood-predicate does not have scope over the remaining proposition. This would be contrary to the linguistic observation.</Paragraph>
      <Paragraph position="3">  As a supplement to semantic predicates, our output contains various kinds of additional information. This is caused by the overall architecture of the Verbmobil system which does not provide for fully-interconnected components. There is, e.g., no direct connection between the speech recognizer and the component for semantic evaluation. Thus, our component has to pipe certain kinds of information (like prosodic values). Accordingly, our output consists of &amp;quot;Verbmobil Interface Terms&amp;quot; (VITs), which differ slightly from the LUD-terms described above mainly in that they include non-semantic information.</Paragraph>
    </Section>
    <Section position="3" start_page="135" end_page="135" type="sub_section">
      <SectionTitle>
5.3 Implementation Status
</SectionTitle>
      <Paragraph position="0"> Currently, the lexicon of the implemented system contains about 1.400 entries (full forms) and the grammar consists of about 400 syntactic rules, of which about 200 constitute a subgrammar for temporal expressions. The system has been tested on three simplified dialogues from a corpus of spoken language appointment scheduling dialogues collected for the project and processes about 90% of the turns the syntax can deal with.</Paragraph>
      <Paragraph position="1"> The system is currently being extended to cover nine additional dialogues from the corpus completely. The size of the lexicon will then be about 2500 entries, which amounts to about 1700 lemmata. null</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML