File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/95/p95-1008_metho.xml

Size: 21,297 bytes

Last Modified: 2025-10-06 14:14:01

<?xml version="1.0" standalone="yes"?>
<Paper uid="P95-1008">
  <Title>DATR Theories and DATR Models</Title>
  <Section position="3" start_page="55" end_page="55" type="metho">
    <SectionTitle>
2 DATR Theories
</SectionTitle>
    <Paragraph position="0"> Let NODE and ATOM be disjoint sets of symbols (the nodes and atoms respectively). Nodes are denoted by N and atoms by a. The set DESC of DATR value descriptors (or simply descriptors) is built up from the atoms and nodes as shown below. Descriptors are denoted by d.</Paragraph>
    <Paragraph position="2"> Value descriptors are either atoms or inheritance descriptors, where an inheritance descriptor is further distinguished as either local (unquoted) or global (quoted). There is just one kind of local descriptor (node/path), but three kinds of global descriptor (node/path, path and node) 1 A path (al...an) is a (possibly empty) sequence of atoms enclosed in angle brackets. Paths are denoted by P. For N a node, P a path and a * ATOM* a (possibly empty) sequence of atoms, an equation of the form N : P = a is called an extensional sentence.</Paragraph>
    <Paragraph position="3"> Intuitively, an extensional sentence N : P = a states that the value associated with the path P at node N is a. For C/ a (possibly empty) sequence of value descriptors, an equation of the form N : P == C/ is called a definitional sentence. A definitional sentence N : P --- C/ specifies a property of the node N, namely that the path P is associated with the value defined by the sequence of value descriptors C/.</Paragraph>
    <Paragraph position="4"> A collection of equations can be used to specify the properties of different nodes in terms of one another, and a finite set of DATR sentences 7- is called a DATR theory. In principle, a DATR theory 7&amp;quot; may consist of any combination of DATR sentences, either definitional or extensional, but in practice, DATR theories are more restricted than this. The theory 7- is said to be definitional if it consists solely of definitional sentences and it is said to be functional if it meets the following condition: 1The syntax presented in (Evans and Gazdar, 1989a; Evans and Gazdar, 1989b) permits nodes and paths to stand as local descriptors. However, these additional forms can be viewed as conventional abbreviations, in the appropriate syntactic context, for node/path pairs N : P == ~b and N : P == C/ E 7&amp;quot; implies ~b = C/ There is a pragmatic distinction between definitional and extensional sentences akin to that drawn between the language used to define a database and that used to query it. DATR interpreters conventionally treat all extensional sentences as 'goal' statements, and evaluate them as soon as they are encountered. Thus, it is not possible, in practice, to combine definitional and extensional sentences within a theory 2. Functionality for DATR theories, as defined above, is really a syntactic notion. However, it approximates a deeper, semantic requirement that the nodes should correspond to (partial) functions from paths to values.</Paragraph>
    <Paragraph position="5"> In the remainder of this paper we will use the term (DATR) theory always in the sense functional, definitional (DATR) theory. For a given DATR theory 7&amp;quot; and node N of 7&amp;quot;, we write 7&amp;quot;/N to denote that subset of the sentences in 7&amp;quot; that relate to the node N. That is:</Paragraph>
    <Paragraph position="7"> The set TIN is referred to as the definition of N (in 7-).</Paragraph>
  </Section>
  <Section position="4" start_page="55" end_page="57" type="metho">
    <SectionTitle>
3 An Overview of DATR
</SectionTitle>
    <Paragraph position="0"> An example of (a fragment of) a DATR theory is shown in figure 1. The theory makes use of some standard abbreviatory devices that enable nodes and/or paths to be omitted in certain cases. For example, sets of sentences relating to the same node are written with the node name implicit in all but the first-given sentence in the set. Also, we write See : 0 == Verb to abbreviate the definitional sentence See: 0 == Verb : 0, and similarly elsewhere. null The theory defines the properties of seven nodes: an abstract Verb node, nodes EnVerb, Aux and Modal, and three abstract lexemes Walk, Mow and Can. Each node is associated with a collection of definitional sentences that specify values associated with different paths. This specification is achieved either explicitly, or implicitly. Values given explicitly are specified either directly, by exhibiting a particular value, or indirectly, in terms of local and/or global inheritance. Implicit specification is achieved via DATR's default mechanism.</Paragraph>
    <Paragraph position="1"> For example, the definition of the Verb node gives the values of the paths (syn cat) and (syn type) directly, as verb and main, respectively. Similarly, the definition of Walk gives the value of (mor root / directly as walk. On the other hand, the value of</Paragraph>
    <Paragraph position="3"> (mor form)== &amp;quot;(mor &amp;quot;(syn form)&amp;quot;)&amp;quot; (mor pres) == &amp;quot;(mor root)&amp;quot; (mor past) == &amp;quot;(mor root)&amp;quot; ed (mor pres part) --= &amp;quot;(mor root)&amp;quot; ing (mor pres sing three) == &amp;quot;(mor root)&amp;quot;</Paragraph>
    <Paragraph position="5"> the empty path at Walk is given indirectly, by local inheritance, as the value of the empty path at Verb.</Paragraph>
    <Paragraph position="6"> Note that in itself, this might not appear to be particularly useful, since the theory does not provide an explicit value for the empty path in the definition of Verb. However, DATR's default mechanism permits any definitional sentence to be applicable not only to the path specified in its left-hand-side, but also for any rightward extension of that path for which no more specific definitional sentences exist. This means that the statement Walk : 0 == Verb : 0 actually corresponds to a class of implicit definitional sentences, each obtained by extending paths on the left- and the right-hand-sides of the equation in the same manner. Examples include the following: Walk: (mor) == Verb: (mor) Walk: (mor form) =- Verb: (mor form) Walk : (syn cat) == Verb : (syn cat) Thus, the value associated with (syn cat) at Walk is given (implicitly) as the value of (syn cat) at Verb, which is given (explicitly) as verb. Also, the values of (mor) and (mor form), amongst many others, are inherited from Verb. In the same way, the value of (syn cat) at Mow is inherited locally from EnVerb (which in turn inherits locally from Verb) and the value of (syn cat) at Can is inherited locally from Modal (which ultimately gets its value from Verb via Aux). Note however, that the following sentences do not follow by default from the specifications given at the relevant nodes: Walk: (mor root) == Verb: (mor root) Can: (mor past) == Modal: (mor past) Aux: (syn type) == Verb: (syn type) In each of the above cases, the theory provides an explicit statement about the value associated with the indicated path at the given node. As a result the default mechanism is effectively over-ridden.</Paragraph>
    <Paragraph position="7"> In order to understand the use of global (i.e. quoted) inheritance descriptors it is necessary to introduce DATR's notion of a global context. Suppose then that we wish to determine the value associated with the path (mor pres) at the node Walk. In this case, the global context will initially consist of the node/path pair Walk//mor pres). Now, by default the value associated with (mor pres) at Walk is inherited locally from (mor pres) at Verb. This, in turn, inherits globally from the path (mor root).</Paragraph>
    <Paragraph position="8"> That is: Verb: (mor pres) == &amp;quot;(mor root)&amp;quot; Consequently, the required value is that associated with (mor root) at the 'global node' Walk (i.e.</Paragraph>
    <Paragraph position="9"> the node provided by the current global context), which is just walk. In a similar fashion, the value  walk ed (i.e. the string of atoms formed by evaluating the specification &amp;quot;(mor root)&amp;quot; ed in the global context Walk/(mor past)).</Paragraph>
    <Paragraph position="10"> More generally, the global context is used to fill in the missing node (path) when a global path (node) is encountered. In addition however, the evaluation of a global descriptor results in the global context being set to the new node/path pair. Thus in the preceding example, after the quoted descriptor &amp;quot;(mor root)&amp;quot; is encountered, the global context effectively becomes Walk / (mor root) (i.e. the path component of the global context is altered). Note that there is a real distinction between a local inheritance descriptor of the form N : P and it's global counterpart &amp;quot;N : P'. The former has no effect on the global context, while the latter effectively overwrites it.</Paragraph>
    <Paragraph position="11"> Finally, the definition of Verb in the theory of figure 1 illustrates a use of the 'evaluable path' construct: null Verb: (mor form) == &amp;quot;(mor &amp;quot;(syn form)&amp;quot;)&amp;quot; This states that the value of (mot form) at Verb is inherited globally from the path (mor...), where the dots represent the result of evaluating the global path &amp;quot;(syn form)&amp;quot; (i.e. the value associated with (syn form) in the prevailing global context). Evaluable paths provide a powerful means of capturing generalizations about the structure of lexical information. null</Paragraph>
  </Section>
  <Section position="5" start_page="57" end_page="59" type="metho">
    <SectionTitle>
4 DATR Models
</SectionTitle>
    <Paragraph position="0"> To a first level of approximation, the DATR theory of figure 1 can be understood as a representation of an inheritance hierarchy (a 'semantic network') as shown in figure 2. In the diagram, nodes are written as labelled boxes, and arcs correspond to (local) inheritance, or isa links. Thus, the node Can inherits from Modal which inherits from Aux which in turn is a Verb. The hierarchy provides a useful means of visualising the overall structure of the lexical knowledge encoded by the DATR theory. However, the semantic network metaphor is of far less value as a way of thinking about the DATR language itself.</Paragraph>
    <Paragraph position="1"> Note that there is nothing inherent in DATR to ensure that theories correspond to simple isa hierarchies of the kind shown in the figure. What is more, the DATR language includes constructs that cannot be visualized in terms of simple networks of nodes connected by (local) inheritance links. Global inheritance, for example, has a dynamic aspect which is difficult to represent in terms of static links. Similar problems are presented by both string values and evaluable paths. Our conclusion is that the network metaphor is of primary value to the DATR user. In order to provide a satisfactory, formal model of how the language 'works' it is necessary to adopt a different perspective.</Paragraph>
    <Paragraph position="2"> DATR theories can be viewed semantically as collections of definitions of partial functions ('nodes' in DATR parlance) that map paths onto values. A model of a DATR theory is then an assignment of func- null tions to node symbols that is consistent with the definitions of those nodes within the theory. This picture of DATR as a formalism for defining partial functions is complicated by two features of the language however. First, the meaning of a given node depends, in general, on the global context of interpretation, so that nodes do not correspond directly to mappings from paths to values, but rather to functions from contexts to such mappings. Second, it is necessary to provide an account of DATR's default mechanism. It will be convenient to present our account of the semantics of DATR in two stages.</Paragraph>
    <Section position="1" start_page="58" end_page="58" type="sub_section">
      <SectionTitle>
4.1 DATR Interpretations
</SectionTitle>
      <Paragraph position="0"> This section considers a restricted version of DATR without the default mechanism. Section 4.2 then shows how implicit information can be modelled by treating value descriptors as families of values indexed by paths.</Paragraph>
      <Paragraph position="2"> to U*.</Paragraph>
      <Paragraph position="3"> 3. F is a valuation function assigning to each node N and atom a an element of U, such that distinct atoms are assigned distinct elements. Elements of the set U are denoted by u and elements of U* are denoted by v. Intuitively, U* is the domain of (semantic) values/paths. Elements of the set C = (U x U*) are called contexts and denoted by c. The function tC/ can be thought of as mapping global contexts onto (partial) functions from local contexts to values. The function F is extended to paths, so that for P = (ax.-.a,~) (n &gt; 0) we write F(P) to denote Ul ...un E U*, where ui = F(ai) for each i (1 &lt; i &lt; n).</Paragraph>
      <Paragraph position="4"> Intuitively, value descriptors denote elements of U* (as we shall see, this will need to be revised later in order to account for DATR's default mechanism). We associate with the interpretation I = (U, t:, F) a partial denotation function D : DESC -'-+ (C -+ U*) and write \[d\], to denote the meaning (value) of descriptor d in the global context c. The denotation function is defined as shown in figure 3. Note that an atom always denotes the same element of U, regardless of the context. By contrast, the denotation of an inheritance descriptor is, in general, sensitive to the global context c in which it appears. Note also that in the case of a global inheritance descriptor, the global context is effectively altered to reflect the new local context c'. The denotation function is extended to sequences of value descriptors in the obvious way. Thus, for C/ =dl .. &amp;quot;dn (n &gt;_ 0), we write \[C/\],todenotevl.-.vn E U* ifvi = \[di\]c (1 &lt; i &lt; n) is defined (and \[C/\], is undefined otherwise).</Paragraph>
      <Paragraph position="5"> Now, let I = (U, s, F) be an interpretation and 7&amp;quot; a theory. We will write \[T/N\]c to denote that partial function from U* to U* given by</Paragraph>
      <Paragraph position="7"> It is easy to verify that \[T/N\], does indeed denote a partial function (it follows from the functionality of the theory 7-). Let us also write \[N\], to denote that partial function from U* to U* given by \[N\],(v) = ~(c)(F(N),v), for all v e U*. Then, I models 7just in case the following containment holds for each node N and context c: \[N\], _.D \[T/N\], That is, an interpretation is a model of a DATR theory just in case (for each global context) the function it associates with each node respects the definition of that node within the theory.</Paragraph>
    </Section>
    <Section position="2" start_page="58" end_page="59" type="sub_section">
      <SectionTitle>
4.2 Implicit Information and Default
Models
</SectionTitle>
      <Paragraph position="0"> The notion of a model presented in the preceding section is too liberal in that it takes no account of information implicit in a theory. For example, consider again the definition of the node Walk from the theory of figure 1, and repeated below.</Paragraph>
      <Paragraph position="1"> Walk: 0==Verb (mor root) == walk According to the definition of a model given previously, any model of the theory of figure 1 will associate with the node Walk a function from paths to values which respects the above definition. This means that for every global context c, the following containment must hold3: \[Walk\], ~ {(0, \[Verb: 0\]*), ((mor root), walk)} On the other hand, there is no guarantee that a given model will also respect the following containment: null \[Walk\]e _D {((mor), \[Verb: (mor)\],), ((mor root root),walk)} In fact, this containment (amongst other things) should hold. It follows 'by default' from the statements made about Walk that the path (mor) inherits locally from Verb and that the value associated with any extension of (mor root) is walk.</Paragraph>
      <Paragraph position="2">  if vi = ~di\]c is defined for each i (1 &lt; i &lt; n), then = t~(c)(F(N),vl.. &amp;quot;vn) undefined otherwise if vi = \[di\]e is defined for each i (1 &lt; i &lt; n), then iC/(c')(d) where d = (F(N), vl ... vn) undefined otherwise if vi = \[di\]e is defined for each i (1 &lt; i &lt; n), then = ~C/(d)(d) where c = (u, v) and d = (u, Vl..-v,) undefined otherwise = iC/(d)(e) where c = (u, v) and d= (F(N), v)  There have been a number of formal treatments of defaults in the setting of attribute-value formalisms (Carpenter, 1993; Bouma, 1992; Russell et al., 1992; Young and Rounds, 1993). Each of these approaches formalizes a notion of default inheritance by defining appropriate operations (e.g. default unification) for combining strict and default information. Strict information is allowed to over-ride default information where the combination would otherwise lead to inconsistency (i.e. unification failure). In the case of DATR however, the formalism does not draw an explicit distinction between strict and default values for paths. In fact, all of the information given explicitly in a DATR theory is strict. The non-monotonic nature of DATR theories arises from a general, default mechanism which 'fills in the gaps' by supplying values for paths not explicitly specified in a theory. More specifically, DATR's default mechanism ensures that any path that is not explicitly specified for a given node will take its definition from the longest prefix of that path that is specified. Thus, the default mechanism defines a class of implicit, definitional sentences with paths on the left that extend paths found on the left of explicit sentences. Furthermore, this extension of paths is also carried over to paths occurring on the right. In effect, each (explicit) path is associated not just with a single value specification, but with a whole family of specifications indexed by extensions of those paths. This suggests the following approach to the semantics of defaults in DATR. Rather than interpreting node definitions (in a given global context) as partial functions from paths to values (i.e. of type U* --+ U*) we choose instead to interpret them as partial functions from (explicit) paths, to functions from extensions of those paths to values (i.e. of type U* -+ (U* --+ U*)). Now suppose that f : U* --~ (U* --~ U*) is the function associated with the node definition T/N in a given DATR interpretation. We can define a partial function A(f) : U* --~ U* (the default interpretation of T/N) as follows. For each</Paragraph>
      <Paragraph position="4"> where v = vlv2 and vx is the longest prefix of v such that f(vl) is defined. In effect, the function A(f) makes explicit that information about paths and values that is only implicit in f, but just in so far as it does not conflict with explicit information provided by f.</Paragraph>
      <Paragraph position="5"> In order to re-interpret node definitions in the manner suggested above, it is necessary to modify the interpretation of value descriptors. In a given global context c, a value descriptor d now corresponds to a total function \[d\]~ : U* --+ U* (intuitively, a function from path extensions to values). For example, atoms now denote constant functions: \[a\]c(v) = F(a) for all v G U&amp;quot; More generally, value descriptors will denote different values for different paths. Figure 4 shows the revised clause for global node/path pairs, the other definitions being very similar. Note the way in which the 'path' argument v is used to extend Vl ...vn in order to define the new local (and in this case also, global) context c ~. On the other hand, the meaning of each of the di is obtained with respect to the 'empty path' e (i.e. path extension does not apply to subterms of inheritance descriptors).</Paragraph>
      <Paragraph position="6"> As before, the interpretation function is extended to sequences of path descriptors, so that for C/ = dl...d, (n &gt;_ o) we have \[C/\]~(v) = Vl...v, G V*, if vi = Idil(v) is defined, for each i (1 &lt; i &lt; n) (and \[C/\],(v) is undefined otherwise). The definition of the interpretation of node definitions can be taken over unchanged from the previous section. However, for a theory T and node N, the function \[T/N\]e is now of type U* --+ (U* ~ U*). An interpretation I = (U, x, F) is a default model for theory T just in case for every context c and node N we have: IN\], _~ A(IT&amp;quot;/NI,) As an example, consider the default interpretation of the definition of the node Walk given above. By  definition, any default model of the theory of figure 1 must respect the following containment: \[W kL ((mor root), Av.walk)} /,From the definition of A, it follows that for any path v, if v extends (mor root), then it is mapped onto the value walk, and otherwise it is mapped to the value given by \[Verb : 0It(v). We have the following picture: \[Walklc _D {(0, \[Verb: Oft(O)), ((mor), \[Verb: Olc((mor))), ((mor root), walk), ((mor root root), walk), * . .} The default models of a theory 7&amp;quot; constitute a proper subset of the models ofT: just those that respect the default interpretations of each of the nodes defined within the theory.</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML