XML Viewer - c02-1156

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/02/c02-1156_metho.xml
Size: 19,220 bytes
Last Modified: 2025-10-06 14:07:50
<?xml version="1.0" standalone="yes"?>
<Paper uid="C02-1156">
  <Title>Putting Frames in Perspective</Title>
  <Section position="3" start_page="0" end_page="0" type="metho">
    <SectionTitle>
2 The FrameNet COMMERCE frame
</SectionTitle>
    <Paragraph position="0"> The FrameNet project has thus far produced two databases: a collection of approximately 80 frames with frame descriptions, chosen to cover a broad range of semantic domains; and a hand-annotated dataset of about 50,000 sentences from the British National Corpus (Baker et al., 1998). The databases  wide variety of lexical items (or lemmas) and thus have the potential to allow corpus-based techniques to be applied to semantically oriented tasks.2 The current release of the FrameNet databases3 defines a COMMERCE frame with frame elements including the familiar Buyer, Seller, Payment and Goods, along with several other FEs needed to cover the data. The frame includes 10 verbs relevant to commercial transactions, for a total of 575 annotated sentences. Figure 1 shows a sampling of data annotated with respect to the COMMERCE frame.</Paragraph>
    <Paragraph position="1"> Considerable research has been devoted to explicating the connections among frames, perspective, and argument structure; see Gawron (ms.) and Hudson (2002). But there has been relatively less work that addresses inferential issues related to perspective. The COMMERCE frame, for example, is implicitly associated with a complex, dynamic network of interrelated events, actions and participants. Our proposal is that perspectival effects may be best understood in terms of subtle inferential effects on interpretation licensed by this network.</Paragraph>
    <Paragraph position="2"> Our task, then, is to make this inferential structure 2See (Gildea and Jurafsky, 2000) for some promising initial work in applying statistical techniques to the FrameNet database to automatically label frame elements.</Paragraph>
    <Paragraph position="3"> 3We refer to data from FrameNet I; an interim release of FrameNet II is expected soon.</Paragraph>
    <Paragraph position="4"> explicit. We take the original COMMERCE frame as our starting point and define the interrelationships present among its FEs. The additional structure we impose on the COMMERCE frame allows us to distinguish a perspective-neutral description of a commercial transaction from the perspectivized situations described by particular verbs. The resulting event representation can be integrated with a simulation-based inference engine to account for differences in the interpretation of sentences like those in the annotated FrameNet data.</Paragraph>
  </Section>
  <Section position="4" start_page="0" end_page="0" type="metho">
    <SectionTitle>
3 Structured event representations
</SectionTitle>
    <Paragraph position="0"> In this section, we present a formal specification used for mapping the flat set of FEs in COMMERCE onto explicitly structured event representations based on the Embodied Construction Grammar (ECG) formalism. ECG is a constraint-based formalism similar in many respects to other unification-based linguistic formalisms, such as HPSG (Pollard and Sag, 1994).4 It differs from other lingustically motivated proposals in that it is 4ECG includes formalisms for both schemas (conceptual representations) and constructions (conventionalized pairings of form and meaning), described in (Bergen and Chang, 2002). We refer here only to the schema formalism in a simplified form. See (Chang et al., 2002) for a more complete version that has been extended to accommodate additional cognitive linguistic primitives.</Paragraph>
    <Paragraph position="1"> designed to support a model of language understanding in which utterances evoke a complex network of conceptual schemas that are then mentally simulated in context to produce a rich set of inferences. It is thus ideally suited for our current goal of translating frames to conceptual representations.</Paragraph>
    <Paragraph position="2"> Figure 2 presents the ECG schema definition language. The indented block labeled roles lists and constrains the schema's local roles, which are equivalent to features (or in this case, frame FEs).</Paragraph>
    <Paragraph position="3"> Roles are declared with a local name (local-role) and may be accompanied by type restrictions (indicated with ':'). Identification (or binding) constraints (indicated with 'a0a2a1 ') may appear in either the roles or the constraints block; these cause roles and constraints to be shared between its arguments, similar to unification or coindexation.5 The subcase relation defines a schema inheritance lattice, with the local schema inheriting all roles and constraints.</Paragraph>
    <Paragraph position="4">  are shown in bold; a left square bracket ([) marks optional blocks; and curly braces (a21a23a22 ) enclose a set of optional statements. See text for details.</Paragraph>
    <Paragraph position="5"> The formalism also has several novel features that we will exploit in representing commercial transactions. The most important of these are: (1) the ability to flexibly evoke and relate multiple schemas, due mainly to the evokes relation; and (2) the ability to assert dynamic conditions that apply to specific event stages, through the use of simulation constraints. We will describe each of these briefly, deferring details to the example schemas below.</Paragraph>
    <Paragraph position="6"> Schemas listed in the evokes block are instantiated locally (as local-name), but the relationship be5Constraints may refer to locally declared roles, inherited roles, and evoked schemas, as well as any roles available through these structures. Standard slot-chain notation is used to refer to role y of a structure x as x.y.</Paragraph>
    <Paragraph position="7"> tween the defined schema and the evoked schema is underspecified. This underspecification allows one schema to be defined in terms of another schema without implying either full inheritance of the evoked schema's roles or containment in either direction. In some cases, the evoked schema corresponds to a subpart of the evoking schema; alternatively, the evoked schema may serve as a background schema against which the evoking schema is defined. We will see examples of each below.</Paragraph>
    <Paragraph position="8"> Simulation constraints use the '::' notation to assert some condition on a particular phase of simulation - either a relation that must hold or an event or action that must take place during that phase. Simulation phases correspond to event stages; these constraints serve as the bridging connection to previous work on modeling event structure and linguistic aspect using active representations (Narayanan, 1997; Chang et al., 1998).</Paragraph>
    <Paragraph position="9"> We now show how the ECG formalism can be used to define more complex schemas that provide the underlying structure we need to tackle the COMMERCE frame; the key schemas for the current discussion are shown in Figure 3.6 The Event schema is of primary importance: it appears directly or indirectly in the rest of the schema definitions, and it serves as the crucial link to simulation. The definition given here is not intended to capture the full complexity of the most generalized event, which may have complex internal structure (start and finish subevents, ongoing period, etc.).</Paragraph>
    <Paragraph position="10"> At a coarser granularity, however, it may also be viewed as a discrete temporal chunk that takes place between two time slices. The schema as shown reflects this coarser view, which is sufficient for current purposes: its roles include before, after, and transition, all referring to simulation phases. Another role, the nucleus, is constrained only to hold or take place during the transition phase. Together these roles anchor the event to the passage of time.</Paragraph>
    <Paragraph position="11"> The other schemas are more complex. The Transfer schema corresponds to an event in which an agent causes a theme to be transferred from the source to the recipient. It is defined as evoking two other schemas: an Action schema (with an actor role) and a Receive schema (in which a receiver comes into possession of the received entity). (These are not shown, nor is the causal relation between them.) Note that both act and rec are conceptually distinct from the  nucleus role inherited from Event, although all are constrained to take place during the event's transition phase. The agent role is constrained to be the same entity as the actor of act. Importantly, the Transfer event schema makes no commitment as to whether its agent - the entity seen as causing the overall event - is the source, recipient or even theme.</Paragraph>
    <Paragraph position="12"> It is in this respect that the Transfer schema can be considered neutral in perspective.</Paragraph>
    <Paragraph position="13"> The Exchange schema is structurally similar to the Transfer schema and provides most of the relevant constraints needed for commercial transactions. It includes two transfer events that occur during the transition phase and are parameterized straightforwardly in the constraints block by two human participants and two entities. An additional agent role is not bound to any particular entity; this schema is thus also perspective-neutral, since either participant (or both) might be viewed as active.</Paragraph>
  </Section>
  <Section position="5" start_page="0" end_page="0" type="metho">
    <SectionTitle>
4 Commercial transaction schemas
</SectionTitle>
    <Paragraph position="0"> We are now in a position to return to the commerce domain and put our inventory of domain-general schemas to use. We first define the Commercial-Transaction (CT) schema as a subcase of the Exchange schema with appropriate role identifications and an additional type restriction on entity1. The role names in this schema differ slightly from those in FrameNet's COMMERCE, reflecting its perspective-neutral status. But given the obvious mapping to the FrameNet FEs, the CT schema fulfills part of our original objective: based on its inherited and evoked schemas and constraints, it concisely and precisely states the conceptual underpinnings of the basic commercial transaction.</Paragraph>
    <Paragraph position="1">  The CT schema provides the underlying infrastructure against which various perspectivized schemas can be defined. As shown in Figure 5, we treat Buy, Sell and Pay as schemas that evoke the CT schema and identify their roles with specific participants and event stages of the evoked CT schema. Note the use of the keyword self (which we treat as a special kind of role) to refer to the schema being defined: Buy and Sell schemas each identify self with the ct.nucleus role (that is, the nucleus of its evoked commercial transaction), and is thus constrained to take place during the evoked CT's transition phase.</Paragraph>
    <Paragraph position="2"> In contrast, since Pay identifies itself with ct.moneytransfer.nucleus, it refers specifically to a subpart of the overall commercial transaction, such that its execution does not necessarily entail the execution of the goods-transfer in the event (i.e., you don't always get what you pay for).</Paragraph>
    <Paragraph position="3"> The three schemas also differ in their participant role bindings: all are defined as subcases of  Transitive-Action (not shown), which corresponds to a prototypical situation in which an actor entity affects or manipulates an undergoer entity. The Buy and Sell schemas both identify the undergoer with ct.goods, and the actor with ct.agent. But the two schemas impose different views on the same situation by virtue of a single additional constraint on this latter role (which corresponds to the active participant in the overall CT), binding it to either the ct.customer (Buy) or the ct.vendor (Sell). The bindings in the Pay schema assert that its actor is the ct.customer, as well as the agent of the money-transfer. Other schemas associated with the CT schema lend themselves to similar analyses, though they draw on additional schemas not defined here. For example, the Spend schema evokes a schema for resource consumption (as in (Hudson, 2002)); Charge involves the vendor's communication of the price to the customer as a prerequisite to the overall exchange of goods and money. In general, the CT schema explicitly specifies the internal event structure of a commercial transaction but remains noncommittal about which of its participants is seen as active. This flexibility in representation allows other schemas to effect the bindings that make appropriate commitments on an individual basis.</Paragraph>
  </Section>
  <Section position="6" start_page="0" end_page="0" type="metho">
    <SectionTitle>
5 Simulation semantics
</SectionTitle>
    <Paragraph position="0"> The structured event formalism we have described allows us to translate FrameNet descriptions into a representation suitable for simulative inference.</Paragraph>
    <Paragraph position="1"> Central to the representation is an event model called executing schemas (or x-schemas), motivated by research in both sensorimotor control and cognitive semantics (Narayanan, 1997). X-schemas are active structures that cleanly capture sequentiality, concurrency and event-based asynchronous control. They thus provide a cognitively motivated basis for modeling diverse linguistic phenomena, including aspectual inference (Chang et al., 1998), metaphoric inference (Narayanan, 1999a) and event-based reasoning in narrative understanding (Narayanan, 1999b). In this paper, we focus on the problem of frame-based inference and the attendent problem of modeling perspectival effects.</Paragraph>
    <Paragraph position="2"> The event model is based on the Petri net, which in its basic form is a weighted, bipartite graph consisting of places (shown as circles) and transitions (shown as rectangles) connected by directed input and output arcs (Murata, 1989; Narayanan, 1997). Places may contain tokens (i.e., they may be marked), and they typically represent states, resources or conditions that apply. Transitions typically represent actions or events. X-schemas extend the basic Petri net to include typed arcs, hierarchical control, durative transitions, parameterization, typed (individual) tokens and stochasticity.</Paragraph>
    <Paragraph position="3"> The most relevant property of the x-schema for this paper is its well-specified execution semantics: a transition is enabled when all its input places are marked, such that it can fire by moving tokens from input to output places. The active execution semantics serves as the engine of context-sensitive inference in the simulation-based model of language understanding mentioned earlier.</Paragraph>
    <Paragraph position="4"> The ECG formalism is designed to allow constraints on x-schema simulation to be expressed.</Paragraph>
    <Paragraph position="5"> In particular, the Event schema in Figure 3 has roles that refer to event phases; these correspond to x-schema places and transitions. Other schema roles specify x-schema parameters, which allow x-schemas to give rise to different execution traces through the network with different parameters.</Paragraph>
    <Paragraph position="6"> The Commercial-Transaction schema has been implemented in the KarmaSIM x-schema simulation environment (Narayanan, 1997); Figure 6 shows part of the network. The phase roles from the schemas in Section 3 have been mapped onto the  ated with the Pay schema, corresponding to the money-transfer event. fine-grained temporal structure of each event, corresponding to the various control nodes in the network (ready, ongoing, finish, done, etc.); the transition phase referenced in the schemas is expanded as the start, ongoing and finish nodes. As shown, execution of the overall CT schema comprises the execution of two subsidiary events, the goods-transfer and the money-transfer. These need not be synchronized, but both must complete for the overall commercial transaction to complete (enforced by the arcs from ongoing(money-transfer) and ongoing(goods-transfer) to finish(transfers)). All the frame-based inferences of the CT frame (e.g., the seller (buyer) has the goods (money) until the goods-transfer (money-transfer) is completed, and the seller (buyer) has the money (goods) when the money-transfer (goods-transfer) is completed) come from simulating the CT frame.</Paragraph>
    <Paragraph position="7"> In the simulation framework, perspectival effects come in at least three flavors. First, the frame element binding patterns may differ among perspectives, as illustrated by Figure 5, in which the lexical item buy identifies the actor of the transitive-action with both the customer of the CT and the agent of the money-transfer. This issue of binding has been the focus of previous work (see Section 2); our approach is similar to construction-based proposals that explicitly represent the binding constraints for different frame element binding patterns.</Paragraph>
    <Paragraph position="8"> Second, some perspectives specify the specific subevents (or collection of subevents) to simulate while others require simulating the entire event frame. An example of this is shown in Figure 6, where the highlighted money-transfer portion of the network corresponds to a simulation of the Pay schema. The token in ongoing(ct) shows that there is an ongoing transaction, but the finish(transfers) transition is not enabled. Technically, the done(ct) place is not reachable (absent other information), since the simulation of Pay does not provide direct evidence for the occurrence of a goods-transfer.7 In contrast, both Buy and Sell involve simulating the entire transaction, include both transfers as well as the done(ct) node. (Thus, the entire network in Figure 6 can be considered an expansion of the CT schema's transition phase.) A third, more subtle aspect of perspective is related to the problem of linguistic focus. The perspectival difference between Buy and Sell, for instance, is only partially captured by their different FE bindings to the CT frame. Another difference stems from the foregrounding of specific relations: buy foregrounds the interaction between the Buyer and the Goods (including the eventual possession of the Goods), while sell foregrounds the interaction between the Seller and the Goods. Work in progress suggests that many foregrounding cases can be handled by simulating different parts of the event at varying degrees of detail. For example, the simulation for Buy could execute x-schemas in which the Buyer interacts with the Goods - such as the goods-transfer and its resulting possession (abbreviated as has(Chuck, car) in Figure 6) - at the default granu7Contextual or background knowledge could provide evidence for the other transfer or allow it to be inferred by default. larity, while other x-schemas are collapsed into less detailed simulations. (See (Narayanan, 1997) for a detailed model of simulation at multiple levels of granularity.) While the model is able to handle some of the issues pertaining to foregrounding and focus, a full account remains a topic of ongoing research.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML