File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/92/c92-4184_metho.xml

Size: 19,970 bytes

Last Modified: 2025-10-06 14:13:00

<?xml version="1.0" standalone="yes"?>
<Paper uid="C92-4184">
  <Title>JAUNT: A Constraint Solver for Disjunctive Feature Structures</Title>
  <Section position="3" start_page="0" end_page="0" type="metho">
    <SectionTitle>
2. Explicit disjunction + constraints
</SectionTitle>
    <Paragraph position="0"> Let us emmlder sentence (1). In order to simplify the following dlscussimt, we iL~stlnle that the sentence is prepmcessed ms in Figure 1. This preprocessing can be done \[W a simple context-free grammar that does not determhm PP-attachments. In the figur% \[...\] is a featnre structure, {...} is a llst, and {~{...Y,} is a disjunction. &amp;quot;Phus~ the variable S represents the (packed) structure of sentence (1) as a list of five eom\[}mmnts 1 each of whldl corresl)onds to a V: an NP, or a PP. The grammatical relation (gr=) and the modiflee (meal=) of the three PPs are disjunctions, meaning that one of the wdues shouhl be selected, but that the correct candidate has not yet beat determined. For example, the first PP &amp;quot;on the floor&amp;quot; has {~0,1~,} .'~s its and= vahm, which means it can modify either phrase 0 (the verb &amp;quot;put&amp;quot;) or phrase 1 (the NP &amp;quot;the block&amp;quot;). Not all the value combhtations of the disjunctions are allowed. In the above example, if a PP modifies the main verb, the grammatical relation should be loc.</Paragraph>
    <Paragraph position="1"> In JAUNT, constraints are introduced by addc statemeats. The program fragment (2) applies constraints between the moditlee and the grammatical relation of a pp.</Paragraph>
    <Paragraph position="2"> for W in S begin</Paragraph>
    <Paragraph position="4"> \[\]ere~ dots and question marks are operators for me: cessing components of bsts or feltture structurest. '\].'he ?The difference between a (lot and a question mark is that a Ac~.s DE COLING-92, NAIVI~S, 23-28 Ao(rr 1992 1 1 6 2 PRec. OF COLING-92, NANIES, AUG. 23-28, 1992 symbols t~ (logical and) an,I =&gt; (iml,ly) h~ve their ordinary logical meanings. In geuernl, ;tny first-order logical formula without qnantifieation is allowed as a ctmstraint. 'Fhe variable W is bonnd to ea~zh V~ NP~ and PP while the addc statelnen|,s lmtweee begin and end are executed. Tim lirst addc statement reaxls ~s ft)ll(,ws: If tile category of W is PP aud the category of the modifiee of W is either PP or NP, then the grammatictd relation of W should be postmod.</Paragraph>
    <Paragraph position="5"> The applied constraints are represented implicitly by an internal data structure called a consltvdnt oeln,o'rk (described t;tter).</Paragraph>
    <Paragraph position="6"> in axlrlititlu~ tam p,'ojeclivity constrzdut, that modifi~ cation liuks do unt crossover, can be progranmmd ~s h)llows:</Paragraph>
    <Paragraph position="8"> We have now obtained a packed representation that consists of explicit disjunctions, as in Figure 1, and constraints attached behind them. Each value conl-bination of the disjunctions that globally sat, isfies tim constraints exactly corresponds to one of the 14 parses of sentence (1).</Paragraph>
    <Paragraph position="9"> Every context-free parsing ,~lgorit.hnl timt ha.s a polynomial time bound prodnees a pu~cketl represen tation of the parsing results (for example, * chart ill chart parsing (Kaplan 1973), a pa,'si.g mat,'ix in the CKY nmthod (Yonnger 1967), and a .sha,~d-packedforest in qbmita's algoritlun (Tomita 1987)). These representations take advantage of tile regularities of syntactic ambiguities in context-free parslog. For example, sillce it is known that 1~ consecutive P\]'s ilave Cutalan(n) different p~zrses, it is possible to encode all PP-attachment ambiguities by renlemberiug only n and the position of tile PPs (Church ~ Patil 1982).</Paragraph>
    <Paragraph position="10"> However, once we try to extract ~ single illterpre tation item these representations, we face a prubhml, because such regularities may be vnid when new cun.straints ~re introduced for disnmbiguati,nl. Consider the application of constraint (4): A verll cannot have two h)eatives. (4) Tiffs constraint viohttes the regularity of the PI' att;miunent ambiguity and tl,ereh)r,~, the Cb'G be-led packed representations nlentioned ailove cannot hall-tile this new int~rmation properly without modifyiug the grammar significantly. Ill JAUNT~ this constraint is ~pplied by a simple addc statement (5).</Paragraph>
    <Paragraph position="12"> lebrrnaliy~ it ha.s beea proven that Constraint Depeudency Grammars: whose rules Ca~ b(! written a~s q.estion mrtrk allows ~ disjunction as its value, w\]mre~ a (lot does not. The cllrretlt inl|llelllentatiOll generates more e|flclent code for dots than for question m~rks.</Paragraph>
    <Paragraph position="14"> restricted f(irms of JAUNT program: have ~ weak generativr power strictly greater than that of CI&amp;quot;G (Maruy~ma 1991), This implies that certain types of pa~rsing results can be represenu..d by constraint networks but not by CFG based represmttatioos.</Paragraph>
    <Paragraph position="15"> Sen and Simmons (1988) proposed syntactic 9~phs and discussed the axlvantagos of having explicit, dis junctions in a packed data structure. Their represeu tation is similar to ours in tile seuse th;tt they have con~trahlts attached to the explicit disjnnctive data structure. However, they d. uot diseusa how to ~rp.</Paragraph>
    <Paragraph position="16"> ply disam\[liguation knowledge in order tn reduce the ambiguity effectiwqy, lu JAUNT, the underlying constraint saris\[action algorithm removes im:onsistmdl val-IteS ~cnd keeps tim constrai/it uetwork locally consistent. Consider, lot example~ the application of the new con straint (6): An object {:annot In! on two distinct objects a,t the same tin,e. (6) This constraint is written a~s follows:</Paragraph>
    <Paragraph position="18"> After this coustraint \[ta.s beet, evahl~ted~ tile and attribute of the t'P &amp;quot;on the t~ble&amp;quot; becomes {~0,2Z}, n~ strewn it, Figure 2, because the vMue 1 is locally inconsisteut ;mcnrtling to the coostraints applied su far, and central, pneti( il,ate ill any of the remaining seveu re;tdings.</Paragraph>
    <Paragraph position="19"> There },ave been several ~Lttenlpts to incorporate dis junctions in uniiicatinu-ba.sed grammars re.g. Karl tuoen 1984). Constr.'tints ;ere introduced by ~t unification between two disjuuctiw.' feature structures. A nnificatio, succeeds only if there are combinations (ff wducs of the disjunctions that s~tisfy tile equality con straints implied by the u,lificatio.. It, order to clarify the exl,ressiw~ power of fe~ture structures with general disjunctions, Kasper ~ Rounds (1986) defined a logic-be-led notation called FM1, A fornlula in FMI, can be rewritteu as an addc statement in JAUNT, and hence, constraints expre~ed hy a unification can also be expr~ssed in JAUNT. In ~|dition, in unification-based grammars, the nnly basic predicate is equality, aud other useful predicates, such em inequalities and set inclusion/membership, are diflicuh to represent. In ~If the secottd PP &amp;quot;on tile table&amp;quot; modifies the NP &amp;quot;the block,&amp;quot; the first PP &amp;quot;on tim riot,r&amp;quot; ha.s no legal modifiee~. ACRES DE COLING-92, Nam'l~s, 23-28 AOUX' 1992 1 1 6 3 I'koc. OV COLING-92, NANTES, AU(L 23-28, 1992 JAUNT, inequalities and set operations are built-in, and user-defined predicates are also allowed.</Paragraph>
  </Section>
  <Section position="4" start_page="0" end_page="0" type="metho">
    <SectionTitle>
3. Constraint-satisfaction algorithm
</SectionTitle>
    <Paragraph position="0"> Since every disjunction in a JAUNT program has a finite number of choices, its satisfiability problem can be formulated as a constraint satisfaction problem over a finite donlain (sometimes called a consi.~tenl-htbeling problem (Men\[snarl 1974)). Much effort has been de voted to developing efficient algorithms for this prob lea.</Paragraph>
    <Paragraph position="1"> Two such algorithms are employed in JAUNT. Ore, is the constrainl propagation algorithm (Mackworth 1977), which is activated when a new constraint is added by addc statements. The constraint propagation algorithm runs in polynomial time, and eliminates locally inconsistent vMues from the choice points and propagates the results to the neighboring constraints.</Paragraph>
    <Paragraph position="2"> The constraint propagation algorithm usually reduces the size of the search space significantly.</Paragraph>
    <Paragraph position="3"> The other algorithm used in JAUNT is the forward-checking Mgorithm (Haralick &amp;Elliott 1980), which is triggered by the execution of a special find stat~ meat. It is essentially a back-tracking algorithm, but it prunes unpromising branches whenever temporal choices are made, thus significantly reducing the size of the remaining search space.</Paragraph>
    <Paragraph position="4"> This section describes in detail the constraint propagation Mgorithm used in JAUNT. Re'0ders are referred to Hentenryck (1989) for the forward-checking algorlthm. null</Paragraph>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
3.1 Internal representation of constraints
</SectionTitle>
      <Paragraph position="0"> Bob)re describing the Mgorithm in detail, let us explain the internal representation of the constraints, hi a compiled \[nodule of a JAUNT program, a disjunction is represented by a data structure called a Choice Point (CP). A CP maintains a list of ptJssible values (called a domain) at the time of program execution.</Paragraph>
      <Paragraph position="1"> When a new constraint is added by a addc statement, the constraint is represented internally ms a conslrrint ms\[lisa For example~ assume that W is bound to \[gr={~loc ,posttaod~}, mod--{Y,0, lY,}\].</Paragraph>
      <Paragraph position="2"> W?gr and W?mod are represented internMly a.~ CPs whose domain size is two. Then, when the constraints (2) are evaluated, a new two-dlmensional constraint matrix is created between the two CPs, as shown in  Each dimension of the constraint matrix corresponds to a CP. The elements indicate whether the particnlar combination of the CP vMues is legal (1) or illegal (0). For example, W?gr=loc and W?mod=O satisfies the constraint and hence the corresponding element in the matrix is 1.</Paragraph>
      <Paragraph position="3"> If another adds statement is then executed declaring that ttm value combination of W?gr=pontraod and W?mod=l is illegal, the corresponding element in the matrix is changed to 0, yielding the matrix shown in  Suppose that the executioll of art adde statement referring to 7t different CPs XhX2,...,Xn reveals that the value combhtation &lt; xl,x~,...:x,~ &gt; is illegal. JAUNT first locates an n dimensional con straint matrix connected to X1,X2,...,X=, and set its element corresponding to the value combination &lt; xi, x2, ..., x,~ &gt; to 0. If there is no such constraint matrix, JAUNT creates a new one whose elements are all 1 except for the element of &lt; xl,x~,...~x,, &gt; that is set to 0.</Paragraph>
    </Section>
    <Section position="2" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
8.2 Constraint propagation
</SectionTitle>
      <Paragraph position="0"> The ba.sie idea of constraint propagation is to re mow~' locally inconsistent values from the, choice points and to reduce their domain size before a back tracking search is performed.</Paragraph>
      <Paragraph position="1"> \[n the example ~d)ove, let us consider the row af W?gr=postraod in the constrah\[t matrix. When i~?gr=postmod~ the elements of the matrix are zero, whatew~r value W?mod Lakes. This means that there are no glnbal solutions with W?gr=postmod, and therefore this value can be safely removed fronl tim domain of the CP W?gr. Similarly, Id?rnod=l ca.n be removed from the domain of the CP W?raod.</Paragraph>
      <Paragraph position="2"> In general, when a particular row or column (or plane or hyperplaue, if the dimension is greater th~n two) contains all zero elements, the corresponding vMne zl of CP X can never participate in ~ solutimt (see Figure 5). Therefore, a'i can be eliminated frmn the domaitt of X. Whenever a constraint matrix is updated~ JAUNT searches for a. hyperplane whose ele~ ale/Its are all zero aud relnoves the corresponding v~thle from its domain. This may updrrte other constraint matrices conllected to l he C.P~ and may cause rabies in other CPs to be elhninated. Thus, updates are propagated ow~r the network of constraints until the entire network reaches a stable state.</Paragraph>
      <Paragraph position="3"> For every hyperplane in a constraint matrix, JAUNT ACTUS DE COLING-92. NAbrrES, 23-28 AO(Yr 1992 1 l 6 4 PROC. OF COLING-92, NANIES, AUG. 23-28, 1992</Paragraph>
      <Paragraph position="5"/>
      <Paragraph position="7"> keeps the current number of t's on that plane, called the support (see Figure 6). When a certain element of a constraint matrix apl)ears to be inconsistent a,s a result of the evaluation of addc statement, the curre sponding support in each dimensiun is decremented.</Paragraph>
      <Paragraph position="8"> When a value in a CP is removed by constraint prop station, the carrespondlng hyperplaue of every con straint matrix connected to the (11' is removed, attd the result is reflected i~( all tt~e support values in the matrix. This algorithm is a uatnral extension of Mohr and Henderson's arc-c(msistency algorithm (Mohr &amp; Henderson 1986) for allowing n:ary constraints.</Paragraph>
      <Paragraph position="9"> The cmnputathmal complexity of our constralut propagation algorithm is hounded by O(eIMD, where IMI is the siz,~ of the constraint matrices and e is the number of the cunstraint matrices, becattse at lemst oue element in st)me matrix is changed to 0 from I for every iteration of constraint propagatiom If the con str~ints are Iocal~ that is, if the arity of each ennstraint is bounded by a small integer, this time bound is a polynomial of the number of disjunctions.</Paragraph>
      <Paragraph position="10"> Our algorlthnr tries to maintain h,cal consistency ix( the sense that it runs(tiers only one eonstr+dnt ma trix at. +t time. This is a generalization of the notion called am consistency (Mackworth 1977) or pair-wise cousisteucy, and is equlva\]ent tn the flrst two steps of Ka.sper's (1987) successive aptnvximatimL Algo rithms for achieving more global consistency by look: ins at mnltlple constraint matrices are possibh+, but as Carter (1990) argues in his paper on the experimen.</Paragraph>
      <Paragraph position="11"> tal Propane parser, once pair-wise consistencies have been achieved, peffurnling a backtrack search is usually more efficient than using higher-level consistency algorithms. In JAUNT, a forward:checking algorithm, which is far better than the traditional backtracking algorithms (Haraliek &amp; Elliot 1980), is provided for generating global solutions, if necessary, although th\[~ intended use of JAUNT is to combine constraint prop agation with the recta-inference described in the uexl section~ rather than t() perfornl a search.</Paragraph>
      <Paragraph position="12"> There hay,' t~en attl!nlpts to formulate natural language pro,:essing as a cunstraiut satisfaction problenl with broader don~ains (fl~r example, the Herbraud domain). CIL (Mukai 1988) and cu=Pmh)g (Tsuda, ltasida &amp; Sirai 1(.189) are examples of such atteuipts.</Paragraph>
      <Paragraph position="13"> There is a trade-off between the expressive power and the COmlmtatiunal complexity, aml we argue that linite donlaius have sutticient expressive power while retain: ing the couqmtational eflicieucy implied by the algu+ rithms described above.</Paragraph>
    </Section>
  </Section>
  <Section position="5" start_page="0" end_page="0" type="metho">
    <SectionTitle>
4. Meta:inference
</SectionTitle>
    <Paragraph position="0"> A cunsiM,(~nt-\]at)elliLg pr()bl(!nl uLay or nlay not have a solution. If it ha_s role, it is most probable that there are multiple solutions. In fax:l, it+ the glven constraints are lint 'tight' enough tu narrow down the uumber uf sohltitlliS to (Hit? or a few! the prohhml Ill~l,y have an eX ponential number of solntions. This situatlon is common hL natural language processing. Strict grammars canse analysis failures for grammatical sentelt{;(~s~ i)n the other hand, lnose graulmars pruduce a combinato rially explosive number oF parse trees fin' certain types of sentence. 'lb avoid this situation, cnnstrahits shouhl be dynamically added aud remuw~d according to the size of the stdul.iut( space+ hi uther words, a constraint solver shunhl tm provided with a means of watching its own infl~rence process and changing its strategy ac= cord(us to tim observati&lt;m.</Paragraph>
    <Paragraph position="1"> To set&gt;purl the metaAnference capability~ JAUNT provides the following built in functions:  1. incousistentp() ... Non-NULl, wilen JAUNT detects incDltslstencies bf~tweell coltstraints 2. saveS(ate() ... Save the current status of constraint sat(slant(us g. loadState() ... lLestore the saved status of con-.</Paragraph>
    <Paragraph position="2">  straint satisfactiuu.</Paragraph>
    <Paragraph position="3"> lu JAUN'I'~ tire state of the constraint:satisfaction process is deJined ms the set of all choice points and all cosstraint nlatrices. Oth,~r statuses bUlC\]I mS global aud local variables, the prograln couuter+ ;utd the coutrol stack are lint saved I sn applications (If cmlstraints nan be uadone without distilrbing the c&lt;uttrul ll.w.</Paragraph>
    <Paragraph position="4"> Meta inh~renc,~ is nonletlntes perh~rmed in an exter hal nlodule. JAUNT has interqm)cess crassus(cation primitiw~s hmm~d on UNIX so&lt;:kets. With these met;uinference capalfilities, an independent inference process timing ext.ernal knuwh~dge can tilt)ill(or and iIiterveneln a JAUNT progra.nt. If it detects an inconsistency, it instructs the JAUNT i&gt;rogram to go b.'u:k tu the previous inferenc\[~ state and try another set of constraints; if it finds thai the solution spa~:e is not small enough+ it may giw~ new constraints from its own knowledge source. By separating the rneta-inference module from tile object-level JAUNT program, modularity \[)f knowledge is ;whieved.</Paragraph>
    <Paragraph position="5"> As an application &lt;&gt;f the meta:inference capability, let us describe the interactive Japanesp parser of the Japanese t&lt;FEnglish m;u:hine translation system JETS (Maruyama, Watanabe, &amp; Oginn 1990). The systmn structure is shown in Figure 7. Tim morphological A(:rl~s BE COL1NG+92, NANTES, 23-28 ^O(JT 1992 1 1 6 S Pron. oi: COLING-92, Nm,n'zs, AUG. 23-28, 1992  analyzer analyzes an input sentence using a type-3 grarl'lln&amp;r and creates a feature structure that COlltalus disjunctions for lexieal and attachment ambiguities (Figure 8). The syntactic analysis program written in JAUNT applies grammatical constraints based on Constraint Dependency Grammar to these choice points and sends the result to a user-interlace running on a separate machine. The amblguons choice points (those with domain size&gt; 1) are highlighted on the screen, and the end user can select an appropriate value for some of them. This information is sent back to the JAUNT program through the inter-process communication channel and applied in the form of new constraints. This iteration is written in JAUNT as fed-</Paragraph>
    <Paragraph position="7"> Thus, h* JETS, the end nser acts as an external know\[ edge source to guide the inference process of the program. null SHALT2, an experimental English-to-Japanese machine translation system currently being developed at IBM's Tokyo Research Laboratory, has a similar system structure (Nagao 1990). Instead of user interaction, an external example ba.~e built from an existing corpus is used for resolving attachment amblguities in SHAUF2. Thus, clear modularization of general syntactic/semantic knowledge from domain-dependent example-based knowledge is achieved.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML