File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/93/h93-1076_intro.xml
Size: 3,609 bytes
Last Modified: 2025-10-06 14:05:29
<?xml version="1.0" standalone="yes"?> <Paper uid="H93-1076"> <Title>Speech and Text-Image Processing in Documents</Title> <Section position="3" start_page="0" end_page="376" type="intro"> <SectionTitle> 2. TEXT IMAGE EDITING: IMAGE EMACS </SectionTitle> <Paragraph position="0"> Image EMACS is an editor for scanned documents in which the inputs and outputs are binary text images\[I\]. The primary document representation in Image EMACS is a set of image elements extracted from scanned text through simple geometrical analysis. These elements consist of groupings of connected components (i.e., connected regions of black pixels)\[2\] that correspond roughly to character images. Editing is performed via operations on the connected components, using editing commands patterned after the text editor EMACS \[11\].</Paragraph> <Paragraph position="1"> Image EMACS supports two classes of editing operations.</Paragraph> <Paragraph position="2"> The first class is based on viewing text as a linear sequence of characters, defined in terms of connected components. Traditional &quot;cut and paste&quot; functionality is enabled by a selection of insertion and deletion commands (e.g., delete-character, kill-line, yank region). As with text, editing is typically performed in the vicinity of a cursor, and operations to adjust cursor position are provided (e.g., forward-word, end-of-buffer). Characters can also be inserted by normal typing. This is accomplished by binding keyboard keys to character bitmaps from a stored font or, alternatively, to user-selected character images in a scanned document. Correlation-based matching of the character image bound to a given key against successive connected-component groupings allows for image-based character search.</Paragraph> <Paragraph position="3"> The second class of operations supported by Image EMACS is based on viewing text as a two-dimensional arrangement of glypbs on an image plane\[l\]. These operations provide typographic functionality, such as horizontal and vertical character placement, interword spacing, vertical line spacing, indentation, centering and line justification. Placement of adjacent characters is accomplished using font metrics estimated for each character directly from the image\[5\]. These metrics allow for typographically acx.eptable character spacing, ineluding character overlap where appropriate.</Paragraph> <Paragraph position="4"> Taken together, Image EMACS commands are intended to convey the impression that the user is editing a text-based document. In actuality, the system is manipulating image components rather than character codes. Moreover, while the user is free to assign character labels to specific image components, editing, including both insertion and search, is accomplished without explicit knowledge of character identity. This approach enables interactive text-image editing and reproduction, independent of font or writing system.</Paragraph> <Paragraph position="5"> Figures 1 and 2 show an example of a scanned mnltilingual document before and after editing with Image EMACS. The example demonstrates the results of image-based insertion, deletion, substitution and justification, as well as intermingling of text in several writing systems and languages (paragapbs 4 through 7). 2 Such capabilities are potentially achievable using a format-conversion paradigm; however, this would require more sophisticated OCR functionality than currently exists, as well as access to fonts and stylistic information used in rendering the original document.</Paragraph> </Section> class="xml-element"></Paper>