| Table of contents | Indexes | XML and Topic Maps | ||||
Case Study: Boeing Intelligent Graphics for Airplane Operations and Maintenance |
Robinson, Molly L. ![]() Seattle ![]() The Boeing Company ![]() Washington ![]() | Molly L.
Robinson
Advanced Computing Technologist, The Boeing Company
Biographical notice Molly Robinson is part of a team that is researching large-scale airplane maintenance systems. The maintenance systems integrate leading edge electronic capabilities such as: intelligent graphics, intelligent navigation, multimedia and visualization. Her specific areas of expertise include: intelligent graphics recognition, digital data delivery, object-oriented programming and the Standard Generalized Markup Language. Molly is currently working with Boeing Commercial Airplane Group and the B52 and AWACS programs to develop production Intelligent Graphic systems for deployment in 1999. She is also leading a software project that is building an internet based retrieval system for airplane part and engineering data. |
Baum, Larry S. Seattle ![]() The Boeing Company ![]() Washington ![]() | Larry S.
Baum
Associate Technical Fellow, The Boeing Company
Biographical notice Larry Baum's current research involves the development of image recognition technology for converting wiring diagrams, schematics and other engineering drawings into intelligent graphics. He is part of a team developing large-scale airplane information systems, leading to products such as Boeing's Portable Maintenance Aid. His specialties include intelligent graphics, structured authoring systems, digital data delivery, and distributed artificial intelligence. Larry was the system architect for the Boeing Standards Knowledge and Delivery System, a structured authoring system for Boeing's electrical standards documents. He has published numerous book chapters, journal articles, conference papers, and technical reports. |
Boose, John H. ![]() Seattle ![]() The Boeing Company ![]() Washington ![]() | John H.
Boose
Technical Fellow, The Boeing Company
Biographical notice John Boose's current work includes projects to acquire knowledge from vector and raster engineering drawings and other airplane documentation using knowledge-based and image processing techniques. He is part of a team that has built a hyper-linked airplane manual system that contains more than four million pages. The results appear in Boeing digital data products such as the Portable Maintenance Aid. His specialties include knowledge acquisition (from both human experts and graphics), intelligent graphics, digital delivery of Boeing airplane and engineering information, knowledge-based systems, group decision support, and corporate knowledge capture and reuse. John introduced new knowledge acquisition technology that reduces the development and delivery time of knowledge-based systems. These tools contributed to systems used in Boeing and NASA. He contributed to NASA's Design Knowledge Capture Technical Plan for the space station. He has published over 160 book chapters, journal articles, conference papers, and technical reports, including a monograph on knowledge acquisition. |
Seattle ![]() Shema, David B. ![]() The Boeing Company ![]() Washington ![]() | David B.
Shema
Associate Technical Fellow, The Boeing Company
Biographical notice Dave Shema is currently working in the areas of raster and vector graphics symbol recognition as applied to acquiring knowledge from engineering drawings. He is also involved in the production of SGML and XML autotaggers for graphics and for text, with an interest in Quality Checking tools that are built into the autotaggers. Results appear in a number of Boeing digital data products, including the Portable Maintenance Aid and military Interactive Electronic Technical Manuals (IETMs). Dave has written a number of articles and papers on knowledge Acquisition and graphics recognition. |
Chew, Susan C. ![]() Seattle ![]() The Boeing Company ![]() Washington ![]() | Susan C.
Chew
Advanced Computing Technologist, The Boeing Company
Biographical notice Susan Chew is part of the team that is developing an internet-based retrieve system for airplane part and engineering data. She is doing work with intelligent graphics and with the databases behind the graphics. She is knowledgeable on SGML and DTD development, having built tools to automatically generate autotagger programs. Susan's work is being applied to Boeing's digital data products, both on the commercial and military side. |
Introduction |
| The Boeing Company maintains tens of millions of pages of information associated with the manufacture and delivery of its products. Much of this information must be made available electronically. We have developed tools to automatically convert and integrate electronic data into industry standard formats. Some of the technical challenges include |
| Our system contains over four million pages of text including tens of thousands of graphics. |
| In this paper we describe tools that recognize and use information within airplane-related vector and raster images. Such images include troubleshooting charts, fault reporting diagrams, component location diagrams, component index tables, wiring diagrams, system schematics, parts illustrations, standards tables, and structural and tooling drawings. Each airplane requires conversion of over 20,000 graphics, including over 900,000 pieces of cross-referenced information. We are also exploring visual information retrieval strategies, including content-based and similarity-based methods for both vector and raster graphics. |
Four Million Pages and Growing |
| The Boeing Company is committed to Air Transport Association Specification 2100 [1] which requires the use of the Standard Generalized Markup Language (SGML) [2]. To comply with these specifications Boeing has converted millions of document pages to SGML. Document sources include commercial tools (e.g., Microsoft Word, InterLeaf, FrameMaker), internal proprietary tools, and data from external vendors. |
| Our team built a series of text autotaggers that automatically convert documents into SGML. From 1990 to the present, we developed autotaggers to create a testbed hypermedia system containing data from over four million pages. |
Graphic Navigation and Functionality |
| The autotaggers receive text in various formats. They receive graphics in vector formats (usually CGM) and raster formats (usually TIFF) and originally did little more than pass the graphics to viewers in DynaText. The graphics lacked information for navigation and retrieval. Users often needed to examine many images to find the right one, including multiple panning and zooming operations. Hyperlinking from a text reference in a graphic was not possible, nor was full text search. The graphics also lacked useful functional information -- What is the logic in this troubleshooting diagram? What are the relationships between items in this table? What lines in this wiring diagram represent continuous circuits? |
| We built a series of graphic recognition tools that use vector and raster graphics to produce navigational and functional information represented in SGML. We then integrated this information into the testbed. |
| Because each recognizer is unique and depends upon knowledge about how the diagrams are constructed and the constraints they should comply with, each involves algorithms specific to that class of drawings. Thus, the algorithms themselves are not generalizable. However, the methodology has proven successful over a large variety of drawing classes. As a result, we have enabled mechanics and other users of the data to quickly and accurately navigate through and more easily access the technical information embedded in the tens of thousands of illustrations in an on-line document set. |
| For example, suppose that there is a problem in the air conditioning system in the crew rest area of a 757. The mechanic uses the system to go directly from the fault code for that problem to the fault isolation procedure in the Fault Isolation Manual, via a text hyperlink. The procedure references a four-sheet figure containing a fault isolation decision tree. By clicking on hotspots in the graphic, he quickly gets to the corrective action, which is a removal and installation procedure in the Airplane Maintenance Manual. There is a hotspot on that reference, so he quickly navigates to the correct location in the Maintenance Manual. The procedure requires inspection of a wiring diagram, found in the Wiring Diagram Manual. There is a hyperlink in the Maintenance Manual which takes the mechanic to the correct diagram. Because the graphic viewer traces electrical circuits in the diagram, the mechanic can readily determine the affected components and the associated pin numbers so that removal/installation is done properly. The desired component is not in stock, however, so the mechanic needs to consult the Illustrated Parts Catalog. The component number in the wiring diagram also has a hotspot, so he can go directly from the Wiring Diagram Manual to the desired location in the Illustrated Parts Catalog. The illustrations there are also populated with hotspots so that the mechanic can easily pull up the information as to part number and supplier, so that the part can be obtained as quickly as possible. |
| This scenario illustrates the enormous benefit of using graphic recognition tools to automatically populate our large data sets with the hundreds of thousands of hotspots. Typically the graphics alone from a single set of manuals contain over 900,000 reference links. |
Graphics Recognition Tools |
Figure 1. The fault diagram recognizer finds arrows, diamonds, and other information, then uses knowledge about diagram layout to produce the application's flow of logic.
|
Figure 2. The component location recognizer uses page layout logic to induce groups and attach callouts.
|
| We used a similar approach for each graphic recognition problem: |
| Fault reporting diagrams are vector images that pilots use to determine what messages to record in the flight logbook when a malfunction occurs. We have been able to accurately determine the decision networks in these diagrams and generate over 20,000 hot spots in over 1,600 diagrams (Figure 1). To illustrate our methodology, we will discuss the processing of these drawings. |
| The 2 diagrams in our example are very different so we construct an individual recognizer for each class of drawing we analyze, using heuristic knowledge of how that particular class is constructed. Thus the recognizer for fault reporting diagrams is a different application than that for component location diagrams (Figure 2.) To illustrate our approach, we will outline the basic steps in the recognition of fault reporting diagrams. |
| Each diagram consists of header art, below which is a series of questions. Beneath each question are the various responses for that question. These answers are connected through a series of horizontal arrows and decision diamonds to comprise a decision tree. A path through the diagram will ultimately lead to a fault code which should be entered in the flight logbook. |
| We exploit the power of SGML by developing a Document Type Definition (DTD) which reflects the intelligence we are recognizing in the graphics. Because fault reporting diagrams are graphical decision trees, the DTD specifies objects such as decisions, questions, and choices; and since the goal is to enable interactive use of the illustrations, the DTD also specifies hotspots: |
Figure 3. Intelligent Graphics DTD
|
| Each decision is linked to the question above it via qid and consists of one or more choices. Each choice links (via refoid) to another decision or an action. All of these have a set of hotspots (i.e. an hslist). Typically an hslist will have a single hotspot in it but occasionally there is a need for multiple disconnected regions for a decision, choice or action. Hence the need for the hslist element. |
| For the Fault Diagram in Figure 1, the resulting markup includes: |
Figure 4. Intelligent Graphics Markup
|
| The recognizer's task, then, is to analyze the graphical primitives in the CGM file to produce such markup. It performs these steps in building a complete and accurate internal representation of the decision tree: |
| Using this recognizer, we are able to convert a complete set of drawings for a Fault Reporting Manual into different styles of interactive electronic books. For example, using one style, the system presents the user with each question on a separate screen and, depending on the response, will present the next set of question and responses in the decision tree. Another presents the original graphic with hot spots that are automatically generated. The user can click on the appropriate response and the system will automatically zoom in on the next question/response area. |
| By devising additional heuristics and recognition algorithms, we have adapted this methodology for many other drawing classes. |
| Fault trees are vector drawings that provide mechanics with decision trees for fault isolation. One drawing can consist of as many as 30 individual sheets with intersheet connections linked by references. Our software analyzes the layout of the boxes and the arrows connecting them and builds an internal representation of the decision tree. It then generates hot spots so that users can easily navigate within a sheet and among sheets as well as follow links to other information. This manual set contains over 750 charts containing over 16,000 hot spots. |
| Component location diagrams provide exploded views of aircraft and their components. Typically one diagram consists of multiple sheets, each of which has one or more subpictures (insets) with internal callout references. Our software must reliably subdivide each image into subpictures and generate the hot spots that link the callouts to the correct subpicture, including references to other sheets. In addition, the software generates hot spots around each equipment number, so that mechanics can easily navigate from the picture to information about that piece of equipment. This manual set contains over 650 vector component location diagrams; the system produced over 3200 subpictures containing over 25,000 hot spots (Figure 2). |
| (This recognizer also populates a graphics database with all the subpictures. This database provides the testbed for the visual information retrieval research mentioned at the end of this paper.) |
| Component index tables are vector drawings that tell the mechanic where to go to find the maintenance procedures for the equipment illustrated in the component location drawings. The index table recognizer determines the individual table cells and relationships among cell contents. It generates over 14,000 hot spots in over 300 tables linking the drawings to component location diagrams, other component index tables and to maintenance procedures. |
| Wiring diagrams are vector drawings extracted from computer-aided design data sets. The intelligent graphics software analyzes both the vector image and the data set to determine how wires are laid out in the drawings so that circuit tracing is enabled in the hypermedia viewer. In addition, there are hot spots providing hyperlinks to equipment lists and wire lists as well as other diagrams. This manual set contains over 1400 diagrams with over 135,000 hot spots (Figures 3, 4). |
Figure 5. A wiring diagram recognizer finds connected circuits and links references together.
|
Figure 6. Users follow hotlinks to retrieve equipment information.
|
| System Schematics are similar in functionality and recognition requirements to wiring diagrams. This manual set contains over 1300 system schematics with over 461,000 hot spots. |
| Parts illustrations are raster images that provide exploded views of the airplane much like component location diagrams. They contain callouts and item numbers that must be linked to other pages. Because they are raster images we must perform more complex analysis than with the vector images. We have developed routines to quickly process callouts and item numbers with high accuracy. This manual set contains over 13,000 sheets with more than 241,000 hot spots (Figure 7). |
Figure 7. A raster parts illustration recognizer finds callouts and reference numbers, using symbol finding and optical character recognition.
|
| Standards tables contain information about Boeing engineering practices. The sources for many of these tables are paper copies that were several generations old, made with typewriters and ruled with ball point pens. The conversion team scanned these images for on-line use and was planning to manually re-enter the information. We developed recognition software to extract the logic in the tables. Together with a commercial optical character recognition package we automatically converted over 6600 tables to InterLeaf format (Figures 8, 9). Later these will also be converted to SGML. |
Figure 8. This original standards table was made using a typewriter and ball-point pen.
|
Figure 9. After scanning the table in Figure 8, the table recognizer automatically finds cells, performs character recognition, then produces standard mark-up files.
|
Evaluating Recognition Results |
| To evaluate the results of text tagging our team used commercial SGML verifiers and specialized custom systems. We developed tools that analyze the text output for consistency and completeness, including statistical measures and link verification. Often these tools find authoring errors as well as tagging errors. |
Figure 10. Custom graphics analysis tool.
|
| It is more difficult to build such tools to check graphic recognition accuracy. We built a custom graphic analysis tool that simultaneously shows recognized objects and the corresponding SGML (Figure 8). For example, if the recognizer finds a reference, the viewer displays the hot spot along with the information as to what type of reference it is and to what it refers. Using this viewer we manually sampled thousands of images to assess recognition accuracy. Recognition accuracy ranged from about 97% for component location graphics to 99.9% for troubleshooting charts. These results exceeded our performance goals. |
| We can also find some types of graphics authoring errors. For example we can automatically detect incorrect references that point to non-existent locations. Previously, such errors could only be found by manual inspection. |
Future Challenges - Work in Progress |
| We are currently working with Boeing's library of millions of structural and tooling raster drawings. These drawings have been scanned from aperture cards and placed on-line. We would like to recognize objects such as title blocks, part numbers, dimension lines, materials lists, and parts tables. Much of the text is hand written and many of the scanned images are skewed or noisy, making this work very challenging (Figure 11). |
Figure 11. Legacy raster structural and tooling drawings
|
| We are also working on visual information retrieval strategies, including content-based and similarity-based methods (Figure 12). Our prototype visual search engine finds similar vector images in a database of over 6,000 drawings. |
Figure 12. Visual Information Retrieval
|
Summary |
| We have successfully added functionality to tens of thousands of vector and raster graphics, integrating them with text in on-line navigation systems, addressing the problems of scale and accuracy. We hope to continue to add functionality to graphics to enhance the overall utility of Boeing's digital data. |
References |
| [1] ATA (1995). Air Transport Association Specification 2100 - Digital Data Standards for Aircraft Support, 1995, Order code A090 |
| [2] Goldfarb, Charles F. (1990). The SGML Handbook, Oxford University Press, ISBN 0-19-853737-9 |
| Table of contents | Indexes | XML and Topic Maps | ||||