An SGML-based Office Document Exchange and Management |
|
Shy-Ming Ju |
| Professor |
| National Institute of Technology at Kaohsiung Dept. of Information Management 1 University Road, Yenchao Kaohsiung Taiwan Republic of China Phone: +886-7-6011000 ext. 4100 Fax: +886-7-6011042 Email: smju@ccms.nitk.edu.tw |
Biographical notice: |
Dr. Shy-Ming Ju |
ABSTRACT: |
| SGML-based facility specification |
A major thrust in the government of the Republic of China is to computerize the exchange and management of office documents. The Research Development and Evaluation Commission is about to promulgate a specification describing how office document exchange and management should be done. The author describes the development of the specification and a conforming SGML-based facility that could be used by an estimated 7,000 government offices. |
As a true SGML application for a real and profound case, it demonstrates the extensibility of SGML and the ease of conversion from SGML to XML. |
Introduction |
|
Therefore, RDEC can only promulgate a specification describing how office document exchange and management should be done. Software vendors can then develop their products according to this specification. A prototype office document creation and exchange facility conforming to this specification will be very helpful to demonstrate the feasibility of this specification and to highlight relevant technical issues. |
When office documents are exchanged electronically in an open systems environment, the process must meet the following conditions:
|
In essence, the above requirements mean that the specification must describe: |
SGML has the mechanism to support all of the above functionality. |
The Specification |
|
Through a project funded by RDEC , we have developed a specification based on SGML that contains:
|
In the following discussion, a particular type of document, the Meeting Notice, will be used as an example to highlight the various aspects of document exchange and management. Figure is a typical printed meeting notice that consists of the following fields: |
A typical printed meeting notice |
![]() |
| structure DTD |
The Structure DTDs |
To enhance readability and modifiability, as well as to simplify the design of style DIs, we decided to provide a separate structure DTD for each document type. The structure DTD for Meeting Notice is shown in Figure . Even though this approach facilitates application software development, it does make the maintenance of the DTDs tedious. In the next edition, we plan to use parameter entities for common constructs to reduce prominent repetitions. |
Structure DTD for meeting notices |
![]() |
Based on the structure DTD in Figure , the structure DI for the particular meeting notice in Figure is shown in Figure . Note the way an attachment is referenced with an external entity name in an "ENTITIES" attribute. The entity name is then associated with a file name in the local file system through an "ENTITY" declaration, and the file type is defined with an "NDATA" construct in the declaration. Processing of the file type is further explained with a "NOTATION" declaration. |
Structure DI for the particular meeting notice |
![]() |
| style DTD |
The Style DTD |
In closer observation we found that the styles of all office documents are made of line segments, literal strings, and invisible rectangles that are to be filled with document contents. Therefore, we have designed a style DTD as shown in Figure . |
Style DTD |
![]() |
This style DTD provides syntax for describing:
|
Based on the style DTD, a style DI for the meeting notice in Figure is shown in Figure . It should be noted that the style DTD is applicable to any type of form such as that shown in Figure . In fact, the style DTD can describe a blank form pretty accurately. However, the style DTD is not meant for sophisticated typesetting. When placing contents into a blank field, it does not have sufficient expressive power to reproduce exact appearance as that of the original document. For office document exchange, at the receiving end, machine processing of documents does not care about style, and human cognition can be satisfied if the structure and content of a document can be correctly deduced from the hints of the style. This is the case when "approximate rendering" is acceptable instead of perfect reproduction. |
A style DI for the meeting notices |
![]() |
A blank form |
![]() |
| packing scheme |
The Packing Scheme |
We have used a profile of the ISO standard, SDIF (SGML Document Interchange Format) , for packing various entities for exchange. The packing scheme is shown in Figure . Selecting a transport mechanism is beyond the scope of this specification, but TCP/IP, SMTP or HTTP will be adequate. |
The packing scheme |
![]() |
The 25 structure DTDs and style DIs will be published as public identifiers, so both the sender and receiver during document exchange will have access to them. Therefore, the structure DTD and style DI associated with an office document need not to be packed in, unless they are created by the sender. |
| conforming facilty |
A Conforming Facility |
We have developed a prototype of conforming document creation and exchange facility that consists of:
|
First-level tree structure |
![]() |
Merging and rendering process |
![]() |
An approximate rendering for the particular meeting notice |
![]() |
Conclusions |
|
The SGML-based specification allows users to define new document types and styles using the predefined tag and attribute set, thus demonstrates the exten-sibility of SGML. The style DTD can describe any layout that is constructed from line segments, literal strings and invisible rectangles for character string positioning. This means that the specification can also support electronic form exchange and management. |
The specification is a true SGML application in the sense that it completely separates document structure from document style. Because of its judicial use of SGML syntactic constructs, the specification can be easily converted to an XML-based one. If adopted by the government, it is estimated that 7,000 government offices and numerous private offices will use this specification and similar conforming facility. Therefore it is safe to say that this is one SGML application with profound consequence. |
Additional references in Chinese are listed in Figure |
Additional references in Chinese |
![]() |
Bibliography
|
| Enabling Everyday Business Applications to Work with Structured Information by using the Associative Model | Table of contents | Indexes | Defining Reusable, Distributable Information Objects Using XML-Data Schemas | |||