| Style Sheets: I18N aspects | Table of contents | Indexes | Managing information networks with Topic Maps | |||
Development of SGML/XML Middleware Component |
|
Kunio Ohno |
| Chief Scientist |
| INS Engineering Corporation 4-31-18 Nishi-Gotanda, Shinagawa-Ku Tokyo Japan 141 Phone: +81 3 3490 6104 Fax: +81 3 3490 6155 Email: ohno@inse.co.jp Web: www.inse.co.jp |
Biographical notice: |
Kunio Ohno |
| Approach Incorporated Beyer, Mortren Japan ![]() Tokyo ![]() |
Kunio Ohno is The Chief Scientist of INS Engineering (INS-E) Corporation. He is engaged in the research and development of distributed multimedia systems at INS-E's Business Development Headquarters. His interests include agent technology and distributed object. He is also a member of OMG Japan SIG. Prior to joining INS Engineering in 1994, Kunio worked for NTT and its Human Interface Laboratories. |
Mortren Beyer |
| System Engineer |
| Approach Incorporated Takahashi Bldg. 3F, 3-2-7 Azabudai, Minato-Ku Tokyo Japan 106-0041 Phone: +81 3 5572 6152 Fax: +81 3 5572 6153 Email: morten@approach.co.jp Web: www.approach.co.jp |
Biographical notice: |
Morten Beyer |
Morten Beyer is a system engineer with Approach Incorporated, a Japanese company specializing in SGML system integrations and consulting services. |
ABSTRACT: |
API, Application Programming Interface ![]() Business applications CORBA IDL Custom API Document Parsing Element API Middleware ![]() Parse Relational database ![]() Security ![]() Source Document API Three-tier client/server system Version Control XML DOM client/server system |
SGML/XML document management architecture based on relational database has been developed. The core of the system is implemented as middleware components within the three-tier client/server systems. The target of the middleware is the realization of the interfaces for business applications. |
Introduction |
|
Access control CALS ![]() Database ![]() Document Integration ISO9000 Multimedia Security ![]() Workflow ![]() document management ![]() document management environment workflow management |
About five years ago, office workers in Japan wrote documents using Ichitaro or OASYS. Ichitaro is a word processor package software, which runs on NEC's PC9800 while OASYS is a dedicated word processor hardware. At that time, Japanese organizations' document management practices were mainly centered around the traditional paper based approach with a majority of the business critical information existing primarily in paper form. After this, the document management environment has gradually changed. The change was partly caused by the organizational requirement for restructuring and the introduction of CALS and ISO9000 . Another factor that should be mentioned as a driving force behind this change is the continued proliferation of PC's and the improvement seen in their networking capabilities based on the Internet and LANs. Rapid development of computer technology and the shift in business information management and maintenance requirements have affected the organizational information systems in Japan. Some of the requirements are as follows: |
Focus on Industrial Challanges |
|
Market Needs for Document Management in Japan |
|
| De-facto standard Departmental solutions |
The key to open and flexible document management is seen in the fundamental architecture of the document management effort. As can be seen from various departmental solutions and practices within an organization, no de-facto standard exists today. |
Technological Possibilities |
|
API for Documents |
|
| Compound Document Distributed Networks OMG ![]() OpenDoc W3C ![]() |
OMG's compound document standard [1] defines a set of API's to manipulate component of hierarchically structured documents. We doubt, however that this will become popular since it is based on Apple's OpenDoc. Microsoft's OLE, which was a rival of OpenDoc has only functions for documents to link and embed applications, and does not have expandability for distributed networks and other systems except windows. In short, a satisfactory architecture does not exist today. Even so, the W3C's XML (described later) holds a promising future. |
Three Tier Client/Server System |
|
DTD, Document Type Definition ![]() LISP Object Oriented Database Three Tier Client/Server System Transaction Processing system middleware ![]() |
When incorporating a database system and a transaction processing system as part the document management framework with the back bone being a client/server system, it is important to provide an API not only for the management of documents and business work-flow, but also on a broader business application level. SGML is not simply a framework to define document structure, but a framework for API of business applications. From another point of view, document information is nothing but a display of printed rendition of hierarchically arranged elements as defined with a DTD. |
Three-tier Client/Server Network |
![]() |
Distributed System |
|
CORBA, Common Object Request Broker Architecture ![]() DataCartridge ![]() Distributed System Middleware ![]() OMG ![]() OODB ![]() OQL RDB ![]() SQL ![]() heterogeneous environment |
In cases where system operation is required in distributed network environments, the key to realize this will actually be to use a system using CORBA of OMG [2]. The CORBA standard is an interface specification when producing client programs that provides interoperability between objects in a heterogeneous distributed environment. to, and does not define actual installation of a server. For the above three-tier system, a function that references the RDB is required. A Query function is defined in the CORBA service [3], but it is based on a query to OODBs (OQL), and is not adequate for existing RDBs. Therefore, hybrid API fusing CORBA and SQL should be considered. Oracle calls such middleware "DataCartridge [4]. |
Object Relational Approach for Multimedia management |
|
| Datablade Illustra Multimedia management SQL3 |
To integrate middleware and RDB, an object relational approach such as SQL3 is also possible. This approach enables direct query to multimedia. ORDB system can define new data types with its API functions like class definition with its methods in object oriented languages. |
Our Solution |
|
SGML Datablade |
|
Architecture |
|
| Illustra Datablade |
We have tried to develop a document management system that handles multimedia as its element on Illustra. This system is implemented as Illustra Datablade. SGML, which is a language to describe logical structure of a document, is applied as the framework of the Datablade. The system architecture is shown in Figure 2. |
SGML Datablade |
![]() |
Basic Features |
|
The basic features can be summarized as follows: |
At present, the server is implemented on Solaris environment with clients of Windows 95/NT installed with Netscape. |
Characteristics |
|
An Illustra system using this Datablade will have the following advantages compared with conventional SGML databases: |
SGML Data Cartridge |
|
Market Reaction |
|
System Architecture and difference from Datablade |
|
Fig 3. Shows the system architecture of the SGML DataCartridge. The following functions were changed at the time of transition from the Datablade to the DataCartridge: |
SGML/XML DataCartridge |
![]() |
Data Cartridge APIs |
|
DataCartridge ![]() |
The APIs of the data cartridge are classified into three groups as shown in Fig.4. The first group - "Source Document API" - manages the SGML/XML document instances directly. The second group - "Element API" - manages the parsed element of the document instance. Navigation of the SGML/XML document and selection, control, and information update of the element is made through this API. The third API is "Custom API" which mainly depends on a certain application with related DTDs, and extends and customizes the functions of the various business applications that may include access control, security, workflow, etc. |
Source API |
|
| Source API |
The Source API's function is to store the SGML instance source files in the database. The source file can be parsed to an element tree or a set of element trees according to its appropriate SGML declaration and DTD. |
APIs of DataCartridge |
![]() |
Element API |
|
The deletion of an element is made by giving the value of DEL attribute. |
Custom API |
|
Vertical Industrial API |
|
The Custom API is very important and useful when we want to develop dedicated document databases for certain vertical industries. The conceptual architectural is shown in Fig.5. |
Architecture of Custom APIs for Vertical Industries |
![]() |
We have already developed a prototype API for the medical industry and the consumer electronics Industry. The Custom API can be something like plug-ins for creating certain database packages. |
Next Development |
|
CORBA based API |
|
XML |
|
After the announcement of the W3C's XML1.0 Recommendation, XML is being given much attention in Japan. When we look at XML, we are most interested in DOM (Document Object Model)[7]. |
Conceptual Architecture based on CORBA IDL |
![]() |
The specification of this model were described in W3C's home page, and found to be very close to our middleware concept in terms of forming API to document. |
We have planned to implement those functions as a Custom API with an XML Parser, but the specification on XSL and XLL is not clear. Another approach to develop a new cartridges is under way. |
Discussion |
|
Response of Customers |
|
Market and Technological Trend |
|
To evaluate the future market and the technology around the DataCartridge, three key fields seem to be important: Network, Digital documents, and Object Technology. |
Network System |
|
Digital Document |
|
Object Technology |
|
Relationship of Integrated Information Techologies in Future |
![]() |
Potential of SGML/XML DataCartridge |
|
Conclusions |
|
Acknowledgments |
| Style Sheets: I18N aspects | Table of contents | Indexes | Managing information networks with Topic Maps | |||