| Implementing a viable architecture for standardized Intelligent Graphics | Table of contents | Indexes | XML::DT - a Perl down translation module | |||
PRISMA: The new publishing process at Samsom Publishers |
| Jan-Willem de Koning |
| Information Manager |
| Samsom (Wolters Kluwer)
P.O. Box 4 2400 MA Alphen aan den Rijn The Netherlands Phone: +31 172 466358 Fax: +31 172 421426 Email: j.koning@samsom.nl Web: www.samsom.nl |
Biographical notice: |
| CMG Trade, Transport & Industry Rotterdam The Netherlands ![]() van Gool, Joris |
From 1992 Jan-Willem has had various managerial functions at Samsom (and subsidiaries) all aimed at realizing media neutral databases. In 1998 Jan-Willem became Information Manager for Samsom; responsible for introducing new production processes based on SGML, aimed at cheaper, faster and better production to help create value for our professional customers. |
| Joris van Gool |
| SGML/OmniMark consultant |
| CMG Trade, Transport & Industry
P.O. Box 8566 3009 AN Rotterdam The Netherlands Phone: +31 10 2537000 Fax: +31 10 2537035 Email: Joris.van.Gool@cmg.nl Web: www.cmg.nl |
Biographical notice: |
Since joining CMG Trade, Transport and Industry in Rotterdam, he has mainly worked with publisher Samsom as an SGML/OmniMark consultant. |
ABSTRACT: |
We based it on the following, |
A sound neutral concept, not just "go, go SGML (Standard Generalized Markup Language) "! Our answer: market driven SGML |
A good set of standards, SGML alone is not good enough. Our answer: simple and sound attitude, aimed at process and techniques |
An organization to back it up. Our answer: A pragmatic mix of make and buy |
In this presentation I want to share our experiences with you. |
| Market driven SGML |
Market driven SGML |
Samsom |
The media we publish on are mostly magazines and loose-leafs. The last couple of years we have witnessed the coming of new product types on CD-ROM and Internet. |
The drive behind neutral storage |
| Time-to-market |
This demand for fit-for-use products is mentioned in every market survey and customer panel we do. And it is shown in the steady decline of the subscriptions on our loose-leafs. Time-to-market is a new phenomenon for us. |
|
The old process
|
|||||||||||||||||||||||||||
Our concept is as follows. |
|
The new process
|
|||||||||||||||||||||||||||
PRISMA is the project that aims to implement this new concept at Samsom. PRISMA is an acronym for PRoduction of Information at Samsom on a Media neutral basis. |
The aim of PRISMA is to fully automate the production of loose-leafs at our main supplier; AlfaBase. |
The Overall Business Process |
The use of SGML in this concept |
| Information types |
First of all we defined Information types: |
While designing the information type DTD's the following questions are kept in mind: |
Building blocks Granularity ![]() Micro Document Architecture |
Of course the granularity here is as small as you need. We call it the smallest sell-able unit. This concept gained popularity under the name of Micro Document Architecture. |
| Product-DTD |
The maintenance of our texts is based on these Information types. But of course we want sell our products, so we have to take parts of the texts stored as Information types and assemble to products. Via several standardized steps, all necessary information like Table of Contents is added and the product is composed into its final format. The steps regarding structure and adding information (automatic numbering for instance) are done at Samsom, the remaining composing steps and addition of page numbers are done at our strategic partner AlfaBase. There is a very sharp line between the tasks at the respective sides and a Product-DTD aids to enforce that separation. |
By this separation in Information types and Product-DTDs, we are able to keep our DTDs clean with the possibility to grow complex, while keeping our applications simple, sound, fast and manageable. |
A Good Set of Standards |
Time-out |
So we had to establish standards, sell our ideas to management and move ahead implementing them. In this we succeeded beginning of 1998. |
Areas for Standards |
Standardize the business process: concentrate on the common denominators, not the (few) exceptions. Build fully automated ‘conversion belts’ with 100% functionality for 60% of your publications. |
Every publication used its own stylebook, defined either by the publisher or by the printer. To create a standard process, these instructions to the printer had to be standardized. |
Stand firm for your standards! In order to keep the process fast, robust and simple, compromises need to be avoided. The process is complex enough by itself. |
Process: Introducing ‘Conversion’ Belts |
| Standard output processes |
To make the new process work and get all possible benefits from the new approach, standard output processes must be designed. In 2 ways. |
First function: a new workflow in a new organization with new roles for people. A radical change with the past: linear one-way flows with clearly defined half-products. |
After each step, working towards that particular final product, a decision about the next step can be taken. For instance, a new branch can be chosen to choose another medium, other software-package, or other stylesheet. In the long run this will make you less dependent on providers and software packages or versions. |
Design philosophies:
|
This is a radical change with the old way, no major compromises to the ‘old feelings’ based on print were allowed. |
Step 1. Define publication |
In this step, the publication is defined: which Information Types and instances from the storage will be incorporated in what order in the publication and what extra information (e.g. colofon, extra titles, TOC, numbers) will be added. |
| Configuration-file |
All this information is assembled using a configuration-file, which defines the publication and is edited by the publisher. At the same time, it removes hidden intelligence from the translation applications of step one and two. |
| Configuration-DTD |
To control the integrity of this file, and to make sure all necessary information is incorporated, we use a configuration-DTD. |
Step 2. Choose medium and product-type |
Although the order of instances has already been chosen – and this is often medium-dependent –, in this step the medium and product-type is chosen. For this, medium/publication-neutral structures are translated into publication-type specific styles, according to some Product-DTD. Also, medium-neutral structures (“<link>”) are translated into medium-specific ones (“<footnote>”, “<hyperlink>”). |
For instance, both hierarchies “<article> - <part> - <chapter>” and “<law> - <section> - <sub>” are translated into “<level1> - <level2> - <level3>” with the possible indication of the stylesheet-type that will be used. By removing information that is not needed any more, the number of styles for the formatting process is reduced considerably. |
Step 3. Choose formatting software product |
In this step, hierarchic styles (“<title> in <level1> in <part>“) are translated into non-hierarchical styles (“<title1c>”). If for the next step a formatting software product with its own styles is used, the hierarchical styles are translated into product-specific styles. |
Xyvision ![]() |
In Prisma-Folio Xyvision is used to produce Postscript-files for printing. For instance, “<title> within <level1>” is translated into “{Title1}”, “<title> within <level2>” is translated into “{Title2b}”. |
Step 4. Choose stylesheet |
Stylesheet ![]() |
For the publication, a specific stylesheet is used and the final product is composed. All stylesheets for a specific publication type use the same styles, making them interchangeable. In Prisma-Folio, Xyvision is used as composition application, so all stylesheets are proprietary Xyvision ones. |
Step 5. Xydiff |
| SGdiff Supplements Xydiff |
For folio loose-leafs, the final step is used to let Xydiff create supplements. This is done by presenting Xydiff the output from Xyvision of the current and an earlier run. Xydiff will find the differences and output only the changed leafs. |
SGML & DTDs |
| DTD Manual DTD modules |
DTD modules and the DTD Manual |
The concept of information types results in a lot of DTD's that have to be developed and managed. In order to support this, Samsom developed a set of standard DTD modules. These modules are included in every DTD. They implement standards in the area of inline elements, hyperlinking, titles, text hierarchy, etc. By using standard modules for DTD's the maintenance and development of DTD's is easier and “outsource-able”. Furthermore a DTD-manual is written in which all the standards are documented. |
Product DTDs |
From the information types within the MNO-database products have to be generated. To shorten the ‘time-to-market’ for a product, several standard product-types were defined for which standard production processes are implemented. The goal of the PRISMA project was to implement standard production processes for the annual, newsletters and loose-leaf products. For each product type a DTD was designed. This product DTD describes the structure of the specific product, thus it stores no semantic information. When information for a certain product has to be communicated from the publisher Samsom to, for example, a typesetter, the information is marked-up according to the product DTD by an automatic conversion process. |
So for all the information-block types there is a conversion to the product DTD's. And the conversion from the product DTD to the final product, for example the conversion from the loose-leaf product DTD to Xyvision, is a standard, only once developed conversion. |
Through stylebooks the layout of the products can be configured. |
Configuration DTDs |
| Publication definition |
If you want to assemble products from information types that are stored in the MNO-database, you need some format to store publication-specific information. This includes which information types to use and in what order. For instance: [“Preface; 1 May 1999”, “TOC”, “Article; About XML”]. This is your publication definition for a specific publication on a specific date. Examples of information that is written in this definition: new titles for articles, type of numbering to be generated, etceteras. |
To control the validity of the publication definition, a ‘Configuration DTD' has been developed. A publisher will write his publication definition according to this DTD, and the validated SGML-file controls the translation process: extraction from the MNO-database and generation of redundant data. |
| OmniMark |
OmniMark |
This powerful tool gave us the opportunity to get on track very fast, without giving up flexibility in future. All the general steps - 1, 2, 3 - are done with OmniMark, each step requiring 2-4 OmniMark scripts. Other conversion processes can use generic scripts or customized scripts within one step. The concept allows for these customized scripts. This results in faster development of new conversion processes and easier maintenance. Software-libraries (in OmniMark) were built for all standard conversions (storage to output). |
Organization |
| Outsourcing |
What do you do yourself, and what do you outsource? |
Management of a publishing house had the tendency to ask: "Can't you just outsource this?" |
First of all, there is no company that you can go to. In our experience you have to teach your suppliers. Of course, there are a lot of good companies, but they are all small. |
Secondly, this is your primary business process - you better get involved heavily. So, start convincing your manager to invest in people first. |
However, the labor market is not favorish for our line of work these days. Let's face it, a good programmer, analyst, or projectmanager can earn more at a softwarehouse. |
Internals |
The organization question doesn't stop at your standard make or buy decision: your also have to define the split of responsibilities in your own company. For this we defined functions and tasks to be performed in a media neutral storage project. |
Our team |
There are certain tasks always to be performed: |
On top of this you want to |
Combining these you get: |
What you must do yourself is maintenance of and the advise about the use of your DTD's (gray); in general you have to bridge the gap between business demands and the translation of these demands in your datastructuring (= DTD). This is the heart of your knowledge and is of strategic value for us. |
Also, you will want to coordinate all the work. This you have to do yourself, and don't think it doesn't take time. |
What you definitely can outsource is development; the projectleader, the SGML-developer, the conversion programmer are all external employees. Better still; it is a good thing to outsource this. Fresh views and a flexible organization favor external employees on this. |
We choose to hire personnel from a limited number of companies, our strategic partners. |
Team spirit |
It is important to think about the fun in the work of the people that work for you. SGML can be boring and/or monotonous. A few things I learned:
|
Concluding remarks |
What is it that we do? We publish! We are not interested in the difference between SGML and XML. We are interested in making a good profit based on high quality publications. We value our content more than our SGML standards. But of course our content is based on SGML now. |
What makes this approach and this project different and interesting?
|
Acknowledgments |
Author wishes to thank the strategic media neutral partners of Samsom:
|
| Implementing a viable architecture for standardized Intelligent Graphics | Table of contents | Indexes | XML::DT - a Perl down translation module | |||