| TMS' History in Web Syndication | Table of contents | Indexes | The Use of Profiling in XML Documents | |||
Brown, Steve ![]() POET Software San Mateo ![]() USA ![]() | Steve Brown |
| Product Manager |
| POET Software |
| 999 Baker Way Suite 100
San Mateo
(California)
USA
(94404)
Email: sbrown@poet.com |
| Biography |
Scope |
| content reuse | This paper discusses the emerging use of "content reuse", or the use of the same content in several publications, with SGML and XML. The discussion primarily deals with "complex content", the area of the information spectrum traditionally addressed by SGML which includes maintenance manuals, documentation, and the like, as opposed to traditional "data" applications (e.g., HR, accounting, and customer data). Next the paper will cover a definition of "content" will be defined, content paradigms that are changing as the result of new technologies, and why content must to be reused in commercial settings. Then I will go into various reuse strategies using the standards and tools of today and tomorrow. Finally, I will address what needs to be done to help publishers facilitate maximum content reuse. (Note that "repurposing", or the delivery of the same publication to different audiences or on different media, which has its own distinct set of issues and solutions, is not covered here.) |
Content; What is it? |
Why Reuse Content? |
The Content Revolution |
publishing ![]() | Increasingly, new publications will consist of interchageable parts. Publications will be open-ended, liberally reusing content from other sources and hyperlinking to related texts. They will separate content and formatting, enabling many different kinds of presentation on different delivery media (e.g. print, CD, Web, handheld devices). Most importantly, this content will be as much "assembled" as it is created from existing sources, and personalized for various audiences and purposes. |
A scenario |
Content reuse: Examples |
| The following examples from the aerospace, automotive, telecommunications, and manufacturing industries demonstrate how content is being reused in XML and SGML applications today: |
Assembled Content; Models |
| The following technologies and standards are paving the way for the future of reusable content: |
| - SGML / XML markup |
| - SGML / XML entities |
| - Database-based content management systems |
| - XLink and XPointer |
How not to do it: Cut and Paste |
| XML tools | Using traditional, proprietary tools (as well as XML tools) publishers can easily "cut and paste" content from one document to another. But this method only works until the reused content changes. If a section copied into another document changes, the publisher has to manually find every place where content is reused, and make manual updates to each of these places. Obviously, this process is labor-intensive and error-prone. Instead of cutting and pasting, it makes sense to maintain reused content in a single place, and reference it from wherever it is reused. SGML and XML are the optimal foundation for this kind of reuse. |
SGML and XML |
SGML ![]() | SGML and XML enforce the structure necessary to enable reuse of content. Through structural enforcement, XML can ensure interoperability between XML-ready systems. Structural enforcement also means reused content can be validated to make sure that it won't break the new document's structure. Basically, structural validation ensures that the applications designed to work with a certain type of content-in authoring, management, or delivery-will work the way they're supposed to. |
XML/SGML Entities |
| entities | Entities, defined as part of the XML and SGML specifications, provide a standard, vendor-neutral means of reusing content. Entities are XML or SGML fragments that can be stored either inside larger documents or as external objects (e.g. files or objects in a database). XML / SGML publications use entity references to point to entities. The XML or SGML application resolves these references, enabling content to be reused on paper or in electronic form. Entity-based reuse is widespread in many industries, including aerospace, pharmaceudical, manufacturing, and reference publishing, among others. |
Database-based content management systems |
Content object sharing | ||||||
Content sharing in RDBMSs | ||||||
Content sharing in ODBMSs | ||||||
Proprietary methods | ||||||
Reusing versions | ||||||
Mapping objects to new DTDs | ||||||
Document assembly | ||||||
The near future: XLink and XPointer |
Conclusion: it's the tools |
| TMS' History in Web Syndication | Table of contents | Indexes | The Use of Profiling in XML Documents | |||