![]() |
Using Topic Maps | Table of contents | Indexes | Answer is just a question [of matching Topic Maps] | ![]() |
|||
Towards knowledge organization with Topic Maps |
| Sigel, Alexander |
| Alexander Sigel |
| M.A. |
| Researcher |
Bonn Germany ![]() Informationszentrum Sozialwissenschaften | Informationszentrum Sozialwissenschaften,
Lennéstr. 30 Bonn Germany D-53225 Phone: +49 228 2281 170 Fax: +49 228 2281 120 email: sigel@bonn.iz-soz.de web site: index.bonn.iz-soz.de/~sigel/ |
| Biography |
| Abstract |
Introduction |
TMs ![]() | Browsing through the advance material of the Metastructures 1999 conference , I stumbled acrossTMs and immediately got hooked. My background in information science and conceptual knowledge organization made me, of course, wonder about the potential of this technology forKO andKM . After all,KO is much about knowledge structures and their interrelations, andTMs promise to provide a standardized technical means to build and share such. Naturally, I wanted to learn more and to discuss implications with myKO colleagues after having read all theTM material I could get hold of. |
Information Science KM ![]() KO ![]() Knowledge Management Knowledge Organization, conceptual Knowledge Structures, conceptual TM ![]() TMs ![]() | However, at that time none of them had even heard about theTM standard, despite its long history. Consequently, my request for discussion initiated no response. This is probably just another instance of the widespread phenomenon of separated communities. What are some of the obstacles in this case? First, it always takes some time for an innovation to diffuse to other fields. Second, theTM standard is not really intended to be read and understood by scholars of the humanities. Third, differing terminology is most hindering (cf. e.g. "facets"). Forth, there existed no tutorial material that was excellent, affordable and easily available. Furthermore, no large-scale real-world applications and examples could be reviewed. (I want to add: In spite of the rising marketing hype, there is still much room for improvement). |
KM ![]() KO ![]() TMs ![]() | To overcome this situation, I imagine an interdisciplinary team working towards a high-quality white paper on the relationship betweenKO
/KM
andTMs
(working title: "How to manage knowledge withTMs
"). It may eventually evolve into a tutorial booklet. In my opinion, it should:
|
GUI, Graphical User Interface ![]() | Moreover, we need more free reference software like 'tmproc: A Topic Maps implementation (in Python) . I am strongly convinced that the availability of such open source tools will increase the number of critical minds which contribute. This in turn will probably lead to the discovery of more flaws and alternative interpretations in the standard, and to additional free software, perhaps even to innovativeGUI concepts. In addition, it will give people the chance to acquire hands-on experience of what it is to work withTMs . |
IT ![]() KM ![]() KO ![]() Open Source ![]() TMs ![]() Topic Maps, reference software Topic Maps, tutorial | It would be fruitful to arrange for some intellectual cross-over, maybe in the form of a small workshop. The excitingTMs are indeed an ideal catalyst for mutual learning experiences for proponents from the partially overlapping communities ofKO ,KM andIT . (The experts onKR have been subsumed under those communities).KO people need more explanation and hands-on training,KM people require a holistic, interdisciplinary methodology, andIT people may welcome suggestions about related work in areas based on a different epistemological horizon. |
IT ![]() KM ![]() KO ![]() KR TM ![]() Topic Maps, interdisciplinary workshop XTM-WG | Even though this is very ambitious indeed, as theTM movement gained momentum, we now may be closer to that goal. Several companies becameTM -aware (e.g. by Holger Rath's article or by theTM tutorials at this conference). The newXTM-WG will hopefully push in the same direction. |
KM ![]() KO ![]() Knowledge Management, methodology TM ![]() TMs ![]() | In this technicalTM session I - aKM apprentice - will share my limited insights onTMs from aKO perspective in order to learn from you. My main message is: Yes, you should seriously considerTMs if you plan for applications on top of digitally organized intellectual assets. But make sure to include strongKO expertise in yourKM methodology and team applyingTMs , since the classical challenges ofKO will inevitably haunt you in a new disguise, even with this promising technology. |
KO ![]() TMs ![]() | After having explained the nature ofKO
in general and the instrumental role ofTMs
forKO
in particular, I will shortly sketch three typicalKO
scenarios in whichTMs
could be fruitfully applied. This should deepen your understanding of the nature ofKO
tasks and of theTMs
' potential to better accomplishing those tasks. The core of this talk is about the most interesting aspect of the relationship betweenKO
andTMs
: The possible role ofTMs
within the classicalKO
challenges of how to: |
KO ![]() TMs ![]() | What isKO about? How doTMs relate to it? |
ISKO ![]() KO ![]() Knowledge Organization, definition of | KO is interested in optimizing the organization (the conceptual access structure) of knowledge repositories in order to support easier retrieval, creation and sharing of knowledge for user communities. (You may find a wealth of information aboutKO via the homepage ofISKO , especially in its quarterly journal "Knowledge Organization" , and in theISKO conference proceedings ). The aim ofKM (especially in knowledge-intensive enterprises) resembles this optimization to a certain extent, becauseKM has to ensure strategically that all important knowledge assets and flows are known, utilized and enhanced according to their respective long-term contribution to the business value. |
Classification ISKO ![]() KM ![]() KO ![]() Knowledge Management, aim of Ontology Thesaurus Vocabulary Control | A basic conviction ofKO is that there exist important domains where some form of vocabulary control is beneficial. Hence the interest ofKO in principles of classifications, thesauri, and ontologies. |
Metadata ![]() Subject Access Points Subject Indexing Process | The aim of an optimal conceptual access structure requires the careful design and steady maintenance of additional knowledge (meta-)structures. In order to provide the most useful subject access points for various user communities, indexers have to judge the potential subjects of an item within a collection from various viewpoints. This costly intellectual operation creates added value and obviously results in metadata. One is acquainted with such metadata from database indexing in the form of entries on library records, but also from subject-oriented clearinghouses (subject gateways, ) on the internet. Another form is sophisticated back-of-the-book indexing of scientific and technical writings (see e.g. ). |
KO ![]() TMs ![]() | At that point we can already see the type of relation betweenKO andTMs :TMs are instrumental toKO , since they constitute (yet another) basic technology whichKO may employ inter alia within a broader methodology in order to provide improved information retrieval and collaboration services. Therefore,TMs can indeed play an important role withinKO , if they (together with other technologies) make it easier to provide innovativeKO services. On the other hand, the requirements ofKO help to spot whatTMs are good for and what they do not achieve. |
KO ![]() Knowledge Management, collaborative Knowledge Structures, user viewpoints in SOAPs Subject Gateway TMs ![]() Topic Maps, consistency of Topic Maps, search and navigation in | In sum, if it is really true thatTMs are |
|
KO ![]() | thenKO would be well advised to use that tool. |
KM ![]() KO ![]() TM ![]() | I think, it is now obvious whyTM ,KO andKM experts should talk to each other: Although a fool with a tool still remains a fool, a tiger team with the right approach and tool set may accomplish the breakthrough. |
KO ![]() Knowledge Organization, scenarios TMs ![]() | A short sketch of three typicalKO scenarios in whichTMs may be usefully applied |
KM ![]() KO ![]() TMs ![]() | All threeKO scenarios make typicalKO /KM tasks more vivid and illustrate the point thatTMs may have a great potential forKO tasks as a basis technology, but that a more comprehensive approach and much further co-operation is needed. |
Scenario 1: Knowledge structures in the social sciences: database indexing, a specialized information service and a clearinghouse |
| SOLIS | The Informationszentrum Sozialwissenschaften, Bonn, Germany , is (inter alia) responsible for the national research databases of the social sciences (SOLIS ,FORIS ). In database indexing, all documents are assigned an abstract, controlled descriptors and classifications. From this pool, specialized information services (so-calledsoFids ) on 28 topics are compiled intellectually. There is some overlap between the topics. Each such service (view) superimposes its own conceptual structure (its table of contents) and filtering upon the already existing structures in the databases. One topic is "migration and ethnic minorities". In this case, a subject gateway linking to external, online information about that topic has recently been started . Again, it has its own conceptual structure. |
| FORIS, Information system on research projects in the social sciences soFids | We are looking for a means to: |
CD-ROM, Compact Disk read-only Memory/Media ![]() soFid | Scenario 2: A virtual reference room for digital cultural heritage |
MMI ![]() | We take the mission and the objective of theMMI from its homepage : |
| One of the pending research projects applies the concept of a digital (virtual) reference room to selected cultural heritage resources, e.g. in the Limburg region. The material is virtually combined to support specialized usage contexts and tasks (e.g. learning about a certain painting technique). Such views put additional requirements on the subject metadata. This poses the challenge of how to cope with the semantic heterogeneity of those diverse resources and their metadata. Of course, in this case, the original sources cannot be changed, either. |
KM ![]() | Scenario 3:KM : Innovative information services based on controlled vocabulary |
CKO, Central Knowledge Officer ![]() | TheCKO of a large technical consultancy sees the necessity to replace the existing, autonomous, redundant and decentrally maintained keyword lists with a centralized repository of structured vocabulary. In his eyes, the homogenization of the vocabulary and its synchronization with global classifications and translation aids is an absolute precondition for efficient search engines, push and pull services, interest profiles, portals, yellow pages, etc. His main concern is the bottleneck of developing and maintaining appropriate vocabularies, not the availability of technology or applications. To reduce the expected high efforts, he wants to buy and re-use as much knowledge structures as possible. A large part of the integration effort would be easier if those structures were formally defined, and more context information was available. The provision of knowledge structures marked up asTMs will open up business opportunities, but only if interested parties will agree on application-orientedTM semantics. |
KO ![]() Knowledge Order Systems, comparability and compatibility of Knowledge Organization, principled approach to Metadata, heterogeneous, merging of TM ![]() TMs ![]() | TMs and classicalKO challenges: How to organize knowledge in a principled way? How to make order systems interoperable? How to deal with heterogeneous subject data? |
KO ![]() | Once we are faced with more than one content provider and stakeholder, with more than one rather homogeneous user community, and once we move from homogeneous databases to the plethora of independent internetworked information resources, searching and navigating by subject requires that we (at least virtually) bridge the gap between the numerous schemata which organize knowledge by subject. Hence we are faced with the well-knownKO problem that it is extremely difficult to achieve comparability and compatibility between various schemata which organize knowledge by subject . All the more this extends to the problem of mapping, merging and integrating the corresponding subject metadata referring to those different schemata. |
XML ![]() | Let's assume that all your data is "XML -ified", each data source has a predefined basic order structure, and your objects have been associated with metadata derived from these structures. Current best practice forWeb resources suggests to use aRDF -embedded cousin of Dublin Core , which points to a machine-readable version of your order system. |
RDF ![]() TM ![]() TMs ![]() Web ![]() XML ![]() | Only recently have documentalists recognized the full power ofXML /RDF as a basis for information systems . If there exists an important difference betweenTMs andRDF , this may have far reaching implications. Given we already possess a vocabulary: Which standard shall we adhere to? WhileRDF is more general and it may be possible to write a converter between both ,RDF is rather centered around the resources instead of around the cross-resource knowledge structures. Moreover, it is important to note that only aTM processor can make useful interpretations, sinceRDF does not deal with the semantics ( , , pp. 21ff.). |
Metadata, Dublin Core TM ![]() TMs ![]() XML/RDF, comparison to Topic Maps | As most of this knowledge is implicit, the computer has no clue of the underlying concepts and is limited to string processing.TMs allow us to formally define such schemata. The standard designers, however, wisely did not prescribe the user semantics of the structured link network. (A fact which we enjoy in lengthy threads on theTM mailing list). |
TMs ![]() Web ![]() | Given that the key players decide thatTMs
are the preferable alternative, especially for theWeb
: Then we still have to resolve: |
KO ![]() TMs ![]() | How can we define theKO structures inTMs in a principled way? |
KO ![]() | We would like to know if it was possible to create knowledge structures in a principled way in order to ease their conjoint use. How can we plan today for future merging? (Note that this is more than asking for the architectural "organizing principle" of the topic paradigm itself). The answer is: WhileKO offers some guiding principles (cf. e.g. , ), there are more open problems than solutions. |
Let me mention somedifficulties
:
|
TMs ![]() | As you cannot simply throw order systems (or their metadata) together and inappropriately use them out of their context, someone seriously working withTMs will also need strong expertise in how to create useful conceptual structures to organize knowledge in a principled way. His business may depend on the state of research in such a methodology. |
KO ![]() TMs ![]() | If we expect large-scaleTMs to find widespread application, it is self-evident that the physical sharing ofTMs per se will not alleviate the problem of incompatibleKO schemata.TMs can be a vehicle for semantic integration, but, on the contrary, the heterogeneity will increase, because it has become easier to create such schemata. |
| I now turn to a fewprinciples : |
KO ![]() TM ![]() | Fortunately, theTM community has already started the fruitful exchange by discussingKO -relevant topics (such as constraint mechanisms like schemata or templates), relationship types, validation and inferencing services, or the principles of analytico-synthetic (faceted)KO schemata). It is always helpful to check with a good handbook on thesaurus construction and maintenance, or on the subject indexing process. Regrettably, books concentrating on conceptual issues are rare, andKO will have to integrate the new requirements and possibilities into new textbooks. |
| It is currently debated whether a constructive way to guide user-oriented depth indexing exists at all . I believe that a thorough, domain-oriented analysis of the types of user requests with the relevant answers, together with their embedding into the social praxis of this special community, will ultimately uncover pragmatically relevant core knowledge structures. |
Facet Classification KO ![]() TM ![]() | The next step is to find out how these complex structures can be broken down and be expressed as a combination of simpler constructs. This brings us to the most prominent way to design principledKO structures: Facetted classification: Every compound subject can be synthesized from a set of elementary, independent building blocks, using the grammar of a powerful knowledge order language. The definition of concepts as specialized composition of faceted (mutually-exclusive) sorts which are subsumed by postulated, very general basic categories avoids enumeration. However, it needs a great deal of expertise to find the right building blocks! Nota bene: Apart from the name, the advancedTM concept "facet" is not related to that method, and calling yourTM topics faceted does not guarantee a usefulKO structure. |
| Much like object-oriented models, faceted knowledge structures are advantageous if you want to merge them: The structure of the building blocks is clearer, and you only have to handle fewer and more abstract elements. |
| Semantic Relations, standardization of Semantic Relations, types of TMs ![]() | Which semantic relation types should we use and standardize inTMs ? |
KO ![]() | KO has at length dealt with the question of which types of relationships are needed. This is more than part-of/has-a, is-a and instance-of. The "related terms" relationship has been semantically differentiated by various specialties, including pragmatism in linguistics and rhetoric structuring theory. However, theKO community did not achieve consensus on which relations to standardize and therefore still lives with the outdated thesaurus standards. Some programs for the structuring of vocabulary offer up to 30 relation types, but only very few are actually used in documentary practice. |
TM ![]() | The reason may be that the very differentiated relation types were not directly more useful, since they were not supported by retrieval software. In addition, the less predictable the assignment of a specific relation, the more errors happen. In sum, the extra work did not pay off. But with ontological engineering this situation may change, because inference and validation services need fine-grained relations. A lesson for the standardization ofTM templates may be that consensus on such sophisticated templates can only be achieved in specialized domains. |
TMs ![]() | What approaches exist to tackle the scalable interrelating ofTMs in order to achieve layered semantic interoperability? |
KO ![]() TM ![]() TMs ![]() Topic Maps, merging of | The original requirement that had led to the development ofTMs was that of a publisher who wanted to merge indices in technical documentation. But how canTM applications merge topics if even similar topics within the same scope may have different extentions? WithTMs one can express conceptual structures, but, of course,TMs do not come up with valid fusion strategies. Without some background inKO or comparable experience, naive merging will result in a big pile of rubbish in which all context will be lost. |
| Information Provision, layered model of decentral Semantic Interoperability TMs ![]() | Because the problem of how to reorganize, enhance and semantically integrate heterogeneous subject data will persist withTMs , I will very briefly discuss concepts for achieving semantic interoperability within a layered model of decentral information provision. Here the focus is on semi-automatic methods which depend on intellectually maintained schemata. |
TMs ![]() | In my view, ideally one would improve all schemata involved towards faceted schemata and reclassify the items. However, limited resources render this approach rather unfeasible. In this situation,TMs could be helpful, because they allow to define structures independent from and across the original documents, they support a more formal definition, they are open for alternative views, and they make collaborative work on evolving structures possible. |
IT ![]() Metadata Integration, transfer methods Metadata, quality control TMs ![]() | ThusTMs could be one aptIT that fits into Krause's layered model of information provision in which no longer a central agency exerts its authority in subject indexing and vocabulary control upon agencies located lower in the hierarchy, but in which a group of partners co-operate. Such a strategy does not result in uniform metadata, but leads to layers of heterogeneous metadata with different quality control procedures. Intellectually controlled high-quality subject schemata lie in the heart of those layers. Intelligent transfer components are sought which can improve on subject data on outer layers by using the structure of inner layers. The main methods are: The compilation of cross-concordances which map between entries, and a combination of quantitative-statistical with qualitative-deductive approaches. The right mix seems to be domain-dependent and is hitherto unknown. Personally, I am convinced that qualitative methods and case analysis will yield rich material and exploitable ideas for transfer strategies. |
| ViBSoZ | The ongoing research projectViBSoZ ( , ) explores how to cope with heterogeneous subject data in the social sciences. TheSIMS project "Search Support for Unfamiliar Metadata Vocabularies" by Michael Buckland et al. is a related approach which also includes issues of scalability and information agents. Several projects have tried to automatically gather and experimentally classifyHTML documents in one specialty according to existing knowledge structures. E.g. Koch and Ardö have thoroughly compared the results both with intellectual classifications and with expert judgements. At best, 2/3 of the results match. |
HTML, Hypertext Markup Language ![]() Metadata Integration, research projects SIMS TMs ![]() | In sum, we dispose of no overall convincing strategy to achieve semantic interoperability, but of a broad range of necessarily heuristic methods. Theory does not say much about this "repair case" in which most systems to be integrated are not principle-based and much context has already been lost. This stresses that it is worthwile to put great effort in the meticulous, intellectual maintenance of conceptual structures, since such key assets are at the heart of the layered model. All transfer components (including automatic ones) will depend on the quality of the innermost schema.TMs could be one tool with which knowledge structures could be maintained more easily, and thus more time could be dedicated towards better quality. Such high-quality knowledge structures will be needed anyway by clever strategies in next-generation ontology- and agent-based knowledge services. |
TMs ![]() | Questions about the relationship betweenTMs and other means to computationally handle semantics in next-generation knowledge services |
AI, artificial intelligence ![]() TMs ![]() | The vision of high-level ontology- and agent-based knowledge services is not new. Likewise, at first sightTMs seem to be nothing more than a new format to mark up what formerly was expressed as assertions and rules inAI databases. As there have been other formats and languages before which did not receive that much attention, the purpose of this short section is to ask, whether they are already superseded, or are just variants, or whether they constitute a welcome complement toTMs (e.g. in order to validate the semantics of aTM application according to aTM schema (or a similar mechanism)). This question is of relevance, since an innovative information service provider is interested in estimating the survival power of a technology before making huge investments. |
AI, artificial intelligence ![]() Artificial Intelligence Information Agents Knowledge Representation SHOE Semantics, computational TM ![]() TMs ![]() Terminological Logics XML ![]() | BeforeTMs , you may have thought about introducing the computational semantics needed for metadata fusion by equipping information agents with clever heuristics based onAI tools. After all, that's what the validation and inferencing services of terminological logic's subsumption is good for. You are right if you object that this is overkill, that one cannot make everything explicit, and that it is computationally demanding. So which other ontological tools did you use instead to express your knowledge structures?XML -encodedSHOE ? Its relativeCKML ? Stanford'sKQML /KIF ( , , )? Why not? Now withTMs : Would the higherTM services differ from general tell-ask-performatives of information agent languages likeACL ? |
ACL, Agent Communication Language ![]() CKML, Conceptual Knowledge Markup Language KIF KQML ![]() TM ![]() TMs ![]() Topic Maps, and Information Agents Topic Maps, validation services for | I would like to learn more about the relation between this research andTMs . Maybe someone can point me in the right direction? I know of no demandingTM validation service. Is it possible to convert between the formats or to communicate between applications? How can we achieve that information agents exploit knowledge structures expressed inTMs ? |
Outlook and conclusion |
AI, artificial intelligence ![]() KO ![]() Knowledge Organization, with Topic Maps, research directions | My personal experience withKO
is that because knowledge structures are a socio-cultural product,AI
modeling is only of limited help. During the process of detecting and exploring emerging knowledge structures, a tool is needed that allows to start less formally. Thus I recommend to investigate:
|
Domain Analysis KO ![]() TM ![]() TMs ![]() | Altogether, I like to see more information agents which rework subject data with their information strategies and which are informed by improved versions of ontological models. |
KO ![]() TM ![]() TMs ![]() | In conclusion,TM -based services may alleviateKO tasks, but strongKO expertise is indispensable. The main implication ofTMs forKO is not thatKO thesauri and classifications can (trivially) be defined and maintained asTMs , but that - like with hypertext - there is a paradigm change: TheKO methodology must be partially redesigned for collaborative knowledge building activities on distributed resources. This paper is a first attempt to stimulate co-operation between the specialties, but much more work is necessary. What about the tutorial white paper and the reference software? What about joint projects? |
| Acknowledgements |
| I am greatly indebted to Steve Pepper for his encouragement. |
| Bibliography |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
![]() |
Using Topic Maps | Table of contents | Indexes | Answer is just a question [of matching Topic Maps] | ![]() | |||