[topicmapmail] Testbed for Subject Identity Measure

Dipl.-Wirtsch.-Inf. Lutz Maicher [Universität Leipzi g] maicher@informatik.uni-leipzig.de
Thu, 24 Jun 2004 10:56:51 +0200


Dear all,

In our current research project at University of Leipzig, department of NLP,
we develop a tool for the automatic generation of Topic Maps from texts in
distributed environments. As a part of this project we research on merging
of distributed Topic Maps.

In such distributed scenario the equality rules of TMDM failure because
distributed Topic Map authors maybe don't agree about a common vocabulary
for declaring Subjects. Therefore we develop a SIM (Subject Identity
Measure) which bases on language independent NLP algorithms. This SIM is
some kind of likelihood whether two Topics describe the same Subject. The
value of this measure may support users to decide which Topics should be
merged if two distributed Topic Maps concur. This approach might be
interesting especially for Topic Maps which make assertions about generic
Subjects, for example: "Introduction of quality management in our company".

But for the development of the SIM we need a testbed. We need two Topic Maps
which describes similar domains with "generic" Subjects. Unfortunately we
don't have such Topic Maps. If anybody can aid us with some data we are
pleased with your contact.

If we have first results we will post it at the mailinglist in hope for a
vital discussion. If anybody is interested in advance we look forward to
your questions.

Best regards
Lutz Maicher

____________________________________________________________________________
_____
Dipl.-Wirtsch.-Inf. Lutz Maicher
Graduiertenkolleg Wissensrepräsentation | Universität Leipzig
Abteilung Automatische Sprachverarbeitung | Institut für Informatik |
Augustusplatz 10-11 | 04109 Leipzig

fon 0341 97 32 303 | mail: maicher + informatik.uni-leipzig.de
http://www.informatik.uni-leipzig.de/~maicher/