[topicmapmail] Fragmented XTM for web metadata, and some
ontology?
Kal Ahmed
kal@techquila.com
26 Jun 2003 22:21:11 +0100
Alexander Johannesen proposed:
> > <topic id="dc:type">
> > <subjectIdentity>
> > <subjectIndicatorRef
> > xlink:href="http://purl.org/dc/elements/1.1/type" />
> > </subjectIdentity>
> > </topic>
> >
> > ( id-attributes follow the DC recomended guideline for DC in XML )
>
and On Thu, 2003-06-26 at 14:29, Murray Altheim wrote:
> I'm not sure when the guidelines were written, but the colon is prohibited
> as a name character nowadays, in accordance with the XML Namespaces 1.1
> Recommendation:
>
> http://www.w3.org/TR/xml-names11/#Conformance
>
> The reason it was allowed as a Name character was to allow experimentation;
> in the end I believe it was decided that "namespaced IDs" didn't make much
> sense. Thank God for that.
>
Perhaps you can use the dc.title form that is used in the HTML
representation of DC instead ?
> > I had some problems groking the associative DC metadata in Jan's
> > document, but that might be because I don't understand why
> > they should be expressed that way. Maybe my thinking is very
> > slow, but I'd like an ontology (called FXTM for now) ;
> >
> [example elided...]
> > My thinking says that for a given page, you have the following XTM;
> >
> > <!-- root node is the fragmented XTM's resource -->
> >
> > <topic id="root">
> > <instanceOf>
> > <topicRef xlink:href="#fxtm:page"/>
> > </instanceOf>
> > <occurrence>
> > <instanceOf>
> > <topicRef xlink:href="#dc:title" />
> > </instanceOf>
> > <resourceData>Some title</resourceData>
> > </occurrence>
> > <occurrence>
> > <instanceOf>
> > <topicRef xlink:href="#dc:date:publish" />
> > </instanceOf>
> > <resourceData>[Some date]</resourceData>
> > </occurrence>
> > </topic>
> >
> > Where the TMs for FXTM and DC are implicit part of the
> > namespaces used (and merged in according to the FXTM
> > spec?).
>
> I don't think it's appropriate semantically to include properties
> of a given topic as occurrences of that topic -- it's just not
> sensible, doesn't "read" right to me. You yourself stated the sentence:
>
> "here is the resources"
>
> which is essentially saying "here are the untyped values". Since we
> want to both name and type our values, and because occurrences don't
> have names, each facet is a topic.
>
This is where I differ. My understanding of the semantics of occurrence
is that an occurrence specifies some information that is in some way
related to the subject. I do understand that a number of other topic map
practitioners believe that an occurrence is only a resource in which the
subject is mentioned or described in some way - so whereas I would say
that metadata such as creation date can be expressed in an occurrence,
they would point at the occurrence and say "where's the subject ?" :-)
With regards to the DC title element I have more sympathy for Jan
Algermissen's analysis that dc.title == baseName.
> > I don't know. Maybe I'm mixing things up too much, or
> > maybe I'm missing the incredible power of making the
> > occurrences topics instead with associations binding them, but I feel
> > that becomes more of an application specific way than a general "here
> > is the resources" way;
> >
What the RDF people found was that if you have literals you reduce the
reliability of merging (smooshing to use the RDF term...no, really!) but
perhaps for a meta data element that is a value from a range (e.g. date,
age, size, GPS position) that is all you can do scalably.
<snip/>
> >
> > Basically, what I want is a PSI set for the FXTM set
> > (both roles and types), and a set of PSIs for DC. The
> > latter is the easy part. Where should I turn for the former? Is there
> > anyone looking into a loose web browsing ontology?
>
> You might look at the "topic map-based" XFML, at http://xfml.org/
> which I believe is designed to solve either the same or a similar
> problem (depending on how I understand your question).
>
> By some coincidence, I happen to be working up a short specification
> for doing facets in XTM, with the idea that you can associate a
> name-value pair to a given topic.
>
> BTW, Kal Ahmed has a PSI set for faceted classification (FC), and
> we're discussing various aspects of it right now. I'm looking at the
> description of facets in ISO 13250 as well as how FC works, as while
> the use of "facet" is the same, there are some differences (as might
> be imagined from terms coming from different communities). I'm
> particularly interested in the overlap, for my own work.
>
> I hope to publish the "Facets in XTM" draft within a week or so. If
> I haven't by July 10th, it'll probably be in two weeks after that
> (as I'm out of town and probably disconnected). I'll be updating my
> "Datatypes in XTM" document at that time as well, as it is used in
> typing the facet values, e.g.
>
> "Birthday" has type "dateTime" and value "1961-07-04T20:00:00Z"
> "Birthday" has type "date" and value "1961-07-04"
>
Ranges are the issue that I haven't addressed with my proposal for
facetted classification - in my model a facet value is taken from some
hierarchical classification scheme. So the facetted classification model
that I propose is merely a way of indexing topics by multiple disjoint
hierarchies.
> I have had a DC topic map for about two years now, unpublished. Since
> that might be useful with the facet TM, if I have time I'll also put
> that online.
>
> Since the term "facet" also occurs in XML Schema datatypes, I'm
> starting to feel a bit surrounded by the damn things, like some
> kind of bad drug trip...
>
As long as the topics don't turn into lizards and you don't end up in a
Las Vegas casino with your lawyer, you'll do just fine :)
Cheers,
Kal