[topicmapmail] PSIs - alternatives
Simon Grant
asimong at btinternet.com
Thu Jun 22 06:52:13 EDT 2006
This strikes me as very clear and persuasive reasoning.
At 20:50 2006-06-21, Steve Pepper wrote:
>The purpose of the subject indicators (or resource descriptors) in the
>Published Subjects (or PRI) paradigm is to provide *humans* with just enough
>information to decide whether or not the PSI/PRI suits their needs, and they
>should therefore contain the minimum number of assertions necessary to
>achieve that purpose. The more assertions, the greater the likelihood that
>people will avoid using the PSI because they disagree with the opinions that
>the minter of the PSI holds about the subject in question. This defeats the
>whole purpose of PSIs, which is to establish agreement concerning the
>*identity* of subjects, not agreement concerning *opinions* about those
>subjects.
Indeed, one of the key precursors to finding agreement is to be sure
to pick something people are able to agree on - the identity of
subjects - and to avoid things people are likely to disagree on.
I think what would be useful to look at is the information necessary
within the PRI/PRD system as a whole. There is
1. as Steve above (providing humans with information)
2. whatever else may be necessary towards a self-sustaining
infrastructure and motivation to use it.
It seems to be Steve's position that all of 2. can be provided
outside the PRD, with the PRD used exclusively for 1. This is
plausible to the extent that proposals for providing the rest of 2.
are plausible. What I read here is that one class of approaches,
involving central repositories or directories, is not thought to be
plausible by many.
Steve's requirements, set out in section 2 of "The Case for Published
Subjects" at
http://www.ontopia.net/topicmaps/materials/The_Case_for_Published_Subjects.pdf
also seem very clear and persuasive. Section 2.4, "Easy for humans" has
1) given a subject, it should be easy to find an identifier (if one
already exists);
2) given a subject, it should be easy to mint an identifier (if
nothing suitable already exists);
3) given an identifier, it should be easy to find out what subject it
identifies; and
4) given an identifier, it should be easy to insert it into a
document, index, database or other source of information.
and section 2.5, "Easy for computers" has
"Any processing that requires more than simple string comparison
(whether it be string normalization, network retrieval of a resource,
or some form of computation), will invariably lead to more
heavyweight and less reliable applications."
A key consideration in discussion here is, what if (as seems likely)
people persist in minting "their own" identifiers for subjects where
there are already identifiers? There are many reasons why this is
likely to happen, including ignorance, laziness, and failure of
discovery mechanisms, but also dissatisfaction with the accuracy or
reliability of another authority. And when it has happened, as it
inevitably will, following Steve's "easy for computers" requirement
it will be essential for everyone to converge on the same one. Will
this happen, and how long will it take? What will happen in the meantime?
My suggestion addresses both the situations where people won't do the
merging, and the situation where alternatives exist prior to merging.
Having a list of accepted equivalent PRIs included means that the
machine comparison is still relatively easy - it would involve
fetching the two PRDs and then string comparison of two lists against
each other. That's all. Maybe Steve and others think that even that
is too much to ask. If so, I have no answer. But is it really too
much to ask, when it would produce what looks like a workable
short-to-long term solution without the need for anything centralised?
Simon
--
Simon Grant http://www.simongrant.org/home.html
Please continue to use my established e-mail address
a (just by itself) (at) simongrant.org
More information about the topicmapmail
mailing list