The QName to URI quagmire

- <xs:annotation>

<xs:documentation source="http://www.w3.org/TR/xmlschema-2/#int" />

</xs:annotation>

- <xs:restriction base="xs:long">

<xs:minInclusive value="-2147483648" id="int.minInclusive" />

<xs:maxInclusive value="2147483647" id="int.maxInclusive" />

</xs:restriction>

</xs:simpleType>

The URI reference:

http://www.w3.org/2001/XMLSchema#int

is specified as the URI for the XML Schema concept "int" which corresponds to the QName "xs:int" (it is implied that the QName prefix "xs" is bound to the URI http://www.w3.org/2001/XMLSchema).

This mapping of QName to URI is consistent with what we expect from HTML where the value of the ID attribute serves as a fragment identifier for a URI reference.

RDF

Unfortunately RDF uses a different mapping to create a URI from a QName: simple concatenation of the namespace URI and local name. For example the RDF snippet:

<rdf:RDF xmlns:rdf="http://www.w3.org/2000/01/22-rdf-syntax#"

xmlns:xsd="http://www.w3.org/2001/XMLSchema">

<rdf:Description rdf:about="http://example.org/foo">

<xsd:int>35</xsd:int>

</rdf:Description>

</rdf:RDF>

should produce the triple:

[http://example.org/foo, http://www.w3.org/2001/XMLSchema#int, "35"]

but instead produces:

[http://example.org/foo, http://www.w3.org/2001/XMLSchemaint, "35"]

The 'problem' is that the RDF Model and Syntax 1.0 recommendation specifies that a URI is constructed from a QName by string concatentation of the namespace URI and local name (as opposed to XML Schema which uses the local name as the fragment identifier).

This results in problems for applications such as DAML (http://www.daml.org) which specify use of XML Schema datatypes in RDF documents.

The QName -> URI mapping problem gets worse for XML Schema complex types. The general XML Schema datatype mechanism uses the value of the name attribute as the local name part of the datatype QName. The difficulty arrises in that this name value can be overloaded among attribute, element and type definitions. For example the definition:

<xs:complexType name="simpleType" abstract="true">
    <xs:complexContent>
      <xs:extension base="xs:annotated">
        <xs:group ref="xs:simpleDerivation"/>
        <xs:attribute name="final" type="xs:simpleDerivationSet"/>
        <xs:attribute name="name" type="xs:NCName">
          <xs:annotation>
            <xs:documentation>
              Can be restricted to required or forbidden
            </xs:documentation>
          </xs:annotation>
        </xs:attribute>
      </xs:extension>
    </xs:complexContent>
</xs:complexType>

<xs:element name="simpleType" type="xs:topLevelSimpleType" id="simpleType">
    <xs:annotation>
      <xs:documentation
        source="http://www.w3.org/TR/xmlschema-2/#element-simpleType"/>
    </xs:annotation>
</xs:element>

The problem this creates, in the intertwined specification quagmire, is that the fragment identifier syntax is dependent on the media type of a document. For example according to RFC 3023, the media type application/xml uses specifies XPointer as the fragment identifier syntax. A new fragment identifier syntax for XML Schema would need to use a new media type (e.g. application/w3c-schema+xml) yet the XML Schema specification prefers the media type application/xml). These issues need to be thought through and coordinated.

At the very least, the RDF XML Syntax, which is currently being revised, at the very least needs to interoperate with and use XML Schema builtin datatypes. This can be accomplished in a relatively straightforward fashion using the following rule for creating a URI from a QName: When the namespace URI ends in an alphanumeric character treat the local name as a fragment identifier, i.e. insert a '#' between the URI and localname. When the namespace URI ends in a character other than an alphanumeric the current system of simple concatentation is used. The effect of this would be negligable for current applications because essentially all RDF applications use namespaces that end in '#' or '/' (for the very reason that otherwise there are problems).

XML Schema's use of QNames