Modeling schema composition in Alloy

Modeling schema composition in Alloy A working paper prepared for the W3C XML Schema Working Group C. M. Sperberg-McQueen &date.last.touched;

This document was originally prepared in 2007, as shown.

It was made public by the author in June 2009; the only changes made for publication were (a) the updating of stylesheet and other links to reflect the location of the public version and (b) the addition of this note.

Introduction

This document describes a simple model of XSDL schema composition in Alloy . One goal is to help clarify the design of composition; another is to generate test cases for testing the behavior of existing processors. If we know what kinds of cases are treated consistently by existing XSDL software, we have a better chance of ensuring that the description of schema composition in XSDL 1.1 does not unnecessarily cause interoperability problems between 1.0 and 1.1 processors. Also, empirical information on existing processor behavior can suggest places where XSDL 1.0 is underspecified, contradictory, unclear, or too subtle.

The document first discusses aspects of schema composition we may want to model, then addresses the problem of building test cases with interpretable results. Then a simple model of schema composition is presented in Alloy notation, and some sample instances of the model are presented. Section describes the form of test cases to be generated from instances of the model; section describes a small number of tests hand-made on the basis of Alloy instances (and in some cases from user reports) and the results of running those tests on existing processors. Section describes code that generates test cases automatically from Alloy model instances; and section summarizes the tests generated in this way and the results obtained from running them.

In this version of this document, familiarity with both XSDL and Alloy is assumed. If they have sufficient tolerance for partially understood technical detail, readers lacking some of that knowledge may nevertheless find it comprehensible in parts.

Sources of variation

To generate test cases with reasonably good coverage of schema composition, it would be convenient to be able to model the points on which the behavior of processors may or must vary. These include: processor policy for component acquisition: are schemaLocation hints followed? In principle, the policy could be anything; for testing, we probably need to specify a choice of: follow no schemaLocation hints; read only the schema documents specified at invocation time (this involves using a proxy server that ensures that the required attempts to dereference the schema documents will fail) follow schemaLocation hints only when required, i.e. only on xs:include and xs:redefine elements; ignore schemaLocation hints on xs:import and in xsi:schemaLocation and xsi:noNamespaceSchemaLocation attributes in the instance follow the first schemaLocation hint for each namespace not covered by a schema document named at invocation time follow all schemaLocation hints (even multiples) Note that the existing XML Schema test suite appears to be built on the assumption that the choice of processor policy for component acquisition is unproblematic. The test suite catalog does allow several schema documents to be specified for a test, using the schemaTest element and its schemaDocument children. It is not clear from the documentation, however, whether processors running the test are expected to read those schema documents and no others, or to read all of those schema documents and optionally any others they may point at. starting documents: which schema documents are specified at invocation time? document availability: for each set of documents, some subset may be unavailable schemaLocation hints in instance: for each namespace, does the instance point to a starting schema document for that namespace? Does it do so once, more than once, or not at all? In what order do the schemaLocation hints occur? order of lists: when several schema documents are specified at invocation time, or in the same schemaLocation attribute, what order are they listed in? Does it matter?It clearly does matter sometimes. Consider an instance beginning ]]> At least two processors build a schema and validate the instance without complaint, but if a second namespace / schema document pair is added to the schemaLocation attribute, they fail to find the declaration for the ns1:wrapper element. ]]> If the order of the schema location pairs is changed, the behavior of the processors changes. schemaLocation information in the schema documents: for each pair of schema documents A and B, does A include B? B include A? import? redefine? N.B. I'll use the verb call to cover all three of these relations: A calls B if and only if A includes B or A redefines B or A imports B. single call vs. multiple call: if A includes B, does it do so once? twice? how many times? Ditto for import and redefine.

It's not clear what interpretation the XSDL spec supplies for schema document A containing two xs:redefine elements each pointing to document B and redefining the same or different components. But I have not noticed any rules which explicitly forbid it. The example serves a useful purpose either way: either we can demonstrate that the spec forbids it, or we can demonstrate that the spec allows it and assigns a particular interpretation to it, or we cannot demonstrate either of those propositions, which is itself a useful, although not a particularly happy, result.

namespace matching: when A calls B, do their namespaces match or differ? The constraints in XSDL are straightforward enough to check that there doesn't seem much point in trying to exercise all possible variations here; it's probably enough to have some simple examples of violations of these rules, and then to generate other test cases from schema documents which obey these rules: if A includes or redefines B, then their target namespaces match, or B lacks a target namespace if A imports B, then their target namespaces differ I don't see rules forbidding cycles on any kind of composition self-inclusion or self-redefinition (N.B. self-import fails the namespace-cleanliness rules) multiple calls to the same schema document of different types multiple calls of the same type to the same schema document behavior with respect to missing components; the possible kinds of missing component appear to be: See , Appendix A, for slightly more detail. attribute {type definition} element {type definition} element {substitution group affiliation} type definition {base type definition} type definition {attribute uses} type definition {content type} (simple type may be unavailable) type definition content model may refer to unavailable elements attribute group may refer to unavailable attributes named model group may refer to unavailable elements element identity constraints may be unavailable simple type definition {member type definition} simple type definition {item type definition} It might simplify life if we can successfully test schema composition without getting into missing components. lazy assembly vs. eager assembly: does the processor gather all schema documents together first, then build the schema, then validate? Or does it begin validation with a partial schema, to which components are added when needed? In principle, the XSDL spec has been written to make lazy assembly possible, but we are not supposed to be able to tell the difference. And indeed one rationale for the hand-waving in XSDL's description of component acquisition is that leaving processors such freedom allows any odd results of lazy assembly to be understood instead as quirky choices during component acquisition. It's not clear whether test cases should, or can, actually expose the reality of the processor's behavior in this area, but it may be possible to construct test cases which present different challenges for lazy and eager assemblers.

A simple model of schema composition

Trying to generate test cases to cover all possible combinations of the variables identified above may be very time-consuming and produce more test cases than we can conveniently handle. If we assume three schema documents some subset (zero to three) of documents specified as starting documents some subset actually available some subset of schema documents pointed to from the instance, in either or both of two locations some network of call relations (include, redefine, import) among schema documents if one schema document calls another, it may do so either once or twiceStrictly speaking, it might do so any number of times, but it seems unlikely that processors will handle two such calls but not three, or vice versa. So we limit the number to two, for simplicity in the calculation we are performing. then there are 8 × 8 × 64 × 7,625,597,484,987, I.e. (2 ** 3) sets of starting documents, × (2 ** 3) sets of documents actually available, × (4 ** 3) distinct patterns of schemaLocation hints from the instance, × (27 ** 9) different general labeled graphs of call relations. or about 3.1E16 (3.1 quadrillion) cases to be generated.

Even if these three quadrillion test cases could be generated with acceptable speed, it might not be easy to find people willing to run them all for various processors. It may be a good idea, therefore, to try to divide the space a little and make some simplifying assumptions in exchange for a more tractable set of test cases.

We are also not necessarily required to generate all possible test cases; a systematic or random selection might suffice to point to important unanswered questions.

In the spirit, therefore, of starting with a fairly simple model of the problem, we can begin by ignoring some aspects of schema composition and modeling just part of the phenomenon.

We are unlikely to need to reuse this model, but in case we do, and for the sake of simple documentation, we define this model as an Alloy model named schema_composition01. The body of the model consists of some rules about namespaces, some about schema documents, and finally some miscellaneous predicates. module schema_composition01 // First simple model of schema composition. Think of this // as a sanity check. // This model oversimplifies by not considering cases where // the same schema document includes, redefines, or imports // other schema documents more than once.

For purposes of this model, namespaces are objects which have identity and can thus be distinguished from each other. They have no other properties of interest. We say this with a very simple Alloy signature declaration: // Namespaces are just things that have identity. We don't // care about their properties. sig NS {}

Schema documents, on the other hand, do have an internal structure we care about. We ignore most of what can actually happen in a schema document: we don't model element or attribute declarations or type definitions, only the relations between schema documents (and schemas) defined in section 4 of the XSDL spec (, ), namely inclusion, import, and redefinition. For now we ignore the sequence of xs:include, xs:import, and xs:redefine elements, and also the possibility of multiple arcs with the same values for originating node, target node, and label. It may be worth modeling the order of calls, and multiple redundant calls, in a later variation of the model. Also, each schema document has an optional target namespace. // Schema documents have a target namespace, includes, // excludes, and imports. sig SchemaDoc { tns : lone NS, includes : set SchemaDoc, redefines : set SchemaDoc, imports : set SchemaDoc }

It's informative to check to see whether the model defined so far has any instances. To do this, we define an Alloy predicate asking for a non-empty world, and ask Alloy to run it. pred non_empty_universe(S : SchemaDoc) {} run non_empty_universe for 3

When we execute the command run non_empty_universe for 3, the Alloy Analyzer presents us with an instance of the predicate we have defined, in which no signature has more than three instances. The first instance found has just one schema document; in the graphical form generated by Alloy, it looks like this:

A one-document instance of the model

The figure shows a green oval labeled SD ($non_empty_universe_S) imports: SD includes: SD, with two arcs running out from the oval and back to the oval, labeled impors and includes. There is also a tan-colored box labeled NS. The world has one schema document, SD, and one namespace, NS. SD has no target namespace, and SD both includes and imports itself.

Another more complex example may also be worth showing:

A three-document instance of the model

The figure shows three green oval labeled SD0, SD1, and SD2. Several arcs run from oval to oval, as described in the text. There is also a tan-colored box labeled NS. There are three schema documents, none with a target namespace: SD0 both includes and redefines SD2. SD1 both includes and redefines both itself and SD0. SD2 includes SD1, redefines SD1, and imports SD0, SD1, and itself.

Notice that neither of these two instances actually represents a legal schema: the self-imports violate the rule that the namespace being imported must differ from the schema document's target namespace. Other instances generated by the model so far violate the rules about namespace matching for inclusions and redefinitions.

We can define a predicate which is true of a schema document S just in case S follows the basic rules for namespaces in schema composition: If S includes a document A, then either their target namespaces match, or else A has no target namespace. (Here and in the other rules, an absent target namespace matches another target namespace, and does not match an actual target namespace.) In Alloy, this can be expressed as all SI : S.includes | (SI.tns = S.tns or no SI.tns) for the case where S has a target namespace,A literal paraphrase of the Alloy expression may be helpful: For all SI in the set S.includes, it is the case that SI.tns = S.tns or else that there exists no namespace S.tns (S.tns is the empty set). and as all SI : S.includes | (no SI.tns) otherwise. A paraphrase: For all SI in the set S.includes, it is the case that there is nothing in the set SI.tns. If S redefines a document A, then either their target namespaces match, or else A has no target namespace. (Here and in the other rules, an absent target namespace matches another target namespace, and does not match an actual target namespace.) The Alloy expression is just as for inclusions, if we substitute redefines for includes. If S imports a document A,More exactly, the XSDL spec would say that S imports a namespace N, and identifies A as a schema document for that namespace; for purposes of this paper, we can say S imports A for this. then target namespaces differ. In Alloy: all SI : S.imports | (ST.tns != S.tns or no SI.tns) (if S has a target namespace) or as all SI : S.imports | (some SI.tns) (otherwise). Putting these together, we have the Alloy definition of the nsok (namespace OK) predicate: pred nsok[S : SchemaDoc] { some S.tns implies ( (all SI : S.includes + S.redefines | (SI.tns = S.tns or no SI.tns)) and (all SI : S.imports | (SI.tns != S.tns or no SI.tns)) ) no S.tns implies ( no S.(includes+redefines).tns and (all SI : S.imports | some SI.tns) ) } run nsok for 3

Armed with the nsok predicate, we can define a predicate that holds for worlds in which each schema document is namespace-OK:There might be some mileage in defining a signature for the class of namespace-OK schema documents; if we did, then all_clean might be defined this way: sig NSCleanSchemaDoc extends SchemaDoc {}{ nsok[this] } pred all_clean() { no SchemaDoc - NSCleanSchemaDoc } The body of the all_clean predicate says: the set difference of the set of schema documents and the set of namespace-OK schema documents (SchemaDoc - NSCleanSchemaDoc) is empty (there exists no member of the set). pred all_clean() { all S : SchemaDoc | nsok[S] } run all_clean for 1 run all_clean for 2 run all_clean for 3

If they are helpful in eliminating uninteresting test cases, further constraints might be defined. Some obvious candidates (not defined here now because in fact I think the cases which violate these constraints are interesting): No self-inclusion, no self-redefinition. (I.e. no cycles of length 1. Longer cycles need not and should not be forbidden, except to explore the implications of adding such a constraint to the spec. No overlap between the includes and the redefines of a given document. (Again, overlap in the transitive closure should not be forbidden, for test cases intended to explore XSDL 1.0 and its implementations, since in reality we know such cases arise and cause problems for implementations and users.)

It will be useful to generate some test cases which do not obey the namespace-OK constraints (or the others, if we later define them), but for the most part, we will be more interested in test cases which don't violate the namespace matching constraints, since they are clear and easily checked.

Recent versions of Alloy expose a Java API to the Alloy Analyzer, by means of which it's possible to perform the operations offered by the Analyzer's GUI, under program control. With the help of the Alloy team (in particular Felix Chang), it has proven straightforward to construct a Java program to load the model given above and write out XML descriptions of instances of the model. The format of these XML descriptions is described further below.

As an example, when the Alloy command run all_clean for 1 is executed, several instances of the model are generated. These are perhaps useful because they compactly exercise conditions of self-inclusion and self-redefinition. If we are satisfied that these instances provide an adequate check on processor behavior in such cases, we can then eliminate self-arcs from the model when generating later test cases. (If self-arcs may interact with code for other inclusions and redefinitions, we won't want to eliminate self-arcs from the cases with multiple schema documents.) A summary of the single-document test cases follows: m1a00.xml 0 NS, 0 SchemaDoc. m1a01.xml 0 NS, 1 SchemaDoc. SchemaDoc$0
m1a02.xml 0 NS, 1 SchemaDoc. SchemaDoc$0 redefines: SchemaDoc$0.

m1a03.xml 0 NS, 1 SchemaDoc. SchemaDoc$0 includes: SchemaDoc$0.

m1a04.xml 0 NS, 1 SchemaDoc. SchemaDoc$0 includes: SchemaDoc$0; redefines: SchemaDoc$0.

m1a05.xml 1 NS, 1 SchemaDoc. SchemaDoc$0 redefines: SchemaDoc$0.

m1a06.xml 1 NS, 0 SchemaDoc.

m1a07.xml 1 NS, 1 SchemaDoc. SchemaDoc$0

m1a08.xml 1 NS, 1 SchemaDoc. SchemaDoc$0 tns: NS$0.

m1a09.xml 1 NS, 1 SchemaDoc. SchemaDoc$0 tns: NS$0. redefines: SchemaDoc$0.

m1a10.xml 1 NS, 1 SchemaDoc. SchemaDoc$0 tns: NS$0. includes: SchemaDoc$0.

m1a11.xml 1 NS, 1 SchemaDoc. SchemaDoc$0 tns: NS$0. includes: SchemaDoc$0; redefines: SchemaDoc$0.

m1a12.xml 1 NS, 1 SchemaDoc. SchemaDoc$0 includes: SchemaDoc$0; redefines: SchemaDoc$0.

m1a13.xml 1 NS, 1 SchemaDoc. SchemaDoc$0 includes: SchemaDoc$0.

It will be noted that some of these are essentially isomorphic: if the schema document has no target namespace, the presence of an unused namespace in the world makes no material difference. If the presence of such extraneous namespaces proves irritating (as it surely will, since it will double the number of cases to process without adding further information), a predicate can be added to require that all namespaces in the model be the target namespace of some schema document.

Before we attempt to generate any concrete test cases from these or other instances of this model, however, there are some other problems we need to consider; we turn to those problems in the next section.

Making interpretable test cases

The primary goal of the test cases is interoperability testing, not conformance testing. That is, we wish to be able to tell whether different processors behave the same way, or in different ways, when handling a particular test case. For this purpose it is not essential that we specify what the expected result of a given test is, only that it be possible to tell what the processor did. If the spec is clear, then either there will be a clearly expected result, or else it will be clear that the behavior is unspecified. If it's not clear what result is required by the spec for a given test, or whether a particular result is required, that test is not much good for conformance checking (except to illustrate that the spec is underspecified). But it can nevertheless be useful for interoperability testing.

XSDL 1.0 defines what PSVI should result from a given input document, given a particular set of schema components. But does not require a processor to expose to clients any particular information about what set of schema components was actually used in the validation. Nor does it require the processor to follow any particular rules for finding schema documents or other sources of schema components (aka component acquisition): at a first approximation, processors may effectively do whatever they like when it comes to schema composition. XSDL 1.1 does require that a processor document what component acquisition policies it follows (or can follow at user option), but XSDL 1.0 does not.

So one important challenge for construction of test cases is to try to ensure that from the test results it is possible to tell precisely what components were present in the validating schema. If schema document A includes schema document B, then we need to be able to tell, from the validation results, whether B was actually included or not. If schema document B is included by A and redefined by C, By phrases like B is redefined by C I mean that schema document C contains an xs:redefine element which specifies schema document B as a document to be consulted. At the moment, it doesn't matter which components of B are redefined in C; we need to choose the components and the terms of redefinition in such a way as to ensure that we can tell whether the xs:redefine was honored or not. then we need to be able to tell whether the relevant components from B have been redefined or not. And so on.

At the schema author's level, we would like to be able to answer questions like: Did the processor consult this particular schema document? Did the processor honor this particular xs:include element? Did the processor honor this particular xs:redefine element? Did the processor honor this particular xs:import element? If a schema document has been called more than once in different ways, which call was actually processed? Was the schema document included or imported? Included or redefined? Why did the processor consult this particular document? Was it because the document was named at invocation time? Pointed to from the document instance/ Pointed to from another schema document being consulted? In what order did the processor consult the various schema documents? (If a document is reachable more than once, in conflicting ways, a processor might raise an error. Or it might process the document the first time it sees a reference, and then decline to process it again when it sees the second reference, either because it believes the document has already been processed correctly or because it knows the second round of processing would lead to an error. In the latter cases, it will be useful to know whether a schema document was processed as for an include, or as for a redefinition.

It is not entirely clear whether test cases can easily be constructed which allow all of these questions to be answered. But I believe we can construct test cases which allow us to answer the following questions for any schema document, by looking at the validation results from any processor which either provides access to the entire PSVI or at least issues error or warning messages for invalid elements, as long as it does not stop work as soon as it sees the first invalid element.If we encounter processors which do stop as soon as they know that the root element is invalid, then we will need to change from the single test document outlined below to a set of 27, 54, or 81 test documents. (Let us call such processors indefatigable, since they do not tire of reporting validity problems.) Was this schema document consulted? Were the components redefined? By what other schema document were the components redefined? We do this by making each schema document in the test define a readily identifiable set of components, which will be detectable if present in the schema, but which will not cause problems by their absence. We also ensure that if any schema document redefines components from any schema document, it does so in a characteristic way which allows us to tell which redefinitions were involved. Finally, we provide a test document which provides element instances which will be valid, or invalid, depending on which components are present in the schema.

The discussion below assumes we define three (or at most three) schema documents with one, two, or three different values for their target namespaces; the principles can be extended for larger finite numbers of schema documents. The three schema documents are called A, B, and C.

Basic tracer components

Each schema document defines (for its target namespace) an element and a type with the same name as the schema document. That is, schema document A reads in part: ... ]]> and schema documents B and C similarly define elements named b and c, respectively, of type tns:b and tns:c, respectively.

For simpllicity, we make the tracer components be simple types; we will define different pattern facets at different places, to make it easy to tell which constraint was imposed where. The basic tracer components have a very simple constraint: element a, of type a, must contain at least one a, and so on. So the full definition of type tns:a is: ]]>

Test document wrapper

If the test document contains some a, b, and c elements in the appropriate namespaces, with at least some of them guaranteed to be invalid, then we can tell which schema documents were consulted by noticing which elements are actually validated.

The elements should match a lax wildcard to ensure that the absence of an element declaration won't lead to extraneous error messages. So the test document should have a root element declared something like this: ]]>

At the moment, I'm not sure how to guarantee that the wrapper element is always declared no matter what.

Redefinition traces

The test cases we'll generate will, or won't, include cases of arbitrary schema documents redefining (components in) arbitrary schema documents. To ensure that the redefinition of a type is visible, each xs:redefine element will impose a distinct constraint, in the form of a unique pattern.

When schema document A redefines schema document B, for example, we will add the pattern .*A.* to type b. For example: ]]> So if instances of type b are valid only when they contain an A, we can infer that type b was redefined by schema document A. If they also require a C, then type b was also redefined by schema document C.

This does not allow us to tell whether the redefinition chain runs A → C → B or C → A → B. If that turns out to be of interest, we can add length facets, with A requiring a minimum length of 1, B requiring minLength 2, and C requiring 3. In that case, the chain A → C → B would be illegal (you can't restrict a minimum length of 3 by requiring a minimum length of 1, and XSDL forbids it as nonsensical), while C → A → B would be legal. We can tell which chain is present by checking for errors in schema construction.

The test document

The test document itself consists of a wrapper containing a series of a, b, and c elements in various namespaces.

For now, we don't try to distinguish lazy from eager assembly, but put schema location information for all three schema documents on the root element. And we don't vary the sequence of children to try to detect sequence dependencies in the processors.

There are six constraints (the patterns requiring, respectively, one of abcABC to appear in the literal), but we don't need to check for all combinations. For the constraints imposed by schema document A, for example, we need to check for the cases of no constraint at all (no a or A required) basic constraint (a required but not A) redefinition constraint (A required) Since there are three such triples (one each for schema documents A, B, and C), we need 3 ** 3 = 27 instances of each element a, b, and c. For now, I assume we only need to look at a, b, and c elements in the appropriate namespace.

The outer shell of the test document looks like this: <test:wrapper >    </test:wrapper>

The namespace and schema location attributes will depend on the particular test case being generated. For the sake of this example assume that schema documents A and B have target namespace ns0 (i.e. http://www.w3.org/XML/Group/2007/09/schema_composition/ns0) and that C has no target namespace. For now, at least, we'll put the wrapper element into namespace http://www.w3.org/XML/Group/2007/09/schema_composition/tests0) and assume it's got a schema document at driver.xsd. Then the relevant attributes will read: xmlns:test="http://www.w3.org/XML/Group/2007/09/schema_composition/tests0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:ns0="http://www.w3.org/XML/Group/2007/09/schema_composition/ns0" xsi:schemaLocation=" http://www.w3.org/XML/Group/2007/09/schema_composition/tests0 driver.xsd http://www.w3.org/XML/Group/2007/09/schema_composition/ns0 a.xsd http://www.w3.org/XML/Group/2007/09/schema_composition/ns0 b.xsd" xsi:noNamespaceSchemaLocation="c.xsd"

The set of tests for type a work systematically through the 27 possible combinations of nothing, lowercase only, uppercase for A, B, and C: <ns0:a>......</ns0:a> <ns0:a>.....c</ns0:a> <ns0:a>....Cc</ns0:a> <ns0:a>...b..</ns0:a> <ns0:a>...b.c</ns0:a> <ns0:a>...bCc</ns0:a> <ns0:a>..Bb..</ns0:a> <ns0:a>..Bb.c</ns0:a> <ns0:a>..BbCc</ns0:a> <ns0:a>.a....</ns0:a> <ns0:a>.a...c</ns0:a> <ns0:a>.a..Cc</ns0:a> <ns0:a>.a.b..</ns0:a> <ns0:a>.a.b.c</ns0:a> <ns0:a>.a.bCc</ns0:a> <ns0:a>.aBb..</ns0:a> <ns0:a>.aBb.c</ns0:a> <ns0:a>.aBbCc</ns0:a> <ns0:a>Aa....</ns0:a> <ns0:a>Aa...c</ns0:a> <ns0:a>Aa..Cc</ns0:a> <ns0:a>Aa.b..</ns0:a> <ns0:a>Aa.b.c</ns0:a> <ns0:a>Aa.bCc</ns0:a> <ns0:a>AaBb..</ns0:a> <ns0:a>AaBb.c</ns0:a> <ns0:a>AaBbCc</ns0:a>

The tests for types b and c are exactly similar and need not be repeated here.

They do, however, have to appear in this document somewhere, in order for the literate programming system to generate a correct sample test document. So they have been hidden in this footnote. First type b: <ns0:b>......</ns0:b> <ns0:b>.....c</ns0:b> <ns0:b>....Cc</ns0:b> <ns0:b>...b..</ns0:b> <ns0:b>...b.c</ns0:b> <ns0:b>...bCc</ns0:b> <ns0:b>..Bb..</ns0:b> <ns0:b>..Bb.c</ns0:b> <ns0:b>..BbCc</ns0:b> <ns0:b>.a....</ns0:b> <ns0:b>.a...c</ns0:b> <ns0:b>.a..Cc</ns0:b> <ns0:b>.a.b..</ns0:b> <ns0:b>.a.b.c</ns0:b> <ns0:b>.a.bCc</ns0:b> <ns0:b>.aBb..</ns0:b> <ns0:b>.aBb.c</ns0:b> <ns0:b>.aBbCc</ns0:b> <ns0:b>Aa....</ns0:b> <ns0:b>Aa...c</ns0:b> <ns0:b>Aa..Cc</ns0:b> <ns0:b>Aa.b..</ns0:b> <ns0:b>Aa.b.c</ns0:b> <ns0:b>Aa.bCc</ns0:b> <ns0:b>AaBb..</ns0:b> <ns0:b>AaBb.c</ns0:b> <ns0:b>AaBbCc</ns0:b>

And then type c: <c>......</c> <c>.....c</c> <c>....Cc</c> <c>...b..</c> <c>...b.c</c> <c>...bCc</c> <c>..Bb..</c> <c>..Bb.c</c> <c>..BbCc</c> <c>.a....</c> <c>.a...c</c> <c>.a..Cc</c> <c>.a.b..</c> <c>.a.b.c</c> <c>.a.bCc</c> <c>.aBb..</c> <c>.aBb.c</c> <c>.aBbCc</c> <c>Aa....</c> <c>Aa...c</c> <c>Aa..Cc</c> <c>Aa.b..</c> <c>Aa.b.c</c> <c>Aa.bCc</c> <c>AaBb..</c> <c>AaBb.c</c> <c>AaBbCc</c>

It should be clear now how this test setup allows us to answer the low-level questions noted above:

Q. Was this schema document consulted?

A. If the elements it defines (a, b, or c) are being validated, then yes, it was consulted. This should be visible either from the PSVI dump (if the processor supports that) or from the presence or absence of validation error messages. The string ...... is not a member of any of the three types, so there should be an error message at least for the test element containing that string. If there are not messages about invalid elements, then either the elements are not being validated or else the processor does not issue error messages for invalid elements (in which case the results must be interpreted in the light of the information the processor actually does provide).

Q. Were the components redefined?

A. If every test element containing the appropriate lowercase letter (a for type a, b for type b, etc.) is valid, then the type has not been redefined. Otherwise, it has.

Q. By what other schema document were the components redefined?

A. If the valid instances of any type all contain a capital letter (A, B, or C), then the type was redefined by the associated schema document. If the only valid instance is AaBbCc, then the type was redefined by all three schema documents.

Hand-created tests

Toy tests

Before discussing how to automate the task of generating test cases from Alloy's XML dumps, it may be useful to examine a few test cases constructed manually, as a way of thinking about details of what the test cases should look like.

The test case for m1a02 (described above at the end of section ) begins with the following start-tag: ]]> and continues as shown above, with the a, b, and c element all unqualified (not in any declared namespace). The test document instance for m1a03 looks similar, but points to a different schema document.

The schema for m1a03 reads, in its essentials, thus::

Sample schema for discussion of schema composition. CMSMcQ, 6 September 2007.

This is test case m1a03: 0 NS, 1 SchemaDoc.

SchemaDoc$0 includes: SchemaDoc$0.

]]>

A quick check of several processors shows that they all do the same thing with test m1a03: element a and type a are defined, and they have no problem with the inclusion cycle.

The schema document for m1a02 is similar, but instead of the inclusion it has a redefinition: ]]>

The test results for this case are suggestive.

One processor (processor C in the tables below in section ) basically ignores the redefinition: the schema used to validate has the basic definition of type a, but not the redefinition, as shown by the error messages (which have been reformatted here for legibility):

One processor (processor B in the tables below in section ) issues some error messages I do not understand, and then validates the document with a schema which apparently has a definition for element a and type a, but does not enforce either of the patterns. $ ~/bin/runProcessorB abc.m1a02.xml [Error] abc.m1a02.xsd:46:27: sch-props-correct.2: A schema cannot contain two global components with the same name; this schema contains two occurrences of ',a'. [Error] abc.m1a02.xsd:39:30: src-resolve: Cannot resolve the name 'a_fn3dktizrknc9pi' to a(n) 'type definition' component. [Error] abc.m1a02.xsd:39:30: cos-applicable-facets: Facet 'pattern' is not allowed by type a. abc.m1a02.xml: 3208 ms (82 elems, 1 attrs, 0 spaces, 742 chars) $ The PSVI output shows the result quite clearly (XML form in abc.m1a02.xml.psvi.out.xml, an HTML display form in abc.m1a02.xml.psvi.out.html.

A third processor (processor E in the tables below in section ) notices the cycle of redefinitions, but appears to try to deal with it anyway:As shown below, a later recent version of processor E eliminates this looping behavior. $ ~/bin/runProcessorE abc.m1a02.xml Error at xsd:redefine on line 36 of file:/Users/cmsmcq/2007/schema/exx/abc.m1a02.xsd: The schema document file:/Users/cmsmcq/2007/schema/exx/abc.m1a02.xsd includes or redefines itself recursively Error at xsd:redefine on line 36 of file:/Users/cmsmcq/2007/schema/exx/abc.m1a02.xsd: The schema document file:/Users/cmsmcq/2007/schema/exx/abc.m1a02.xsd includes or redefines itself recursively Error at xsd:redefine on line 36 of file:/Users/cmsmcq/2007/schema/exx/abc.m1a02.xsd: The schema document file:/Users/cmsmcq/2007/schema/exx/abc.m1a02.xsd includes or redefines itself recursively Error at xsd:redefine on line 36 of file:/Users/cmsmcq/2007/schema/exx/abc.m1a02.xsd: The schema document file:/Users/cmsmcq/2007/schema/exx/abc.m1a02.xsd includes or redefines itself recursively Error at xsd:redefine on line 36 of file:/Users/cmsmcq/2007/schema/exx/abc.m1a02.xsd: The schema document file:/Users/cmsmcq/2007/schema/exx/abc.m1a02.xsd includes or redefines itself recursively $ At the point shown, I interrupted the program.

A fourth processor (processor A in the tables below in section ) produces this output:I have elided some uninformative messages about the DTDs, or lack of them, in the test case. $ runProcessorA abc.m1a02.xsd abc.m1a02.xml \\ > abc.m1a02.xml.procA.out.xml ... abc.m1a02.xml:8: element a: Schemas validity error : Element 'a' [ST 'a', facet 'pattern']: The value '......' is not accepted by the pattern '.*a.*'. abc.m1a02.xml:9: element a: Schemas validity error : Element 'a' [ST 'a', facet 'pattern']: The value '.....c' is not accepted by the pattern '.*a.*'. abc.m1a02.xml:10: element a: Schemas validity error : Element 'a' [ST 'a', facet 'pattern']: The value '....Cc' is not accepted by the pattern '.*a.*'. abc.m1a02.xml:11: element a: Schemas validity error : Element 'a' [ST 'a', facet 'pattern']: The value '...b..' is not accepted by the pattern '.*a.*'. abc.m1a02.xml:12: element a: Schemas validity error : Element 'a' [ST 'a', facet 'pattern']: The value '...b.c' is not accepted by the pattern '.*a.*'. abc.m1a02.xml:13: element a: Schemas validity error : Element 'a' [ST 'a', facet 'pattern']: The value '...bCc' is not accepted by the pattern '.*a.*'. abc.m1a02.xml:14: element a: Schemas validity error : Element 'a' [ST 'a', facet 'pattern']: The value '..Bb..' is not accepted by the pattern '.*a.*'. abc.m1a02.xml:15: element a: Schemas validity error : Element 'a' [ST 'a', facet 'pattern']: The value '..Bb.c' is not accepted by the pattern '.*a.*'. abc.m1a02.xml:16: element a: Schemas validity error : Element 'a' [ST 'a', facet 'pattern']: The value '..BbCc' is not accepted by the pattern '.*a.*'. abc.m1a02.xml fails to validate diumaget-3:~/2007/schema/exx cmsmcq$ That is, processor A does the same as processor C: it ignores the self-redefinition.

A variation on the test, in which the xsd:redefine element is empty, produced the same results as the original test, from each of the four processors.

This manual test seems to me to illustrate the utility of the model instances produced by Alloy, for both the goals outlined at the beginning of this paper. First, Alloy provides assistance with sanity checking a design: even the simplest instances of this simplest Alloy model of schema composition turn out to illustrate situations for which an ideal definition of schema composition must be prepared, but for which (judging by the evidence), the XSDL 1.0 spec does not provide a clear result. And tests constructed from these simple instances provide useful empirical data on the interoperability of existing XSDL 1.0 implementations.

Test set t0

Those colleagues who had volunteered to run some test cases asked for a small set of tests to try out first, so that they could make sure the structure of the test set and so on were clear. This section describes that test set, called t0.

Tests

Set t0 contains tests constructed by hand (which I had on hand when it became clear we needed a start-to-finish complete small set of tests very soon). A formal description is on the Web at ./t0/t0.testSet, formulated in the XML Schema Test Suite vocabulary for test catalogs, and the tests themselves may be browsed at ./t0.

The test groups are:

Test group PVB: tests created on the basis of a problem described by Paul V. Biron in private email.

There are three schema documents: standard.xsd defines type cc in namespace http://www.example.com/standard (which I'll just call standard). extensions.xsd imports the standard namespace (specifying standard.xsd as the schema location) and defines element e1 for an extensions namespace (prefix ext). top.xsd imports the extensions namespace (specifying extensions.xsd as the schema location) and redefines type standard:cc from standard.xsd. In the catalog, these are given in the order top, extensions, standard. (This turns out to matter.)

When difficulties arise here, they arise because standard.xsd is reached both via a redefinition element in top.xsd and via an import element in extensions.xsd.

There are three XML test documents, which differ only in the schemaLocation attribute on the root element: one points at top.xsd for the standard namespace and extensions.xsd for the extensions namespace, in that order, one points at them in the opposite order, and one points only at top.xsd.

Test group abc: tests following the test construction pattern described above, with a call graph isomorphic to that in test group pvb. (These tests were created when I had trouble figuring out what was actually going on in the pvb documents; the idea was that these tests would be easier to interpret.)

Again, there are three schema documents: a.xsd (for namespace ns1) corresponds to standard.xsd. b.xsd corresponds to extensions.xsd: it imports ns1 (via a.xsd). c.xsd corresponds to top.xsd: it imports ns2 (via b.xsd) and redefines a component in ns1 (from a.xsd).

There are four XML test documents, which differ only in the schemaLocation attribute on the root element: one points at b.xsd, then c.xsd, one points at them in the other order, one points only at c.xsd, and one has no schemaLocation hint.

The schemaTest element in the test catalog lists the schema documents in the order c, a, b (so the top-level schema document is first).

Test group abc-2 is identical to group abc, except that the catalog lists the schema documents in the order b, a, c.

When processors pre-load the schema documents listed in the catalog, in order, then either there is no order dependency in the processor (in which case they should produce the same results for abc and for abc-2) or there is (in which case the difference in results for abc and abc-2 should trigger the order dependency).

For those processors which accept only a single schema document on the command line, or which accept several but ignore all but one, the difference in order turns into a difference in initial schema document point.

Test group cycle-include includes a test translated by hand from the Alloy instance labeled m1a02 above (section ): schema document a.xsd includes itself. Test group cycle-redefine-1 includes another test translated by hand from an Alloy instance: m1a03 in the discussion of section : schema document a.xsd redefines the type a from schema document a.xsd. Test group cycle-redefine-2 is a variation on the preceding: again, schema document a.xsd includes a redefine element pointing to schema document a.xsd, but this time the redefine element is empty (so that it's semantically very similar to an include — it may be identical, but I haven't checked the fine print of the spec on that question).

Test results

The tests of t0 have been run on several processors, sometimes in different configurations.

The procesors for which we have data so far are anonymized here as A, B, C, etc., at least until further research shows the license agreements do not forbid dissemination of test results. When the same processor has been tested with multiple configurations, the same processor letter is used and configurations are distinguished by numbers: A1, A2, A3, etc. are all the same executable, invoked with different options and/or paramters.

Several processors allow schema documents to be supplied on the command line. In some cases, it's clear that all such schema documents are pre-loaded before the document is validated; in others, it appears that only one is pre-loaded. In no case does the documentation say what happens when multiple schema documents are specified, or mention any sequence dependencies. Since order dependencies are clearly important for some tests in t0, those processors have been tested in several configurations: all: all schema document names given for the test group in the catalog are passed to the processor, in order. reverse: all schema document names given for the test group in the catalog are passed to the processor, in reverse order. first: only the first schema document named in the catalog is passed to the processor. last: only the last schema document named in the catalog is passed to the processor. none: no schema document named in the catalog is passed to the processor at invocation time; the only information available to the processor is the schema location hints in the test instance.

For example processor D can be invoked with zero or more schema documents given on the command line. It is represented below by six rows in each table. Those labeled D1 through D5, which represent different invocation patterns; D6 is a newer version of the processor. For test group abc, where the test catalog (t0/t0.testSet) lists the schema documents in the order c.xsd, a.xsd, b.xsd, the codes mean: D1 (all): invocation pattern $PROCESSORNAME $INSTANCENAME abc/c.xsd abc/a.xsd abc/b.xsd D2 (first): invocation pattern $PROCESSORNAME $INSTANCENAME abc/c.xsd D3 (reverse): invocation pattern $PROCESSORNAME $INSTANCENAME abc/b.xsd abc/a.xsd abc/c.xsd D4 (last): invocation pattern $PROCESSORNAME $INSTANCENAME abc/b.xsd D5 (none): invocation pattern $PROCESSORNAME $INSTANCENAME D6 (all): invocation pattern $PROCESSORNAME $INSTANCENAME abc/c.xsd abc/a.xsd abc/b.xsd Processor F does not accept multiple schema documents on its command line, so it has been tested only with the first and last configuration options.

In some cases, processors have other invocation options which have been tested separately.

For example, processor B is a library which ships with several sample applications. Shells scripts B1, B2, B3, and B4 were constructed to invoke different sample applications, to see whether the choice of sample application made any difference to validation behavior. (As may be seen, B1 through B3 all yield essentially the same validation results: although their error output differs, the diferences reflect configuration issues unrelated to schema validity assessment. B4, on the other hand, provides different results, because its sample application is written with a different schema construction policy in mind.) B1, B2, and B3 all follow the same invocation pattern (all the schema documents listed in the catalog are passed on the command line); it is not yet know whether variations in the order of document names on the command line matters. (Configurations B6 through B9 will be constructed at some point to investigate this question.)

Processor G has been tested in two versions: G1-5 are different invocation patterns for the earlier version, G6-G10 for the later version.

PVB group

The results so far for test group pvb are summarized in this table. Note that The root element of each test document is ext:e1. In extensions.xsd, the declaration of ext:e1 refers to type standard:cc.. The catalog lists the schema documents in the order top, extensions, standard. (In some forms of this paper, background colors have been supplied for the cells of the table. Similar results should have similar background colors, but the colors themselves are arbitrary and have no significance; sometimes the same color is used in different tables for different results.) Test group pvb Schema documents: t0/pvb/top.xsd t0/pvb/extensions.xsd t0/pvb/standard.xsd
Product Test pvb / point-ext.top Test pvb / point-top.ext Test pvb / point-top t0/pvb/pt.ext.top.xml t0/pvb/pt.top.ext.xml t0/pvb/pt.top.xml A1 (all) No declaration found for ext:e1. No declaration found for ext:e1. No declaration found for ext:e1. A2 (first) Schema error in extensions.xsd: type standard:cc not found Schema error in extensions.xsd: type standard:cc not found Schema error in extensions.xsd: type standard:cc not found A3 (reverse) Schema error in extensions.xsd: type standard:cc not found Schema error in extensions.xsd: type standard:cc not found Schema error in extensions.xsd: type standard:cc not found A4 (last) No declaration found for ext:e1. No declaration found for ext:e1. No declaration found for ext:e1. &spacer4; B1 (all) no errors no errors no errors B2 (all) no errors no errors no errors B3 (all) no errors no errors no errors B4 (none) no errors no errors No declaration found for ext:e1 &spacer4; C1 (none) Invalid: e1 not allowedThe error message does not mention the namespace, but it appears that the element objected to is ext:e1, not standard:e1, and that the place it's objected to is inside type standard:cc, where it was added by top.xsd's redefinition of that type. Error in extensions.xsd: type standard:cc not found. Error in extensions.xsd: type standard:cc not found. &spacer4; D1 (all) no errors no errors no errors D2 (first) no errors no errors no errors D3 (reverse) Invalid: ext:e1 not allowed here (i.e. at end of type standard:cc). Invalid: ext:e1 not allowed here (i.e. at end of type standard:cc). Invalid: ext:e1 not allowed here (i.e. at end of type standard:cc). D4 (last) Invalid: ext:e1 not allowed here (i.e. at end of type standard:cc). Invalid: ext:e1 not allowed here (i.e. at end of type standard:cc). Invalid: ext:e1 not allowed here (i.e. at end of type standard:cc). D5 (none) Invalid: ext:e1 not allowed here (i.e. at end of type standard:cc). no errors no errors D6 (all) no errors no errors no errors &spacer4; E1 (all) no errors no errors no errors E2 (first) no errors no errors no errors E3 (reverse) no errors no errors no errors E4 (last) no errors no errors no errors E5 (none) no errors no errors no errors &spacer4; F1 (first) no errors no errors no errors F2 (last) Invalid: outer ext:e1 in error (probably should be standard:e1) Invalid: outer ext:e1 in error (probably should be standard:e1) Invalid: outer ext:e1 in error (probably should be standard:e1) &spacer4; G1 (all) Error in top.xsd: no ext:cc (sic) exists to be redefined Error in top.xsd: no ext:cc (sic) exists to be redefined Error in top.xsd: no ext:cc (sic) exists to be redefined G2 (first) Error in top.xsd: no ext:cc (sic) exists to be redefined Error in top.xsd: no ext:cc (sic) exists to be redefined Error in top.xsd: no ext:cc (sic) exists to be redefined G3 (reverse) Error in standard.xsd: duplicate standard:cc Error in standard.xsd: duplicate standard:cc Error in standard.xsd: duplicate standard:cc G4 (last) root element had no schema element content invalid element content invalid G5 (none) std:e1 used by not declared in schema element content invalid element content invalid G6 (all) no errors no errors no errors G7 (first) no errors no errors no errors G8 (reverse) Error in top.xsd: schema component already present cannot be redefined (cc) Error in top.xsd: schema component already present cannot be redefined (cc) Error in top.xsd: schema component already present cannot be redefined (cc) G9 (last) Error: schema component already present cannot be redefined (cc) Error: schema component already present cannot be redefined (cc) Error: schema component already present cannot be redefined (cc) G10 (none) no errors no errors no errors

For each test, five different behaviors are exhibited.

Groups abc and abc-2

The results so far for test groups abc and abc-2 are these. Some of the notations may need glosses: -,b,c: the error message shows that the basic pattern facets for types b and c are enforced; there are no error messages relating to type a. aC,b,c: the error message shows that the basic pattern facets for types a, b, and c are enforced; additionally, type a has been redefined by schema document C, and the added facet (pattern .*C.*) is enforced. aC*,b,c: the error message shows that the basic pattern facets for types a, b, and c are enforced; additionally, type a has been redefined by schema document C, and the added facet (pattern .*C.* is enforced. However, some facets appear either to be enforced irregularly: they produce fewer error messages than expected. Specifically: .*a.* (one error message not nine), .*C.* (five error messages not nine or twelve), .*b.* (three error message not nine). (The pattern .*c.* does produce nine error messages.) a,b,-: the error message shows that the basic pattern facets for types a and b are enforced; no error messages reflect the pattern facet for type c. Test group abc Schema documents: t0/abc/c.xsd t0/abc/a.xsd t0/abc/b.xsd
Processor Test abc / point-c-only Test abc / point-cb Test abc / point-bc Test abc / point-none t0/abc/abc1.xml t0/abc/abc2.xml t0/abc/abc3.xml t0/abc/abc4.xml A1 (all) No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. A2 (first) -,b,c -,b,c -,b,c -,b,c A3 (reverse) -,b,c -,b,c -,b,c -,b,c A4 (last) No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. &spacer5; B1 (all)aC,b,caC,b,caC,b,caC,b,c B2 (all)aC,b,caC,b,caC,b,caC,b,c B3 (all)aC,b,caC,b,caC,b,caC,b,c B4 (none)aC,b,caC,b,caC,b,cNo declaration found for ns1:wrapper &spacer5; C1 (none)a,b,-aC,b,caC,b,cNo declaration found for ns1:wrapper, ns1:a, ns2:b, ns1:c &spacer5; D1 (all)aC,b,caC,b,caC,b,caC,b,c D2 (first)aC,b,caC,b,caC,b,caC,b,c D3 (reverse) No declaration found for ns1:wrapper No declaration found for ns1:wrapper No declaration found for ns1:wrapper No declaration found for ns1:wrapper D4 (last) No declaration found for ns1:wrapper No declaration found for ns1:wrapper No declaration found for ns1:wrapper No declaration found for ns1:wrapper D5 (none) No declaration found for ns1:wrapper aC,b,c aC,b,c No declaration found for ns1:wrapper D6 (all)aC,b,caC,b,caC,b,caC,b,c &spacer5; E1 (all)aC,b,caC,b,caC,b,caC,b,c E2 (first)aC,b,caC,b,caC,b,caC,b,c E3 (reverse)aC,b,caC,b,caC,b,caC,b,c E4 (last)aC,b,caC,b,caC,b,ca,b,- E5 (none)aC,b,caC,b,caC,b,cno errors (lax validation) &spacer5; F1 (first) aC*,b,c aC*,b,c aC*,b,c aC*,b,c F2 (last) Invalid: ns1:wrapper not allowed. Invalid: ns1:wrapper not allowed. Invalid: ns1:wrapper not allowed. Invalid: ns1:wrapper not allowed. &spacer5; G1 (all) Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined G2 (first) Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined G3 (reverse) Error in a.xsd: duplicate ns1:a Error in a.xsd: duplicate ns1:a Error in a.xsd: duplicate ns1:a Error in a.xsd: duplicate ns1:a G4 (last) a,b,- a,b,- a,b,- a,b,- G5 (none) ns1:a not declared, ns2:b has no schema, ns1:c not declared a,b,- a,b,- ns1:a not declared, ns2:b has no schema, ns1:c not declared G6 (all) aC,b,c aC,b,c aC,b,c aC,b,c G7 (first) aC,b,c aC,b,c aC,b,c aC,b,c G8 (reverse) Error in c.xsd: schema component already present cannot be redefined (a) Error in c.xsd: schema component already present cannot be redefined (a) Error in c.xsd: schema component already present cannot be redefined (a) Error in c.xsd: schema component already present cannot be redefined (a) G9 (last) Error in c.xsd: schema component already present cannot be redefined (a) Error in c.xsd: schema component already present cannot be redefined (a) Error in c.xsd: schema component already present cannot be redefined (a) a,b,- G10 (none) aC,b,c aC,b,c aC,b,c neither valid nor invalid: no declaration found &spacer5; H1 (all) -,b,- -,b,- -,b,- -,b,-

We don't currently have reliable results for these tests run on processor G.

The results for group abc-2 are similar in some ways. Test group abc-2 Schema documents: t0/abc/b.xsd t0/abc/a.xsd t0/abc/c.xsd
Processor Test abc-2 / point-c-only (t0/abc/abc1.xml) Test abc-2 / point-cb (t0/abc/abc2.xml) Test abc-2 / point-bc (t0/abc/abc3.xml) Test abc-2 / point-none (t0/abc/abc4.xml) A1 (all)-,b,c-,b,c-,b,c-,b,c A2 (first) No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. A3 (reverse) No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. No declaration found for ns1:wrapper. A4 (last) -,b,c -,b,c -,b,c -,b,c &spacer5; B1 (all)a,b,-a,b,-a,b,-a,b,- B2 (all)a,b,-a,b,-a,b,-a,b,- B3 (all)a,b,-a,b,-a,b,-a,b,- B4 (none)aC,b,caC,b,caC,b,cNo declaration found for ns1:wrapper &spacer5; C1 (none)a,b,-aC,b,caC,b,cNo declaration found for ns1:wrapper, ns1:a, ns2:b, ns1:c &spacer5; D1 (all) No declaration found for ns1:wrapper No declaration found for ns1:wrapper No declaration found for ns1:wrapper No declaration found for ns1:wrapper D2 (first) No declaration found for ns1:wrapper No declaration found for ns1:wrapper No declaration found for ns1:wrapper No declaration found for ns1:wrapper D3 (reverse)aC,b,caC,b,caC,b,caC,b,c D4 (last)aC,b,caC,b,caC,b,caC,b,c D5 (none) No declaration found for ns1:wrapper aC,b,c aC,b,c No declaration found for ns1:wrapper D6 (all) No declaration found for ns1:wrapper No declaration found for ns1:wrapper No declaration found for ns1:wrapper No declaration found for ns1:wrapper &spacer5; E1 (all)aC,b,caC,b,caC,b,caC,b,c E2 (first)aC,b,caC,b,caC,b,ca,b,- E3 (reverse)aC,b,caC,b,caC,b,caC,b,c E4 (last)aC,b,caC,b,caC,b,caC,b,c E5 (none)aC,b,caC,b,caC,b,cno errors (lax validation) &spacer5; F1 (first) Invalid: ns1:wrapper not allowed. Invalid: ns1:wrapper not allowed. Invalid: ns1:wrapper not allowed. Invalid: ns1:wrapper not allowed. F2 (last) aC*,b,c aC*,b,c aC*,b,c aC*,b,c &spacer5; G1 (all) Error in a.xsd: duplicate ns1:a Error in a.xsd: duplicate ns1:a Error in a.xsd: duplicate ns1:a Error in a.xsd: duplicate ns1:a G2 (first) a,b,- a,b,- a,b,- a,b,- G3 (reverse) Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined G4 (last) Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined Error in c.xsd: no ns2:a (sic) exists to be redefined G5 (none) ns1:a not declared, ns2:b has no schema, ns1:c not declared a,b,- a,b,- ns1:a not declared, ns2:b has no schema, ns1:c not declared G6 (all) Error in c.xsd: schema component already present cannot be redefined (a) Error in c.xsd: schema component already present cannot be redefined (a) Error in c.xsd: schema component already present cannot be redefined (a) Error in c.xsd: schema component already present cannot be redefined (a) G7 (first) Error in c.xsd: schema component already present cannot be redefined (a) Error in c.xsd: schema component already present cannot be redefined (a) Error in c.xsd: schema component already present cannot be redefined (a) a,b,- G8 (reverse) aC,b,c aC,b,c aC,b,c aC,b,c G9 (last) aC,b,c aC,b,c aC,b,c aC,b,c G10 (none) aC,b,c aC,b,c aC,b,c neither valid nor invalid: no declaration found &spacer5; H1 (all) -,b,- -,b,- -,b,- -,b,-

For each test, there again appear to be several different behaviors.

Cyclic include and redefine groups

The final three test groups are the cyclic inclusion and the two cyclic redefinitions. The results use the following code: a,- For elements of type a, the facet .*a.* is enforced, but not the facet .*A.*. a,A For elements of type a, both the facet .*a.* and the facet .*A.* are enforced. loop The processor repeatedly issued the same error message, until the job was terminated. dup The processor issued an error message complaining that the schema contained multiple. Processor Test cycle-include Test cycle-redefine-1 Test cycle-redefine-2 Schema: t0/ci/a.xsd Schema: t0/cr1/a.xsd Schema: t0/cr2/a.xsd Instance: t0/ci/abc.xml Instance: t0/cr1/abc.xml Instance: t0/cr2/abc.xml

A1 (all)loopa,-a,- &spacer4; B1 (all)dupsdupsa,- B4 (none)a,-dupsa,- &spacer4; C1 (none)a,-a,-a,- &spacer4; D1 (all)a,-looploop D6 (all)a,-no schemaThe processor notes the recursion and reports Warning: No schema could be loaded at location a.xsd. Warning: Validation will continue without the schema at a.xsd. The validation of the instance then proceeds in strict wildcard mode and no declaration for the wrapper element is found. This is true for both cr1 and cr2.no schema &spacer4; E1 (all)a,-undefined typea,- &spacer4; F1 (first)dupsdup (and exception)dups &spacer4; G1 (all)duplicate ano a to redefineduplicate a G6 (all)a,-a,Aa,- &spacer4; H1 (all)no schemano schemano schema

Analysis and evaluation

The results shown above, of running the t0 tests on several processors, suggest a number of tentative conclusions and questions:

Most processors exhibit different behavior for the same set of schema documents and instances, depending on the order in which schema documents are given on the command line or listed in schemaLocation hints in the instances. (A third source of variation in sequence is the order in which includes, imports, and redefines occur in a schem adocument; test set t0 includes no examples of such variation.)

Some processors (E, B1-B3) exhibit no such sequence dependencies in this set of tests.

We might achieve better interoperability if the specification said explicitly either that sequence is or should be significant in lists of schema documents given at invocation time, in calls from one schema document to another, and in schemaLocation hints (and if so, describe what effect variations in order are to have), or that sequence is not significant in the contexts defined by the spec, and should not be significatn elsewhere (e.g. in command line interfaces).

None of the documentation for the processors tested describes in any obvious place whether the construction of the schema depends on the order in which multiple schema documents are named.

Some processors appear (further tests are probably required to tell for certain) to perform a sort of depth-first processing of schema documents: if documents A, B, and C are named on the command line, and A calls D and E, these processors will read and process D and E before B. A 'breadth-first' approach is also possible, in which D and E will be processed only after B and C.

The difference between depth- and breadth-first handling of schema documents will affect the results when multiple schema documents are available for the same namespace and the processor choose to include only components from the first schema document encountered (plus schema documents included by that one).

Processors vary in their (command-line or library) interfaces. Some processors allow zero or more schema documents to be specified on the command line, others require exactly one. Processor A allows arbitrarily many, but appears to ignore all but the last; the tests of set t0 are not suitable for proving this conclusively. Processor F requires exactly one schema document on the command line. Processors B4 and C1 don't accept schema document names on the command line at all.

It is difficult to see how to construct a consistent testing discipline for such disparate interfaces.

One can factor out the interface variability by providing, for each test case, a single 'driver' schema document, which does nothing at all but include the other schema documents. The name of this single driver document can be passed as a command-line parameter both to processors which accept multiple schema document arguments and to processors which allow t most one, or exactly one. If the driver is also pointed to from a single schemaLocation hint on the document root element, variability in the treatment of schemaLocation hints can also be factored out (and processors which don't support specification of schemas on the command line can be tested).

On the other hand, it's not clear that a set of interoperability tests should factor out these sources of variation: a collection of tests which does not exercise the full interface of a processor is unlikely to provide a full picture of its interoperability with other processors.

Do processors behave the same way when multiple schema documents for the same namespace are mentioned to the processor (via invocation options or references in other schema documents or hints in the XML document instance)?

Not all processors are indefatigable. Some, that is, do not issue more than one error message for the style of test document described above: once the first a element with content ...... is recognized as invalid, no further error messages are emitted. This makes their results difficult to interpret.

Further test sets should therefore generate not a single test instance for a given call graph, but eighty-one (one for each of the twenty-seven a, b, and c elements in the test document described above.

The XML Schema Test Suite vocabulary describes the expected results of a test as the document element being valid, invalid, or having validity=notKnown; other items and properties in the PSVI cannot be specified. As a result, some test harnesses for schema tests stop processing (as noted above) as soon as the validity result is known.

Another result is that the test catalog cannot describe all of the relevant properties of a conforming result.

Some practical observations may also be worth recording:

The schema for the test suite vocabulary reports that the official namespace name is http://www.w3.org/2003/XMLSchema/TestSuite/PLACEHOLDER — but in fact uses the namespace name TestSuite (which is not namespace-well-formed).

At least one tester reports that his test setup had problems with t0/t0.testSet; either there is more than one version of the schema floating around, or there are unwritten conventions I have omitted to follow.

When an invalid element is encountered, some processors but not all mention the names of the element and its type; several mention only the location in the input stream, in line and character number. Reliable interpretation of the results requires that one know for certain whether the element on line 78 is one of the final 'b' elements, or one of the early 'c' elements in the test document.

When, however, the constraint violated is a fact of a simple type, then virtually all processors mention the literal representation of the value in their error messages.

It would simplify automatic analysis of error messages if the literals contained clear information on their parent element and its type. Inserting the names 'a', 'b', 'c' would interfere with the pattern constraints; perhaps the generic identifier should be rot13-encoded, so that all instances of ns1:a begin ns1:n or ns1:rot13(n).

It is a mistake to place DTDs, stylesheets, and other auxiliary materials in a directory called aux: this name is reserved in MS-Windows systems and no directory of this name can be created.

The use of DTDs appears to cause complications for some XML tools; use of the tests is simpler if DTDs are not used at all.

It's helpful to supply explicit XML declarations on all XML documents in the test set.

Generating test cases

[Description here of dumping instances from Alloy, the XML format for instances, the XML format for test cases, and the XSLT to go from the one to the other. The test cases should be described by a catalog following the schema agreed on by the XML Schema Working Group, with the possible exception that in some cases, I do not know what the expected result should be. I'll either leave it out, or claim that the expected result is that the schema is valid and the document invalid. This section also needs a description of the test cases, and pointers to them on the server.]

Dumping Alloy instances in XML

The Java API for Alloy makes it possible to load an Alloy model and then loop through the commands (predicates and assertions) in the model, generating instances for each command. Each instance can be written out to disk in XML form, using the writeXML method of the A4Solution class. (I won't go into further details of the Java here; consult the Alloy documentation or discussion list, or the author, for more information.)

The XML dump format currently used by Alloy 4 is straightforward and reasonably self-explanatory. For example, the command run all_clean for 3 generates a number of instances, including this one:

Test results

[Description here of test results for specific processors for the test cases described above.]

Conclusion and further work

This paper has demonstrated that even a simple Alloy model of schema composition can be useful, both in sanity checking designs for this part of the schema spec and in generating test cases for interoperability testing (and, once the required behaviors are more clearly specifid, also for conformance testing).

The first item of further work is to complete this paper by automating the generation of test cases from Alloy instances the tabulation of results (for t0, manual inspection of the output has sufficed, but this has proven time-consuming even for t0; for larger test sets it will be impracticable) and then by generating test cases, using them to test as many XSDL 1.0 processors as is feasible, and studying the results.

Further models should also be developed (to be reported in separate papers, not this one), to capture more of the details of schema composition and allow the formulation of assertions to test possibly interesting propositions (e.g. for two schema documents in the same namespace, the order in which they are imported or included or redefined by other schema documents is immaterial; both orders produce the same result — as formulated, I expect this proposition to be false, but it expresses a central idea which I believe is true; getting the formulation accurate is a challenge with which light-weight formal methods like Alloy can assist.

Thanks are due to Daniel M. Jackson and the Alloy team for the tool itself and for assistance with questions and problems; to Liam Quin, for discussion of the issues raised in this paper; to David Ezell and Michael Kay for helpful discussions and for assistance in gathering test results.

References Jackson, Daniel. Software abstractions: Logic, language, and analysis. Cambridge: MIT Press, 2006. Sperberg-McQueen, C. M. Notes on RQ-151: Schema composition. Working paper prepared for the W3C XML Schema Working Group. [Cambridge, Sophia-Antipolis, Tokyo]: World Wide Web Consortium, 2004. Available on the Web at http://www.w3.org/2004/07/msm/rq-151.notes.xml; the discussion of missing information is in appendix A, http://www.w3.org/2004/07/msm/rq-151.notes.xml#missinginfo. (Accessible to W3C members only.) W3C (World Wide Web Consortium). XML Schema Part 1: Structures. W3C Recommendation 2 May 2001. Second edition, W3C Recommendation 28 October 2004. Ed. Henry S. Thompson et al. [Cambridge, Sophia-Antipolis, Tokyo]: World Wide Web Consortium, 2001, 2004. Available on the Web at http://www.w3.org/TR/2004/REC-xmlschema-1-20041028/. Latest version at http://www.w3.org/TR/xmlschema-1/. W3C (World Wide Web Consortium). W3C XML Schema Definition Language (XSDL) 1.1 Part 1: Structures. W3C Working Draft 30 August 2007, ed. Sandy Gao et al. [Cambridge, Sophia-Antipolis, Tokyo]: World Wide Web Consortium, 2007. Available on the Web at http://www.w3.org/TR/2007/WD-xmlschema11-1-20070830/. Latest version at http://www.w3.org/TR/xmlschema11-1/.

To do Define and use a concise XML representation for instances of the model. Define the vocabulary; call it m1. Document it, or at least illustrate it. Write XSLT to go from Alloy dump format to m1 format. Point to the stylesheet. Write XSLT to go from m1 format to a set of test cases: schema documents as specified single test instance document (wrapper with 81 children) (if necessary) 81 small test instances

The schema for the XML Schema test suite allows only the values valid, invalid, and notKnown as expected results. There is no facility as the test suite is currently specified, for describing a more complicated result (like the valid / invalid / notKnown pattern expected among the 81 children of the root, for the single-document test).

To avoid pointless redundancy, it may be better not to generate distinct test instances for each test set; instead, one could generate all four possible flavors (no namespace, namespace ns0, ns1, ns2) of the twenty-seven copies of each of the three element types a, b, and c. That's 324 single-element test cases (assuming the test instances don't need schemaLocation information), which can be held in a common directory.

In that case, the appropriate selection of the eighty-one elements to validate will be reflected only in the instanceTest elements of the test set catalog, not in the file system.

The wrapper-plus-children test can also be held in common (again, assuming no schemaLocation hints are required); there are sixty-four such documents. (If we required document a always to have namespace ns0 or no namespace, b to have no namespace, ns0, or ns2, and allow only c to have ns1, ns2, ns3, or none, then there would be only twelve 82-element test documents needed. But I don't know a good way to enforce that without either ugly ad-hoc rules in the model or more work in the XSLT to go from m1 to test cases.)

catalog entry for the test set, describing the schema documents and test instances (either one of them, or eighty-two of them) Write XSLT to translate m1 format to an HTML description of a set of test cases; add it to the stylesheet for this document.

Define a model to explore some issues of sequencing: abstract sig Call { target : SchemaDoc } sig Include extends Call {} sig Import extends Call {} sig Redefine extends Call {} sig SchemaDoc { tns : lone NS, calls : seq Call }

Note that you may need to override the default bounds on sequences to get sequences long enough to cover cases where the same schema document is called more than once.

Extend the sequence-based model to include set-valued properties analogous to those in model m1, something like this: abstract sig Call { target : SchemaDoc } sig Include extends Call {} sig Import extends Call {} sig Redefine extends Call {} sig SchemaDoc { tns : lone NS, calls : seq Call, includes : set SchemaDoc, imports : set SchemaDoc, redefines : set SchemaDoc }{ includes = (calls.elems & Include).target, imports = (calls.elems & Import).target, redefines = (calls.elems & Redefine).target }

This may allow formulation of useful assertions about the equivalence of the sequence-based model and the set-based model. (Perhaps not, as we still don't actually say in the model what inclusion, import, or redefinition mean.)

Automate the running of tests, from the catalog, for various validators, and produce standard test-report XML. Write more tools to simplify the interpretation of the verbose output (along the lines of rdsx.sh).

Lessons learned from work with manually constructed test set; things to do differently next time. Do not use aux as the name of a directory with auxiliary information. It clashes with a legacy restriction on Windows machines (AUX was or is a device driver on those machines and nothing in the file system is allowed to use that name). So call it something else. The use of DTDs clearly causes problems for some testers; strip them out before putting things on the server. Supply XML declarations on all documents. Make a variant of the current abc tests, in which C redefines from A first, and then imports B; in some cases, this is order-sensitive.