August 1, 2005

XML Enhances Java

If you thought it's only Microsoft who's working on integrating XML into the core of programming languages, look at what IBM does for Java. This is a manstream trend now. XML Enhancements for Java are an emerging technology from IBM, which provides a set of language extensions that facilitate XML ...

What are XML Enhancements for JavaTM?

XML Enhancements for Java (XJ) are a set of extensions to Java 1.4 that integrate support for XML, XML Schema and XPath 1.0 into the language. The advantages of XJ over existing mechanisms for XML development are:

o Familiarity (for the XML Programmer) : XML processing in XJ is consistent with open XML standards.
o Robustness : XJ programs are strongly typed with respect to XML Schemas. The XJ compiler can detect errors in uses of XPath expressions and construction of XML data.
o Easier Maintenance: Since XJ programs are written in terms of XML and not low-level APIs such as DOM or SAX, they are easier to maintain and modify if XML Schemas change.
o Performance: Since the compiler is aware of the use of XML in a program, it can optimize the runtime representation, parsing, and XPath evaluation of XML.
In XJ, one can import XML schemas just as one does Java classes. All the element declarations in the XML schema are then available to programmers as if they were Java classes. Programmers can write inline XPath expressions on these classes, and the compiler checks them for correctness with respect to the XML schema. In addition, the compiler performs optimizations in order to improve the evaluation of XPath expressions. A programmer may construct new XML documents by writing XML directly inline. Again, the compiler ensures correctness with respect to the appropriate schema. By integrating XML and Java, XJ allows programmers to reuse existing Java libraries in the development of XML code and vice-versa.
Here are some samples of what XJ allows:
XPath integration:
int min = 70; 
Sequence<year> ys = sd[|/year[sum(.//sales) > $min]|];
Inline XML construction:
region r = new region(<region> 
    <name>NorthEast</name> 
    <sales unit='GBP'>75</sales> 
</region>);
Dynamic one:
float conversion = 1.9; 
salesdata s = 
  new salesdata( 
    <salesdata> 
      <year> 
        {y} 
        <sales unit='Dollars'>{grossSales * conversion}</sales> 
        {r} 
      </year> 
    </salesdata>); 
And many more. Find XJ documentation here.

New in .NET 2.0: Push-Based XML Validation with XmlSchemaValidator Class

This is a real hidden gem in .NET 2.0 everybody (including me) have pretty much overlooked. XmlSchemaValidator class from the System.Xml.Schema namespace is a push-based W3C XML Schema validatation engine. Push-based means different processing model - an opposite for pull-based one. Think about how you work with XmlWriter (push) and ...

Which scenarios does XmlSchemaValidator enable:

  1. Validation of XML in-place, whithout necessity to reparse it by reading via XmlReader. This is actually how new XmlDocument.Validate() method is implemented.
  2. Validation of custom XML or even viewed-as-XML data stores
  3. Validation during XML construction - now it's possible to create validating XmlWritrer. And I wonder why it's not done yet? That's a job for XML MVPs for sure.
  4. Partial validation
  5. Access to PSVI (Post Schema Validation Information)
  6. Retrieving Expected Particles, Attributes, and Unspecified Default Attributes - this is how XML Editor in Visual Studio 2005 smart Intellisense works.
Quite impressive list and quite impressive class.