Transforming only a part of XML document

May 12, 2004 12:06 PM | 11 Comments | 16 TrackBacks

Daniel writes about transforming a portion of XML document using XPathNavigatorReader. That's a common bad surprise for MSXML-experienced people, who used to fooNode.transformNode() method, where only fooNode and its descendants are visible in XSLT. In .NET that's different - no matter which DOM node you pass to XslTransform.Transform() method, the whole XmlDocument tree is visisble in XSLT. MSDN suggests to load the portion of XML you want to transform into a temporary XmlDocument and pass it to the transformation. Too bad.

What Daniel proposes instead is to load the portion of XML to be transformed into a temporary XPathDocument instead of XmlDocument. Well, that's better, but what I'd like to ask is why do one need any temporary tree at all? It's a piece of cake to write custom XPathNavigator that would limit navigation only to a specified subtree. I did that once with XmlNodeNavigator, which represents XPathNavigator over XmlNode. Less than 10Kb of code. It's ideal solution to transform only a subtree of an XmlDocument. No temporary objects, just lightweight code between XmlDocument and XSLT that limits navigation not higher that specified node.

That's perfect for XmlDocument. For XPathDocument we need another one. Or better we need generic XPathNavigatorNavigator (did we really go nuts with these XML beasts?). XPathNavigatorNavigator should allow to navigate over given XPathNavigator, but should not allow to move it outside the subtree. Comments?

XPathNavigatorReader and XmlNodeNavigator are both parts of Mvp.Xml project.

16 TrackBacks

TrackBack URL: http://www.tkachenko.com/cgi-bin/mt-tb.cgi/236

WordML and FAQ resources from John R. Durant's WebLog on May 12, 2004 6:26 PM

TITLE: WordML and FAQ resources URL: http://weblogs.asp.net/johnrdurant/archive/2004/05/12/130500.aspx IP: 66.129.67.202 BLOG NAME: John R. Durant's WebLog DATE: 05/12/2004 06:26:33 PM Read More

WordML and FAQ resources from John R. Durant's WebLog on May 12, 2004 9:33 PM

TITLE: WordML and FAQ resources URL: http://weblogs.asp.net/johnrdurant/archive/0001/01/01/130500.aspx IP: 66.129.67.203 BLOG NAME: John R. Durant's WebLog DATE: 05/12/2004 09:33:09 PM Read More

re: Why You Won't See XSLT 2.0 or XPath 2.0 in the Next Version of the .NET Framework from Dare Obasanjo's WebLog on May 16, 2004 2:55 PM

TITLE: re: Why You Won't See XSLT 2.0 or XPath 2.0 in the Next Version of the .NET Framework URL: http://blogs.msdn.com/dareobasanjo/archive/2004/05/16/132771.aspx IP: 66.129.67.203 BLOG NAME: Dare Obasanjo's WebLog DATE: 05/16/2004 02:55:40 PM Read More

Performant XML (IV): subtree transformations without re-parsing from on June 24, 2004 5:57 AM

TITLE: Performant XML (IV): subtree transformations without re-parsing URL: http://weblogs.asp.net/cazzu/archive/2004/06/24/164243.aspx IP: 66.129.67.202 BLOG NAME: DATE: 06/24/2004 05:57:45 AM Read More

Performant XML (IV): subtree transformations without re-parsing from on June 24, 2004 6:00 AM

TITLE: Performant XML (IV): subtree transformations without re-parsing URL: http://weblogs.asp.net/cazzu/archive/0001/01/01/164243.aspx IP: 66.129.67.203 BLOG NAME: DATE: 06/24/2004 06:00:34 AM Read More

Performant XML (IV): subtree transformations without re-parsing from on June 24, 2004 6:02 AM

Performant XML (IV): subtree transformations without re-parsing from on June 24, 2004 6:53 AM

TITLE: Performant XML (IV): subtree transformations without re-parsing URL: http://weblogs.asp.net/cazzu/archive/0001/01/01/164243.aspx IP: 66.129.67.202 BLOG NAME: DATE: 06/24/2004 06:53:13 AM Read More

High-performance XML (IV): subtree transformations without re-parsing from on June 25, 2004 2:42 AM

TITLE: High-performance XML (IV): subtree transformations without re-parsing URL: http://weblogs.asp.net/cazzu/archive/0001/01/01/164243.aspx IP: 66.129.67.202 BLOG NAME: DATE: 06/25/2004 02:42:28 AM Read More

Performant XML (IV): subtree transformations without re-parsing from on June 25, 2004 2:53 AM

Performant XML (III): subtree transformations without re-parsing from on June 25, 2004 5:09 AM

TITLE: Performant XML (III): subtree transformations without re-parsing URL: http://weblogs.asp.net/cazzu/archive/0001/01/01/164243.aspx IP: 66.129.67.203 BLOG NAME: DATE: 06/25/2004 05:09:51 AM Read More

Performant XML (III): subtree transformations without re-parsing from on June 25, 2004 6:11 AM

TITLE: Performant XML (III): subtree transformations without re-parsing URL: http://weblogs.asp.net/cazzu/archive/0001/01/01/164243.aspx IP: 66.129.67.202 BLOG NAME: DATE: 06/25/2004 06:11:59 AM Read More

High-performance XML (III): subtree transformations without re-parsing from on June 28, 2004 7:35 AM

TITLE: High-performance XML (III): subtree transformations without re-parsing URL: http://weblogs.asp.net/cazzu/archive/0001/01/01/164243.aspx IP: 66.129.67.203 BLOG NAME: DATE: 06/28/2004 07:35:45 AM Read More

High-performance XML (IV): subtree transformations without re-parsing from on July 6, 2004 8:20 AM

High-performance XML (IV): subtree transformations without re-parsing from on July 6, 2004 8:23 AM

TITLE: High-performance XML (IV): subtree transformations without re-parsing URL: http://weblogs.asp.net/cazzu/archive/0001/01/01/164243.aspx IP: 66.129.67.203 BLOG NAME: DATE: 07/06/2004 08:23:35 AM Read More

High-performance XML (IV): subtree transformations without re-parsing from on July 9, 2004 3:23 PM

High-performance XML (IV): subtree transformations without re-parsing from on July 12, 2004 7:47 AM

11 Comments

Oleg Tkachenko | August 21, 2004 7:34 PM | Reply

Joe, the answer depends on API you are using. If it's XmlDocument, you can create new element and assign comment value to its InnerXml property. Alternatively instead of commenting you can move element to another branch of the tree (e.g. deleted contacts).

Joe | August 19, 2004 11:07 PM | Reply

I need a help!
Maybe one of you guys culd turn on the light for me.
I have a xml file with data like name, phone and birthday date from my friends.
I am building an application to manage this "database".
When I lost contact with a friend, I comment it´s node in the xml file (running very well).

But ...

Is there a way to reenable (make this comment node became an element node again) using something else then a text editor?

Thanks for any help
Joe

Oleg Tkachenko | June 24, 2004 11:14 AM | Reply

Good name I think. Self-explaining and follows naming conventions. I wish I was such good at naming :)

Daniel Cazzulino | June 24, 2004 6:05 AM | Reply

I say your updates to your classes. Cool :)
And now, users of the Mvp.Xml project (http://sf.net/projects/mvp-xml) also have the same functionality for arbitrary XPathNavigator implementations: http://weblogs.asp.net/cazzu/posts/164243.aspx. I called it the SubtreeXPathNavigator... how does it sound?

Vincent Jacquet | May 24, 2004 11:39 AM | Reply

It is an elegant solution, much better than loading a new XMLDocument with the outerXML of the node you want to transform.

Eventhough, if you modify the stylesheet, you may also transform only a part of XML document while keeping the context, i.e. the ancestors, sibblings, ...
You may want to take a look at http://www.flowgroup.fr/tech_transformNodeWithStartingMode_us.htm.

Daniel Cazzulino | May 16, 2004 8:55 PM | Reply

You're definitely right Oleg. Your XmlNodeNavigator is the appropriate one to replace the sub-XmlDocument loading problem. However, I also wanted to stress (and move users to) the more performant XPathDocument for transformations.
I'll investigate a XPathSubNavigator (?!) to limit navigation scope. Looks like an interesting thing to do, and will completely avoid generating a new XPathDocument....
So, the user now has XmlNodeNavigator if they already have an XmlDocument, and XPathNavigatorReader if they have an XPathDocument instead.
BTW, you really need to document and create a buch of tests for your classes ;) If I've some spare time I will try to do if for you.

Oleg Tkachenko | May 16, 2004 1:21 PM | Reply

Done with documentation and almost done with tests, Daniel!

Oleg Tkachenko | May 13, 2004 11:55 AM | Reply

Well, if I understand correctly, namespace navigation is simple: on each elenent node (it's only elements who can bear namespace nodes), there is a list of namespace nodes to expose - one implicit (xml namespace) and one for each namespace in effect on the element.
WRT XmlNodeNavigator and hypothethical XPathNavigatorNavigator they can rely on underlying XPathNavigators when exposing namespace nodes.

Luc Cluitmans | May 13, 2004 11:41 AM | Reply

I was refering to the namespaces mostly because of the lack of correct namspace handling examples.

Because of that lack, and the underdocumentation of what the namespace-related methods (MoveToFirstNamespace/ MoveToNextNamespace/ MoveToNamespace/ GetNamespace) should expose exactly (the framework docs basically just refer to the XML specs), I am still confused what those methods should expose.

And therefore I don't know for sure, but can imagine, that there might be problems with namespaces being declared outside the exposed part of the document, which should be 'transferred' to the document element of the exposed branch.

But then again, it might just be my lack of understanding of how Namespaces should be handled in XPathNavigator that causes me to see problems where there are none...

Oleg Tkachenko | May 13, 2004 10:40 AM | Reply

You are right about MoveToId() (the same problem exists with MoveTo())
Well, I believe inefficiency in couple of rarely used methods is quite reasonable price.

And which namespace effects do you mean?

Luc Cluitmans | May 12, 2004 3:12 PM | Reply

I have been thinking a bit about something like your XPathNavigatorNavigator a few weeks ago (but didn't implement it yet). The only problematic function is MoveToId(). To handle it correctly, you need to check if the parent implementation moves to a node that is in the exposed tree at all. That is not too hard to get working, but will probably not be very efficient.
Also, there may be some effects on the Namespace navigation methods. But then, I have yet to see a public source XPathNavigator subclass that handles namespaces correctly at all. All implementations I have seen always return String.Empty, which is not a valid implementation, even for navigators that don't handle namespaces, because the default xml: namespace is always there...

Name

Email Address

URL

Remember personal info?

Comments (You may use HTML tags for style)

Oleg Tkachenko: Joe, the answer depends on API you are using. If read more
Joe: I need a help! Maybe one of you guys culd read more
Oleg Tkachenko: Good name I think. Self-explaining and follows naming conventions. I read more
Daniel Cazzulino: I say your updates to your classes. Cool :) And read more
Vincent Jacquet: It is an elegant solution, much better than loading a read more
Daniel Cazzulino: You're definitely right Oleg. Your XmlNodeNavigator is the appropriate one read more
Oleg Tkachenko: Done with documentation and almost done with tests, Daniel! read more
Oleg Tkachenko: Well, if I understand correctly, namespace navigation is simple: on read more
Luc Cluitmans: I was refering to the namespaces mostly because of the read more
Oleg Tkachenko: You are right about MoveToId() (the same problem exists with read more

Transforming only a part of XML document

Tags:

Related Blog Posts

16 TrackBacks

11 Comments

Leave a comment

Search

About this Entry

Recent Tweets

Recent Comments

Recent Posts

Transforming only a part of XML document

Tags:

Related Blog Posts

16 TrackBacks

11 Comments

Leave a comment

Search

About this Entry

Recent Tweets

Archives

Tag Cloud

Recent Comments

Recent Posts