design pattern for nested xml?

lawpoop · Aug 3, 2009

I have two questions about the 'best' way to design an xml document.
The first is about tagging a collection of similar items as such, and
the other is about a nested structure.

I'm designing an app that can handle an arbitrary tree of a company's
organizational structure. Offices are actual physical locations, while
regions are nestable logical groupings of the offices.

When a region has office(s) as children, I've include the office tags
in <offices></offices>. However, I don't include nested regions in
<regions></regions> tags. Should I? Why or why not? I've noticed that
Michael Kay seems to always enclose repeating items in collection
tags, while other examples don't.

Here's an example.

<company>
<region>
<name>Northeast</name>
<region>
<name>New York</name>
<offices>
<office>
<name>New York Office</name>
<address>...</address>
</office>
<office>
<name>Albany</name>
<address>...</address>
<office>
</offices>
</region>
<region>
<name>Boston</name>
</region>
<region>
<name>Newark</name>
</region>
<region>
<name>Gulf Coast</name>
<region>
<name>Dallas</name>
</region>
<region>
<name>Dallas</name>
</region>
</region>
</company>

As far as my second question, if I should enclose any set of regions
in <regions></regions> tags, then shouldn't I include the entire tree
in <regions></regions> tags? In that case, couldn't I replace
<company>, or must the root node always be unique, even with nested
xmls?

Andy Dingley · Aug 4, 2009

I have two questions about the 'best' way to design an xml document.

For your example, I wouldn't use XML, I'd go straight to RDF, RDF
Schema and maybe even OWL.

Your offices aren't a "tree", they're a graph (mulltiple roots, not a
simple branching tree). XML is poor at these.

Your identifier structure is crucial to success here. Use URIs.

Typing (as practised by RDFS) would be extremely useful to you.

The downside of using RDF is that is does rather restrict the
implementation language to Java (or Scala, if you're fashionable!) as
that's where the good RDF toolsets are (start by looking at Jena).

Martin Honnen · Aug 4, 2009

lawpoop said:
I have two questions about the 'best' way to design an xml document.
The first is about tagging a collection of similar items as such, and
the other is about a nested structure.

I'm designing an app that can handle an arbitrary tree of a company's
organizational structure. Offices are actual physical locations, while
regions are nestable logical groupings of the offices.

When a region has office(s) as children, I've include the office tags
in <offices></offices>. However, I don't include nested regions in
<regions></regions> tags. Should I? Why or why not? I've noticed that
Michael Kay seems to always enclose repeating items in collection
tags, while other examples don't.

In my view you should at least be consistent in one XML document format
so use offices to wrap office elements and use regions to wrap region
elements or don't do it at all. Using one style for a part of the
document and a different for another is confusing.

Here's an example.

<company>
<region>
<name>Northeast</name>
<region>
<name>New York</name>
<offices>
<office>
<name>New York Office</name>
<address>...</address>
</office>
<office>
<name>Albany</name>
<address>...</address>
<office>
</offices>
</region>
<region>
<name>Boston</name>
</region>
<region>
<name>Newark</name>
</region>
<region>
<name>Gulf Coast</name>
<region>
<name>Dallas</name>
</region>
<region>
<name>Dallas</name>
</region>
</region>
</company>

As far as my second question, if I should enclose any set of regions
in <regions></regions> tags, then shouldn't I include the entire tree
in <regions></regions> tags? In that case, couldn't I replace
<company>, or must the root node always be unique, even with nested
xmls?

You need to have a single root element but the name does not have to be
unique.

lawpoop · Aug 4, 2009

In my view you should at least be consistent in one XML document format
so use offices to wrap office elements and use regions to wrap region
elements or don't do it at all. Using one style for a part of the
document and a different for another is confusing.

Martin, thanks for your perspective.

Does naming or not-naming collections have any arguments besides
consistency? For instance, say I want to do xslt with this xml.
Offices may or may not exist under a particular region. Would it be
any easier to check for offices as a collection or just looking for an
office.

Martin Honnen · Aug 4, 2009

lawpoop said:
Does naming or not-naming collections have any arguments besides
consistency? For instance, say I want to do xslt with this xml.
Offices may or may not exist under a particular region. Would it be
any easier to check for offices as a collection or just looking for an
office.

XSLT and XPath can access
offices/office
or
office
if a region element is the context node. That should not make a
difference in terms of XSLT/XPath.

Working on mobile css menu with plenty of frustration!	2	Dec 29, 2022
Elise Mooney reports on Channel 9 about Maths Worldwide and the fraudthat it is	1	Apr 16, 2010
word_set = set() def should_preceed_with_an(phrase): first_word =	1	Jan 26, 2013
comp.lang.java.gui FAQ	0	Sep 13, 2006
comp.lang.c Answers to Frequently Asked Questions (FAQ List)	15	Apr 1, 2006
SQL Server and .NET Interview questions free download	0	Oct 28, 2006
Download the JAVA , .NET and SQL Server interview with answers	0	Sep 14, 2006
Download the JAVA , .NET and SQL Server interview PDF	0	Sep 17, 2006

design pattern for nested xml?

lawpoop

Andy Dingley

Martin Honnen

lawpoop

Martin Honnen

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads