Good vs Valid XML: Cheap is Dear
/For many years I preached the merits of XML-first and XML-early workflows before it was the norm. Now my platform is "good vs valid XML."
Any service provider can provide XML.
Indeed automated XML is pretty much a standard output from most systems that have anything to do with publishing. It's been 13 years since Microsoft Office introduced the XML formats for Excel and Word files.
Yet when I hit the road and speak with publishers about their challenges, a lot comes back to what I put into this bucket of "good vs valid XML." There is a distinction between a valid XML file and a good valid XML file. You can have a file that is valid but doesn’t really achieve the goal of what the content is supposed to be. What happens too often is that budgets demand, or conversion teams choose to do whatever is easiest (i.e., cheapest) instead of doing the right thing to create a good XML file.
Let's look at some examples
Glossary Example
Following is the rendered text and image:
Following are examples of what I call "good" XML and "valid" XML. Take note of the tagging structure used. The <dl> tag itself better defines the content and provides inherent semantic meaning. The valid XML example is missing alternative text. Without alternative text publishers are missing out on improving SEO and, more important, are failing at content accessibility.
Good XML
Definition term with class to differentiate languages
Valid XML
Definition term in paragraph element with strong element.
- amphibian
- (am fib´ ē ən) An animal that lives part of its life in water and part of its life on land. My pet frog is an amphibian.
- anfibio
- Animal que pasa parte de su vida
en el agua y parte en tierra.
Mi ranita es un
anfibio.
amphibian (am fib´ ē ən) An animal that lives part of its life in water and part of its life on land. My pet frog is an amphibian.
anfibio Animal
que pasa parte de su vida
en el agua y parte en tierra.
Mi ranita es un anfibio.
Annotated Text Example
Following is the rendered text and image:
The good and valid XML demonstrate an image with annotated text (good) and just an image (valid). Take note of the alternative text in the valid XML example. This description is virtually useless to a visually impaired reader.
Good XML
A purpose statement explains a website’s overall goals and the specific objectives that will be used to achieve those goals.
primary goal
secondary goals
objectives
Regifting Website
Purpose Statement:
The goal of the reusable and …
Valid XML
A purpose statement explains a website’s overall goals and the specific objectives that will be used to achieve those goals.
Alt Text Example
This example demonstrates an image with alt text (good) compared with XML just as an image (valid). Alt text improves discoverability and supports accessibility.
Good XML
I bet the folks at home would like to know what we’re going to do this year!
Valid XML
Takeaways
Talk to your vendor about the quality of the XML they produce. The proliferation of offshore vendors has brought pricing models down and this has impacted quality. While price is of great importance and low-cost XML is attractive, publishers are finding that thoughtfulness and editorial quality have been slipping away. With so much technology integrated into publishers’ workflows, it is easy to forget that human QA ensures premium editorial and production services.
Good XML is critical for accessibility
Good XML improves downstream discoverability
Good XML involves automation plus human intervention and that equals quality