Comment On XML Kōan from the Fourth Dimension

John Y recently had to deal with an XML-like dump from a "4D" database. This dump used a peculiar form of abbreviation in which letters were chosen seemingly at random from field names, in order to meet the well known XML limitation of only allowing 5 characters per tag name. [expand full text]
« PrevPage 1 | Page 2 | Page 3Next »

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:04 • by Foosball Girl In My Dreams
</>*

*Comment_Not_Found

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:05 • by batasrki
Oh yes, XML's tag name limitations are well known. Damn Tim Bray and his invention. :-)

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:06 • by snoofle
156871 in reply to 156869
Now now, maybe the intention was to say that he (in the tag) had nothing worth saying - a null tag

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:08 • by David (unregistered)

in order to meet the well known XML limitation of only allowing 5 characters per tag name.


I hope you're bullshiting me right? It must be, it isn't, isn't it? Please tell me it isn't, please? Otherwise all of my XML of the last X years has been invalid...
Nah, you must be bullshitting me, aren't you? Seriously, please tell me this is bullshit.

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:09 • by pcooper
I think constructions like that may be legal in some applications of SGML. But not in XML.

I've always been amused that <_/> is a legal XML document, though.

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:15 • by FDF (unregistered)
156874 in reply to 156872
David:

in order to meet the well known XML limitation of only allowing 5 characters per tag name.


I hope you're bullshiting me right? It must be, it isn't, isn't it? Please tell me it isn't, please? Otherwise all of my XML of the last X years has been invalid...
Nah, you must be bullshitting me, aren't you? Seriously, please tell me this is bullshit.


Of course he's not bullshitting you

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:17 • by cynic (unregistered)
Imagine if namespaces were used to bridge the "limitation"

<dicti:onary>
<defin:ition>
</defin:ition>
</dicti:onary>

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:25 • by DaveK
156877 in reply to 156872
David:

in order to meet the well known XML limitation of only allowing 5 characters per tag name.


I hope you're bullshiting me right? It must be, it isn't, isn't it? Please tell me it isn't, please? Otherwise all of my XML of the last X years has been invalid...
Nah, you must be bullshitting me, aren't you? Seriously, please tell me this is bullshit.


YHBT, fairly comprehensively!

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:26 • by ObiWayneKenobi
WTF? I've seen plenty of XML that use more than 5 characters in tags. Are you telling me this isn't really valid XML, despite conforming to the W3C standards for XML?

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:27 • by Talchas
156880 in reply to 156878
The five letters thing was sarcasm...

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:31 • by AndyBee (unregistered)
156882 in reply to 156878
good grief.

In future perhaps we should have <sarcasm></sarcasm> tags?

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:35 • by Sunstorm
156883 in reply to 156882
AndyBee:
good grief.

In future perhaps we should have <sarcasm></sarcasm> tags?

<srcsm></srcsm>, get it right.

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:37 • by Darkstar (unregistered)
156885 in reply to 156880
Talchas:
The five letters thing was sarcasm...


<sarcasm>NOW you tell me, it has taken me hours to go through all our XML documents and redo the tags - not to mention building up a cross reference XML document indicating what the tags were and are now.</sarcasm>

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:38 • by Grant (unregistered)
And even without the <sarcasm /> tags, it's probably worth noting that the XML specs are available for free on the iterweb. The interweb is a neat little tool that really helps software developers out. If you haven't checked it out yet, give it a try.

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:40 • by Mean Mr. Mustard (unregistered)
Actually, all English communication is supposed to be words of five characters or less. About the only legal words on the Internet are those CAPTCHA thingies.

Re: XML Kōan from the Fourth Dimension

2007-10-12 09:42 • by An apprentice (unregistered)
156888 in reply to 156886
Grant:
And even without the <sarcasm /> tags, it's probably worth noting that the XML specs are available for free on the iterweb. The interweb is a neat little tool that really helps software developers out. If you haven't checked it out yet, give it a try.


Excuse me, but how exactly can a series of tubes help in software development?

Does anyone ever actually bother to change the subject line?

2007-10-12 09:42 • by Andy Goth (unregistered)
XML is overused and underused at the same time. I see it used for things that would have been handled better by traditional formats or custom formats. But then I also see ugly custom "I can't believe it's not XML!" formats used where XML would have done the trick nicely. I suspect the decision to use XML is primarily driven by marketing and management, rarely by developers who might actually have a clue about what's appropriate.

Also, I once saw XML used to contain HTML. A literal ampersand had to be written out thusly: &amp;amp; . In context it looked like this: <mydata>Hello &lt;b&gt;Alex &amp;amp; Jake!&lt;/b&gt;</mydata> . Wow.

Re: Does anyone ever actually bother to change the subject line?

2007-10-12 09:53 • by brazzy
156890 in reply to 156889
Andy Goth:
XML is overused and underused at the same time. I see it used for things that would have been handled better by traditional formats or custom formats. But then I also see ugly custom "I can't believe it's not XML!" formats used where XML would have done the trick nicely. I suspect the decision to use XML is primarily driven by marketing and management, rarely by developers who might actually have a clue about what's appropriate.

You forgot "badly used". I currently work on a large banking app that uses XML to communicate with a certain backend system - perfectly legitimate use of XML. Unfortunately, the backend system provider's understanding of XML is adequately summarized by this verbatim quote:

"The schemas are just considered documentation and are not binding. The actual binding specification of the communication format are these Excel sheets."

Re: XML Kōan from the Fourth Dimension

2007-10-12 10:06 • by Martin (unregistered)
All of the XHTML is bad then! <a href=""> </a> was illegal for so long!

Re: XML Kōan from the Fourth Dimension

2007-10-12 10:08 • by FredSaw
156894 in reply to 156883
Sunstorm:
AndyBee:
good grief.

In future perhaps we should have <sarcasm></sarcasm> tags?
<srcsm></srcsm>, get it right.
How about...
<sar:chasm>Signifying the expanse between what was said and what was understood</sar:chasm>

Re: XML Kōan from the Fourth Dimension

2007-10-12 10:10 • by gabba (unregistered)
I like how the dump was "XML-like". No doubt the programmer thought improving on XML was way better than conforming to the standard.

Re: Does anyone ever actually bother to change the subject line?

2007-10-12 10:11 • by FredSaw
156896 in reply to 156890
brazzy:
"The schemas are just considered documentation and are not binding. The actual binding specification of the communication format are these Excel sheets."
Terms like "justifiable homicide" spring to mind.

Re: XML Kōan from the Fourth Dimension

2007-10-12 10:14 • by Bosshog (unregistered)
156897 in reply to 156883
Sunstorm:
AndyBee:
good grief.

In future perhaps we should have <sarcasm></sarcasm> tags?

<srcsm></srcsm>, get it right.


Brillant! :D

Re: XML Kōan from the Fourth Dimension

2007-10-12 10:32 • by d3matt
156906 in reply to 156891
Martin:
All of the XHTML is bad then! <a href=""> </a> was illegal for so long!

Wrong. The tag is <a />. The href is an attribute of <a />. Also, href is less five characters as well so it is perfectly fine :)

Re: XML Kōan from the Fourth Dimension

2007-10-12 10:35 • by John Cowan (unregistered)
156909 in reply to 156873
Quite so. In full SGML, "</> means "End the current element", and "<>" means "Repeat the most recent start-tag". So "<p>foo</><>bar</>" was short for "<p>foo</p><p>bar</p>".

It's also true that the default maximum length for tags is 8 characters (not 5), though implementations could and did extend that.

Re: XML Kōan from the Fourth Dimension

2007-10-12 10:45 • by Charlie (unregistered)
156913 in reply to 156872
David:

in order to meet the well known XML limitation of only allowing 5 characters per tag name.


I hope you're bullshiting me right? It must be, it isn't, isn't it? Please tell me it isn't, please? Otherwise all of my XML of the last X years has been invalid...
Nah, you must be bullshitting me, aren't you? Seriously, please tell me this is bullshit.


You must be new here. This is how Alex et. al. do humor.

Re: XML Kōan from the Fourth Dimension

2007-10-12 10:46 • by LoL (unregistered)
Is it just me or is the real WTF some of these comments?

Re: XML Kōan from the Fourth Dimension

2007-10-12 10:49 • by Soul-Grinding Madness Looms (unregistered)
156917 in reply to 156909
John Cowan:
Quite so. In full SGML, "</> means "End the current element", and "<>" means "Repeat the most recent start-tag". So "<p>foo</><>bar</>" was short for "<p>foo</p><p>bar</p>".

Just one of many, many SGML WTFs, which seems to be designed to frustrate automatic processing as much as possible. Why they ever picked this to base HTML on... XML is basically SGML after de-WTF-ification. Not that XML is completely free of brain damage itself, mind you, but SGML is from the days where people thought that ADD A TO B GIVING C would be nice to have as a statement in a programming language.

Re: XML Kōan from the Fourth Dimension

2007-10-12 10:58 • by NiceWTF (unregistered)
156927 in reply to 156872
David:

in order to meet the well known XML limitation of only allowing 5 characters per tag name.


I hope you're bullshiting me right?


I think your sarcasm detector is broken.

Re: XML Kōan from the Fourth Dimension

2007-10-12 11:13 • by Hmmmm... (unregistered)
"The schemas are just considered documentation and are not binding. The actual binding specification of the communication format are these Excel sheets."


Doesn't seem unreasonable to me. You have normative (the spreadsheets) and descriptive (the schemas). If there is a inconsistency between the two, you hold up the defined "correct" copy as the standard and use that to fix the schemas...

Re: XML Kōan from the Fourth Dimension

2007-10-12 11:45 • by Synonymous Awkward (unregistered)
156953 in reply to 156917
Soul-Grinding Madness Looms:
John Cowan:
Quite so. In full SGML, "</> means "End the current element", and "<>" means "Repeat the most recent start-tag".

Just one of many, many SGML WTFs, which seems to be designed to frustrate automatic processing as much as possible.

Actually, if </> were enforced as the only way to end a tag, it'd make parsing a bit faster since well-formed-ness checking is so much simpler. Of course, you lose the sanity-checking provided by the repeated tag-name. And you've basically just reinvented S-exps.

But oh well. :-)

Re: XML Kōan from the Fourth Dimension

2007-10-12 12:00 • by snoofle
156958 in reply to 156917
Soul-Grinding Madness Looms:

... SGML is from the days where people thought that ADD A TO B GIVING C would be nice to have as a statement in a programming language.

I'm from those days, and I worked on a project where management thought it would be "cool" to use this new-fangled SGML (to represent component parts of documents that had been scanned in so they could be electronically updated instead of completely retyped; at the time, it was an improvement over what we had available). Of course, this was in the days where we had to hand-crank the computer to get it to do anything...

Re: XML Kōan from the Fourth Dimension

2007-10-12 12:05 • by FredSaw
156962 in reply to 156933
Hmmmm...:
"The schemas are just considered documentation and are not binding. The actual binding specification of the communication format are these Excel sheets."


Doesn't seem unreasonable to me. You have normative (the spreadsheets) and descriptive (the schemas). If there is a inconsistency between the two, you hold up the defined "correct" copy as the standard and use that to fix the schemas...
That makes sense. So, whenever a client wants to connect with a web or WCF service, the service can put the excel spreadsheet on a wooden table, take a polaroid of it, and fax that over to the client to establish the contract.

Re: Does anyone ever actually bother to change the subject line?

2007-10-12 12:05 • by Michael (unregistered)
156963 in reply to 156890
brazzy:
"The schemas are just considered documentation and are not binding. The actual binding specification of the communication format are these Excel sheets."

I can't wait until they migrate to Office 2007 with their MSOOXML Excel sheet, the infinite recursion should literally make people explode.

Re: XML Kōan from the Fourth Dimension

2007-10-12 12:10 • by Fish Basket Gordo (unregistered)
156965 in reply to 156894
FredSaw:
Sunstorm:
AndyBee:
good grief.

In future perhaps we should have <sarcasm></sarcasm> tags?
<srcsm></srcsm>, get it right.
How about...
<sar:chasm>Signifying the expanse between what was said and what was understood</sar:chasm>


What great wit. I didn't know Oscar Wilde had the Internet in the hereafter. (I am purposely leaving off any tags to indicate my tone.)

Re: Does anyone ever actually bother to change the subject line?

2007-10-12 12:40 • by jayh (unregistered)
156969 in reply to 156896
homocide:

accidental
justifiable
laudable

Re: XML Kōan from the Fourth Dimension

2007-10-12 12:50 • by Daniel15
156972 in reply to 156888
An apprentice:
Grant:
And even without the <sarcasm /> tags, it's probably worth noting that the XML specs are available for free on the iterweb. The interweb is a neat little tool that really helps software developers out. If you haven't checked it out yet, give it a try.


Excuse me, but how exactly can a series of tubes help in software development?

On the other hand... A large truck, now *that* would be useful.
:)

Re: Does anyone ever actually bother to change the subject line?

2007-10-12 12:52 • by Darien H (unregistered)
156973 in reply to 156889
Andy Goth:
Also, I once saw XML used to contain HTML. A literal ampersand had to be written out thusly: &amp;amp; . In context it looked like this: <mydata>Hello &lt;b&gt;Alex &amp;amp; Jake!&lt;/b&gt;</mydata> . Wow.


There was this MLS service which abruptly cut off the ability to downloads SQL dumps for a client and then said everyone had to use their badly-documented SOAP service and a demo/test site which would inexplicably fail every other invocation.

The XML specification for my query simply would not work when I put it into the soap body.

Then I found out that it wanted my XML in <![CDATA[ tags...

Re: XML Kōan from the Fourth Dimension

2007-10-12 12:57 • by brazzy
156976 in reply to 156933
Hmmmm...:
"The schemas are just considered documentation and are not binding. The actual binding specification of the communication format are these Excel sheets."


Doesn't seem unreasonable to me. You have normative (the spreadsheets) and descriptive (the schemas). If there is a inconsistency between the two, you hold up the defined "correct" copy as the standard and use that to fix the schemas...

You've never used non-trivial XML communication, have you?
XML schemas are designed and perfectly fit to provide a formal normative description of an XML format. Using Excel sheets means you throw away all formal regidity and introduce endless possibilities for inconsistence, vagueness and openness to interpretation that would simply not exist if you used a schema.

Using Excel sheets instead of a schema to specify an XML format is like being faced with the task of putting a nail into a wall and deliberately choosing an empty bottle instead of a (prefectly available) hammer as a tool.

Re: Does anyone ever actually bother to change the subject line?

2007-10-12 13:04 • by Desperate for True WTFs (unregistered)
156977 in reply to 156890
brazzy:
Andy Goth:
XML is overused and underused at the same time. I see it used for things that would have been handled better by traditional formats or custom formats. But then I also see ugly custom "I can't believe it's not XML!" formats used where XML would have done the trick nicely. I suspect the decision to use XML is primarily driven by marketing and management, rarely by developers who might actually have a clue about what's appropriate.

You forgot "badly used". I currently work on a large banking app that uses XML to communicate with a certain backend system - perfectly legitimate use of XML. Unfortunately, the backend system provider's understanding of XML is adequately summarized by this verbatim quote:

"The schemas are just considered documentation and are not binding. The actual binding specification of the communication format are these Excel sheets."


And this is why I continue to read Daily WTF. Even when the owners of the site can't manage to post a real WTF for weeks on end, occasionally, a user comment strikes WTF gold!

Re: XML Kōan from the Fourth Dimension

2007-10-12 13:08 • by Edward Royce (unregistered)
Hmmmm.

<spoon></spoon>

Re: Does anyone ever actually bother to change the subject line?

2007-10-12 13:12 • by Nonymous (unregistered)
156983 in reply to 156889
Andy Goth:
XML is overused and underused at the same time. I see it used for things that would have been handled better by traditional formats or custom formats. But then I also see ugly custom "I can't believe it's not XML!" formats used where XML would have done the trick nicely. I suspect the decision to use XML is primarily driven by marketing and management, rarely by developers who might actually have a clue about what's appropriate.

Also, I once saw XML used to contain HTML. A literal ampersand had to be written out thusly: &amp;amp; . In context it looked like this: <mydata>Hello &lt;b&gt;Alex &amp;amp; Jake!&lt;/b&gt;</mydata> . Wow.


Or they could just use a CDATA block...

Re: XML Kōan from the Fourth Dimension

2007-10-12 13:18 • by AdT (unregistered)
156985 in reply to 156872
David:
I hope you're bullshiting me right?


Hi David,

the irony meter manufacturer just called. Your device is malfunctioning. Please take it to the nearest service station ASAP.

Re: XML Kōan from the Fourth Dimension

2007-10-12 13:25 • by AdT (unregistered)
156987 in reply to 156976
brazzy:
Using Excel sheets means you throw away all formal regidity and introduce endless possibilities for inconsistence, vagueness and openness to interpretation that would simply not exist if you used a schema.


Not to mention that there aren't too many tools that can automatically check the validity of an XML document using an Excel spreadsheet "schema".

Re: XML Kōan from the Fourth Dimension

2007-10-12 13:39 • by G Money (unregistered)
156992 in reply to 156886
Grant:
And even without the <sarcasm /> tags, it's probably worth noting that the XML specs are available for free on the iterweb. The interweb is a neat little tool that really helps software developers out. If you haven't checked it out yet, give it a try.


I'll wait until it's available on computers.

Re: XML Kōan from the Fourth Dimension

2007-10-12 13:45 • by phaedrus
156995 in reply to 156914
LoL:
Is it just me or is the real WTF some of these comments?


You must also be new here. TRWTF(tm) is always the comments. That's why you read them.

Re: XML Kōan from the Fourth Dimension

2007-10-12 13:56 • by Pablo (unregistered)
156996 in reply to 156927
The guy hoped it was a joke, and begged for confirmation.His sarcasm detector was fine.

His hope that anything in IT, anywhere, ever, is even remotely sane, is right on the edge. It's okay David. You don't have to break down yet.

Re: Does anyone ever actually bother to change the subject line?

2007-10-12 14:39 • by gasman
157003 in reply to 156983
Nonymous:
Andy Goth:

Also, I once saw XML used to contain HTML. A literal ampersand had to be written out thusly: &amp;amp; . In context it looked like this: <mydata>Hello &lt;b&gt;Alex &amp;amp; Jake!&lt;/b&gt;</mydata> . Wow.


Or they could just use a CDATA block...


...and turn an ugly-but-working data format into one that breaks as soon as your user input contains the characters ]]>.

Re: XML Kōan from the Fourth Dimension

2007-10-12 14:39 • by Zygo (unregistered)
157004 in reply to 156917
Soul-Grinding Madness Looms:
Just one of many, many SGML WTFs, which seems to be designed to frustrate automatic processing as much as possible. Why they ever picked this to base HTML on...


Because the available alternatives at the time were SGML, ASN.1, Gopher's hypertext format, or something from Microsoft.

If the timing had played out a little differently, web browsers would have shipped with BSD and web pages would probably end up written in m4 or something. <shiver>

Re: XML Kōan from the Fourth Dimension

2007-10-12 15:16 • by iMalc (unregistered)
Of course we all know it was supposed to be:
<>There is no spoon</>
« PrevPage 1 | Page 2 | Page 3Next »

Add Comment