X-Git-Url: http://gitweb.michael.orlitzky.com/?a=blobdiff_plain;f=doc%2Fman1%2Fhtsn-import.1;h=aebfb062bd0c4030eb047e8791bc82e17e286189;hb=449d86461d8afd7839de750ec48339a4c0f735d0;hp=7a215b142c420e2931045a895daa1d3ebba974c6;hpb=5a8d0ad5929fb297c74ea0802c11f9aa94b25ce7;p=dead%2Fhtsn-import.git diff --git a/doc/man1/htsn-import.1 b/doc/man1/htsn-import.1 index 7a215b1..aebfb06 100644 --- a/doc/man1/htsn-import.1 +++ b/doc/man1/htsn-import.1 @@ -268,6 +268,21 @@ construct the DTDs ourselves, the results are sometimes inconsistent. Here we document a few of them. .IP \[bu] 2 +\fInewsxml.dtd\fR + +The TSN DTD for news (and almost all XML on the wire) suggests that +there is a exactly one (possibly-empty) element present in each +message. However, we have seen an example (XML_File_ID 21232353) where +an empty followed a non-empty one: + +.fi +Odd Man Rush: Snow under pressure to improve Isles quickly + +.nf + +We don't parse this case at the moment. + +.IP \[bu] \fIOdds_XML.dtd\fR The elements here are supposed to be associated with a set of