From: Michael Orlitzky Date: Fri, 4 Jul 2014 20:29:36 +0000 (-0400) Subject: Add a note to the man page about the double-SMS in a news sample. X-Git-Tag: 0.0.6~34 X-Git-Url: http://gitweb.michael.orlitzky.com/?p=dead%2Fhtsn-import.git;a=commitdiff_plain;h=449d86461d8afd7839de750ec48339a4c0f735d0 Add a note to the man page about the double-SMS in a news sample. --- diff --git a/doc/man1/htsn-import.1 b/doc/man1/htsn-import.1 index 7a215b1..aebfb06 100644 --- a/doc/man1/htsn-import.1 +++ b/doc/man1/htsn-import.1 @@ -268,6 +268,21 @@ construct the DTDs ourselves, the results are sometimes inconsistent. Here we document a few of them. .IP \[bu] 2 +\fInewsxml.dtd\fR + +The TSN DTD for news (and almost all XML on the wire) suggests that +there is a exactly one (possibly-empty) element present in each +message. However, we have seen an example (XML_File_ID 21232353) where +an empty followed a non-empty one: + +.fi +Odd Man Rush: Snow under pressure to improve Isles quickly + +.nf + +We don't parse this case at the moment. + +.IP \[bu] \fIOdds_XML.dtd\fR The elements here are supposed to be associated with a set of