X-Git-Url: http://gitweb.michael.orlitzky.com/?a=blobdiff_plain;f=doc%2Fman1%2Fhtsn-import.1;h=5e3d5ac21a720c70b38f1e1defb90020f0276e95;hb=b2c39ebe5ff9c1ea3224231df5078c52d0ad8737;hp=7a215b142c420e2931045a895daa1d3ebba974c6;hpb=5e06d6a189fd5bc1cbc67a349bbee5e168d3bf24;p=dead%2Fhtsn-import.git diff --git a/doc/man1/htsn-import.1 b/doc/man1/htsn-import.1 index 7a215b1..5e3d5ac 100644 --- a/doc/man1/htsn-import.1 +++ b/doc/man1/htsn-import.1 @@ -268,6 +268,21 @@ construct the DTDs ourselves, the results are sometimes inconsistent. Here we document a few of them. .IP \[bu] 2 +\fInewsxml.dtd\fR + +The TSN DTD for news (and almost all XML on the wire) suggests that +there is a exactly one (possibly-empty) element present in each +message. However, we have seen an example (XML_File_ID 21232353) where +an empty followed a non-empty one: + +.fi +Odd Man Rush: Snow under pressure to improve Isles quickly + +.nf + +We don't parse this case at the moment. + +.IP \[bu] \fIOdds_XML.dtd\fR The elements here are supposed to be associated with a set of @@ -285,7 +300,7 @@ There appear to be two types of weather documents; the first has contained within . While it would be possible to parse both, it would greatly complicate things. The first form is more common, so that's all we support for now. An example is provided as -schemagen/weatherxml/20143655.xml. +doc/xml-samples/weird-weatherxml.xml. .SH DEPLOYMENT .P