From: Michael Orlitzky Date: Tue, 22 Jul 2014 18:04:03 +0000 (-0400) Subject: Document weird date/time issues in the man page. X-Git-Tag: 0.0.9~19 X-Git-Url: https://gitweb.michael.orlitzky.com/?a=commitdiff_plain;h=e3ebe2e3e72155cd0e526386ac4eb970152255d7;p=dead%2Fhtsn-import.git Document weird date/time issues in the man page. --- diff --git a/doc/man1/htsn-import.1 b/doc/man1/htsn-import.1 index 616f7e8..9c76165 100644 --- a/doc/man1/htsn-import.1 +++ b/doc/man1/htsn-import.1 @@ -316,6 +316,137 @@ report these as unsupported and then \(dqsucceed\(dq so that the offending document can be removed if desired. An example is provided as test/xml/weatherxml-backwards-teams.xml. +.SH DATE/TIME ISSUES + +Dates and times appear in a number of places on the feed. The date +portions are usually, fine, but the times often lack important +information such as the time zone, or whether \(dq8 o'clock\(dq means +a.m. or p.m. + +The most pervasive issue occurs with the timestamps that are included +in every message. A typical timestamp looks like, + +.nf + May 24, 2014, at 04:18 PM ET +.fi + +The \(dqtime zone\(dq is given as \(dqET\(dq, but unfortunately +\(dqET\(dq is not a valid time zone. It stands for \(dqEastern +Time\(dq, which can belong to either of two time zones, EST or EDT, +based on the time of the year (i.e. whether or not daylight savings +time is in effect). Since we can't tell from the timestamp, we always +parse these as EST which is UTC-5. When daylight savings is in effect, +they will be off by an hour. + +Here's a list of the ones that may cause surprises: + +.IP \[bu] 2 +\fIAutoRacingResultsXML.dtd\fR + +The elements contain a full date and time, but no time zone +information: + +.nf +5/24/2014 2:45:00 PM +.fi + +We parse them as UTC, which will be wrong when stored, +but \(dqcorrect\(dq if the new UTC time zone is ignored. + +.IP \[bu] +\fIAuto_Racing_Schedule_XML.dtd\fR + +The and elements are combined into on field in +the database, but no time zone information is given. For example, + +.nf +02/16/2013 +08:10 PM +.fi + +As a result, we parse and store the times as UTC. The race times are +not always present in the database, but when they are missing, they +are presented as \(dqTBA\(dq (to be announced): + +.nf +TBA +.fi + +Since the dates do not appear to be optional, we store only the race +date in that case. + +.IP \[bu] +\fIearlylineXML.dtd\fR + +The