X-Git-Url: http://gitweb.michael.orlitzky.com/?a=blobdiff_plain;f=doc%2Fman1%2Fhtsn-import.1;h=a9a2f3c2f2bf8f522f9821c3b65cdc0b85ea52ca;hb=57959023bb6ff17726eb10a83869a7c4c5deec1a;hp=2c60d2f2474e2eff61e489abe98570743eb614ba;hpb=06ad1384dc1f6db82bf1d6cef8ce005c9fc985f2;p=dead%2Fhtsn-import.git diff --git a/doc/man1/htsn-import.1 b/doc/man1/htsn-import.1 index 2c60d2f..a9a2f3c 100644 --- a/doc/man1/htsn-import.1 +++ b/doc/man1/htsn-import.1 @@ -48,13 +48,9 @@ pickle/unpickle everything already, this should be impossible. The XML document types obtained from the feed are uniquely identified by their DTDs. We currently support documents with the following DTDs: .IP \[bu] 2 -Auto_Racing_Schedule_XML.dtd -.IP \[bu] -CBASK_Lineup_XML.dtd (GameInfo) +AutoRacingResultsXML.dtd .IP \[bu] -cbaskpreviewxml.dtd (GameInfo) -.IP \[bu] -cflpreviewxml.dtd (GameInfo) +Auto_Racing_Schedule_XML.dtd .IP \[bu] Heartbeat.dtd .IP \[bu] @@ -62,43 +58,129 @@ Injuries_Detail_XML.dtd .IP \[bu] injuriesxml.dtd .IP \[bu] -Matchup_NBA_NHL_XML.dtd (GameInfo) +newsxml.dtd +.IP \[bu] +Odds_XML.dtd +.IP \[bu] +scoresxml.dtd .IP \[bu] -MLB_Gaming_Matchup_XML.dtd (GameInfo) +weatherxml.dtd .IP \[bu] -MLB_Lineup_XML.dtd (GameInfo) +GameInfo +.RS .IP \[bu] -MLB_Matchup_XML.dtd (GameInfo) +CBASK_Lineup_XML.dtd .IP \[bu] -MLS_Preview_XML.dtd (GameInfo) +cbaskpreviewxml.dtd .IP \[bu] -mlbpreviewxml.dtd (GameInfo) +cflpreviewxml.dtd .IP \[bu] -NBA_Gaming_Matchup_XML.dtd (GameInfo) +Matchup_NBA_NHL_XML.dtd .IP \[bu] -NBA_Playoff_Matchup_XML.dtd (GameInfo) +MLB_Gaming_Matchup_XML.dtd .IP \[bu] -NBALineupXML.dtd (GameInfo) +MLB_Lineup_XML.dtd .IP \[bu] -nbapreviewxml.dtd (GameInfo) +MLB_Matchup_XML.dtd .IP \[bu] -NCAA_FB_Preview_XML.dtd (GameInfo) +MLS_Preview_XML.dtd .IP \[bu] -newsxml.dtd +mlbpreviewxml.dtd .IP \[bu] -nhlpreviewxml.dtd (GameInfo) +NBA_Gaming_Matchup_XML.dtd .IP \[bu] -Odds_XML.dtd +NBA_Playoff_Matchup_XML.dtd .IP \[bu] -recapxml.dtd (GameInfo) +NBALineupXML.dtd .IP \[bu] -scoresxml.dtd +nbapreviewxml.dtd .IP \[bu] -weatherxml.dtd +NCAA_FB_Preview_XML.dtd +.IP \[bu] +NFL_NCAA_FB_Matchup_XML.dtd +.IP \[bu] +nflpreviewxml.dtd +.IP \[bu] +nhlpreviewxml.dtd +.IP \[bu] +recapxml.dtd +.IP \[bu] +WorldBaseballPreviewXML.dtd +.RE +.IP \[bu] +SportInfo +.RS +.IP \[bu] +CBASK_3PPctXML.dtd +.IP \[bu] +Cbask_All_Tourn_Teams_XML.dtd +.IP \[bu] +CBASK_AssistsXML.dtd +.IP \[bu] +Cbask_Awards_XML.dtd +.IP \[bu] +CBASK_BlocksXML.dtd +.IP \[bu] +Cbask_Conf_Standings_XML.dtd +.IP \[bu] +Cbask_DivII_III_Indv_Stats_XML.dtd +.IP \[bu] +Cbask_DivII_Team_Stats_XML.dtd +.IP \[bu] +Cbask_DivIII_Team_Stats_XML.dtd +.IP \[bu] +CBASK_FGPctXML.dtd +.IP \[bu] +CBASK_FoulsXML.dtd +.IP \[bu] +CBASK_FTPctXML.dtd +.IP \[bu] +Cbask_Indv_Scoring_XML.dtd +.IP \[bu] +CBASK_MinutesXML.dtd +.IP \[bu] +Cbask_Polls_XML.dtd +.IP \[bu] +CBASK_ReboundsXML.dtd +.IP \[bu] +CBASK_ScoringLeadersXML.dtd +.IP \[bu] +Cbask_Team_ThreePT_Made_XML.dtd +.IP \[bu] +Cbask_Team_ThreePT_PCT_XML.dtd +.IP \[bu] +Cbask_Team_Win_Pct_XML.dtd +.IP \[bu] +Cbask_Top_Twenty_Five_XML.dtd +.IP \[bu] +CBASK_TopTwentyFiveResult_XML.dtd +.IP \[bu] +Cbask_Tourn_Awards_XML.dtd +.IP \[bu] +Cbask_Tourn_Champs_XML.dtd +.IP \[bu] +Cbask_Tourn_Indiv_XML.dtd +.IP \[bu] +Cbask_Tourn_Leaders_XML.dtd +.IP \[bu] +Cbask_Tourn_MVP_XML.dtd +.IP \[bu] +Cbask_Tourn_Records_XML.dtd +.IP \[bu] +LeagueScheduleXML.dtd +.IP \[bu] +minorscoresxml.dtd +.IP \[bu] +Minor_Baseball_League_Leaders_XML.dtd +.IP \[bu] +Minor_Baseball_Standings_XML.dtd +.IP \[bu] +Minor_Baseball_Transactions_XML.dtd +.RE .P -The GameInfo and SportsInfo types do not have their own top-level +The GameInfo and SportInfo types do not have their own top-level tables in the database. Instead, their raw XML is stored in either the -\(dqgame_info\(dq or \(dqsports_info\(dq table respectively. +\(dqgame_info\(dq or \(dqsport_info\(dq table respectively. .SH DATABASE SCHEMA .P @@ -143,11 +225,11 @@ unique constraint in the top-level table's \(dqxml_file_id\(dq will prevent duplication in this case anyway. .P The aforementioned exceptions are the \(dqgame_info\(dq and -\(dqsports_info\(dq tables. These tables contain the raw XML for a +\(dqsport_info\(dq tables. These tables contain the raw XML for a number of DTDs that are not handled individually. This is partially for backwards-compatibility with a legacy implementation, but is mostly a stopgap due to a lack of resources at the moment. These two -tables (game_info and sports_info) still possess timestamps that allow +tables (game_info and sport_info) still possess timestamps that allow us to prune old data. .P UML diagrams of the resulting database schema for each XML document