X-Git-Url: https://gitweb.michael.orlitzky.com/?a=blobdiff_plain;f=doc%2FTODO;h=9ae15d28ff69fa94554a8c544ebe6ffc70b996fc;hb=7b33eb87f59d46ec97d619f7521c067d8a6b9308;hp=b399605601343013e70805b5cd886f4c0c306aa7;hpb=8eabc3df8903fdf6be13b3d17fa24edf71d7723c;p=dead%2Fhtsn-import.git diff --git a/doc/TODO b/doc/TODO index b399605..9ae15d2 100644 --- a/doc/TODO +++ b/doc/TODO @@ -1,34 +1,12 @@ -1. Re-enable DTD validation once - https://github.com/UweSchmidt/hxt/issues/15 is fixed. - -2. Write a test for test/xml/Odds_XML-long-import.xml once it no - longer takes 10 minutes to import (Postgres only?). - -3. We have DTDs but no sample XML for the following SportInfo types, +1. We have DTDs but no sample XML for the following SportInfo types, which have therefore been left unimplmented for now: - * Cbask_Indv_No_Avg_XML.dtd - * Cbask_Indv_Shooting_XML.dtd * CBASK_StealsXML.dtd - * Cbask_Team_Scoring_Rebound_Margin_XML.dtd - * Cbask_Team_Scoring_XML.dtd - * Cbask_Team_Shooting_Pct_XML.dtd - * NFLGrassTurfDomeOutsideXML.dtd - * NFLInterceptionLeadersXML.dtd - * NFLPassingLeadersXML.dtd - * NFLReceivingLeadersXML.dtd - * NFLRushingLeadersXML.dtd - * NFLTackleFFLeadersXML.dtd - * NFLTopKickoffReturnXML.dtd - * NFLTopPuntReturnXML.dtd - * NFLYardsXML.dtd - * NFL_PuntingLeaders_XML.dtd -4. The following DTD types were handled (in some form) by the old +2. The following DTD types were handled (in some form) by the old FeedGrabber. They are not yet handled by htsn-import (some may not be valid): - * AutoRacingDriverList * AutoRacingGridXML * CBASK_Individual_Stats_XML * Cbask_Roster_XML @@ -36,11 +14,9 @@ * CBATeamScheduleXML * CFLTeamScheduleXML * CFLTotalTeamScheduleXML - * earlylineXML * Minor_Baseball_TeamScheduleXML * MinorLeagueHockeyTeamScheduleXML * MLB_Boxscore_XML - * MLB_earlylineXML * MLB_IndividualStats_XML * MLB_Probable_Pitchers_XML * MLB_Roster_XML @@ -63,5 +39,15 @@ * WNBA_Individual_Stats_XML * WNBATeamScheduleXML -5. Consolidate all of the make_game_time functions which take a +3. Consolidate all of the make_game_time functions which take a date/time and produce a combined time. + +4. Factor out test code where possible; a lot of them differ only in + the filename. + +5. Combine test XML files where possible. We don't need to e.g. import + two copies of newsxml just because one has an empty attribute that + we want to test: we could just delete an attribute from the first + file. + +6. Update the Auto_Racing_Schedule_XML schema diagram.