X-Git-Url: http://gitweb.michael.orlitzky.com/?a=blobdiff_plain;f=doc%2FTODO;h=9ae15d28ff69fa94554a8c544ebe6ffc70b996fc;hb=HEAD;hp=67a7727d158126935523778b8f6d81d4bf5f1083;hpb=03540a4a36867165073fc811b7ed3a89fea6786a;p=dead%2Fhtsn-import.git diff --git a/doc/TODO b/doc/TODO index 67a7727..9ae15d2 100644 --- a/doc/TODO +++ b/doc/TODO @@ -1,46 +1,12 @@ -1. Re-enable DTD validation once - https://github.com/UweSchmidt/hxt/issues/15 is fixed. - -2. Write a test for test/xml/Odds_XML-long-import.xml once it no - longer takes 10 minutes to import (Postgres only?). - -3. Add support the the second type of weatherxml (see man page). - -4. There are a few remaining document types that we need to parse - before "version one point oh." This list refers to an - old proprietary implementation, sorry: - - * Schedule_Changes_XML - -5. Test addition of a new field (weatherxml) and ensure that the - database schema is updated accordingly. Document this fact in the - man page. - -6. We have DTDs but no sample XML for the following SportInfo types, +1. We have DTDs but no sample XML for the following SportInfo types, which have therefore been left unimplmented for now: - * Cbask_Indv_No_Avg_XML.dtd - * Cbask_Indv_Shooting_XML.dtd * CBASK_StealsXML.dtd - * Cbask_Team_Scoring_Rebound_Margin_XML.dtd - * Cbask_Team_Scoring_XML.dtd - * Cbask_Team_Shooting_Pct_XML.dtd - * NFLGrassTurfDomeOutsideXML.dtd - * NFLInterceptionLeadersXML.dtd - * NFLPassingLeadersXML.dtd - * NFLReceivingLeadersXML.dtd - * NFLRushingLeadersXML.dtd - * NFLTackleFFLeadersXML.dtd - * NFLTopKickoffReturnXML.dtd - * NFLTopPuntReturnXML.dtd - * NFLYardsXML.dtd - * NFL_PuntingLeaders_XML.dtd -7. The following DTD types were handled (in some form) by the old +2. The following DTD types were handled (in some form) by the old FeedGrabber. They are not yet handled by htsn-import (some may not be valid): - * AutoRacingDriverList * AutoRacingGridXML * CBASK_Individual_Stats_XML * Cbask_Roster_XML @@ -48,11 +14,9 @@ * CBATeamScheduleXML * CFLTeamScheduleXML * CFLTotalTeamScheduleXML - * earlylineXML * Minor_Baseball_TeamScheduleXML * MinorLeagueHockeyTeamScheduleXML * MLB_Boxscore_XML - * MLB_earlylineXML * MLB_IndividualStats_XML * MLB_Probable_Pitchers_XML * MLB_Roster_XML @@ -74,3 +38,16 @@ * WNBA_Boxscore_XML * WNBA_Individual_Stats_XML * WNBATeamScheduleXML + +3. Consolidate all of the make_game_time functions which take a + date/time and produce a combined time. + +4. Factor out test code where possible; a lot of them differ only in + the filename. + +5. Combine test XML files where possible. We don't need to e.g. import + two copies of newsxml just because one has an empty attribute that + we want to test: we could just delete an attribute from the first + file. + +6. Update the Auto_Racing_Schedule_XML schema diagram.