User:Worf/Pokemon/Parser specs: Difference between revisions

No edit summary
 
(One intermediate revision by the same user not shown)
Line 57: Line 57:


Parse all output files a<AID>_[characters|pokemon].out to create a list of character and pokemon ids that are not present in any of the lists. (Take highest id, create list 1-<highest id>, remove all appearing ids from list, output list in missing_[characters|pokemon].out. Incorporate blacklist_[characters|pokemon].in file to remove further characters from the list that arent anime characters (pokemon live show characters for example).
Parse all output files a<AID>_[characters|pokemon].out to create a list of character and pokemon ids that are not present in any of the lists. (Take highest id, create list 1-<highest id>, remove all appearing ids from list, output list in missing_[characters|pokemon].out. Incorporate blacklist_[characters|pokemon].in file to remove further characters from the list that arent anime characters (pokemon live show characters for example).
=v2 Character parser=
Compare data already added to anidb with data from pm.net using http://anidb.net/perl-bin/animedb.pl?show=report&report.type=worf_pokemon_characterlist&report.csv=1&do.report=Generate+Report
Mappings file between pm.net and anidb: http://wiki.anidb.net/w/User:Worf/Pokemon/Parser_specs_mapping
<pre>
charid -> anidb.net character id
identifier -> pm.net character id of the character; http://pocketmonsters.net/character/<identifier> (remove leading 0s)
pokedexid -> pokedex id the character is a guise of; http://pocketmonsters.net/dex/<pokedexid> (remove leading 0s)
pokemon -> comma-separated list of pm.net character ids; list of pokemon the character owns
eids -> list of anidb.net episode ids the character appears in; check against pm.net->anidb.net mappings file
names -> list of comma-separated list of names; each element consists of name, type and language divided by ||
other -> description of the character
</pre>
Data that should be compared:
- whether or not a name differs between the two systems and whether a name is present in one but missing on the other system
- whether the description of a character is the same on both systems
- if the character is a pokemon (has a guise relation on anidb / has a "Pokemon Species" set on pm.net) check whether the relation is set on both systems
- whether all trainer->pokemon relations are set
- whether the pokemon->trainer relation is correctly set
- whether the episode appearances are the same on both systems
Output:
- Differences as specified above in a humanly readable format (pm.net_id, identifier, differing field(s))
- List of pm.net character IDs that don't appear in the report (ignore blacklisted IDs read from blacklist file)
=v2 Episode parser=
---todo---
MediaWiki spam blocked by CleanTalk.
MediaWiki spam blocked by CleanTalk.