Absolutely! So that’s why the scraper sends pull requests – to benefit from git’s version control.
For instance, the auto pull request sent on Friday already shows a change to a DAC CRS code – OOF was removed from flow types. You can see this in the pull request diff:
If you check the ‘type of flow’ sheet of the DAC CRS codelist xls, you can see that is the case – code 20 (Other Official Flows) has gone.
Note that I’m following the same model here as mySociety’s EveryPoliticianBot. One improvement would be to create human-readable descriptions of the pull requests, as that bot does – rather than having to read diffs. But in general, the diffs here are likely to be small and relatively easy to understand.
The next step here would be to build on @bjwebb’s work, to pull this stuff into non-embedded codelists and maintain a list of withdrawn codes: