I find that with "IATI-wide queries" across all available data, there is quite a bit of curation to do:
- Uppercase and lowercase differences in identifiers (gb-CHC-... versus GB-CHC-...)
- Not (consistently) using an organisation identifier
- Changes of organisation identifier (NL-1 became XM-DAC-7, but most activities are still identified with NL-1-...)
- Using different organisation identifiers (ICCO, like most Dutch NGOs, has several legal entities in the Chamber of Commerce, which one to pick?)
- Dealing with secondary publishers: UNOCHA publishes FTS information on activities that have different donor codes (XM-OCHA-FTS5191 as yet another identifier for NL or NL-1/XM-DAC-7)
- Funky activity identifiers containing symbols like #,$,*,',| etc that often break standard tools without pre-processing
- Combining multiple organisations in one participating-org name, or using generic descriptions ("Oxfam partner(s)")
- Outdated datasets that are still active in the IATI Registry
I've done a query on all activities from NGOs (types 21,22,23) with recipient-country Nepal: it's a big graph, and there are some errors on my end to resolve, but it's the sort of output that helps me feedback to organisations on their data quality.
For instance, use "Find" and look for "ActionAid United Kingdom", you'll see:
* GB-CHC-274467 (ActionAid United Kingdom) working with "ActionAid Nepal"
* GB-CHC-27446721 (ActionAid UK) working with "ACTIONAID NEPAL"
Hope this is helpful, although perhaps not hopeful :-)