Update notes for 2020-10-22

This commit is contained in:
2020-10-22 20:07:42 +03:00
parent 8614add1c9
commit f60bb8b10f
22 changed files with 67 additions and 27 deletions

View File

@ -640,5 +640,20 @@ Authorization denied for action WORKFLOW_STEP_1 on COLLECTION:1072 by user 1759
- The submit step is defined, but has no users or groups
- I added the IITA submitters there and told Bosede to try again
- Add two new blocks to list the top communities and collections on AReS
- I want to extract all CRPs and affiliations from AReS to do some text processing and create some mappings...
- First extract 10,000 affiliations from Elasticsearch by only including the `affiliation` source:
```
$ http 'http://localhost:9200/openrxv-items-final/_search?_source_includes=affiliation&size=10000&q=*:*' > /tmp/affiliations.json
```
- Then I decided to try a different approach and I adjusted my `convert-mapping.py` script to re-consider some replacement patterns with acronyms from the original AReS `mapping.json` file to hopefully address some MEL to CGSpace mappings
- For example, to changes this:
- find: International Livestock Research Institute
- replace: International Livestock Research Institute - ILRI
- ... into this:
- find: International Livestock Research Institute - ILRI
- replace: International Livestock Research Institute
- I re-uploaded the mappings to Elasticsearch like I did yesterday and restarted the harvesting
<!-- vim: set sw=2 ts=2: -->