mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Update notes for 2020-10-22
This commit is contained in:
@ -640,5 +640,20 @@ Authorization denied for action WORKFLOW_STEP_1 on COLLECTION:1072 by user 1759
|
||||
- The submit step is defined, but has no users or groups
|
||||
- I added the IITA submitters there and told Bosede to try again
|
||||
- Add two new blocks to list the top communities and collections on AReS
|
||||
- I want to extract all CRPs and affiliations from AReS to do some text processing and create some mappings...
|
||||
- First extract 10,000 affiliations from Elasticsearch by only including the `affiliation` source:
|
||||
|
||||
```
|
||||
$ http 'http://localhost:9200/openrxv-items-final/_search?_source_includes=affiliation&size=10000&q=*:*' > /tmp/affiliations.json
|
||||
```
|
||||
|
||||
- Then I decided to try a different approach and I adjusted my `convert-mapping.py` script to re-consider some replacement patterns with acronyms from the original AReS `mapping.json` file to hopefully address some MEL to CGSpace mappings
|
||||
- For example, to changes this:
|
||||
- find: International Livestock Research Institute
|
||||
- replace: International Livestock Research Institute - ILRI
|
||||
- ... into this:
|
||||
- find: International Livestock Research Institute - ILRI
|
||||
- replace: International Livestock Research Institute
|
||||
- I re-uploaded the mappings to Elasticsearch like I did yesterday and restarted the harvesting
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
Reference in New Issue
Block a user