Add notes for 2022-05-25

This commit is contained in:
2022-05-25 17:05:40 +03:00
parent f783b75f4e
commit cc24e999df
28 changed files with 105 additions and 34 deletions

View File

@ -176,4 +176,41 @@ $ csvgrep -c matched -m false /tmp/2022-05-13-ror.csv | csvcut -c organization >
- Update CGSpace to latest `6_x-prod` branch, which removes a handful of Mirage 2 themes and migrates to Node.js 14 and some newer build deps
- Run all system updates on CGSpace (linode18) and reboot it
## 2022-05-25
- Maria Garruccio sent me a handful of new ORCID identifiers for Alliance staff
- We currently have 1349 unique identifiers and this adds about forty-five new ones (!):
```console
$ grep -oE '[A-Z0-9]{4}-[A-Z0-9]{4}-[A-Z0-9]{4}-[A-Z0-9]{4}' ~/src/git/DSpace/dspace/config/controlled-vocabularies/cg-creator-identifier.xml | sort | uniq | wc -l
1349
$ cat ~/src/git/DSpace/dspace/config/controlled-vocabularies/cg-creator-identifier.xml /tmp/new-abc-orcids.txt | grep -oE '[A-Z0-9]{4}-[A-Z0-9]{4}-[A-Z0-9]{4}-[A-Z0-9]{4}' | sort | uniq > /tmp/2022-05-25-combined-orcids.txt
$ wc -l /tmp/2022-05-25-combined-orcids.txt
1395 /tmp/2022-05-25-combined-orcids.txt
```
- After combining and filtering them I resolved their names using my `resolve-orcids.py` script:
```console
$ ./ilri/resolve-orcids.py -i /tmp/2022-05-25-combined-orcids.txt -o /tmp/2022-05-25-combined-orcids-names.txt
```
- There are some names that changed, so I need to run them through the `fix-metadata-values.py` script:
```console
$ cat 2022-05-25-update-orcids.csv
cg.creator.identifier,correct
"Andrea Fongar: 0000-0003-2084-1571","ANDREA CECILIA SANCHEZ BOGADO: 0000-0003-4549-6970"
"Bekele Shiferaw: 0000-0002-3645-320X","Bekele A. Shiferaw: 0000-0002-3645-320X"
"Henry Kpaka: 0000-0002-7480-2933","Henry Musa Kpaka: 0000-0002-7480-2933"
"Josephine Agogbua: 0000-0001-6317-1227","Josephine Udunma Agogbua: 0000-0001-6317-1227"
"Martha Lilia Del Río Duque: 0000-0002-0879-0292","Martha Del Río: 0000-0002-0879-0292"
$ ./ilri/fix-metadata-values.py -i 2022-05-25-update-orcids.csv -db dspace -u dspace -p 'fuuu' -f cg.creator.identifier -m 247 -t correct -d -n
Connected to database.
Would fix 4 occurences of: Andrea Fongar: 0000-0003-2084-1571
Would fix 1 occurences of: Bekele Shiferaw: 0000-0002-3645-320X
Would fix 2 occurences of: Josephine Agogbua: 0000-0001-6317-1227
Would fix 34 occurences of: Martha Lilia Del Río Duque: 0000-0002-0879-0292
```
<!-- vim: set sw=2 ts=2: -->