1
0
mirror of https://github.com/alanorth/cgspace-notes.git synced 2025-01-27 05:49:12 +01:00

Add notes for 2022-05-30

This commit is contained in:
2022-05-30 16:00:02 +03:00
parent b5642c03f2
commit e478850def
28 changed files with 74 additions and 33 deletions

@ -232,4 +232,25 @@ Would fix 34 occurences of: Martha Lilia Del Río Duque: 0000-0002-0879-0292
- Start a harvest on AReS
## 2022-05-30
- Help IITA with some collection authorization issues on CGSpace
- Finally looking into Peter's Altmetric export from 2022-02
- We want to try to compare some of the information about open access status with that in CGSpace
- I created a new column for all items that have CGSpace handles using this GREL:
```console
"https://hdl.handle.net/" + value.match(/.*?(10568\/\d+).*?/)[0]
```
- With that I can do a join on the CGSpace metadata and perhaps clean up some items
```console
$ ./bin/dspace metadata-export -f 2022-05-30-cgspace.csv
$ csvcut -c 'id,dc.identifier.uri[en_US],dcterms.accessRights[en_US],dcterms.license[en_US]' 2022-05-30-cgspace.csv | sed '1 s/dc\.identifier\.uri\[en_US\]/dc.identifier.uri/' > /tmp/cgspace.csv
$ csvjoin -c 'dc.identifier.uri' ~/Downloads/2022-05-30-Altmetric-Research-Outputs-CGSpace.csv /tmp/cgspace.csv > /tmp/cgspace-altmetric.csv
```
- Examining the data in OpenRefine I spot checked a few records where Altmetric and CGSpace disagree and in most cases I found Altmetric to be wrong...
<!-- vim: set sw=2 ts=2: -->