mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2022-05-30
This commit is contained in:
@ -232,4 +232,25 @@ Would fix 34 occurences of: Martha Lilia Del Río Duque: 0000-0002-0879-0292
|
||||
|
||||
- Start a harvest on AReS
|
||||
|
||||
## 2022-05-30
|
||||
|
||||
- Help IITA with some collection authorization issues on CGSpace
|
||||
- Finally looking into Peter's Altmetric export from 2022-02
|
||||
- We want to try to compare some of the information about open access status with that in CGSpace
|
||||
- I created a new column for all items that have CGSpace handles using this GREL:
|
||||
|
||||
```console
|
||||
"https://hdl.handle.net/" + value.match(/.*?(10568\/\d+).*?/)[0]
|
||||
```
|
||||
|
||||
- With that I can do a join on the CGSpace metadata and perhaps clean up some items
|
||||
|
||||
```console
|
||||
$ ./bin/dspace metadata-export -f 2022-05-30-cgspace.csv
|
||||
$ csvcut -c 'id,dc.identifier.uri[en_US],dcterms.accessRights[en_US],dcterms.license[en_US]' 2022-05-30-cgspace.csv | sed '1 s/dc\.identifier\.uri\[en_US\]/dc.identifier.uri/' > /tmp/cgspace.csv
|
||||
$ csvjoin -c 'dc.identifier.uri' ~/Downloads/2022-05-30-Altmetric-Research-Outputs-CGSpace.csv /tmp/cgspace.csv > /tmp/cgspace-altmetric.csv
|
||||
```
|
||||
|
||||
- Examining the data in OpenRefine I spot checked a few records where Altmetric and CGSpace disagree and in most cases I found Altmetric to be wrong...
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
Reference in New Issue
Block a user