mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes
This commit is contained in:
@ -109,4 +109,42 @@ map $request_uri $new_uri {
|
||||
}
|
||||
```
|
||||
|
||||
## 2024-04-19
|
||||
|
||||
- Spend some time looking at duplicate DOIs again...
|
||||
- Refresh ORCID identifiers from ORCID API and update CGSpace metadata and controlled vocabulary
|
||||
|
||||
## 2024-04-20
|
||||
|
||||
- I read an [interesting thread about DOI casing](https://github.com/greenelab/scihub/issues/9)
|
||||
- Apparently the DOI specification says ASCII characters in DOIs are case insensitive
|
||||
- Indeed, [Crossref recommends lower case](https://www.crossref.org/documentation/member-setup/constructing-your-dois/) for all DOIs
|
||||
- I was curious about the DOIs in our database so I checked before and after lower casing:
|
||||
|
||||
```console
|
||||
localhost/dspace7= ☘ \COPY (SELECT DISTINCT(text_value) FROM metadatavalue WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=220 AND text_value IS NOT NULL AND text_value !='') TO /tmp/dois-sql-before.txt;
|
||||
COPY 25675
|
||||
localhost/dspace7= ☘ \COPY (SELECT DISTINCT(lower(text_value)) FROM metadatavalue WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=220 AND text_value IS NOT NULL AND text_value !='') TO /tmp/dois-sql-after.txt;
|
||||
COPY 25666
|
||||
```
|
||||
|
||||
- I need to investigate options for lower casing these in the repository, for example in a curation task, and in all workflows around DSpace metadata...
|
||||
|
||||
## 2024-04-23
|
||||
|
||||
- Spent some time writing a Java curation task to normalize DOIs in items when they enter the workflow edit step
|
||||
- The workflow curation tasks are not documented very well but I got a basic configuration working
|
||||
- I found a bug in DSpace curation tasks and discussed on Slack
|
||||
- I finalized the `NormalizeDOIs` curation task and released v7.6.1.1 of the [cgspace-java-helpers](https://github.com/ilri/cgspace-java-helpers) project
|
||||
|
||||
## 2024-04-24
|
||||
|
||||
- A bit more testing of the curation tasks
|
||||
- I tested a patch by Mark Wood
|
||||
- I added support for normalizing DOIs to this same format to my [csv-metadata-quality](https://github.com/ilri/csv-metadata-quality) project
|
||||
|
||||
## 2024-04-25
|
||||
|
||||
- I lowercased the remaining 3,900 DOIs on CGSpace that had uppercase ASCII characters
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
Reference in New Issue
Block a user