mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2022-11-28
This commit is contained in:
@ -367,4 +367,56 @@ java.lang.IndexOutOfBoundsException: 1-based index out of bounds: 2
|
||||
- I synced DSpace 7 Test with CGSpace
|
||||
- I had to follow my notes from 2022-03 to delete the missing Atmire migrations
|
||||
|
||||
## 2022-11-28
|
||||
|
||||
- Update `ilri/fix-metadata-values.py` to update the `last_modified` date for items when it updates metadata
|
||||
- This should allow us to use the normal `index-discovery` (with out `-b`) as well as having REST API responses showing a correct last modified date
|
||||
- Maria asked me to add some ORCID identifiers for Alliance staff to the controlled vocabulary
|
||||
- I also updated the `add-orcid-identifiers-csv.py` to update the `last_modified` timestamp of the item
|
||||
- I re-factored my CGSpace Python scripts to use a helper `util.py` module with common functions
|
||||
- For now it only has the one for updating an item's `last_modified` timestamp but I will gradually add more
|
||||
- I also ran our list of ORCID identifiers against ORCID's API to see if anyone changed their name format
|
||||
- Then I ran them on CGSpace with `ilri/update-orcids.py` to fix them
|
||||
- Normalize the `text_lang` values for CGSpace metadata again:
|
||||
|
||||
```console
|
||||
localhost/dspacetest= ☘ SELECT DISTINCT text_lang, count(text_lang) FROM metadatavalue WHERE dspace_object_id IN (SELECT uuid FROM item) GROUP BY text_lang ORDER BY count DESC;
|
||||
text_lang │ count
|
||||
───────────┼─────────
|
||||
en_US │ 2912429
|
||||
│ 108387
|
||||
en │ 12457
|
||||
fr │ 2
|
||||
vi │ 2
|
||||
es │ 1
|
||||
␀ │ 0
|
||||
(7 rows)
|
||||
|
||||
Time: 624.651 ms
|
||||
localhost/dspacetest= ☘ BEGIN;
|
||||
BEGIN
|
||||
Time: 0.130 ms
|
||||
localhost/dspacetest= ☘ UPDATE metadatavalue SET text_lang='en_US' WHERE dspace_object_id IN (SELECT uuid FROM item) AND text_lang IN ('en', '');
|
||||
UPDATE 120844
|
||||
Time: 4074.879 ms (00:04.075)
|
||||
localhost/dspacetest= ☘ SELECT DISTINCT text_lang, count(text_lang) FROM metadatavalue WHERE dspace_object_id IN (SELECT uuid FROM item) GROUP BY text_lang ORDER BY count DESC;
|
||||
text_lang │ count
|
||||
───────────┼─────────
|
||||
en_US │ 3033273
|
||||
fr │ 2
|
||||
vi │ 2
|
||||
es │ 1
|
||||
␀ │ 0
|
||||
(5 rows)
|
||||
|
||||
Time: 346.913 ms
|
||||
localhost/dspacetest= ☘ COMMIT;
|
||||
```
|
||||
|
||||
- Discussing the UN M.49 regions on CGSpace with Valentina and Abenet
|
||||
- The PRMS team is confused about our regions, which are mostly UN M.49 with some legacy stuff using different ones
|
||||
- I think we can fix all the stuff for Initiatives from this year very easily, then work on the legacy stuff later
|
||||
- Also, I noticed that that [country_converter was using the wrong UN M.49 region for Myanmar](https://github.com/konstantinstadler/country_converter/issues/124)
|
||||
- I submitted a [pull request](https://github.com/konstantinstadler/country_converter/pull/125)
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
Reference in New Issue
Block a user