Add notes for 2023-01-10

This commit is contained in:
2023-01-10 22:22:03 +03:00
parent d1278a67d8
commit 967b16a966
124 changed files with 218 additions and 156 deletions

View File

@ -75,4 +75,32 @@ $ psql -c 'SELECT * FROM pg_locks pl LEFT JOIN pg_stat_activity psa ON pl.pid =
- I am going to disable that cron job for now and watch the server load for a few weeks
- Start a harvest on AReS
## 2023-01-08
- It's Sunday and I see some PostgreSQL locks belonging to dspaceCli that started at 05:00
- That's strange because I disabled the `dspace checker` one last week, so I'm not sure which this is...
- It's currently 2:30PM on the server so these locks have been there for almost twelve hours
- I exported the entire CGSpace to update the Initiative mappings
- Items were mapped to ~58 new Initiative collections
- Then I ran the ORCID import to catch any new ones that might not have been tagged
- Then I started a harvest on AReS
## 2023-01-09
- Fix some invalid Initiative names on CGSpace and then check for missing mappings
- Check for missing regions in the Initiatives collection
- Export a list of author affiliations from the Initiatives community for Peter to check
- Was slightly ghetto because I did it from a CSV export of the Initiatives community, then imported to OpenRefine to split multi-value fields, then did some sed nonsense to handle the quoting:
```console
$ csvcut -c 'cg.contributor.affiliation[en_US]' ~/Downloads/2023-01-09-initiatives.csv | \
sed -e 's/^"//' -e 's/"$//' -e 's/||/\n/g' | \
sort -u | \
sed -e 's/^\(.*\)/"\1/' -e 's/\(.*\)$/\1"/' > /tmp/2023-01-09-initiatives-affiliations.csv
```
## 2023-01-10
- Export the CGSpace Initiatives collection to check for missing regions and collection mappings
<!-- vim: set sw=2 ts=2: -->