mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2022-11-30
This commit is contained in:
@ -502,5 +502,28 @@ $ sort -k4,1 /tmp/duplicates.txt | \
|
||||
|
||||
- This worked very well, but there were some metadata values that were tripled or quadrupled, so it only deleted the first duplicate
|
||||
- I just ran it again two more times to find the last duplicates, now we have none!
|
||||
- I also generated another SQL file with commands to update the last modified timestamps of these items:
|
||||
|
||||
```console
|
||||
$ awk -F'\t' '{print $4}' /tmp/duplicates.txt | sort -u | sed "s/^\(.*\)$/UPDATE item SET last_modified=NOW() WHERE uuid='\1';/" > /tmp/update-timestamp.sql
|
||||
```
|
||||
|
||||
- Tezira said she was having trouble archiving submissions
|
||||
- In the afternoon I looked and found a high number of locks:
|
||||
|
||||
```console
|
||||
$ psql -c 'SELECT * FROM pg_locks pl LEFT JOIN pg_stat_activity psa ON pl.pid = psa.pid;' | grep -o -E '(dspaceWeb|dspaceApi|dspaceCli)' | sort | uniq -c | sort -n
|
||||
60 dspaceCli
|
||||
176 dspaceApi
|
||||
1194 dspaceWeb
|
||||
```
|
||||
|
||||
[!PostgreSQL database locks](/cgspace-notes/2022/11/postgres_locks_cgspace-day.png)
|
||||
|
||||
- The timing looks suspiciously close to when I was running the batch updates on the ILRI community this morning.
|
||||
- I restarted Tomcat and PostgreSQL and everything was back to normal
|
||||
- I found some items on CGSpace in Dinka, Ndogo, and Bari languages, but the `dcterms.language` field was "other"
|
||||
- That's so unfortunate! These languages are not in ISO 639-1, but they are in ISO 639-3, which uses Alpha 3 and has more space for languages
|
||||
- I changed them from other to use the three-letter codes, and I will suggest to the CG Core group that we use ISO 639-3 in the future
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
Reference in New Issue
Block a user