Add notes for 2017-02-28

This commit is contained in:
2017-02-28 18:57:31 +02:00
parent ff4dca769e
commit a3f0d88945
5 changed files with 107 additions and 1 deletions

View File

@ -290,6 +290,24 @@ $ grep -c "unable to find valid certification path" [dspace]/log/dspace.log.2017
- Regarding the `filter-media` issue I found earlier, it seems that the ImageMagick PDF plugin will also process JPGs if they are in the "Content Files" (aka `ORIGINAL`) bundle
- The problem likely lies in the logic of `ImageMagickThumbnailFilter.java`, as `ImageMagickPdfThumbnailFilter.java` extends it
- Run CIAT corrections on CGSpace
```
dspace=# update metadatavalue set authority='3026b1de-9302-4f3e-85ab-ef48da024eb2', confidence=600 where resource_type_id=2 and metadata_field_id=3 and text_value = 'International Center for Tropical Agriculture';
```
- CGNET has fixed the certificate chain on their LDAP server
- Redeploy CGSpace and DSpace Test to on latest `5_x-prod` branch with fixes for LDAP bind user
- Run all system updates on CGSpace server and reboot
## 2017-02-28
- After running the CIAT corrections and updating the Discovery and authority indexes, there is still no change in the number of items listed for CIAT in Discovery
- Ah, this is probably because some items have the `International Center for Tropical Agriculture` author twice, which I first noticed in 2016-12 but couldn't figure out how to fix
- I think I can do it by first exporting all metadatavalues that have the author `International Center for Tropical Agriculture`
```
dspace=# \copy (select resource_id, metadata_value_id from metadatavalue where resource_type_id=2 and metadata_field_id=3 and text_value='International Center for Tropical Agriculture') to /tmp/ciat.csv with csv;
COPY 1968
```
- And then using awk or uniq to either remove or print the lines that have a duplicate `resource_id` (meaning they belong to the same item in DSpace and are therefore duplicates), and then using the `metadata_value_id` to delete them