diff --git a/content/posts/2019-08.md b/content/posts/2019-08.md index d5085ee78..199ded843 100644 --- a/content/posts/2019-08.md +++ b/content/posts/2019-08.md @@ -242,4 +242,28 @@ return os.path.basename(value) - Run system updates on AReS / OpenRXV dev server (linode20) and reboot it - Fix AReS exports on DSpace Test by adding a new nginx proxy pass +## 2019-08-26 + +- Peter sent 2,943 corrections to the author dump I had originally sent him on 2019-05-27 + - I noticed that one correction had a missing space after the comma, ie "Adamou,A." so I corrected it + - Also, I should add that as a check to the csv-metadata-quality pipeline + - Apply the corrections to my local dev machine in preparation for the CGSpace: + +``` +$ ./fix-metadata-values.py -i ~/Downloads/2019-08-26-Peter-authors.csv -db dspace -u dspace -p 'fuuu' -f dc.contributor.author -m 3 -t correct +``` + +- Apply the corrections on CGSpace and DSpace Test + - After that I started a full Discovery re-indexing on both servers: + +``` +$ time schedtool -B -e ionice -c2 -n7 nice -n19 dspace index-discovery -b +``` + +- Peter asked me to add related citation aka `cg.link.citation` to the item view + - I created a [pull request](https://github.com/ilri/DSpace/pull/430) with a draft implementation and asked for Peter's feedback + +- Add the ability to skip certain fields from the csv-metadata-quality script using `--exclude-fields` + - For example, when I'm working on the author corrections I want to do the basic checks on the corrected fields, but on the original fields so I would use `--exclude-fields dc.contributor.author` for example + diff --git a/docs/2019-08/index.html b/docs/2019-08/index.html index 73a62d373..70dc43d17 100644 --- a/docs/2019-08/index.html +++ b/docs/2019-08/index.html @@ -27,7 +27,7 @@ Run system updates on DSpace Test (linode19) and reboot it - + @@ -59,9 +59,9 @@ Run system updates on DSpace Test (linode19) and reboot it "@type": "BlogPosting", "headline": "August, 2019", "url": "https:\/\/alanorth.github.io\/cgspace-notes\/2019-08\/", - "wordCount": "1744", + "wordCount": "1924", "datePublished": "2019-08-03T12:39:51\x2b03:00", - "dateModified": "2019-08-21T17:14:49\x2b03:00", + "dateModified": "2019-08-23T13:38:38\x2b03:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -447,6 +447,43 @@ return os.path.basename(value)
Peter sent 2,943 corrections to the author dump I had originally sent him on 2019-05-27
+ +Apply the corrections to my local dev machine in preparation for the CGSpace:
+ +$ ./fix-metadata-values.py -i ~/Downloads/2019-08-26-Peter-authors.csv -db dspace -u dspace -p 'fuuu' -f dc.contributor.author -m 3 -t correct
+
Apply the corrections on CGSpace and DSpace Test
+ +After that I started a full Discovery re-indexing on both servers:
+ +$ time schedtool -B -e ionice -c2 -n7 nice -n19 dspace index-discovery -b
+
Peter asked me to add related citation aka cg.link.citation
to the item view
Add the ability to skip certain fields from the csv-metadata-quality script using --exclude-fields
--exclude-fields dc.contributor.author
for example