From f835a78d30b30a6a74ec3412541217d1ba262e2e Mon Sep 17 00:00:00 2001 From: Alan Orth Date: Tue, 4 Sep 2018 17:31:20 +0300 Subject: [PATCH] Update notes for 2018-09-04 --- content/posts/2018-09.md | 4 ++-- docs/2018-09/index.html | 8 ++++---- docs/sitemap.xml | 10 +++++----- 3 files changed, 11 insertions(+), 11 deletions(-) diff --git a/content/posts/2018-09.md b/content/posts/2018-09.md index aa7d1ec86..beb0221b0 100644 --- a/content/posts/2018-09.md +++ b/content/posts/2018-09.md @@ -56,11 +56,11 @@ Caused by: java.lang.RuntimeException: Failed to startup the DSpace Service Mana - This makes it super annoying to do the checks and cleanup, so I will merge them (also time consuming) - Five items had `dc.date.issued` values like `2013-5` so I corrected them to be `2013-05` - Several metadata fields had values with newlines in them (even in some titles!), which I fixed by trimming the consecutive whitespaces in Open Refine - - Many (196!) items from before 2011 are indicated as having a CRP, but CRPs didn't exist then so this is impossible + - Many (91!) items from before 2011 are indicated as having a CRP, but CRPs didn't exist then so this is impossible - I got all items that were from 2011 and onwards using a custom facet with this GREL on the `dc.date.issued` column: `isNotNull(value.match(/201[1-8].*/))` and then blanking their CRPs - Some affiliations with only one separator (|) for multiple values - I replaced smart quotes like `’` with plain ones - - Some inconsitencies in `cg.subject.iita` like COWPEA and COWPEAS, and YAM and YAMS, etc, as well as some spelling mistakes like IMPACT ASSESSMENTN + - Some inconsistencies in `cg.subject.iita` like COWPEA and COWPEAS, and YAM and YAMS, etc, as well as some spelling mistakes like IMPACT ASSESSMENTN - Some values in the `dc.identifier.isbn` are actually ISSNs so I moved them to the `dc.identifier.issn` column - I found one invalid ISSN using a custom text facet with the regex from the [ISSN page on Wikipedia](https://en.wikipedia.org/wiki/International_Standard_Serial_Number#Code_format): `isNotBlank(value.match(/^\d{4}-\d{3}[\dxX]$/))` - One invalid value for `dc.type` diff --git a/docs/2018-09/index.html b/docs/2018-09/index.html index d52e532df..2eddb951f 100644 --- a/docs/2018-09/index.html +++ b/docs/2018-09/index.html @@ -18,7 +18,7 @@ I’m testing the new DSpace 5.8 branch in my Ubuntu 18.04 environment and I " /> - + This makes it super annoying to do the checks and cleanup, so I will merge them (also time consuming)
  • Five items had dc.date.issued values like 2013-5 so I corrected them to be 2013-05
  • Several metadata fields had values with newlines in them (even in some titles!), which I fixed by trimming the consecutive whitespaces in Open Refine
  • -
  • Many (196!) items from before 2011 are indicated as having a CRP, but CRPs didn’t exist then so this is impossible
  • +
  • Many (91!) items from before 2011 are indicated as having a CRP, but CRPs didn’t exist then so this is impossible
  • I got all items that were from 2011 and onwards using a custom facet with this GREL on the dc.date.issued column: isNotNull(value.match(/201[1-8].*/)) and then blanking their CRPs
  • Some affiliations with only one separator (|) for multiple values
  • I replaced smart quotes like with plain ones
  • -
  • Some inconsitencies in cg.subject.iita like COWPEA and COWPEAS, and YAM and YAMS, etc, as well as some spelling mistakes like IMPACT ASSESSMENTN
  • +
  • Some inconsistencies in cg.subject.iita like COWPEA and COWPEAS, and YAM and YAMS, etc, as well as some spelling mistakes like IMPACT ASSESSMENTN
  • Some values in the dc.identifier.isbn are actually ISSNs so I moved them to the dc.identifier.issn column
  • I found one invalid ISSN using a custom text facet with the regex from the ISSN page on Wikipedia: isNotBlank(value.match(/^\d{4}-\d{3}[\dxX]$/))
  • One invalid value for dc.type
  • diff --git a/docs/sitemap.xml b/docs/sitemap.xml index 496215727..2bb88ce60 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -4,7 +4,7 @@ https://alanorth.github.io/cgspace-notes/2018-09/ - 2018-09-04T13:25:13+03:00 + 2018-09-04T17:08:34+03:00 @@ -184,7 +184,7 @@ https://alanorth.github.io/cgspace-notes/ - 2018-09-04T13:25:13+03:00 + 2018-09-04T17:08:34+03:00 0 @@ -195,7 +195,7 @@ https://alanorth.github.io/cgspace-notes/tags/notes/ - 2018-09-04T13:25:13+03:00 + 2018-09-04T17:08:34+03:00 0 @@ -207,13 +207,13 @@ https://alanorth.github.io/cgspace-notes/posts/ - 2018-09-04T13:25:13+03:00 + 2018-09-04T17:08:34+03:00 0 https://alanorth.github.io/cgspace-notes/tags/ - 2018-09-04T13:25:13+03:00 + 2018-09-04T17:08:34+03:00 0