diff --git a/content/post/2017-08.md b/content/post/2017-08.md index e02ab1a06..ba3b1997c 100644 --- a/content/post/2017-08.md +++ b/content/post/2017-08.md @@ -176,3 +176,7 @@ $ grep -rsI SQLException dspace-xmlui | wc -l ## 2017-08-15 - Increase the nginx upload limit on CGSpace (linode18) so Sisay can upload 23 CIAT reports +- Do some last minute cleanups and de-duplications of the CGIAR Library data, as I need to send it to Peter this week +- Metadata fields like `dc.contributor.author`, `dc.publisher`, `dc.type`, and a few others had somehow been duplicated along the line +- Also, a few dozen `dc.description.abstract` fields still had various HTML tags and entities in them +- Also, a bunch of `dc.subject` fields that were not AGROVOC had not been moved properly to `cg.system.subject` diff --git a/public/2017-08/index.html b/public/2017-08/index.html index 66654791a..7fecefd22 100644 --- a/public/2017-08/index.html +++ b/public/2017-08/index.html @@ -37,7 +37,7 @@ Then I cleaned up the author authorities and HTML characters in OpenRefine and s - + @@ -85,9 +85,9 @@ Then I cleaned up the author authorities and HTML characters in OpenRefine and s "@type": "BlogPosting", "headline": "August, 2017", "url": "https://alanorth.github.io/cgspace-notes/2017-08/", - "wordCount": "1877", + "wordCount": "1948", "datePublished": "2017-08-01T11:51:52+03:00", - "dateModified": "2017-08-14T14:41:36+03:00", + "dateModified": "2017-08-15T11:56:35+03:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -361,6 +361,10 @@ $ grep -rsI SQLException dspace-xmlui | wc -l
dc.contributor.author
, dc.publisher
, dc.type
, and a few others had somehow been duplicated along the linedc.description.abstract
fields still had various HTML tags and entities in themdc.subject
fields that were not AGROVOC had not been moved properly to cg.system.subject