mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-22 14:45:03 +01:00
Update notes for 2018-09-04
This commit is contained in:
parent
f37fb89092
commit
f835a78d30
@ -56,11 +56,11 @@ Caused by: java.lang.RuntimeException: Failed to startup the DSpace Service Mana
|
||||
- This makes it super annoying to do the checks and cleanup, so I will merge them (also time consuming)
|
||||
- Five items had `dc.date.issued` values like `2013-5` so I corrected them to be `2013-05`
|
||||
- Several metadata fields had values with newlines in them (even in some titles!), which I fixed by trimming the consecutive whitespaces in Open Refine
|
||||
- Many (196!) items from before 2011 are indicated as having a CRP, but CRPs didn't exist then so this is impossible
|
||||
- Many (91!) items from before 2011 are indicated as having a CRP, but CRPs didn't exist then so this is impossible
|
||||
- I got all items that were from 2011 and onwards using a custom facet with this GREL on the `dc.date.issued` column: `isNotNull(value.match(/201[1-8].*/))` and then blanking their CRPs
|
||||
- Some affiliations with only one separator (|) for multiple values
|
||||
- I replaced smart quotes like `’` with plain ones
|
||||
- Some inconsitencies in `cg.subject.iita` like COWPEA and COWPEAS, and YAM and YAMS, etc, as well as some spelling mistakes like IMPACT ASSESSMENTN
|
||||
- Some inconsistencies in `cg.subject.iita` like COWPEA and COWPEAS, and YAM and YAMS, etc, as well as some spelling mistakes like IMPACT ASSESSMENTN
|
||||
- Some values in the `dc.identifier.isbn` are actually ISSNs so I moved them to the `dc.identifier.issn` column
|
||||
- I found one invalid ISSN using a custom text facet with the regex from the [ISSN page on Wikipedia](https://en.wikipedia.org/wiki/International_Standard_Serial_Number#Code_format): `isNotBlank(value.match(/^\d{4}-\d{3}[\dxX]$/))`
|
||||
- One invalid value for `dc.type`
|
||||
|
@ -18,7 +18,7 @@ I’m testing the new DSpace 5.8 branch in my Ubuntu 18.04 environment and I
|
||||
" />
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2018-09/" /><meta property="article:published_time" content="2018-09-02T09:55:54+03:00"/>
|
||||
<meta property="article:modified_time" content="2018-09-04T13:25:13+03:00"/>
|
||||
<meta property="article:modified_time" content="2018-09-04T17:08:34+03:00"/>
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="September, 2018"/>
|
||||
<meta name="twitter:description" content="2018-09-02
|
||||
@ -43,7 +43,7 @@ I’m testing the new DSpace 5.8 branch in my Ubuntu 18.04 environment and I
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2018-09/",
|
||||
"wordCount": "668",
|
||||
"datePublished": "2018-09-02T09:55:54+03:00",
|
||||
"dateModified": "2018-09-04T13:25:13+03:00",
|
||||
"dateModified": "2018-09-04T17:08:34+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -168,11 +168,11 @@ Caused by: java.lang.RuntimeException: Failed to startup the DSpace Service Mana
|
||||
<li>This makes it super annoying to do the checks and cleanup, so I will merge them (also time consuming)</li>
|
||||
<li>Five items had <code>dc.date.issued</code> values like <code>2013-5</code> so I corrected them to be <code>2013-05</code></li>
|
||||
<li>Several metadata fields had values with newlines in them (even in some titles!), which I fixed by trimming the consecutive whitespaces in Open Refine</li>
|
||||
<li>Many (196!) items from before 2011 are indicated as having a CRP, but CRPs didn’t exist then so this is impossible</li>
|
||||
<li>Many (91!) items from before 2011 are indicated as having a CRP, but CRPs didn’t exist then so this is impossible</li>
|
||||
<li>I got all items that were from 2011 and onwards using a custom facet with this GREL on the <code>dc.date.issued</code> column: <code>isNotNull(value.match(/201[1-8].*/))</code> and then blanking their CRPs</li>
|
||||
<li>Some affiliations with only one separator (|) for multiple values</li>
|
||||
<li>I replaced smart quotes like <code>’</code> with plain ones</li>
|
||||
<li>Some inconsitencies in <code>cg.subject.iita</code> like COWPEA and COWPEAS, and YAM and YAMS, etc, as well as some spelling mistakes like IMPACT ASSESSMENTN</li>
|
||||
<li>Some inconsistencies in <code>cg.subject.iita</code> like COWPEA and COWPEAS, and YAM and YAMS, etc, as well as some spelling mistakes like IMPACT ASSESSMENTN</li>
|
||||
<li>Some values in the <code>dc.identifier.isbn</code> are actually ISSNs so I moved them to the <code>dc.identifier.issn</code> column</li>
|
||||
<li>I found one invalid ISSN using a custom text facet with the regex from the <a href="https://en.wikipedia.org/wiki/International_Standard_Serial_Number#Code_format">ISSN page on Wikipedia</a>: <code>isNotBlank(value.match(/^\d{4}-\d{3}[\dxX]$/))</code></li>
|
||||
<li>One invalid value for <code>dc.type</code></li>
|
||||
|
@ -4,7 +4,7 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2018-09/</loc>
|
||||
<lastmod>2018-09-04T13:25:13+03:00</lastmod>
|
||||
<lastmod>2018-09-04T17:08:34+03:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -184,7 +184,7 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||
<lastmod>2018-09-04T13:25:13+03:00</lastmod>
|
||||
<lastmod>2018-09-04T17:08:34+03:00</lastmod>
|
||||
<priority>0</priority>
|
||||
</url>
|
||||
|
||||
@ -195,7 +195,7 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||
<lastmod>2018-09-04T13:25:13+03:00</lastmod>
|
||||
<lastmod>2018-09-04T17:08:34+03:00</lastmod>
|
||||
<priority>0</priority>
|
||||
</url>
|
||||
|
||||
@ -207,13 +207,13 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
||||
<lastmod>2018-09-04T13:25:13+03:00</lastmod>
|
||||
<lastmod>2018-09-04T17:08:34+03:00</lastmod>
|
||||
<priority>0</priority>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||
<lastmod>2018-09-04T13:25:13+03:00</lastmod>
|
||||
<lastmod>2018-09-04T17:08:34+03:00</lastmod>
|
||||
<priority>0</priority>
|
||||
</url>
|
||||
|
||||
|
Loading…
Reference in New Issue
Block a user