mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-22 14:45:03 +01:00
Update notes for 2017-12-17
This commit is contained in:
parent
226a940ff3
commit
7016d8b3f1
@ -202,4 +202,13 @@ UPDATE 1
|
|||||||
- The dates are in super long ISO8601 format (from Excel?) like `2016-02-07T00:00:00Z` so I converted them to simpler forms in GREL: `value.toString("yyyy-MM-dd")`
|
- The dates are in super long ISO8601 format (from Excel?) like `2016-02-07T00:00:00Z` so I converted them to simpler forms in GREL: `value.toString("yyyy-MM-dd")`
|
||||||
- I trimmed the whitespaces in a few fields but it wasn't many
|
- I trimmed the whitespaces in a few fields but it wasn't many
|
||||||
- Rename her thumbnail column to filename, and format it so SAFBuilder adds the files to the thumbnail bundle with this GREL in OpenRefine: `value + "__bundle:THUMBNAIL"`
|
- Rename her thumbnail column to filename, and format it so SAFBuilder adds the files to the thumbnail bundle with this GREL in OpenRefine: `value + "__bundle:THUMBNAIL"`
|
||||||
|
- Rename dc.identifier.status and dc.identifier.url columns to cg.identifier.status and cg.identifier.url
|
||||||
|
- Item 4 has weird characters in citation, ie: Nagoya et de Trait
|
||||||
|
- Some author names need normalization, ie: `Aggarwal, Pramod` and `Aggarwal, Pramod K.`
|
||||||
|
- Something weird going on with duplicate authors that have the same text value, like `Berto, Jayson C.` and `Balmeo, Katherine P.`
|
||||||
- I will send her feedback on some author names like UNEP and ICRISAT and ask her for the missing thumbnail11.jpg
|
- I will send her feedback on some author names like UNEP and ICRISAT and ask her for the missing thumbnail11.jpg
|
||||||
|
- I did a test import of the data locally after building with SAFBuilder but for some reason I had to specify the collection (even though the collections were specified in the `collection` field)
|
||||||
|
|
||||||
|
```
|
||||||
|
$ JAVA_OPTS="-Xmx512m -Dfile.encoding=UTF-8" ~/dspace/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/89338 --source /Users/aorth/Downloads/2016\ bulk\ upload\ thumbnails/SimpleArchiveFormat --mapfile=/tmp/ccafs.map &> /tmp/ccafs.log
|
||||||
|
```
|
||||||
|
@ -23,7 +23,7 @@ The list of connections to XMLUI and REST API for today:
|
|||||||
|
|
||||||
<meta property="article:published_time" content="2017-12-01T13:53:54+03:00"/>
|
<meta property="article:published_time" content="2017-12-01T13:53:54+03:00"/>
|
||||||
|
|
||||||
<meta property="article:modified_time" content="2017-12-17T09:55:04+02:00"/>
|
<meta property="article:modified_time" content="2017-12-17T10:56:56+02:00"/>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
@ -56,9 +56,9 @@ The list of connections to XMLUI and REST API for today:
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "December, 2017",
|
"headline": "December, 2017",
|
||||||
"url": "https://alanorth.github.io/cgspace-notes/2017-12/",
|
"url": "https://alanorth.github.io/cgspace-notes/2017-12/",
|
||||||
"wordCount": "1128",
|
"wordCount": "1231",
|
||||||
"datePublished": "2017-12-01T13:53:54+03:00",
|
"datePublished": "2017-12-01T13:53:54+03:00",
|
||||||
"dateModified": "2017-12-17T09:55:04+02:00",
|
"dateModified": "2017-12-17T10:56:56+02:00",
|
||||||
"author": {
|
"author": {
|
||||||
"@type": "Person",
|
"@type": "Person",
|
||||||
"name": "Alan Orth"
|
"name": "Alan Orth"
|
||||||
@ -347,10 +347,18 @@ UPDATE 1
|
|||||||
<li>The dates are in super long ISO8601 format (from Excel?) like <code>2016-02-07T00:00:00Z</code> so I converted them to simpler forms in GREL: <code>value.toString("yyyy-MM-dd")</code></li>
|
<li>The dates are in super long ISO8601 format (from Excel?) like <code>2016-02-07T00:00:00Z</code> so I converted them to simpler forms in GREL: <code>value.toString("yyyy-MM-dd")</code></li>
|
||||||
<li>I trimmed the whitespaces in a few fields but it wasn’t many</li>
|
<li>I trimmed the whitespaces in a few fields but it wasn’t many</li>
|
||||||
<li>Rename her thumbnail column to filename, and format it so SAFBuilder adds the files to the thumbnail bundle with this GREL in OpenRefine: <code>value + "__bundle:THUMBNAIL"</code></li>
|
<li>Rename her thumbnail column to filename, and format it so SAFBuilder adds the files to the thumbnail bundle with this GREL in OpenRefine: <code>value + "__bundle:THUMBNAIL"</code></li>
|
||||||
|
<li>Rename dc.identifier.status and dc.identifier.url columns to cg.identifier.status and cg.identifier.url</li>
|
||||||
|
<li>Item 4 has weird characters in citation, ie: Nagoya et de Trait</li>
|
||||||
|
<li>Some author names need normalization, ie: <code>Aggarwal, Pramod</code> and <code>Aggarwal, Pramod K.</code></li>
|
||||||
|
<li>Something weird going on with duplicate authors that have the same text value, like <code>Berto, Jayson C.</code> and <code>Balmeo, Katherine P.</code></li>
|
||||||
<li>I will send her feedback on some author names like UNEP and ICRISAT and ask her for the missing thumbnail11.jpg</li>
|
<li>I will send her feedback on some author names like UNEP and ICRISAT and ask her for the missing thumbnail11.jpg</li>
|
||||||
</ul></li>
|
</ul></li>
|
||||||
|
<li>I did a test import of the data locally after building with SAFBuilder but for some reason I had to specify the collection (even though the collections were specified in the <code>collection</code> field)</li>
|
||||||
</ul>
|
</ul>
|
||||||
|
|
||||||
|
<pre><code>$ JAVA_OPTS="-Xmx512m -Dfile.encoding=UTF-8" ~/dspace/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/89338 --source /Users/aorth/Downloads/2016\ bulk\ upload\ thumbnails/SimpleArchiveFormat --mapfile=/tmp/ccafs.map &> /tmp/ccafs.log
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
@ -4,7 +4,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/2017-12/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/2017-12/</loc>
|
||||||
<lastmod>2017-12-17T09:55:04+02:00</lastmod>
|
<lastmod>2017-12-17T10:56:56+02:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
@ -139,7 +139,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||||
<lastmod>2017-12-17T09:55:04+02:00</lastmod>
|
<lastmod>2017-12-17T10:56:56+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -150,7 +150,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||||
<lastmod>2017-12-17T09:55:04+02:00</lastmod>
|
<lastmod>2017-12-17T10:56:56+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -162,13 +162,13 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
||||||
<lastmod>2017-12-17T09:55:04+02:00</lastmod>
|
<lastmod>2017-12-17T10:56:56+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||||
<lastmod>2017-12-17T09:55:04+02:00</lastmod>
|
<lastmod>2017-12-17T10:56:56+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user