mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-12-25 06:24:29 +01:00
Add notes for 2017-06-20
This commit is contained in:
parent
4756e9025b
commit
41ba0acca9
@ -91,3 +91,19 @@ $ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add -
|
|||||||
|
|
||||||
- Redeploy CGSpace with latest changes from `5_x-prod`, run system updates, and reboot the server
|
- Redeploy CGSpace with latest changes from `5_x-prod`, run system updates, and reboot the server
|
||||||
- Continue working on ansible infrastructure changes for CGIAR Library
|
- Continue working on ansible infrastructure changes for CGIAR Library
|
||||||
|
|
||||||
|
## 2017-06-20
|
||||||
|
|
||||||
|
- Import Abenet and Peter's changes to the CGIAR Library CRP community
|
||||||
|
- Due to them using Windows and renaming some columns there were formatting, encoding, and duplicate metadata value issues
|
||||||
|
- I had to remove some fields from the CSV and rename some back to, ie, `dc.subject[en_US]` just so DSpace would detect changes properly
|
||||||
|
- Now it looks much better: https://dspacetest.cgiar.org/handle/10947/2517
|
||||||
|
- Removing the HTML tags and HTML/XML entities using the following GREL:
|
||||||
|
- `replace(value,/<\/?\w+((\s+\w+(\s*=\s*(?:".*?"|'.*?'|[^'">\s]+))?)+\s*|\s*)\/?>/,'')`
|
||||||
|
- `value.unescape("html").unescape("xml")`
|
||||||
|
- Finally import 914 CIAT Book Chapters to CGSpace in two batches:
|
||||||
|
|
||||||
|
```
|
||||||
|
$ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/35701 --source /home/aorth/CIAT-Books/SimpleArchiveFormat/ --mapfile=/tmp/ciat-books.map &> /tmp/ciat-books.log
|
||||||
|
$ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/35701 --source /home/aorth/CIAT-Books/SimpleArchiveFormat/ --mapfile=/tmp/ciat-books2.map &> /tmp/ciat-books2.log
|
||||||
|
```
|
||||||
|
@ -13,7 +13,7 @@
|
|||||||
|
|
||||||
|
|
||||||
<meta property="article:published_time" content="2017-06-01T10:14:52+03:00"/>
|
<meta property="article:published_time" content="2017-06-01T10:14:52+03:00"/>
|
||||||
<meta property="article:modified_time" content="2017-06-07T18:12:09+03:00"/>
|
<meta property="article:modified_time" content="2017-06-18T14:53:20+03:00"/>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
@ -45,9 +45,9 @@
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "June, 2017",
|
"headline": "June, 2017",
|
||||||
"url": "https://alanorth.github.io/cgspace-notes/2017-06/",
|
"url": "https://alanorth.github.io/cgspace-notes/2017-06/",
|
||||||
"wordCount": "892",
|
"wordCount": "1001",
|
||||||
"datePublished": "2017-06-01T10:14:52+03:00",
|
"datePublished": "2017-06-01T10:14:52+03:00",
|
||||||
"dateModified": "2017-06-07T18:12:09+03:00",
|
"dateModified": "2017-06-18T14:53:20+03:00",
|
||||||
"author": {
|
"author": {
|
||||||
"@type": "Person",
|
"@type": "Person",
|
||||||
"name": "Alan Orth"
|
"name": "Alan Orth"
|
||||||
@ -223,6 +223,26 @@
|
|||||||
<li>Continue working on ansible infrastructure changes for CGIAR Library</li>
|
<li>Continue working on ansible infrastructure changes for CGIAR Library</li>
|
||||||
</ul>
|
</ul>
|
||||||
|
|
||||||
|
<h2 id="2017-06-20">2017-06-20</h2>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>Import Abenet and Peter’s changes to the CGIAR Library CRP community</li>
|
||||||
|
<li>Due to them using Windows and renaming some columns there were formatting, encoding, and duplicate metadata value issues</li>
|
||||||
|
<li>I had to remove some fields from the CSV and rename some back to, ie, <code>dc.subject[en_US]</code> just so DSpace would detect changes properly</li>
|
||||||
|
<li>Now it looks much better: <a href="https://dspacetest.cgiar.org/handle/10947/2517">https://dspacetest.cgiar.org/handle/10947/2517</a></li>
|
||||||
|
<li>Removing the HTML tags and HTML/XML entities using the following GREL:
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li><code>replace(value,/<\/?\w+((\s+\w+(\s*=\s*(?:".*?"|'.*?'|[^'">\s]+))?)+\s*|\s*)\/?>/,'')</code></li>
|
||||||
|
<li><code>value.unescape("html").unescape("xml")</code></li>
|
||||||
|
</ul></li>
|
||||||
|
<li>Finally import 914 CIAT Book Chapters to CGSpace in two batches:</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
<pre><code>$ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/35701 --source /home/aorth/CIAT-Books/SimpleArchiveFormat/ --mapfile=/tmp/ciat-books.map &> /tmp/ciat-books.log
|
||||||
|
$ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/35701 --source /home/aorth/CIAT-Books/SimpleArchiveFormat/ --mapfile=/tmp/ciat-books2.map &> /tmp/ciat-books2.log
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
@ -4,7 +4,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/2017-06/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/2017-06/</loc>
|
||||||
<lastmod>2017-06-07T18:12:09+03:00</lastmod>
|
<lastmod>2017-06-18T14:53:20+03:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
@ -104,7 +104,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||||
<lastmod>2017-06-07T18:12:09+03:00</lastmod>
|
<lastmod>2017-06-18T14:53:20+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -115,19 +115,19 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||||
<lastmod>2017-06-07T18:12:09+03:00</lastmod>
|
<lastmod>2017-06-18T14:53:20+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
||||||
<lastmod>2017-06-07T18:12:09+03:00</lastmod>
|
<lastmod>2017-06-18T14:53:20+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||||
<lastmod>2017-06-07T18:12:09+03:00</lastmod>
|
<lastmod>2017-06-18T14:53:20+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user