Update notes for 2017-09-24

This commit is contained in:
Alan Orth 2017-09-25 00:38:30 +03:00
parent f220c9a7a8
commit a37eaf795e
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
4 changed files with 34 additions and 13 deletions

View File

@ -463,3 +463,13 @@ $ schedtool -D -e ionice -c2 -n7 nice -n19 dspace filter-media -f -i 10947/1 -p
- Peter asked if we could map all the items of type `Journal Article` in [ILRI Archive](https://cgspace.cgiar.org/handle/10568/2703) to [ILRI articles in journals and newsletters](https://cgspace.cgiar.org/handle/10568/3) - Peter asked if we could map all the items of type `Journal Article` in [ILRI Archive](https://cgspace.cgiar.org/handle/10568/2703) to [ILRI articles in journals and newsletters](https://cgspace.cgiar.org/handle/10568/3)
- It is easy to do via CSV using OpenRefine but I noticed that on CGSpace ~1,000 of the expected 2,500 are already mapped, while on DSpace Test they were not - It is easy to do via CSV using OpenRefine but I noticed that on CGSpace ~1,000 of the expected 2,500 are already mapped, while on DSpace Test they were not
- I've asked Peter if he knows what's going on (or who mapped them) - I've asked Peter if he knows what's going on (or who mapped them)
- Turns out he had already mapped some, but requested that I finish the rest
- With this GREL in OpenRefine I can find items that are mapped, ie they have `10568/3||` or `10568/3$` in their `collection` field:
```
isNotNull(value.match(/.+?10568\/3(\|\|.+|$)/))
```
- Peter also made a lot of changes to the data in the Archives collections while I was attempting to import the changes, so we were essentially competing for PostgreSQL and Solr connections
- I ended up having to kill the import and wait until he was done
- I exported a clean CSV and applied the changes from that one, which was a hundred or two less than I thought there should be (at least compared to the current state of DSpace Test, which is a few months old)

View File

@ -25,7 +25,7 @@ Ask Sisay to clean up the WLE approvers a bit, as Marianne’s user account
<meta property="article:published_time" content="2017-09-07T16:54:52&#43;07:00"/> <meta property="article:published_time" content="2017-09-07T16:54:52&#43;07:00"/>
<meta property="article:modified_time" content="2017-09-24T11:43:57&#43;03:00"/> <meta property="article:modified_time" content="2017-09-24T17:10:15&#43;03:00"/>
@ -61,9 +61,9 @@ Ask Sisay to clean up the WLE approvers a bit, as Marianne&rsquo;s user account
"@type": "BlogPosting", "@type": "BlogPosting",
"headline": "September, 2017", "headline": "September, 2017",
"url": "https://alanorth.github.io/cgspace-notes/2017-09/", "url": "https://alanorth.github.io/cgspace-notes/2017-09/",
"wordCount": "3456", "wordCount": "3580",
"datePublished": "2017-09-07T16:54:52&#43;07:00", "datePublished": "2017-09-07T16:54:52&#43;07:00",
"dateModified": "2017-09-24T11:43:57&#43;03:00", "dateModified": "2017-09-24T17:10:15&#43;03:00",
"author": { "author": {
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
@ -649,6 +649,17 @@ DELETE 207
<li>Peter asked if we could map all the items of type <code>Journal Article</code> in <a href="https://cgspace.cgiar.org/handle/10568/2703">ILRI Archive</a> to <a href="https://cgspace.cgiar.org/handle/10568/3">ILRI articles in journals and newsletters</a></li> <li>Peter asked if we could map all the items of type <code>Journal Article</code> in <a href="https://cgspace.cgiar.org/handle/10568/2703">ILRI Archive</a> to <a href="https://cgspace.cgiar.org/handle/10568/3">ILRI articles in journals and newsletters</a></li>
<li>It is easy to do via CSV using OpenRefine but I noticed that on CGSpace ~1,000 of the expected 2,500 are already mapped, while on DSpace Test they were not</li> <li>It is easy to do via CSV using OpenRefine but I noticed that on CGSpace ~1,000 of the expected 2,500 are already mapped, while on DSpace Test they were not</li>
<li>I&rsquo;ve asked Peter if he knows what&rsquo;s going on (or who mapped them)</li> <li>I&rsquo;ve asked Peter if he knows what&rsquo;s going on (or who mapped them)</li>
<li>Turns out he had already mapped some, but requested that I finish the rest</li>
<li>With this GREL in OpenRefine I can find items that are mapped, ie they have <code>10568/3||</code> or <code>10568/3$</code> in their <code>collection</code> field:</li>
</ul>
<pre><code>isNotNull(value.match(/.+?10568\/3(\|\|.+|$)/))
</code></pre>
<ul>
<li>Peter also made a lot of changes to the data in the Archives collections while I was attempting to import the changes, so we were essentially competing for PostgreSQL and Solr connections</li>
<li>I ended up having to kill the import and wait until he was done</li>
<li>I exported a clean CSV and applied the changes from that one, which was a hundred or two less than I thought there should be (at least compared to the current state of DSpace Test, which is a few months old)</li>
</ul> </ul>

View File

@ -27,7 +27,7 @@ Disallow: /cgspace-notes/2015-12/
Disallow: /cgspace-notes/2015-11/ Disallow: /cgspace-notes/2015-11/
Disallow: /cgspace-notes/ Disallow: /cgspace-notes/
Disallow: /cgspace-notes/categories/ Disallow: /cgspace-notes/categories/
Disallow: /cgspace-notes/tags/notes/
Disallow: /cgspace-notes/categories/notes/ Disallow: /cgspace-notes/categories/notes/
Disallow: /cgspace-notes/tags/notes/
Disallow: /cgspace-notes/post/ Disallow: /cgspace-notes/post/
Disallow: /cgspace-notes/tags/ Disallow: /cgspace-notes/tags/

View File

@ -9,7 +9,7 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/2017-09/</loc> <loc>https://alanorth.github.io/cgspace-notes/2017-09/</loc>
<lastmod>2017-09-24T11:43:57+03:00</lastmod> <lastmod>2017-09-24T17:10:15+03:00</lastmod>
</url> </url>
<url> <url>
@ -133,27 +133,27 @@
<priority>0</priority> <priority>0</priority>
</url> </url>
<url>
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
<lastmod>2017-09-24T11:43:57+03:00</lastmod>
<priority>0</priority>
</url>
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc> <loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
<lastmod>2017-09-19T22:23:37+03:00</lastmod> <lastmod>2017-09-19T22:23:37+03:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
<url>
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
<lastmod>2017-09-24T17:10:15+03:00</lastmod>
<priority>0</priority>
</url>
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/post/</loc> <loc>https://alanorth.github.io/cgspace-notes/post/</loc>
<lastmod>2017-09-24T11:43:57+03:00</lastmod> <lastmod>2017-09-24T17:10:15+03:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc> <loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
<lastmod>2017-09-24T11:43:57+03:00</lastmod> <lastmod>2017-09-24T17:10:15+03:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>