Update notes for 2017-08-16

This commit is contained in:
Alan Orth 2017-08-16 12:50:03 +03:00
parent 08f89e683f
commit 720c15124b
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
3 changed files with 27 additions and 8 deletions

View File

@ -224,3 +224,12 @@ isNotNull(value.match(/(CGIAR .+?)\|\|\1/))
``` ```
- This would be true if the authors were like `CGIAR System Management Office||CGIAR System Management Office`, which some of the CGIAR Library's were - This would be true if the authors were like `CGIAR System Management Office||CGIAR System Management Office`, which some of the CGIAR Library's were
- Unfortunately when you fix these in OpenRefine and then submit the metadata to DSpace it doesn't detect any changes, so you have to edit them all manually via DSpace's "Edit Item"
- Ooh! And an even more interesting regex would match _any_ duplicated author:
```
isNotNull(value.match(/(.+?)\|\|\1/))
```
- Which means it can also be used to find items with duplicate `dc.subject` fields...
- Finally sent Peter the final dump of the CGIAR System Organization community so he can have a last look at it

View File

@ -37,7 +37,7 @@ Then I cleaned up the author authorities and HTML characters in OpenRefine and s
<meta property="article:published_time" content="2017-08-01T11:51:52&#43;03:00"/> <meta property="article:published_time" content="2017-08-01T11:51:52&#43;03:00"/>
<meta property="article:modified_time" content="2017-08-15T16:44:59&#43;03:00"/> <meta property="article:modified_time" content="2017-08-16T12:00:37&#43;03:00"/>
@ -85,9 +85,9 @@ Then I cleaned up the author authorities and HTML characters in OpenRefine and s
"@type": "BlogPosting", "@type": "BlogPosting",
"headline": "August, 2017", "headline": "August, 2017",
"url": "https://alanorth.github.io/cgspace-notes/2017-08/", "url": "https://alanorth.github.io/cgspace-notes/2017-08/",
"wordCount": "2449", "wordCount": "2528",
"datePublished": "2017-08-01T11:51:52&#43;03:00", "datePublished": "2017-08-01T11:51:52&#43;03:00",
"dateModified": "2017-08-15T16:44:59&#43;03:00", "dateModified": "2017-08-16T12:00:37&#43;03:00",
"author": { "author": {
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
@ -417,6 +417,16 @@ UPDATE 4899
<ul> <ul>
<li>This would be true if the authors were like <code>CGIAR System Management Office||CGIAR System Management Office</code>, which some of the CGIAR Library&rsquo;s were</li> <li>This would be true if the authors were like <code>CGIAR System Management Office||CGIAR System Management Office</code>, which some of the CGIAR Library&rsquo;s were</li>
<li>Unfortunately when you fix these in OpenRefine and then submit the metadata to DSpace it doesn&rsquo;t detect any changes, so you have to edit them all manually via DSpace&rsquo;s &ldquo;Edit Item&rdquo;</li>
<li>Ooh! And an even more interesting regex would match <em>any</em> duplicated author:</li>
</ul>
<pre><code>isNotNull(value.match(/(.+?)\|\|\1/))
</code></pre>
<ul>
<li>Which means it can also be used to find items with duplicate <code>dc.subject</code> fields&hellip;</li>
<li>Finally sent Peter the final dump of the CGIAR System Organization community so he can have a last look at it</li>
</ul> </ul>

View File

@ -4,7 +4,7 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/2017-08/</loc> <loc>https://alanorth.github.io/cgspace-notes/2017-08/</loc>
<lastmod>2017-08-15T16:44:59+03:00</lastmod> <lastmod>2017-08-16T12:00:37+03:00</lastmod>
</url> </url>
<url> <url>
@ -114,7 +114,7 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/</loc> <loc>https://alanorth.github.io/cgspace-notes/</loc>
<lastmod>2017-08-15T16:44:59+03:00</lastmod> <lastmod>2017-08-16T12:00:37+03:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
@ -125,19 +125,19 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc> <loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
<lastmod>2017-08-15T16:44:59+03:00</lastmod> <lastmod>2017-08-16T12:00:37+03:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/post/</loc> <loc>https://alanorth.github.io/cgspace-notes/post/</loc>
<lastmod>2017-08-15T16:44:59+03:00</lastmod> <lastmod>2017-08-16T12:00:37+03:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc> <loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
<lastmod>2017-08-15T16:44:59+03:00</lastmod> <lastmod>2017-08-16T12:00:37+03:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>