mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-26 08:28:18 +01:00
Update notes for 2017-08-16
This commit is contained in:
parent
08f89e683f
commit
720c15124b
@ -224,3 +224,12 @@ isNotNull(value.match(/(CGIAR .+?)\|\|\1/))
|
|||||||
```
|
```
|
||||||
|
|
||||||
- This would be true if the authors were like `CGIAR System Management Office||CGIAR System Management Office`, which some of the CGIAR Library's were
|
- This would be true if the authors were like `CGIAR System Management Office||CGIAR System Management Office`, which some of the CGIAR Library's were
|
||||||
|
- Unfortunately when you fix these in OpenRefine and then submit the metadata to DSpace it doesn't detect any changes, so you have to edit them all manually via DSpace's "Edit Item"
|
||||||
|
- Ooh! And an even more interesting regex would match _any_ duplicated author:
|
||||||
|
|
||||||
|
```
|
||||||
|
isNotNull(value.match(/(.+?)\|\|\1/))
|
||||||
|
```
|
||||||
|
|
||||||
|
- Which means it can also be used to find items with duplicate `dc.subject` fields...
|
||||||
|
- Finally sent Peter the final dump of the CGIAR System Organization community so he can have a last look at it
|
||||||
|
@ -37,7 +37,7 @@ Then I cleaned up the author authorities and HTML characters in OpenRefine and s
|
|||||||
|
|
||||||
|
|
||||||
<meta property="article:published_time" content="2017-08-01T11:51:52+03:00"/>
|
<meta property="article:published_time" content="2017-08-01T11:51:52+03:00"/>
|
||||||
<meta property="article:modified_time" content="2017-08-15T16:44:59+03:00"/>
|
<meta property="article:modified_time" content="2017-08-16T12:00:37+03:00"/>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
@ -85,9 +85,9 @@ Then I cleaned up the author authorities and HTML characters in OpenRefine and s
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "August, 2017",
|
"headline": "August, 2017",
|
||||||
"url": "https://alanorth.github.io/cgspace-notes/2017-08/",
|
"url": "https://alanorth.github.io/cgspace-notes/2017-08/",
|
||||||
"wordCount": "2449",
|
"wordCount": "2528",
|
||||||
"datePublished": "2017-08-01T11:51:52+03:00",
|
"datePublished": "2017-08-01T11:51:52+03:00",
|
||||||
"dateModified": "2017-08-15T16:44:59+03:00",
|
"dateModified": "2017-08-16T12:00:37+03:00",
|
||||||
"author": {
|
"author": {
|
||||||
"@type": "Person",
|
"@type": "Person",
|
||||||
"name": "Alan Orth"
|
"name": "Alan Orth"
|
||||||
@ -417,6 +417,16 @@ UPDATE 4899
|
|||||||
|
|
||||||
<ul>
|
<ul>
|
||||||
<li>This would be true if the authors were like <code>CGIAR System Management Office||CGIAR System Management Office</code>, which some of the CGIAR Library’s were</li>
|
<li>This would be true if the authors were like <code>CGIAR System Management Office||CGIAR System Management Office</code>, which some of the CGIAR Library’s were</li>
|
||||||
|
<li>Unfortunately when you fix these in OpenRefine and then submit the metadata to DSpace it doesn’t detect any changes, so you have to edit them all manually via DSpace’s “Edit Item”</li>
|
||||||
|
<li>Ooh! And an even more interesting regex would match <em>any</em> duplicated author:</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
<pre><code>isNotNull(value.match(/(.+?)\|\|\1/))
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>Which means it can also be used to find items with duplicate <code>dc.subject</code> fields…</li>
|
||||||
|
<li>Finally sent Peter the final dump of the CGIAR System Organization community so he can have a last look at it</li>
|
||||||
</ul>
|
</ul>
|
||||||
|
|
||||||
|
|
||||||
|
@ -4,7 +4,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/2017-08/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/2017-08/</loc>
|
||||||
<lastmod>2017-08-15T16:44:59+03:00</lastmod>
|
<lastmod>2017-08-16T12:00:37+03:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
@ -114,7 +114,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||||
<lastmod>2017-08-15T16:44:59+03:00</lastmod>
|
<lastmod>2017-08-16T12:00:37+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -125,19 +125,19 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||||
<lastmod>2017-08-15T16:44:59+03:00</lastmod>
|
<lastmod>2017-08-16T12:00:37+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
||||||
<lastmod>2017-08-15T16:44:59+03:00</lastmod>
|
<lastmod>2017-08-16T12:00:37+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||||
<lastmod>2017-08-15T16:44:59+03:00</lastmod>
|
<lastmod>2017-08-16T12:00:37+03:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user