Add notes for 2022-05-30

This commit is contained in:
Alan Orth 2022-05-30 16:00:02 +03:00
parent b5642c03f2
commit e478850def
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
28 changed files with 74 additions and 33 deletions

View File

@ -232,4 +232,25 @@ Would fix 34 occurences of: Martha Lilia Del Río Duque: 0000-0002-0879-0292
- Start a harvest on AReS
## 2022-05-30
- Help IITA with some collection authorization issues on CGSpace
- Finally looking into Peter's Altmetric export from 2022-02
- We want to try to compare some of the information about open access status with that in CGSpace
- I created a new column for all items that have CGSpace handles using this GREL:
```console
"https://hdl.handle.net/" + value.match(/.*?(10568\/\d+).*?/)[0]
```
- With that I can do a join on the CGSpace metadata and perhaps clean up some items
```console
$ ./bin/dspace metadata-export -f 2022-05-30-cgspace.csv
$ csvcut -c 'id,dc.identifier.uri[en_US],dcterms.accessRights[en_US],dcterms.license[en_US]' 2022-05-30-cgspace.csv | sed '1 s/dc\.identifier\.uri\[en_US\]/dc.identifier.uri/' > /tmp/cgspace.csv
$ csvjoin -c 'dc.identifier.uri' ~/Downloads/2022-05-30-Altmetric-Research-Outputs-CGSpace.csv /tmp/cgspace.csv > /tmp/cgspace-altmetric.csv
```
- Examining the data in OpenRefine I spot checked a few records where Altmetric and CGSpace disagree and in most cases I found Altmetric to be wrong...
<!-- vim: set sw=2 ts=2: -->

View File

@ -35,7 +35,7 @@ I purged 93,974 hits from these IPs using my check-spider-ip-hits.sh script
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2022-05/" />
<meta property="article:published_time" content="2022-05-04T09:13:39+03:00" />
<meta property="article:modified_time" content="2022-05-27T16:46:18+03:00" />
<meta property="article:modified_time" content="2022-05-28T18:25:00+03:00" />
@ -76,9 +76,9 @@ I purged 93,974 hits from these IPs using my check-spider-ip-hits.sh script
"@type": "BlogPosting",
"headline": "May, 2022",
"url": "https://alanorth.github.io/cgspace-notes/2022-05/",
"wordCount": "1552",
"wordCount": "1673",
"datePublished": "2022-05-04T09:13:39+03:00",
"dateModified": "2022-05-27T16:46:18+03:00",
"dateModified": "2022-05-28T18:25:00+03:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -404,6 +404,26 @@ I purged 93,974 hits from these IPs using my check-spider-ip-hits.sh script
<ul>
<li>Start a harvest on AReS</li>
</ul>
<h2 id="2022-05-30">2022-05-30</h2>
<ul>
<li>Help IITA with some collection authorization issues on CGSpace</li>
<li>Finally looking into Peter&rsquo;s Altmetric export from 2022-02
<ul>
<li>We want to try to compare some of the information about open access status with that in CGSpace</li>
<li>I created a new column for all items that have CGSpace handles using this GREL:</li>
</ul>
</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>&#34;https://hdl.handle.net/&#34; + value.match(/.*?(10568\/\d+).*?/)[0]
</span></span></code></pre></div><ul>
<li>With that I can do a join on the CGSpace metadata and perhaps clean up some items</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ ./bin/dspace metadata-export -f 2022-05-30-cgspace.csv
</span></span><span style="display:flex;"><span>$ csvcut -c <span style="color:#e6db74">&#39;id,dc.identifier.uri[en_US],dcterms.accessRights[en_US],dcterms.license[en_US]&#39;</span> 2022-05-30-cgspace.csv | sed <span style="color:#e6db74">&#39;1 s/dc\.identifier\.uri\[en_US\]/dc.identifier.uri/&#39;</span> &gt; /tmp/cgspace.csv
</span></span><span style="display:flex;"><span>$ csvjoin -c <span style="color:#e6db74">&#39;dc.identifier.uri&#39;</span> ~/Downloads/2022-05-30-Altmetric-Research-Outputs-CGSpace.csv /tmp/cgspace.csv &gt; /tmp/cgspace-altmetric.csv
</span></span></code></pre></div><ul>
<li>Examining the data in OpenRefine I spot checked a few records where Altmetric and CGSpace disagree and in most cases I found Altmetric to be wrong&hellip;</li>
</ul>
<!-- raw HTML omitted -->

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-05-27T16:46:18+03:00" />
<meta property="og:updated_time" content="2022-05-28T18:25:00+03:00" />

View File

@ -3,19 +3,19 @@
xmlns:xhtml="http://www.w3.org/1999/xhtml">
<url>
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
<lastmod>2022-05-27T16:46:18+03:00</lastmod>
<lastmod>2022-05-28T18:25:00+03:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/</loc>
<lastmod>2022-05-27T16:46:18+03:00</lastmod>
<lastmod>2022-05-28T18:25:00+03:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/2022-05/</loc>
<lastmod>2022-05-27T16:46:18+03:00</lastmod>
<lastmod>2022-05-28T18:25:00+03:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
<lastmod>2022-05-27T16:46:18+03:00</lastmod>
<lastmod>2022-05-28T18:25:00+03:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
<lastmod>2022-05-27T16:46:18+03:00</lastmod>
<lastmod>2022-05-28T18:25:00+03:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/2022-04/</loc>
<lastmod>2022-05-04T11:09:45+03:00</lastmod>