Add notes for 2022-08-23

This commit is contained in:
Alan Orth 2022-08-23 12:14:14 -07:00
parent 8e6c83a5e1
commit 64d5b998f9
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
29 changed files with 114 additions and 34 deletions

View File

@ -209,4 +209,43 @@ return "||".join(res)
- Then I finished adding the `dcterms.relation` and CRP metadata flagged by Peter on the Gender presentations
- I'm waiting for him to send me the PDFs and then I will upload them to DSpace Test
## 2022-08-21
- Start indexing on AReS
- The load on CGSpace was around 5.0 today, and now that I started the harvesting it's over 10 for an hour now, sigh...
- I'm going to try an experiment to block Googlebot, bingbot, and Yandex for a week to see if the load goes down
## 2022-08-22
- I tried to re-generate the SAF bundle for the MARLO Innovations after improving the AGROVOC subjects and the v3 PDFs, but six are missing from the v3 zip that are present in the original zip:
- ProjectInnovationSummary-WLE-P500-I78.pdf
- ProjectInnovationSummary-WLE-P452-I699.pdf
- ProjectInnovationSummary-WLE-P518-I696.pdf
- ProjectInnovationSummary-WLE-P442-I740.pdf
- ProjectInnovationSummary-WLE-P516-I647.pdf
- ProjectInnovationSummary-WLE-P438-I585.pdf
- I downloaded them manually using the URLs in the original CSV
- I also uploaded a new version of the MELIAs to DSpace Test
## 2022-08-23
- Checking the number of items on CGSpace so we can keep an eye on the 100,000 number:
```console
dspace=# SELECT COUNT(uuid) FROM item WHERE in_archive='t';
count
-------
95716
(1 row)
```
- If I check OAI I see more, but perhaps that counts mapped items multiple times
- Peter said the 303 Gender PPTs were good to go, so I updated the collection mappings and IDs in OpenRefine and then uploaded them to CGSpace:
```console
$ dspace import --add --eperson=fuu@fuu.com --source /tmp/SimpleArchiveFormat --mapfile=./2022-08-23-gender-ppts.map
```
- I created a [GitHub issue for OpenRXV compatibility issues with DSpace 7](https://github.com/ilri/OpenRXV/issues/133)
<!-- vim: set sw=2 ts=2: -->

View File

@ -14,7 +14,7 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2022-08/" />
<meta property="article:published_time" content="2022-08-01T10:22:36+03:00" />
<meta property="article:modified_time" content="2022-08-19T21:55:36-07:00" />
<meta property="article:modified_time" content="2022-08-20T22:37:35-07:00" />
@ -34,9 +34,9 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
"@type": "BlogPosting",
"headline": "August, 2022",
"url": "https://alanorth.github.io/cgspace-notes/2022-08/",
"wordCount": "1862",
"wordCount": "2068",
"datePublished": "2022-08-01T10:22:36+03:00",
"dateModified": "2022-08-19T21:55:36-07:00",
"dateModified": "2022-08-20T22:37:35-07:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -354,6 +354,47 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
</ul>
</li>
</ul>
<h2 id="2022-08-21">2022-08-21</h2>
<ul>
<li>Start indexing on AReS</li>
<li>The load on CGSpace was around 5.0 today, and now that I started the harvesting it&rsquo;s over 10 for an hour now, sigh&hellip;
<ul>
<li>I&rsquo;m going to try an experiment to block Googlebot, bingbot, and Yandex for a week to see if the load goes down</li>
</ul>
</li>
</ul>
<h2 id="2022-08-22">2022-08-22</h2>
<ul>
<li>I tried to re-generate the SAF bundle for the MARLO Innovations after improving the AGROVOC subjects and the v3 PDFs, but six are missing from the v3 zip that are present in the original zip:
<ul>
<li>ProjectInnovationSummary-WLE-P500-I78.pdf</li>
<li>ProjectInnovationSummary-WLE-P452-I699.pdf</li>
<li>ProjectInnovationSummary-WLE-P518-I696.pdf</li>
<li>ProjectInnovationSummary-WLE-P442-I740.pdf</li>
<li>ProjectInnovationSummary-WLE-P516-I647.pdf</li>
<li>ProjectInnovationSummary-WLE-P438-I585.pdf</li>
</ul>
</li>
<li>I downloaded them manually using the URLs in the original CSV</li>
<li>I also uploaded a new version of the MELIAs to DSpace Test</li>
</ul>
<h2 id="2022-08-23">2022-08-23</h2>
<ul>
<li>Checking the number of items on CGSpace so we can keep an eye on the 100,000 number:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>dspace=# SELECT COUNT(uuid) FROM item WHERE in_archive=&#39;t&#39;;
</span></span><span style="display:flex;"><span> count
</span></span><span style="display:flex;"><span>-------
</span></span><span style="display:flex;"><span> 95716
</span></span><span style="display:flex;"><span>(1 row)
</span></span></code></pre></div><ul>
<li>If I check OAI I see more, but perhaps that counts mapped items multiple times</li>
<li>Peter said the 303 Gender PPTs were good to go, so I updated the collection mappings and IDs in OpenRefine and then uploaded them to CGSpace:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ dspace import --add --eperson<span style="color:#f92672">=</span>fuu@fuu.com --source /tmp/SimpleArchiveFormat --mapfile<span style="color:#f92672">=</span>./2022-08-23-gender-ppts.map
</span></span></code></pre></div><ul>
<li>I created a <a href="https://github.com/ilri/OpenRXV/issues/133">GitHub issue for OpenRXV compatibility issues with DSpace 7</a></li>
</ul>
<!-- raw HTML omitted -->

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-19T21:55:36-07:00" />
<meta property="og:updated_time" content="2022-08-20T22:37:35-07:00" />

View File

@ -3,19 +3,19 @@
xmlns:xhtml="http://www.w3.org/1999/xhtml">
<url>
<loc>https://alanorth.github.io/cgspace-notes/2022-08/</loc>
<lastmod>2022-08-19T21:55:36-07:00</lastmod>
<lastmod>2022-08-20T22:37:35-07:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
<lastmod>2022-08-19T21:55:36-07:00</lastmod>
<lastmod>2022-08-20T22:37:35-07:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/</loc>
<lastmod>2022-08-19T21:55:36-07:00</lastmod>
<lastmod>2022-08-20T22:37:35-07:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
<lastmod>2022-08-19T21:55:36-07:00</lastmod>
<lastmod>2022-08-20T22:37:35-07:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
<lastmod>2022-08-19T21:55:36-07:00</lastmod>
<lastmod>2022-08-20T22:37:35-07:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/2022-07/</loc>
<lastmod>2022-07-31T15:49:35+03:00</lastmod>