Add notes for 2022-08-23

This commit is contained in:
2022-08-23 12:14:14 -07:00
parent 8e6c83a5e1
commit 64d5b998f9
29 changed files with 114 additions and 34 deletions

View File

@ -14,7 +14,7 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2022-08/" />
<meta property="article:published_time" content="2022-08-01T10:22:36+03:00" />
<meta property="article:modified_time" content="2022-08-19T21:55:36-07:00" />
<meta property="article:modified_time" content="2022-08-20T22:37:35-07:00" />
@ -34,9 +34,9 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
"@type": "BlogPosting",
"headline": "August, 2022",
"url": "https://alanorth.github.io/cgspace-notes/2022-08/",
"wordCount": "1862",
"wordCount": "2068",
"datePublished": "2022-08-01T10:22:36+03:00",
"dateModified": "2022-08-19T21:55:36-07:00",
"dateModified": "2022-08-20T22:37:35-07:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -354,6 +354,47 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
</ul>
</li>
</ul>
<h2 id="2022-08-21">2022-08-21</h2>
<ul>
<li>Start indexing on AReS</li>
<li>The load on CGSpace was around 5.0 today, and now that I started the harvesting it&rsquo;s over 10 for an hour now, sigh&hellip;
<ul>
<li>I&rsquo;m going to try an experiment to block Googlebot, bingbot, and Yandex for a week to see if the load goes down</li>
</ul>
</li>
</ul>
<h2 id="2022-08-22">2022-08-22</h2>
<ul>
<li>I tried to re-generate the SAF bundle for the MARLO Innovations after improving the AGROVOC subjects and the v3 PDFs, but six are missing from the v3 zip that are present in the original zip:
<ul>
<li>ProjectInnovationSummary-WLE-P500-I78.pdf</li>
<li>ProjectInnovationSummary-WLE-P452-I699.pdf</li>
<li>ProjectInnovationSummary-WLE-P518-I696.pdf</li>
<li>ProjectInnovationSummary-WLE-P442-I740.pdf</li>
<li>ProjectInnovationSummary-WLE-P516-I647.pdf</li>
<li>ProjectInnovationSummary-WLE-P438-I585.pdf</li>
</ul>
</li>
<li>I downloaded them manually using the URLs in the original CSV</li>
<li>I also uploaded a new version of the MELIAs to DSpace Test</li>
</ul>
<h2 id="2022-08-23">2022-08-23</h2>
<ul>
<li>Checking the number of items on CGSpace so we can keep an eye on the 100,000 number:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>dspace=# SELECT COUNT(uuid) FROM item WHERE in_archive=&#39;t&#39;;
</span></span><span style="display:flex;"><span> count
</span></span><span style="display:flex;"><span>-------
</span></span><span style="display:flex;"><span> 95716
</span></span><span style="display:flex;"><span>(1 row)
</span></span></code></pre></div><ul>
<li>If I check OAI I see more, but perhaps that counts mapped items multiple times</li>
<li>Peter said the 303 Gender PPTs were good to go, so I updated the collection mappings and IDs in OpenRefine and then uploaded them to CGSpace:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ dspace import --add --eperson<span style="color:#f92672">=</span>fuu@fuu.com --source /tmp/SimpleArchiveFormat --mapfile<span style="color:#f92672">=</span>./2022-08-23-gender-ppts.map
</span></span></code></pre></div><ul>
<li>I created a <a href="https://github.com/ilri/OpenRXV/issues/133">GitHub issue for OpenRXV compatibility issues with DSpace 7</a></li>
</ul>
<!-- raw HTML omitted -->