Add notes for 2022-08-30

This commit is contained in:
2022-08-30 17:45:35 +03:00
parent 49af872267
commit d9e2669a3d
30 changed files with 109 additions and 37 deletions

View File

@ -14,7 +14,7 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2022-08/" />
<meta property="article:published_time" content="2022-08-01T10:22:36+03:00" />
<meta property="article:modified_time" content="2022-08-24T21:24:07-07:00" />
<meta property="article:modified_time" content="2022-08-29T04:54:12+03:00" />
@ -34,9 +34,9 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
"@type": "BlogPosting",
"headline": "August, 2022",
"url": "https://alanorth.github.io/cgspace-notes/2022-08/",
"wordCount": "2353",
"wordCount": "2651",
"datePublished": "2022-08-01T10:22:36+03:00",
"dateModified": "2022-08-24T21:24:07-07:00",
"dateModified": "2022-08-29T04:54:12+03:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -164,7 +164,7 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
<li>Then I used SAFBuilder to create a SimpleItemArchive and import to DSpace Test:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ export JAVA_OPTS<span style="color:#f92672">=</span><span style="color:#e6db74">&#34;-Dfile.encoding=UTF-8 -Xmx2048m&#34;</span>
</span></span><span style="display:flex;"><span>$ dspace import --add --eperson<span style="color:#f92672">=</span>aorth@mjanja.ch --source /tmp/SimpleArchiveFormat --mapfile<span style="color:#f92672">=</span>./2022-08-03-innovations.map
</span></span><span style="display:flex;"><span>$ dspace import --add --eperson<span style="color:#f92672">=</span>fuuu@fuuu.com --source /tmp/SimpleArchiveFormat --mapfile<span style="color:#f92672">=</span>./2022-08-03-innovations.map
</span></span></code></pre></div><ul>
<li>Meeting with Mohammed Salem about harmonizing MEL and CGSpace metadata fields
<ul>
@ -454,6 +454,45 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
<h2 id="2022-08-29">2022-08-29</h2>
<ul>
<li>Start a harvest on AReS</li>
<li>Meeting with Peter and Abenet about CGSpace issues</li>
<li>I mapped the one MARLO OICR duplicate from the CCAFS Reports collection and deleted it from the OICRs CSV</li>
</ul>
<h2 id="2022-08-30">2022-08-30</h2>
<ul>
<li>Manuel from the &ldquo;Alianza SIDALC&rdquo; in South America contacted me asking for permission to harvest CGSpace and include our content in their system
<ul>
<li>I responded that we would be glad if they harvested us, and that they should use a useful user agent so we can contact them incase of any issues or changes on the server</li>
</ul>
</li>
<li>I emailed ILRI ICT to ask how Abenet and I can use the CGSpace Support email address in our email applications because we haven&rsquo;t checked that account in years
<ul>
<li>I tried to log in on office365.com but it gave an error</li>
<li>I got access to the account and cleaned up the inbox, unsubscribed from a bunch of Microsoft and Yammer feeds, etc</li>
</ul>
</li>
<li>Remind Dani, Tariku, and Andrea about the legacy links that we want to update on ILRI&rsquo;s website:
<ul>
<li><a href="http://mahider.ilri.org">http://mahider.ilri.org</a><a href="https://cgspace.cgiar.org">https://cgspace.cgiar.org</a></li>
<li><a href="http://mahider.ilri.org/handle/10568/xxxxx">http://mahider.ilri.org/handle/10568/xxxxx</a><a href="https://hdl.handle.net/10568/xxxxx">https://hdl.handle.net/10568/xxxxx</a></li>
<li><a href="http://www.ilri.org/ilrinews/index.php/archives/xxxx">http://www.ilri.org/ilrinews/index.php/archives/xxxx</a><a href="https://newsarchive.ilri.org/archives/xxxx">https://newsarchive.ilri.org/archives/xxxx</a></li>
</ul>
</li>
<li>Join the OICRs with their relations that I processed a few days ago (minus the second id column and some others):</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ xsv join --left id ~/Downloads/2022-08-24-OICRs.csv id ~/Downloads/oicrs-matches-csv.csv | xsv <span style="color:#66d9ef">select</span> <span style="color:#e6db74">&#39;!id[1],Your Title,Their Title,Similarity,Your Date,Their Date,datediff&#39;</span> &gt; /tmp/oicrs-with-relations.csv
</span></span></code></pre></div><ul>
<li>Then I cleaned them with csv-metadata-quality to catch some duplicates, add regions, etc and re-imported to OpenRefine
<ul>
<li>I flagged a few duplicates for Jose and he&rsquo;ll let me know what to do with them</li>
</ul>
</li>
<li>I imported the OICRs to DSpace Test:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ export JAVA_OPTS<span style="color:#f92672">=</span><span style="color:#e6db74">&#34;-Dfile.encoding=UTF-8 -Xmx2048m&#34;</span>
</span></span><span style="display:flex;"><span>$ dspace import --add --eperson<span style="color:#f92672">=</span>fuuuu@fuuu.com --source /tmp/SimpleArchiveFormat-oicrs --mapfile<span style="color:#f92672">=</span>./2022-08-30-OICRs.map
</span></span></code></pre></div><ul>
<li>Meeting with Marie-Angelique, Abenet, Valentina, Sara, and Margarita about Types</li>
<li>I am testing the <code>org.apache.cocoon.uploads.autosave=false</code> setting for XMLUI so that files posted via multi-part forms get memory mapped instead of written to disk</li>
</ul>
<!-- raw HTML omitted -->