Add notes for 2022-08-30

This commit is contained in:
Alan Orth 2022-08-30 17:45:35 +03:00
parent 49af872267
commit d9e2669a3d
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
30 changed files with 109 additions and 37 deletions

View File

@ -56,7 +56,7 @@ $ csvjoin --left -c dc.title ~/Downloads/2022-08-03-Innovations-Cleaned.csv ~/Do
```console
$ export JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx2048m"
$ dspace import --add --eperson=aorth@mjanja.ch --source /tmp/SimpleArchiveFormat --mapfile=./2022-08-03-innovations.map
$ dspace import --add --eperson=fuuu@fuuu.com --source /tmp/SimpleArchiveFormat --mapfile=./2022-08-03-innovations.map
```
- Meeting with Mohammed Salem about harmonizing MEL and CGSpace metadata fields
@ -305,5 +305,36 @@ $ ./ilri/check-duplicates.py -i ~/Downloads/2022-08-24-OICRs.csv -u dspace -db d
## 2022-08-29
- Start a harvest on AReS
- Meeting with Peter and Abenet about CGSpace issues
- I mapped the one MARLO OICR duplicate from the CCAFS Reports collection and deleted it from the OICRs CSV
## 2022-08-30
- Manuel from the "Alianza SIDALC" in South America contacted me asking for permission to harvest CGSpace and include our content in their system
- I responded that we would be glad if they harvested us, and that they should use a useful user agent so we can contact them incase of any issues or changes on the server
- I emailed ILRI ICT to ask how Abenet and I can use the CGSpace Support email address in our email applications because we haven't checked that account in years
- I tried to log in on office365.com but it gave an error
- I got access to the account and cleaned up the inbox, unsubscribed from a bunch of Microsoft and Yammer feeds, etc
- Remind Dani, Tariku, and Andrea about the legacy links that we want to update on ILRI's website:
- http://mahider.ilri.org → https://cgspace.cgiar.org
- http://mahider.ilri.org/handle/10568/xxxxx → https://hdl.handle.net/10568/xxxxx
- http://www.ilri.org/ilrinews/index.php/archives/xxxx → https://newsarchive.ilri.org/archives/xxxx
- Join the OICRs with their relations that I processed a few days ago (minus the second id column and some others):
```console
$ xsv join --left id ~/Downloads/2022-08-24-OICRs.csv id ~/Downloads/oicrs-matches-csv.csv | xsv select '!id[1],Your Title,Their Title,Similarity,Your Date,Their Date,datediff' > /tmp/oicrs-with-relations.csv
```
- Then I cleaned them with csv-metadata-quality to catch some duplicates, add regions, etc and re-imported to OpenRefine
- I flagged a few duplicates for Jose and he'll let me know what to do with them
- I imported the OICRs to DSpace Test:
```console
$ export JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx2048m"
$ dspace import --add --eperson=fuuuu@fuuu.com --source /tmp/SimpleArchiveFormat-oicrs --mapfile=./2022-08-30-OICRs.map
```
- Meeting with Marie-Angelique, Abenet, Valentina, Sara, and Margarita about Types
- I am testing the `org.apache.cocoon.uploads.autosave=false` setting for XMLUI so that files posted via multi-part forms get memory mapped instead of written to disk
<!-- vim: set sw=2 ts=2: -->

View File

@ -60,7 +60,7 @@ I don&rsquo;t see anything interesting in the web server logs around that time t
"@type": "BlogPosting",
"headline": "January, 2019",
"url": "https://alanorth.github.io/cgspace-notes/2019-01/",
"wordCount": "5509",
"wordCount": "5531",
"datePublished": "2019-01-02T09:48:30+02:00",
"dateModified": "2022-03-22T22:03:59+03:00",
"author": {
@ -949,6 +949,8 @@ $ http &#39;http://localhost:8081/solr/statistics/select?indent=on&amp;rows=0&am
<ul>
<li>Peter noticed that some goo.gl links in our tweets from Feedburner are broken, for example this one from last week:</li>
</ul>
<blockquote class="twitter-tweet"><p lang="en" dir="ltr"><a href="https://twitter.com/hashtag/ILRI?src=hash&amp;ref_src=twsrc%5Etfw">#ILRI</a> research: Towards unlocking the potential of the hides and skins value chain in Somaliland <a href="https://t.co/EZH7ALW4dp">https://t.co/EZH7ALW4dp</a></p>&mdash; ILRI.org (@ILRI) <a href="https://twitter.com/ILRI/status/1086330519904673793?ref_src=twsrc%5Etfw">January 18, 2019</a></blockquote>
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>
<ul>
<li>The shortened link is <a href="goo.gl/fb/VRj9Gq">goo.gl/fb/VRj9Gq</a> and it shows a &ldquo;Dynamic Link not found&rdquo; error from Firebase:</li>

View File

@ -14,7 +14,7 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2022-08/" />
<meta property="article:published_time" content="2022-08-01T10:22:36+03:00" />
<meta property="article:modified_time" content="2022-08-24T21:24:07-07:00" />
<meta property="article:modified_time" content="2022-08-29T04:54:12+03:00" />
@ -34,9 +34,9 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
"@type": "BlogPosting",
"headline": "August, 2022",
"url": "https://alanorth.github.io/cgspace-notes/2022-08/",
"wordCount": "2353",
"wordCount": "2651",
"datePublished": "2022-08-01T10:22:36+03:00",
"dateModified": "2022-08-24T21:24:07-07:00",
"dateModified": "2022-08-29T04:54:12+03:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -164,7 +164,7 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
<li>Then I used SAFBuilder to create a SimpleItemArchive and import to DSpace Test:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ export JAVA_OPTS<span style="color:#f92672">=</span><span style="color:#e6db74">&#34;-Dfile.encoding=UTF-8 -Xmx2048m&#34;</span>
</span></span><span style="display:flex;"><span>$ dspace import --add --eperson<span style="color:#f92672">=</span>aorth@mjanja.ch --source /tmp/SimpleArchiveFormat --mapfile<span style="color:#f92672">=</span>./2022-08-03-innovations.map
</span></span><span style="display:flex;"><span>$ dspace import --add --eperson<span style="color:#f92672">=</span>fuuu@fuuu.com --source /tmp/SimpleArchiveFormat --mapfile<span style="color:#f92672">=</span>./2022-08-03-innovations.map
</span></span></code></pre></div><ul>
<li>Meeting with Mohammed Salem about harmonizing MEL and CGSpace metadata fields
<ul>
@ -454,6 +454,45 @@ Our request to add CC-BY-3.0-IGO to SPDX was approved a few weeks ago
<h2 id="2022-08-29">2022-08-29</h2>
<ul>
<li>Start a harvest on AReS</li>
<li>Meeting with Peter and Abenet about CGSpace issues</li>
<li>I mapped the one MARLO OICR duplicate from the CCAFS Reports collection and deleted it from the OICRs CSV</li>
</ul>
<h2 id="2022-08-30">2022-08-30</h2>
<ul>
<li>Manuel from the &ldquo;Alianza SIDALC&rdquo; in South America contacted me asking for permission to harvest CGSpace and include our content in their system
<ul>
<li>I responded that we would be glad if they harvested us, and that they should use a useful user agent so we can contact them incase of any issues or changes on the server</li>
</ul>
</li>
<li>I emailed ILRI ICT to ask how Abenet and I can use the CGSpace Support email address in our email applications because we haven&rsquo;t checked that account in years
<ul>
<li>I tried to log in on office365.com but it gave an error</li>
<li>I got access to the account and cleaned up the inbox, unsubscribed from a bunch of Microsoft and Yammer feeds, etc</li>
</ul>
</li>
<li>Remind Dani, Tariku, and Andrea about the legacy links that we want to update on ILRI&rsquo;s website:
<ul>
<li><a href="http://mahider.ilri.org">http://mahider.ilri.org</a><a href="https://cgspace.cgiar.org">https://cgspace.cgiar.org</a></li>
<li><a href="http://mahider.ilri.org/handle/10568/xxxxx">http://mahider.ilri.org/handle/10568/xxxxx</a><a href="https://hdl.handle.net/10568/xxxxx">https://hdl.handle.net/10568/xxxxx</a></li>
<li><a href="http://www.ilri.org/ilrinews/index.php/archives/xxxx">http://www.ilri.org/ilrinews/index.php/archives/xxxx</a><a href="https://newsarchive.ilri.org/archives/xxxx">https://newsarchive.ilri.org/archives/xxxx</a></li>
</ul>
</li>
<li>Join the OICRs with their relations that I processed a few days ago (minus the second id column and some others):</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ xsv join --left id ~/Downloads/2022-08-24-OICRs.csv id ~/Downloads/oicrs-matches-csv.csv | xsv <span style="color:#66d9ef">select</span> <span style="color:#e6db74">&#39;!id[1],Your Title,Their Title,Similarity,Your Date,Their Date,datediff&#39;</span> &gt; /tmp/oicrs-with-relations.csv
</span></span></code></pre></div><ul>
<li>Then I cleaned them with csv-metadata-quality to catch some duplicates, add regions, etc and re-imported to OpenRefine
<ul>
<li>I flagged a few duplicates for Jose and he&rsquo;ll let me know what to do with them</li>
</ul>
</li>
<li>I imported the OICRs to DSpace Test:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ export JAVA_OPTS<span style="color:#f92672">=</span><span style="color:#e6db74">&#34;-Dfile.encoding=UTF-8 -Xmx2048m&#34;</span>
</span></span><span style="display:flex;"><span>$ dspace import --add --eperson<span style="color:#f92672">=</span>fuuuu@fuuu.com --source /tmp/SimpleArchiveFormat-oicrs --mapfile<span style="color:#f92672">=</span>./2022-08-30-OICRs.map
</span></span></code></pre></div><ul>
<li>Meeting with Marie-Angelique, Abenet, Valentina, Sara, and Margarita about Types</li>
<li>I am testing the <code>org.apache.cocoon.uploads.autosave=false</code> setting for XMLUI so that files posted via multi-part forms get memory mapped instead of written to disk</li>
</ul>
<!-- raw HTML omitted -->

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2022-08-24T21:24:07-07:00" />
<meta property="og:updated_time" content="2022-08-29T04:54:12+03:00" />

View File

@ -3,19 +3,19 @@
xmlns:xhtml="http://www.w3.org/1999/xhtml">
<url>
<loc>https://alanorth.github.io/cgspace-notes/2022-08/</loc>
<lastmod>2022-08-24T21:24:07-07:00</lastmod>
<lastmod>2022-08-29T04:54:12+03:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
<lastmod>2022-08-24T21:24:07-07:00</lastmod>
<lastmod>2022-08-29T04:54:12+03:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/</loc>
<lastmod>2022-08-24T21:24:07-07:00</lastmod>
<lastmod>2022-08-29T04:54:12+03:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
<lastmod>2022-08-24T21:24:07-07:00</lastmod>
<lastmod>2022-08-29T04:54:12+03:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
<lastmod>2022-08-24T21:24:07-07:00</lastmod>
<lastmod>2022-08-29T04:54:12+03:00</lastmod>
</url><url>
<loc>https://alanorth.github.io/cgspace-notes/2022-07/</loc>
<lastmod>2022-07-31T15:49:35+03:00</lastmod>