Add notes

This commit is contained in:
2023-11-08 08:20:31 +03:00
parent c6d514bef9
commit 01fb17950b
145 changed files with 1724 additions and 1787 deletions

View File

@ -10,14 +10,14 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2023-10-24T23:26:01+03:00" />
<meta property="og:updated_time" content="2023-11-02T20:58:43+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="Notes"/>
<meta name="twitter:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."/>
<meta name="generator" content="Hugo 0.119.0">
<meta name="generator" content="Hugo 0.120.3">
@ -81,6 +81,36 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2023-11/">November, 2023</a></h2>
<p class="blog-post-meta"><time datetime="2023-11-02T12:59:36+03:00">Thu Nov 02, 2023</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2023-11-01">2023-11-01</h2>
<ul>
<li>Work a bit on the ETL pipeline for the CGIAR Climate Change Synthesis
<ul>
<li>I improved the filtering and wrote some Python using pandas to merge my sources more reliably</li>
</ul>
</li>
</ul>
<h2 id="2023-11-02">2023-11-02</h2>
<ul>
<li>Export CGSpace to check missing Initiative collection mappings</li>
<li>Start a harvest on AReS</li>
</ul>
<a href='https://alanorth.github.io/cgspace-notes/2023-11/'>Read more →</a>
</article>
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2023-10/">October, 2023</a></h2>
@ -311,32 +341,6 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2023-01/">January, 2023</a></h2>
<p class="blog-post-meta"><time datetime="2023-01-01T08:44:36+03:00">Sun Jan 01, 2023</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2023-01-01">2023-01-01</h2>
<ul>
<li>Apply some more ORCID identifiers to items on CGSpace using my <code>2022-09-22-add-orcids.csv</code> file
<ul>
<li>I want to update all ORCID names and refresh them in the database</li>
<li>I see we have some new ones that aren&rsquo;t in our list if I combine with this file:</li>
</ul>
</li>
</ul>
<a href='https://alanorth.github.io/cgspace-notes/2023-01/'>Read more →</a>
</article>
<nav class="blog-pagination">
@ -361,6 +365,8 @@
<ol class="list-unstyled">
<li><a href="/cgspace-notes/2023-11/">November, 2023</a></li>
<li><a href="/cgspace-notes/2023-10/">October, 2023</a></li>
<li><a href="/cgspace-notes/2023-09/">September, 2023</a></li>
@ -369,8 +375,6 @@
<li><a href="/cgspace-notes/2023-07/">July, 2023</a></li>
<li><a href="/cgspace-notes/2023-06/">June, 2023</a></li>
</ol>
</section>

View File

@ -6,12 +6,31 @@
<description>Recent content in Notes on CGSpace Notes</description>
<generator>Hugo -- gohugo.io</generator>
<language>en-us</language>
<lastBuildDate>Mon, 02 Oct 2023 09:05:36 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/categories/notes/index.xml" rel="self" type="application/rss+xml" />
<lastBuildDate>Thu, 02 Nov 2023 12:59:36 +0300</lastBuildDate>
<atom:link href="https://alanorth.github.io/cgspace-notes/categories/notes/index.xml" rel="self" type="application/rss+xml" />
<item>
<title>November, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-11/</link>
<pubDate>Thu, 02 Nov 2023 12:59:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-11/</guid>
<description>&lt;h2 id=&#34;2023-11-01&#34;&gt;2023-11-01&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Work a bit on the ETL pipeline for the CGIAR Climate Change Synthesis
&lt;ul&gt;
&lt;li&gt;I improved the filtering and wrote some Python using pandas to merge my sources more reliably&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2023-11-02&#34;&gt;2023-11-02&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Export CGSpace to check missing Initiative collection mappings&lt;/li&gt;
&lt;li&gt;Start a harvest on AReS&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>October, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-10/</link>
<pubDate>Mon, 02 Oct 2023 09:05:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-10/</guid>
<description>&lt;h2 id=&#34;2023-10-02&#34;&gt;2023-10-02&lt;/h2&gt;
&lt;ul&gt;
@ -24,12 +43,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>September, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-09/</link>
<pubDate>Sat, 02 Sep 2023 17:29:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-09/</guid>
<description>&lt;h2 id=&#34;2023-09-02&#34;&gt;2023-09-02&lt;/h2&gt;
&lt;ul&gt;
@ -37,12 +54,10 @@
&lt;li&gt;Start a harvest on AReS&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>August, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-08/</link>
<pubDate>Thu, 03 Aug 2023 11:18:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-08/</guid>
<description>&lt;h2 id=&#34;2023-08-03&#34;&gt;2023-08-03&lt;/h2&gt;
&lt;ul&gt;
@ -54,21 +69,17 @@
&lt;li&gt;Start working on some batch uploads for IFPRI&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>July, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-07/</link>
<pubDate>Sat, 01 Jul 2023 17:14:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-07/</guid>
<description>2023-07-01 Export CGSpace to check for missing Initiative collection mappings Start harvesting on AReS 2023-07-02 Minor edits to the crossref_doi_lookup.py script while running some checks from 22,000 CGSpace DOIs 2023-07-03 I analyzed the licenses declared by Crossref and found with high confidence that ~400 of ours were incorrect I took the more accurate ones from Crossref and updated the items on CGSpace I took a few hundred ISBNs as well for where we were missing them I also tagged ~4,700 items with missing licenses as &amp;ldquo;Copyrighted; all rights reserved&amp;rdquo; based on their Crossref license status being TDM, mostly from Elsevier, Wiley, and Springer Checking a dozen or so manually, I confirmed that if Crossref only has a TDM license then it&amp;rsquo;s usually copyrighted (could still be open access, but we can&amp;rsquo;t tell via Crossref) I would be curious to write a script to check the Unpaywall API for open access status&amp;hellip; In the past I found that their license status was not very accurate, but the open access status might be more reliable More minor work on the DSpace 7 item views I learned some new Angular template syntax I created a custom component to show Creative Commons licenses on the simple item page I also decided that I don&amp;rsquo;t like the Impact Area icons as a component because they don&amp;rsquo;t have any visual meaning 2023-07-04 Focus group meeting with CGSpace partners about DSpace 7 I added a themed file selection component to the CGSpace theme It displays the bistream description instead of the file name, just like we did in DSpace 6 XMLUI I added a custom component to show share icons 2023-07-05 I spent some time trying to update OpenRXV from Angular 9 to 10 to 11 to 12 to 13 Most things work but there are some minor bugs it seems Mishell from CIP emailed me to say she was having problems approving an item on CGSpace Looking at PostgreSQL I saw there were a dozen or so locks that were several hours and even over one day old so I killed those processes and told her to try again 2023-07-06 Types meeting I wrote a Python script to check Unpaywall for some information about DOIs 2023-07-7 Continue exploring Unpaywall data for some of our DOIs In the past I&amp;rsquo;ve found their licensing information to not be very reliable (preferring Crossref), but I think their open access status is more reliable, especially when the provider is listed as being the publisher Even so, sometimes the version can be &amp;ldquo;acceptedVersion&amp;rdquo;, which is presumably the author&amp;rsquo;s version, as opposed to the &amp;ldquo;publishedVersion&amp;rdquo;, which means it&amp;rsquo;s available as open access on the publisher&amp;rsquo;s website I did some quality assurance and found ~100 that were marked as Limited Access, but should have been Open Access, and fixed a handful of licenses Delete duplicate metadata as describe in my DSpace issue from last year: https://github.</description>
</item>
<item>
<title>June, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-06/</link>
<pubDate>Fri, 02 Jun 2023 10:29:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-06/</guid>
<description>&lt;h2 id=&#34;2023-06-02&#34;&gt;2023-06-02&lt;/h2&gt;
&lt;ul&gt;
@ -85,12 +96,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>May, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-05/</link>
<pubDate>Wed, 03 May 2023 08:53:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-05/</guid>
<description>&lt;h2 id=&#34;2023-05-03&#34;&gt;2023-05-03&lt;/h2&gt;
&lt;ul&gt;
@ -108,12 +117,10 @@
&lt;li&gt;Work on cleaning, proofing, and uploading twenty-seven records for IFPRI to CGSpace&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>April, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-04/</link>
<pubDate>Sun, 02 Apr 2023 08:19:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-04/</guid>
<description>&lt;h2 id=&#34;2023-04-02&#34;&gt;2023-04-02&lt;/h2&gt;
&lt;ul&gt;
@ -126,12 +133,10 @@
&lt;li&gt;Start a harvest on AReS&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>March, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-03/</link>
<pubDate>Wed, 01 Mar 2023 07:58:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-03/</guid>
<description>&lt;h2 id=&#34;2023-03-01&#34;&gt;2023-03-01&lt;/h2&gt;
&lt;ul&gt;
@ -140,12 +145,10 @@
&lt;li&gt;I finally got through with porting the input form from DSpace 6 to DSpace 7&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>February, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-02/</link>
<pubDate>Wed, 01 Feb 2023 10:57:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-02/</guid>
<description>&lt;h2 id=&#34;2023-02-01&#34;&gt;2023-02-01&lt;/h2&gt;
&lt;ul&gt;
@ -156,12 +159,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>January, 2023</title>
<link>https://alanorth.github.io/cgspace-notes/2023-01/</link>
<pubDate>Sun, 01 Jan 2023 08:44:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2023-01/</guid>
<description>&lt;h2 id=&#34;2023-01-01&#34;&gt;2023-01-01&lt;/h2&gt;
&lt;ul&gt;
@ -173,12 +174,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>December, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-12/</link>
<pubDate>Thu, 01 Dec 2022 08:52:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-12/</guid>
<description>&lt;h2 id=&#34;2022-12-01&#34;&gt;2022-12-01&lt;/h2&gt;
&lt;ul&gt;
@ -191,12 +190,10 @@
&lt;li&gt;Replace &amp;ldquo;East Asia&amp;rdquo; with &amp;ldquo;Eastern Asia&amp;rdquo; region on CGSpace (UN M.49 region)&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>November, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-11/</link>
<pubDate>Tue, 01 Nov 2022 09:11:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-11/</guid>
<description>&lt;h2 id=&#34;2022-11-01&#34;&gt;2022-11-01&lt;/h2&gt;
&lt;ul&gt;
@ -213,12 +210,10 @@
&lt;li&gt;I reverted the Cocoon autosave change because it was more of a nuissance that Peter can&amp;rsquo;t upload CSVs from the web interface and is a very low severity security issue&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>October, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-10/</link>
<pubDate>Sat, 01 Oct 2022 19:45:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-10/</guid>
<description>&lt;h2 id=&#34;2022-10-01&#34;&gt;2022-10-01&lt;/h2&gt;
&lt;ul&gt;
@ -231,12 +226,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>September, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-09/</link>
<pubDate>Thu, 01 Sep 2022 09:41:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-09/</guid>
<description>&lt;h2 id=&#34;2022-09-01&#34;&gt;2022-09-01&lt;/h2&gt;
&lt;ul&gt;
@ -254,24 +247,20 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>August, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-08/</link>
<pubDate>Mon, 01 Aug 2022 10:22:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-08/</guid>
<description>&lt;h2 id=&#34;2022-08-01&#34;&gt;2022-08-01&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Our request to add &lt;a href=&#34;https://github.com/spdx/license-list-XML/issues/1525&#34;&gt;CC-BY-3.0-IGO to SPDX&lt;/a&gt; was approved a few weeks ago&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>July, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-07/</link>
<pubDate>Sat, 02 Jul 2022 14:07:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-07/</guid>
<description>&lt;h2 id=&#34;2022-07-02&#34;&gt;2022-07-02&lt;/h2&gt;
&lt;ul&gt;
@ -283,12 +272,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>June, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-06/</link>
<pubDate>Mon, 06 Jun 2022 09:01:36 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-06/</guid>
<description>&lt;h2 id=&#34;2022-06-06&#34;&gt;2022-06-06&lt;/h2&gt;
&lt;ul&gt;
@ -307,12 +294,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>May, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-05/</link>
<pubDate>Wed, 04 May 2022 09:13:39 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-05/</guid>
<description>&lt;h2 id=&#34;2022-05-04&#34;&gt;2022-05-04&lt;/h2&gt;
&lt;ul&gt;
@ -340,21 +325,17 @@
&lt;li&gt;I purged 93,974 hits from these IPs using my &lt;code&gt;check-spider-ip-hits.sh&lt;/code&gt; script&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>April, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-04/</link>
<pubDate>Fri, 01 Apr 2022 10:53:39 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-04/</guid>
<description>2022-04-01 I did G1GC tests on DSpace Test (linode26) to compliment the CMS tests I did yesterday The Discovery indexing took this long: real 334m33.625s user 227m51.331s sys 3m43.037s 2022-04-04 Start a full harvest on AReS Help Marianne with submit/approve access on a new collection on CGSpace Go back in Gaia&amp;rsquo;s batch reports to find records that she indicated for replacing on CGSpace (ie, those with better new copies, new versions, etc) Looking at the Solr statistics for 2022-03 on CGSpace I see 54.</description>
</item>
<item>
<title>March, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-03/</link>
<pubDate>Tue, 01 Mar 2022 16:46:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-03/</guid>
<description>&lt;h2 id=&#34;2022-03-01&#34;&gt;2022-03-01&lt;/h2&gt;
&lt;ul&gt;
@ -366,12 +347,10 @@
&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;$ csvjoin -c id /tmp/2022-03-01-tac-batch4-701-980.csv /tmp/tac4-filenames.csv &amp;gt; /tmp/2022-03-01-tac-batch4-701-980-filenames.csv
&lt;/span&gt;&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;</description>
</item>
<item>
<title>February, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-02/</link>
<pubDate>Tue, 01 Feb 2022 14:06:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-02/</guid>
<description>&lt;h2 id=&#34;2022-02-01&#34;&gt;2022-02-01&lt;/h2&gt;
&lt;ul&gt;
@ -385,24 +364,20 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>January, 2022</title>
<link>https://alanorth.github.io/cgspace-notes/2022-01/</link>
<pubDate>Sat, 01 Jan 2022 15:20:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2022-01/</guid>
<description>&lt;h2 id=&#34;2022-01-01&#34;&gt;2022-01-01&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Start a full harvest on AReS&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>December, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-12/</link>
<pubDate>Wed, 01 Dec 2021 16:07:07 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-12/</guid>
<description>&lt;h2 id=&#34;2021-12-01&#34;&gt;2021-12-01&lt;/h2&gt;
&lt;ul&gt;
@ -417,12 +392,10 @@
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;&lt;span style=&#34;color:#960050;background-color:#1e0010&#34;&gt;&lt;/span&gt;Total number of bot hits purged: 3679
&lt;/span&gt;&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;</description>
</item>
<item>
<title>November, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-11/</link>
<pubDate>Tue, 02 Nov 2021 22:27:07 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-11/</guid>
<description>&lt;h2 id=&#34;2021-11-02&#34;&gt;2021-11-02&lt;/h2&gt;
&lt;ul&gt;
@ -433,12 +406,10 @@
&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;$ zstd statistics-2019.json
&lt;/span&gt;&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;</description>
</item>
<item>
<title>October, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-10/</link>
<pubDate>Fri, 01 Oct 2021 11:14:07 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-10/</guid>
<description>&lt;h2 id=&#34;2021-10-01&#34;&gt;2021-10-01&lt;/h2&gt;
&lt;ul&gt;
@ -456,12 +427,10 @@
&lt;li&gt;So we have 1879/7100 (26.46%) matching already&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>September, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-09/</link>
<pubDate>Wed, 01 Sep 2021 09:14:07 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-09/</guid>
<description>&lt;h2 id=&#34;2021-09-02&#34;&gt;2021-09-02&lt;/h2&gt;
&lt;ul&gt;
@ -480,12 +449,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>August, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-08/</link>
<pubDate>Sun, 01 Aug 2021 09:01:07 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-08/</guid>
<description>&lt;h2 id=&#34;2021-08-01&#34;&gt;2021-08-01&lt;/h2&gt;
&lt;ul&gt;
@ -496,12 +463,10 @@
&lt;li&gt;I decided to upgrade linode20 from Ubuntu 18.04 to 20.04&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>July, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-07/</link>
<pubDate>Thu, 01 Jul 2021 08:53:07 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-07/</guid>
<description>&lt;h2 id=&#34;2021-07-01&#34;&gt;2021-07-01&lt;/h2&gt;
&lt;ul&gt;
@ -511,12 +476,10 @@
&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;COPY 20994
&lt;/span&gt;&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;</description>
</item>
<item>
<title>June, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-06/</link>
<pubDate>Tue, 01 Jun 2021 10:51:07 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-06/</guid>
<description>&lt;h2 id=&#34;2021-06-01&#34;&gt;2021-06-01&lt;/h2&gt;
&lt;ul&gt;
@ -529,12 +492,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>May, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-05/</link>
<pubDate>Sun, 02 May 2021 09:50:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-05/</guid>
<description>&lt;h2 id=&#34;2021-05-01&#34;&gt;2021-05-01&lt;/h2&gt;
&lt;ul&gt;
@ -547,12 +508,10 @@
&lt;li&gt;I will add the RI/1.0 pattern to our DSpace agents overload and purge them from Solr (we had previously seen this agent with 9,000 hits or so in 2020-09), but I think I will leave the Microsoft Word one&amp;hellip; as that&amp;rsquo;s an actual user&amp;hellip;&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>April, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-04/</link>
<pubDate>Thu, 01 Apr 2021 09:50:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-04/</guid>
<description>&lt;h2 id=&#34;2021-04-01&#34;&gt;2021-04-01&lt;/h2&gt;
&lt;ul&gt;
@ -569,12 +528,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>March, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-03/</link>
<pubDate>Mon, 01 Mar 2021 10:13:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-03/</guid>
<description>&lt;h2 id=&#34;2021-03-01&#34;&gt;2021-03-01&lt;/h2&gt;
&lt;ul&gt;
@ -586,22 +543,18 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>CGSpace CG Core v2 Migration</title>
<link>https://alanorth.github.io/cgspace-notes/cgspace-cgcorev2-migration/</link>
<pubDate>Sun, 21 Feb 2021 13:27:35 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/cgspace-cgcorev2-migration/</guid>
<description>&lt;p&gt;Changes to CGSpace metadata fields to align more with DC, QDC, and DCTERMS as well as CG Core v2. Implemented on 2021-02-21.&lt;/p&gt;
&lt;p&gt;With reference to &lt;a href=&#34;https://agriculturalsemantics.github.io/cg-core/cgcore.html&#34;&gt;CG Core v2 draft standard&lt;/a&gt; by Marie-Angélique as well as &lt;a href=&#34;http://www.dublincore.org/specifications/dublin-core/dcmi-terms/&#34;&gt;DCMI DCTERMS&lt;/a&gt;.&lt;/p&gt;</description>
</item>
<item>
<title>February, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-02/</link>
<pubDate>Mon, 01 Feb 2021 10:13:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-02/</guid>
<description>&lt;h2 id=&#34;2021-02-01&#34;&gt;2021-02-01&lt;/h2&gt;
&lt;ul&gt;
@ -626,12 +579,10 @@
&lt;/span&gt;&lt;/span&gt;&lt;span style=&#34;display:flex;&#34;&gt;&lt;span&gt;}
&lt;/span&gt;&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;</description>
</item>
<item>
<title>January, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-01/</link>
<pubDate>Sun, 03 Jan 2021 10:13:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-01/</guid>
<description>&lt;h2 id=&#34;2021-01-03&#34;&gt;2021-01-03&lt;/h2&gt;
&lt;ul&gt;
@ -651,12 +602,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>December, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-12/</link>
<pubDate>Tue, 01 Dec 2020 11:32:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-12/</guid>
<description>&lt;h2 id=&#34;2020-12-01&#34;&gt;2020-12-01&lt;/h2&gt;
&lt;ul&gt;
@ -669,21 +618,17 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>CGSpace DSpace 6 Upgrade</title>
<link>https://alanorth.github.io/cgspace-notes/cgspace-dspace6-upgrade/</link>
<pubDate>Sun, 15 Nov 2020 13:27:35 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/cgspace-dspace6-upgrade/</guid>
<description>&lt;p&gt;Notes about the DSpace 6 upgrade on CGSpace in 2020-11.&lt;/p&gt;</description>
</item>
<item>
<title>November, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-11/</link>
<pubDate>Sun, 01 Nov 2020 13:11:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-11/</guid>
<description>&lt;h2 id=&#34;2020-11-01&#34;&gt;2020-11-01&lt;/h2&gt;
&lt;ul&gt;
@ -694,12 +639,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>October, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-10/</link>
<pubDate>Tue, 06 Oct 2020 16:55:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-10/</guid>
<description>&lt;h2 id=&#34;2020-10-06&#34;&gt;2020-10-06&lt;/h2&gt;
&lt;ul&gt;
@ -716,12 +659,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>September, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-09/</link>
<pubDate>Wed, 02 Sep 2020 15:35:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-09/</guid>
<description>&lt;h2 id=&#34;2020-09-02&#34;&gt;2020-09-02&lt;/h2&gt;
&lt;ul&gt;
@ -740,12 +681,10 @@
&lt;li&gt;I filed an issue on OpenRXV to make some minor edits to the admin UI: &lt;a href=&#34;https://github.com/ilri/OpenRXV/issues/40&#34;&gt;https://github.com/ilri/OpenRXV/issues/40&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>August, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-08/</link>
<pubDate>Sun, 02 Aug 2020 15:35:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-08/</guid>
<description>&lt;h2 id=&#34;2020-08-02&#34;&gt;2020-08-02&lt;/h2&gt;
&lt;ul&gt;
@ -758,12 +697,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>July, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-07/</link>
<pubDate>Wed, 01 Jul 2020 10:53:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-07/</guid>
<description>&lt;h2 id=&#34;2020-07-01&#34;&gt;2020-07-01&lt;/h2&gt;
&lt;ul&gt;
@ -777,12 +714,10 @@
&lt;li&gt;Since I was restarting Tomcat anyways I decided to redeploy the latest changes from the &lt;code&gt;5_x-prod&lt;/code&gt; branch and I added a note about COVID-19 items to the CGSpace frontpage at Peter&amp;rsquo;s request&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>June, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-06/</link>
<pubDate>Mon, 01 Jun 2020 13:55:39 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-06/</guid>
<description>&lt;h2 id=&#34;2020-06-01&#34;&gt;2020-06-01&lt;/h2&gt;
&lt;ul&gt;
@ -795,12 +730,10 @@
&lt;li&gt;I tried to build the OAI registry on the freshly migrated DSpace 6 on DSpace Test and I get an error:&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>May, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-05/</link>
<pubDate>Sat, 02 May 2020 09:52:04 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-05/</guid>
<description>&lt;h2 id=&#34;2020-05-02&#34;&gt;2020-05-02&lt;/h2&gt;
&lt;ul&gt;
@ -812,12 +745,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>April, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-04/</link>
<pubDate>Thu, 02 Apr 2020 10:53:24 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-04/</guid>
<description>&lt;h2 id=&#34;2020-04-02&#34;&gt;2020-04-02&lt;/h2&gt;
&lt;ul&gt;
@ -836,12 +767,10 @@
&lt;li&gt;On the same note, the &lt;a href=&#34;https://hdl.handle.net/10568/106573&#34;&gt;one item&lt;/a&gt; Abenet pointed out last week now has a donut with score of 104 after I &lt;a href=&#34;https://twitter.com/mralanorth/status/1243163710241345536&#34;&gt;tweeted it&lt;/a&gt; last week&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>March, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-03/</link>
<pubDate>Mon, 02 Mar 2020 12:31:30 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-03/</guid>
<description>&lt;h2 id=&#34;2020-03-02&#34;&gt;2020-03-02&lt;/h2&gt;
&lt;ul&gt;
@ -857,12 +786,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>February, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-02/</link>
<pubDate>Sun, 02 Feb 2020 11:56:30 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-02/</guid>
<description>&lt;h2 id=&#34;2020-02-02&#34;&gt;2020-02-02&lt;/h2&gt;
&lt;ul&gt;
@ -876,12 +803,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>January, 2020</title>
<link>https://alanorth.github.io/cgspace-notes/2020-01/</link>
<pubDate>Mon, 06 Jan 2020 10:48:30 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2020-01/</guid>
<description>&lt;h2 id=&#34;2020-01-06&#34;&gt;2020-01-06&lt;/h2&gt;
&lt;ul&gt;
@ -904,12 +829,10 @@
&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>December, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-12/</link>
<pubDate>Sun, 01 Dec 2019 11:22:30 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-12/</guid>
<description>&lt;h2 id=&#34;2019-12-01&#34;&gt;2019-12-01&lt;/h2&gt;
&lt;ul&gt;
@ -927,12 +850,10 @@
# reboot
&lt;/code&gt;&lt;/pre&gt;</description>
</item>
<item>
<title>November, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-11/</link>
<pubDate>Mon, 04 Nov 2019 12:20:30 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-11/</guid>
<description>&lt;h2 id=&#34;2019-11-04&#34;&gt;2019-11-04&lt;/h2&gt;
&lt;ul&gt;
@ -956,21 +877,17 @@
106781
&lt;/code&gt;&lt;/pre&gt;</description>
</item>
<item>
<title>October, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-10/</link>
<pubDate>Tue, 01 Oct 2019 13:20:51 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-10/</guid>
<description>2019-10-01 Udana from IWMI asked me for a CSV export of their community on CGSpace I exported it, but a quick run through the csv-metadata-quality tool shows that there are some low-hanging fruits we can fix before I send him the data I will limit the scope to the titles, regions, subregions, and river basins for now to manually fix some non-breaking spaces (U+00A0) there that would otherwise be removed by the csv-metadata-quality script&amp;rsquo;s &amp;ldquo;unneccesary Unicode&amp;rdquo; fix: $ csvcut -c &amp;#39;id,dc.</description>
</item>
<item>
<title>September, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-09/</link>
<pubDate>Sun, 01 Sep 2019 10:17:51 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-09/</guid>
<description>&lt;h2 id=&#34;2019-09-01&#34;&gt;2019-09-01&lt;/h2&gt;
&lt;ul&gt;
@ -1001,12 +918,10 @@
9124 45.5.186.2
&lt;/code&gt;&lt;/pre&gt;</description>
</item>
<item>
<title>August, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-08/</link>
<pubDate>Sat, 03 Aug 2019 12:39:51 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-08/</guid>
<description>&lt;h2 id=&#34;2019-08-03&#34;&gt;2019-08-03&lt;/h2&gt;
&lt;ul&gt;
@ -1024,12 +939,10 @@
&lt;li&gt;Run system updates on DSpace Test (linode19) and reboot it&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>July, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-07/</link>
<pubDate>Mon, 01 Jul 2019 12:13:51 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-07/</guid>
<description>&lt;h2 id=&#34;2019-07-01&#34;&gt;2019-07-01&lt;/h2&gt;
&lt;ul&gt;
@ -1043,12 +956,10 @@
&lt;li&gt;Abenet had another similar issue a few days ago when trying to find the stats for 2018 in the RTB community&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>June, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-06/</link>
<pubDate>Sun, 02 Jun 2019 10:57:51 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-06/</guid>
<description>&lt;h2 id=&#34;2019-06-02&#34;&gt;2019-06-02&lt;/h2&gt;
&lt;ul&gt;
@ -1060,12 +971,10 @@
&lt;li&gt;Skype with Marie-Angélique and Abenet about &lt;a href=&#34;https://agriculturalsemantics.github.io/cg-core/cgcore.html&#34;&gt;CG Core v2&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>May, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-05/</link>
<pubDate>Wed, 01 May 2019 07:37:43 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-05/</guid>
<description>&lt;h2 id=&#34;2019-05-01&#34;&gt;2019-05-01&lt;/h2&gt;
&lt;ul&gt;
@ -1084,12 +993,10 @@ DELETE 1
&lt;li&gt;But after this I tried to delete the item from the XMLUI and it is &lt;em&gt;still&lt;/em&gt; present&amp;hellip;&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>April, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-04/</link>
<pubDate>Mon, 01 Apr 2019 09:00:43 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-04/</guid>
<description>&lt;h2 id=&#34;2019-04-01&#34;&gt;2019-04-01&lt;/h2&gt;
&lt;ul&gt;
@ -1116,12 +1023,10 @@ $ ./delete-metadata-values.py -i /tmp/2019-02-21-delete-2-countries.csv -db dspa
$ ./delete-metadata-values.py -i /tmp/2019-02-21-delete-1-region.csv -db dspace -u dspace -p &amp;#39;fuuu&amp;#39; -m 231 -f cg.coverage.region -d
&lt;/code&gt;&lt;/pre&gt;</description>
</item>
<item>
<title>March, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-03/</link>
<pubDate>Fri, 01 Mar 2019 12:16:30 +0100</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-03/</guid>
<description>&lt;h2 id=&#34;2019-03-01&#34;&gt;2019-03-01&lt;/h2&gt;
&lt;ul&gt;
@ -1139,12 +1044,10 @@ $ ./delete-metadata-values.py -i /tmp/2019-02-21-delete-1-region.csv -db dspace
&lt;li&gt;I think I will need to ask Udana to re-copy and paste the abstracts with more care using Google Docs&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>February, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-02/</link>
<pubDate>Fri, 01 Feb 2019 21:37:30 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-02/</guid>
<description>&lt;h2 id=&#34;2019-02-01&#34;&gt;2019-02-01&lt;/h2&gt;
&lt;ul&gt;
@ -1175,12 +1078,10 @@ user 0m22.203s
sys 0m1.979s
&lt;/code&gt;&lt;/pre&gt;</description>
</item>
<item>
<title>January, 2019</title>
<link>https://alanorth.github.io/cgspace-notes/2019-01/</link>
<pubDate>Wed, 02 Jan 2019 09:48:30 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2019-01/</guid>
<description>&lt;h2 id=&#34;2019-01-02&#34;&gt;2019-01-02&lt;/h2&gt;
&lt;ul&gt;
@ -1200,12 +1101,10 @@ sys 0m1.979s
903 54.70.40.11
&lt;/code&gt;&lt;/pre&gt;</description>
</item>
<item>
<title>December, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-12/</link>
<pubDate>Sun, 02 Dec 2018 02:09:30 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-12/</guid>
<description>&lt;h2 id=&#34;2018-12-01&#34;&gt;2018-12-01&lt;/h2&gt;
&lt;ul&gt;
@ -1218,12 +1117,10 @@ sys 0m1.979s
&lt;li&gt;I noticed that there is another issue with PDF thumbnails on CGSpace, and I see there was another &lt;a href=&#34;https://usn.ubuntu.com/3831-1/&#34;&gt;Ghostscript vulnerability last week&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>November, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-11/</link>
<pubDate>Thu, 01 Nov 2018 16:41:30 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-11/</guid>
<description>&lt;h2 id=&#34;2018-11-01&#34;&gt;2018-11-01&lt;/h2&gt;
&lt;ul&gt;
@ -1236,12 +1133,10 @@ sys 0m1.979s
&lt;li&gt;Today these are the top 10 IPs:&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>October, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-10/</link>
<pubDate>Mon, 01 Oct 2018 22:31:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-10/</guid>
<description>&lt;h2 id=&#34;2018-10-01&#34;&gt;2018-10-01&lt;/h2&gt;
&lt;ul&gt;
@ -1249,12 +1144,10 @@ sys 0m1.979s
&lt;li&gt;I created a GitHub issue to track this &lt;a href=&#34;https://github.com/ilri/DSpace/issues/389&#34;&gt;#389&lt;/a&gt;, because I&amp;rsquo;m super busy in Nairobi right now&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>September, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-09/</link>
<pubDate>Sun, 02 Sep 2018 09:55:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-09/</guid>
<description>&lt;h2 id=&#34;2018-09-02&#34;&gt;2018-09-02&lt;/h2&gt;
&lt;ul&gt;
@ -1264,12 +1157,10 @@ sys 0m1.979s
&lt;li&gt;I&amp;rsquo;m testing the new DSpace 5.8 branch in my Ubuntu 18.04 environment and I&amp;rsquo;m getting those autowire errors in Tomcat 8.5.30 again:&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>August, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-08/</link>
<pubDate>Wed, 01 Aug 2018 11:52:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-08/</guid>
<description>&lt;h2 id=&#34;2018-08-01&#34;&gt;2018-08-01&lt;/h2&gt;
&lt;ul&gt;
@ -1287,12 +1178,10 @@ sys 0m1.979s
&lt;li&gt;I ran all system updates on DSpace Test and rebooted it&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>July, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-07/</link>
<pubDate>Sun, 01 Jul 2018 12:56:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-07/</guid>
<description>&lt;h2 id=&#34;2018-07-01&#34;&gt;2018-07-01&lt;/h2&gt;
&lt;ul&gt;
@ -1305,12 +1194,10 @@ sys 0m1.979s
&lt;pre tabindex=&#34;0&#34;&gt;&lt;code&gt;There is insufficient memory for the Java Runtime Environment to continue.
&lt;/code&gt;&lt;/pre&gt;</description>
</item>
<item>
<title>June, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-06/</link>
<pubDate>Mon, 04 Jun 2018 19:49:54 -0700</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-06/</guid>
<description>&lt;h2 id=&#34;2018-06-04&#34;&gt;2018-06-04&lt;/h2&gt;
&lt;ul&gt;
@ -1334,12 +1221,10 @@ user 8m5.056s
sys 2m7.289s
&lt;/code&gt;&lt;/pre&gt;</description>
</item>
<item>
<title>May, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-05/</link>
<pubDate>Tue, 01 May 2018 16:43:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-05/</guid>
<description>&lt;h2 id=&#34;2018-05-01&#34;&gt;2018-05-01&lt;/h2&gt;
&lt;ul&gt;
@ -1353,12 +1238,10 @@ sys 2m7.289s
&lt;li&gt;Also, I switched it to use OpenJDK instead of Oracle Java, as well as re-worked the &lt;a href=&#34;https://github.com/ilri/rmg-ansible-public&#34;&gt;Ansible infrastructure scripts&lt;/a&gt; to support hosts choosing which distribution they want to use&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>April, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-04/</link>
<pubDate>Sun, 01 Apr 2018 16:13:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-04/</guid>
<description>&lt;h2 id=&#34;2018-04-01&#34;&gt;2018-04-01&lt;/h2&gt;
&lt;ul&gt;
@ -1366,24 +1249,20 @@ sys 2m7.289s
&lt;li&gt;Catalina logs at least show some memory errors yesterday:&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>March, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-03/</link>
<pubDate>Fri, 02 Mar 2018 16:07:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-03/</guid>
<description>&lt;h2 id=&#34;2018-03-02&#34;&gt;2018-03-02&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Export a CSV of the IITA community metadata for Martin Mueller&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>February, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-02/</link>
<pubDate>Thu, 01 Feb 2018 16:28:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-02/</guid>
<description>&lt;h2 id=&#34;2018-02-01&#34;&gt;2018-02-01&lt;/h2&gt;
&lt;ul&gt;
@ -1393,12 +1272,10 @@ sys 2m7.289s
&lt;li&gt;I copied the logic in the &lt;code&gt;jmx_tomcat_dbpools&lt;/code&gt; provided by Ubuntu&amp;rsquo;s &lt;code&gt;munin-plugins-java&lt;/code&gt; package and used the stuff I discovered about JMX &lt;a href=&#34;https://alanorth.github.io/cgspace-notes/cgspace-notes/2018-01/&#34;&gt;in 2018-01&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>January, 2018</title>
<link>https://alanorth.github.io/cgspace-notes/2018-01/</link>
<pubDate>Tue, 02 Jan 2018 08:35:54 -0800</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2018-01/</guid>
<description>&lt;h2 id=&#34;2018-01-02&#34;&gt;2018-01-02&lt;/h2&gt;
&lt;ul&gt;
@ -1468,12 +1345,10 @@ dspace.log.2018-01-02:34
&lt;li&gt;Danny wrote to ask for help renewing the wildcard ilri.org certificate and I advised that we should probably use Let&amp;rsquo;s Encrypt if it&amp;rsquo;s just a handful of domains&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>December, 2017</title>
<link>https://alanorth.github.io/cgspace-notes/2017-12/</link>
<pubDate>Fri, 01 Dec 2017 13:53:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2017-12/</guid>
<description>&lt;h2 id=&#34;2017-12-01&#34;&gt;2017-12-01&lt;/h2&gt;
&lt;ul&gt;
@ -1483,12 +1358,10 @@ dspace.log.2018-01-02:34
&lt;li&gt;The list of connections to XMLUI and REST API for today:&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>November, 2017</title>
<link>https://alanorth.github.io/cgspace-notes/2017-11/</link>
<pubDate>Thu, 02 Nov 2017 09:37:54 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2017-11/</guid>
<description>&lt;h2 id=&#34;2017-11-01&#34;&gt;2017-11-01&lt;/h2&gt;
&lt;ul&gt;
@ -1507,12 +1380,10 @@ dspace.log.2018-01-02:34
COPY 54701
&lt;/code&gt;&lt;/pre&gt;</description>
</item>
<item>
<title>October, 2017</title>
<link>https://alanorth.github.io/cgspace-notes/2017-10/</link>
<pubDate>Sun, 01 Oct 2017 08:07:54 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2017-10/</guid>
<description>&lt;h2 id=&#34;2017-10-01&#34;&gt;2017-10-01&lt;/h2&gt;
&lt;ul&gt;
@ -1524,15 +1395,12 @@ COPY 54701
&lt;li&gt;Add Katherine Lutz to the groups for content submission and edit steps of the CGIAR System collections&lt;/li&gt;
&lt;/ul&gt;</description>
</item>
<item>
<title>CGIAR Library Migration</title>
<link>https://alanorth.github.io/cgspace-notes/cgiar-library-migration/</link>
<pubDate>Mon, 18 Sep 2017 16:38:35 +0300</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/cgiar-library-migration/</guid>
<description>&lt;p&gt;Rough notes for importing the CGIAR Library content. It was decided that this content would go to a new top-level community called &lt;em&gt;CGIAR System Organization&lt;/em&gt;.&lt;/p&gt;</description>
</item>
</channel>
</rss>

View File

@ -10,14 +10,14 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2023-10-24T23:26:01+03:00" />
<meta property="og:updated_time" content="2023-11-02T20:58:43+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="Notes"/>
<meta name="twitter:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."/>
<meta name="generator" content="Hugo 0.119.0">
<meta name="generator" content="Hugo 0.120.3">
@ -81,6 +81,32 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2023-01/">January, 2023</a></h2>
<p class="blog-post-meta"><time datetime="2023-01-01T08:44:36+03:00">Sun Jan 01, 2023</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2023-01-01">2023-01-01</h2>
<ul>
<li>Apply some more ORCID identifiers to items on CGSpace using my <code>2022-09-22-add-orcids.csv</code> file
<ul>
<li>I want to update all ORCID names and refresh them in the database</li>
<li>I see we have some new ones that aren&rsquo;t in our list if I combine with this file:</li>
</ul>
</li>
</ul>
<a href='https://alanorth.github.io/cgspace-notes/2023-01/'>Read more →</a>
</article>
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2022-12/">December, 2022</a></h2>
@ -337,32 +363,6 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2022-03/">March, 2022</a></h2>
<p class="blog-post-meta"><time datetime="2022-03-01T16:46:54+03:00">Tue Mar 01, 2022</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2022-03-01">2022-03-01</h2>
<ul>
<li>Send Gaia the last batch of potential duplicates for items 701 to 980:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ csvcut -c id,dc.title,dcterms.issued,dcterms.type ~/Downloads/2022-03-01-CGSpace-TAC-ICW-batch4-701-980.csv &gt; /tmp/tac4.csv
</span></span><span style="display:flex;"><span>$ ./ilri/check-duplicates.py -i /tmp/tac4.csv -db dspace -u dspace -p <span style="color:#e6db74">&#39;fuuu&#39;</span> -o /tmp/2022-03-01-tac-batch4-701-980.csv
</span></span><span style="display:flex;"><span>$ csvcut -c id,filename ~/Downloads/2022-03-01-CGSpace-TAC-ICW-batch4-701-980.csv &gt; /tmp/tac4-filenames.csv
</span></span><span style="display:flex;"><span>$ csvjoin -c id /tmp/2022-03-01-tac-batch4-701-980.csv /tmp/tac4-filenames.csv &gt; /tmp/2022-03-01-tac-batch4-701-980-filenames.csv
</span></span></code></pre></div>
<a href='https://alanorth.github.io/cgspace-notes/2022-03/'>Read more →</a>
</article>
<nav class="blog-pagination">
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/" rel="prev" role="button">Previous page</a>
@ -387,6 +387,8 @@
<ol class="list-unstyled">
<li><a href="/cgspace-notes/2023-11/">November, 2023</a></li>
<li><a href="/cgspace-notes/2023-10/">October, 2023</a></li>
<li><a href="/cgspace-notes/2023-09/">September, 2023</a></li>
@ -395,8 +397,6 @@
<li><a href="/cgspace-notes/2023-07/">July, 2023</a></li>
<li><a href="/cgspace-notes/2023-06/">June, 2023</a></li>
</ol>
</section>

View File

@ -10,14 +10,14 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2023-10-24T23:26:01+03:00" />
<meta property="og:updated_time" content="2023-11-02T20:58:43+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="Notes"/>
<meta name="twitter:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."/>
<meta name="generator" content="Hugo 0.119.0">
<meta name="generator" content="Hugo 0.120.3">
@ -81,6 +81,32 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2022-03/">March, 2022</a></h2>
<p class="blog-post-meta"><time datetime="2022-03-01T16:46:54+03:00">Tue Mar 01, 2022</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2022-03-01">2022-03-01</h2>
<ul>
<li>Send Gaia the last batch of potential duplicates for items 701 to 980:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>$ csvcut -c id,dc.title,dcterms.issued,dcterms.type ~/Downloads/2022-03-01-CGSpace-TAC-ICW-batch4-701-980.csv &gt; /tmp/tac4.csv
</span></span><span style="display:flex;"><span>$ ./ilri/check-duplicates.py -i /tmp/tac4.csv -db dspace -u dspace -p <span style="color:#e6db74">&#39;fuuu&#39;</span> -o /tmp/2022-03-01-tac-batch4-701-980.csv
</span></span><span style="display:flex;"><span>$ csvcut -c id,filename ~/Downloads/2022-03-01-CGSpace-TAC-ICW-batch4-701-980.csv &gt; /tmp/tac4-filenames.csv
</span></span><span style="display:flex;"><span>$ csvjoin -c id /tmp/2022-03-01-tac-batch4-701-980.csv /tmp/tac4-filenames.csv &gt; /tmp/2022-03-01-tac-batch4-701-980-filenames.csv
</span></span></code></pre></div>
<a href='https://alanorth.github.io/cgspace-notes/2022-03/'>Read more →</a>
</article>
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2022-02/">February, 2022</a></h2>
@ -324,33 +350,6 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-05/">May, 2021</a></h2>
<p class="blog-post-meta"><time datetime="2021-05-02T09:50:54+03:00">Sun May 02, 2021</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2021-05-01">2021-05-01</h2>
<ul>
<li>I looked at the top user agents and IPs in the Solr statistics for last month and I see these user agents:
<ul>
<li>&ldquo;RI/1.0&rdquo;, 1337</li>
<li>&ldquo;Microsoft Office Word 2014&rdquo;, 941</li>
</ul>
</li>
<li>I will add the RI/1.0 pattern to our DSpace agents overload and purge them from Solr (we had previously seen this agent with 9,000 hits or so in 2020-09), but I think I will leave the Microsoft Word one&hellip; as that&rsquo;s an actual user&hellip;</li>
</ul>
<a href='https://alanorth.github.io/cgspace-notes/2021-05/'>Read more →</a>
</article>
<nav class="blog-pagination">
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/2/" rel="prev" role="button">Previous page</a>
@ -375,6 +374,8 @@
<ol class="list-unstyled">
<li><a href="/cgspace-notes/2023-11/">November, 2023</a></li>
<li><a href="/cgspace-notes/2023-10/">October, 2023</a></li>
<li><a href="/cgspace-notes/2023-09/">September, 2023</a></li>
@ -383,8 +384,6 @@
<li><a href="/cgspace-notes/2023-07/">July, 2023</a></li>
<li><a href="/cgspace-notes/2023-06/">June, 2023</a></li>
</ol>
</section>

View File

@ -10,14 +10,14 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2023-10-24T23:26:01+03:00" />
<meta property="og:updated_time" content="2023-11-02T20:58:43+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="Notes"/>
<meta name="twitter:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."/>
<meta name="generator" content="Hugo 0.119.0">
<meta name="generator" content="Hugo 0.120.3">
@ -81,6 +81,33 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-05/">May, 2021</a></h2>
<p class="blog-post-meta"><time datetime="2021-05-02T09:50:54+03:00">Sun May 02, 2021</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2021-05-01">2021-05-01</h2>
<ul>
<li>I looked at the top user agents and IPs in the Solr statistics for last month and I see these user agents:
<ul>
<li>&ldquo;RI/1.0&rdquo;, 1337</li>
<li>&ldquo;Microsoft Office Word 2014&rdquo;, 941</li>
</ul>
</li>
<li>I will add the RI/1.0 pattern to our DSpace agents overload and purge them from Solr (we had previously seen this agent with 9,000 hits or so in 2020-09), but I think I will leave the Microsoft Word one&hellip; as that&rsquo;s an actual user&hellip;</li>
</ul>
<a href='https://alanorth.github.io/cgspace-notes/2021-05/'>Read more →</a>
</article>
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-04/">April, 2021</a></h2>
@ -334,39 +361,6 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2020-09/">September, 2020</a></h2>
<p class="blog-post-meta"><time datetime="2020-09-02T15:35:54+03:00">Wed Sep 02, 2020</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2020-09-02">2020-09-02</h2>
<ul>
<li>Replace Marissa van Epp for Rhys Bucknall in the CCAFS groups on CGSpace because Marissa no longer works at CCAFS</li>
<li>The AReS Explorer hasn&rsquo;t updated its index since 2020-08-22 when I last forced it
<ul>
<li>I restarted it again now and told Moayad that the automatic indexing isn&rsquo;t working</li>
</ul>
</li>
<li>Add <code>Alliance of Bioversity International and CIAT</code> to affiliations on CGSpace</li>
<li>Abenet told me that the general search text on AReS doesn&rsquo;t get reset when you use the &ldquo;Reset Filters&rdquo; button
<ul>
<li>I filed a bug on OpenRXV: <a href="https://github.com/ilri/OpenRXV/issues/39">https://github.com/ilri/OpenRXV/issues/39</a></li>
</ul>
</li>
<li>I filed an issue on OpenRXV to make some minor edits to the admin UI: <a href="https://github.com/ilri/OpenRXV/issues/40">https://github.com/ilri/OpenRXV/issues/40</a></li>
</ul>
<a href='https://alanorth.github.io/cgspace-notes/2020-09/'>Read more →</a>
</article>
<nav class="blog-pagination">
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/3/" rel="prev" role="button">Previous page</a>
@ -391,6 +385,8 @@
<ol class="list-unstyled">
<li><a href="/cgspace-notes/2023-11/">November, 2023</a></li>
<li><a href="/cgspace-notes/2023-10/">October, 2023</a></li>
<li><a href="/cgspace-notes/2023-09/">September, 2023</a></li>
@ -399,8 +395,6 @@
<li><a href="/cgspace-notes/2023-07/">July, 2023</a></li>
<li><a href="/cgspace-notes/2023-06/">June, 2023</a></li>
</ol>
</section>

View File

@ -10,14 +10,14 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2023-10-24T23:26:01+03:00" />
<meta property="og:updated_time" content="2023-11-02T20:58:43+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="Notes"/>
<meta name="twitter:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."/>
<meta name="generator" content="Hugo 0.119.0">
<meta name="generator" content="Hugo 0.120.3">
@ -81,6 +81,39 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2020-09/">September, 2020</a></h2>
<p class="blog-post-meta"><time datetime="2020-09-02T15:35:54+03:00">Wed Sep 02, 2020</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2020-09-02">2020-09-02</h2>
<ul>
<li>Replace Marissa van Epp for Rhys Bucknall in the CCAFS groups on CGSpace because Marissa no longer works at CCAFS</li>
<li>The AReS Explorer hasn&rsquo;t updated its index since 2020-08-22 when I last forced it
<ul>
<li>I restarted it again now and told Moayad that the automatic indexing isn&rsquo;t working</li>
</ul>
</li>
<li>Add <code>Alliance of Bioversity International and CIAT</code> to affiliations on CGSpace</li>
<li>Abenet told me that the general search text on AReS doesn&rsquo;t get reset when you use the &ldquo;Reset Filters&rdquo; button
<ul>
<li>I filed a bug on OpenRXV: <a href="https://github.com/ilri/OpenRXV/issues/39">https://github.com/ilri/OpenRXV/issues/39</a></li>
</ul>
</li>
<li>I filed an issue on OpenRXV to make some minor edits to the admin UI: <a href="https://github.com/ilri/OpenRXV/issues/40">https://github.com/ilri/OpenRXV/issues/40</a></li>
</ul>
<a href='https://alanorth.github.io/cgspace-notes/2020-09/'>Read more →</a>
</article>
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2020-08/">August, 2020</a></h2>
@ -348,44 +381,6 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2019-11/">November, 2019</a></h2>
<p class="blog-post-meta"><time datetime="2019-11-04T12:20:30+02:00">Mon Nov 04, 2019</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2019-11-04">2019-11-04</h2>
<ul>
<li>Peter noticed that there were 5.2 million hits on CGSpace in 2019-10 according to the Atmire usage statistics
<ul>
<li>I looked in the nginx logs and see 4.6 million in the access logs, and 1.2 million in the API logs:</li>
</ul>
</li>
</ul>
<pre tabindex="0"><code># zcat --force /var/log/nginx/*access.log.*.gz | grep -cE &#34;[0-9]{1,2}/Oct/2019&#34;
4671942
# zcat --force /var/log/nginx/{rest,oai,statistics}.log.*.gz | grep -cE &#34;[0-9]{1,2}/Oct/2019&#34;
1277694
</code></pre><ul>
<li>So 4.6 million from XMLUI and another 1.2 million from API requests</li>
<li>Let&rsquo;s see how many of the REST API requests were for bitstreams (because they are counted in Solr stats):</li>
</ul>
<pre tabindex="0"><code># zcat --force /var/log/nginx/rest.log.*.gz | grep -c -E &#34;[0-9]{1,2}/Oct/2019&#34;
1183456
# zcat --force /var/log/nginx/rest.log.*.gz | grep -E &#34;[0-9]{1,2}/Oct/2019&#34; | grep -c -E &#34;/rest/bitstreams&#34;
106781
</code></pre>
<a href='https://alanorth.github.io/cgspace-notes/2019-11/'>Read more →</a>
</article>
<nav class="blog-pagination">
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/4/" rel="prev" role="button">Previous page</a>
@ -410,6 +405,8 @@
<ol class="list-unstyled">
<li><a href="/cgspace-notes/2023-11/">November, 2023</a></li>
<li><a href="/cgspace-notes/2023-10/">October, 2023</a></li>
<li><a href="/cgspace-notes/2023-09/">September, 2023</a></li>
@ -418,8 +415,6 @@
<li><a href="/cgspace-notes/2023-07/">July, 2023</a></li>
<li><a href="/cgspace-notes/2023-06/">June, 2023</a></li>
</ol>
</section>

View File

@ -10,14 +10,14 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2023-10-24T23:26:01+03:00" />
<meta property="og:updated_time" content="2023-11-02T20:58:43+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="Notes"/>
<meta name="twitter:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."/>
<meta name="generator" content="Hugo 0.119.0">
<meta name="generator" content="Hugo 0.120.3">
@ -81,6 +81,44 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2019-11/">November, 2019</a></h2>
<p class="blog-post-meta"><time datetime="2019-11-04T12:20:30+02:00">Mon Nov 04, 2019</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2019-11-04">2019-11-04</h2>
<ul>
<li>Peter noticed that there were 5.2 million hits on CGSpace in 2019-10 according to the Atmire usage statistics
<ul>
<li>I looked in the nginx logs and see 4.6 million in the access logs, and 1.2 million in the API logs:</li>
</ul>
</li>
</ul>
<pre tabindex="0"><code># zcat --force /var/log/nginx/*access.log.*.gz | grep -cE &#34;[0-9]{1,2}/Oct/2019&#34;
4671942
# zcat --force /var/log/nginx/{rest,oai,statistics}.log.*.gz | grep -cE &#34;[0-9]{1,2}/Oct/2019&#34;
1277694
</code></pre><ul>
<li>So 4.6 million from XMLUI and another 1.2 million from API requests</li>
<li>Let&rsquo;s see how many of the REST API requests were for bitstreams (because they are counted in Solr stats):</li>
</ul>
<pre tabindex="0"><code># zcat --force /var/log/nginx/rest.log.*.gz | grep -c -E &#34;[0-9]{1,2}/Oct/2019&#34;
1183456
# zcat --force /var/log/nginx/rest.log.*.gz | grep -E &#34;[0-9]{1,2}/Oct/2019&#34; | grep -c -E &#34;/rest/bitstreams&#34;
106781
</code></pre>
<a href='https://alanorth.github.io/cgspace-notes/2019-11/'>Read more →</a>
</article>
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2019-10/">October, 2019</a></h2>
@ -380,40 +418,6 @@ sys 0m1.979s
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2019-01/">January, 2019</a></h2>
<p class="blog-post-meta"><time datetime="2019-01-02T09:48:30+02:00">Wed Jan 02, 2019</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2019-01-02">2019-01-02</h2>
<ul>
<li>Linode alerted that CGSpace (linode18) had a higher outbound traffic rate than normal early this morning</li>
<li>I don&rsquo;t see anything interesting in the web server logs around that time though:</li>
</ul>
<pre tabindex="0"><code># zcat --force /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep -E &#34;02/Jan/2019:0(1|2|3)&#34; | awk &#39;{print $1}&#39; | sort | uniq -c | sort -n | tail -n 10
92 40.77.167.4
99 210.7.29.100
120 38.126.157.45
177 35.237.175.180
177 40.77.167.32
216 66.249.75.219
225 18.203.76.93
261 46.101.86.248
357 207.46.13.1
903 54.70.40.11
</code></pre>
<a href='https://alanorth.github.io/cgspace-notes/2019-01/'>Read more →</a>
</article>
<nav class="blog-pagination">
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/5/" rel="prev" role="button">Previous page</a>
@ -438,6 +442,8 @@ sys 0m1.979s
<ol class="list-unstyled">
<li><a href="/cgspace-notes/2023-11/">November, 2023</a></li>
<li><a href="/cgspace-notes/2023-10/">October, 2023</a></li>
<li><a href="/cgspace-notes/2023-09/">September, 2023</a></li>
@ -446,8 +452,6 @@ sys 0m1.979s
<li><a href="/cgspace-notes/2023-07/">July, 2023</a></li>
<li><a href="/cgspace-notes/2023-06/">June, 2023</a></li>
</ol>
</section>

View File

@ -10,14 +10,14 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2023-10-24T23:26:01+03:00" />
<meta property="og:updated_time" content="2023-11-02T20:58:43+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="Notes"/>
<meta name="twitter:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."/>
<meta name="generator" content="Hugo 0.119.0">
<meta name="generator" content="Hugo 0.120.3">
@ -81,6 +81,40 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2019-01/">January, 2019</a></h2>
<p class="blog-post-meta"><time datetime="2019-01-02T09:48:30+02:00">Wed Jan 02, 2019</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2019-01-02">2019-01-02</h2>
<ul>
<li>Linode alerted that CGSpace (linode18) had a higher outbound traffic rate than normal early this morning</li>
<li>I don&rsquo;t see anything interesting in the web server logs around that time though:</li>
</ul>
<pre tabindex="0"><code># zcat --force /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep -E &#34;02/Jan/2019:0(1|2|3)&#34; | awk &#39;{print $1}&#39; | sort | uniq -c | sort -n | tail -n 10
92 40.77.167.4
99 210.7.29.100
120 38.126.157.45
177 35.237.175.180
177 40.77.167.32
216 66.249.75.219
225 18.203.76.93
261 46.101.86.248
357 207.46.13.1
903 54.70.40.11
</code></pre>
<a href='https://alanorth.github.io/cgspace-notes/2019-01/'>Read more →</a>
</article>
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2018-12/">December, 2018</a></h2>
@ -327,27 +361,6 @@ sys 2m7.289s
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2018-03/">March, 2018</a></h2>
<p class="blog-post-meta"><time datetime="2018-03-02T16:07:54+02:00">Fri Mar 02, 2018</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2018-03-02">2018-03-02</h2>
<ul>
<li>Export a CSV of the IITA community metadata for Martin Mueller</li>
</ul>
<a href='https://alanorth.github.io/cgspace-notes/2018-03/'>Read more →</a>
</article>
<nav class="blog-pagination">
<a class="btn btn-outline-primary" href="/cgspace-notes/categories/notes/page/6/" rel="prev" role="button">Previous page</a>
@ -372,6 +385,8 @@ sys 2m7.289s
<ol class="list-unstyled">
<li><a href="/cgspace-notes/2023-11/">November, 2023</a></li>
<li><a href="/cgspace-notes/2023-10/">October, 2023</a></li>
<li><a href="/cgspace-notes/2023-09/">September, 2023</a></li>
@ -380,8 +395,6 @@ sys 2m7.289s
<li><a href="/cgspace-notes/2023-07/">July, 2023</a></li>
<li><a href="/cgspace-notes/2023-06/">June, 2023</a></li>
</ol>
</section>

View File

@ -10,14 +10,14 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2023-10-24T23:26:01+03:00" />
<meta property="og:updated_time" content="2023-11-02T20:58:43+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="Notes"/>
<meta name="twitter:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."/>
<meta name="generator" content="Hugo 0.119.0">
<meta name="generator" content="Hugo 0.120.3">
@ -81,6 +81,27 @@
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2018-03/">March, 2018</a></h2>
<p class="blog-post-meta"><time datetime="2018-03-02T16:07:54+02:00">Fri Mar 02, 2018</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/categories/notes/" rel="category tag">Notes</a>
</p>
</header>
<h2 id="2018-03-02">2018-03-02</h2>
<ul>
<li>Export a CSV of the IITA community metadata for Martin Mueller</li>
</ul>
<a href='https://alanorth.github.io/cgspace-notes/2018-03/'>Read more →</a>
</article>
<article class="blog-post">
<header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2018-02/">February, 2018</a></h2>
@ -315,6 +336,8 @@ COPY 54701
<ol class="list-unstyled">
<li><a href="/cgspace-notes/2023-11/">November, 2023</a></li>
<li><a href="/cgspace-notes/2023-10/">October, 2023</a></li>
<li><a href="/cgspace-notes/2023-09/">September, 2023</a></li>
@ -323,8 +346,6 @@ COPY 54701
<li><a href="/cgspace-notes/2023-07/">July, 2023</a></li>
<li><a href="/cgspace-notes/2023-06/">June, 2023</a></li>
</ol>
</section>