Add notes for 2022-12-18

This commit is contained in:
2022-12-19 07:03:13 +02:00
parent 54769fcb04
commit 8354acdbdd
29 changed files with 61 additions and 34 deletions

View File

@ -20,7 +20,7 @@ Replace “East Asia” with “Eastern Asia” region on CGSpac
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2022-12/" />
<meta property="article:published_time" content="2022-12-01T08:52:36+03:00" />
<meta property="article:modified_time" content="2022-12-14T22:14:03+03:00" />
<meta property="article:modified_time" content="2022-12-15T16:41:04+03:00" />
@ -46,9 +46,9 @@ Replace &ldquo;East Asia&rdquo; with &ldquo;Eastern Asia&rdquo; region on CGSpac
"@type": "BlogPosting",
"headline": "December, 2022",
"url": "https://alanorth.github.io/cgspace-notes/2022-12/",
"wordCount": "1507",
"wordCount": "1588",
"datePublished": "2022-12-01T08:52:36+03:00",
"dateModified": "2022-12-14T22:14:03+03:00",
"dateModified": "2022-12-15T16:41:04+03:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -370,6 +370,20 @@ Replace &ldquo;East Asia&rdquo; with &ldquo;Eastern Asia&rdquo; region on CGSpac
<ul>
<li>I did some metadata quality checks on the Initiatives collection, adding some missing regions and removing a few duplicate ones</li>
</ul>
<h2 id="2022-12-18">2022-12-18</h2>
<ul>
<li>Load on the server is a bit high
<ul>
<li>Looking at the nginx logs I see someone from the University of Chicago (128.135.98.29) is using RStudio Desktop to query and scrape CGSpace</li>
</ul>
</li>
</ul>
<pre tabindex="0"><code># grep -c &#39;RStudio Desktop&#39; /var/log/nginx/access.log
5570
</code></pre><ul>
<li>RStudio is already in the ILRI bot overrides for DSpace so it shouldn&rsquo;t be causing any extra hits, but I&rsquo;ll put an HTTP 403 in the nginx config to tell the user to use the REST API</li>
<li>Start a harvest on AReS</li>
</ul>
<!-- raw HTML omitted -->