Add notes for 2020-05-29

This commit is contained in:
2020-05-29 10:25:41 +03:00
parent 4d0c002a89
commit 8d01f31ddc
92 changed files with 161 additions and 120 deletions

View File

@ -18,7 +18,7 @@ I see that CGSpace (linode18) is still using PostgreSQL JDBC driver version 42.2
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2020-05/" />
<meta property="article:published_time" content="2020-05-02T09:52:04+03:00" />
<meta property="article:modified_time" content="2020-05-20T09:44:36+03:00" />
<meta property="article:modified_time" content="2020-05-25T11:52:28+03:00" />
<meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="May, 2020"/>
@ -31,7 +31,7 @@ I see that CGSpace (linode18) is still using PostgreSQL JDBC driver version 42.2
"/>
<meta name="generator" content="Hugo 0.71.0" />
<meta name="generator" content="Hugo 0.71.1" />
@ -41,9 +41,9 @@ I see that CGSpace (linode18) is still using PostgreSQL JDBC driver version 42.2
"@type": "BlogPosting",
"headline": "May, 2020",
"url": "https://alanorth.github.io/cgspace-notes/2020-05/",
"wordCount": "1288",
"wordCount": "1473",
"datePublished": "2020-05-02T09:52:04+03:00",
"dateModified": "2020-05-20T09:44:36+03:00",
"dateModified": "2020-05-25T11:52:28+03:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -316,6 +316,30 @@ $ ./add-orcid-identifiers-csv.py -i 2020-05-25-add-orcids.csv -db dspace -u dspa
</ul>
</li>
</ul>
<h2 id="2020-05-29">2020-05-29</h2>
<ul>
<li>Linode alerted to say that the CPU load on CGSpace (linode18) was high for a few hours this morning
<ul>
<li>Looking at the nginx logs for this morning with goaccess:</li>
</ul>
</li>
</ul>
<pre><code># cat /var/log/nginx/*.log.1 | grep -E &quot;29/May/2020:(02|03|04|05)&quot; | goaccess --log-format=COMBINED -
</code></pre><ul>
<li>The top is 172.104.229.92, which is the AReS harvester (still not using a user agent, but it&rsquo;s tagged as a bot in the nginx mapping)</li>
<li>Second is 188.134.31.88, which is a Russian host that we also saw in the last few weeks, using a browser user agent and hitting the XMLUI (but it is tagged as a bot in nginx as well)</li>
<li>Another one is 51.158.106.4, which is some Scaleway IP making requests to XMLUI with different browser user agents that I am pretty sure I have seen before but never blocked
<ul>
<li>According to Solr it has made about 800 requests this year, but still&hellip; it&rsquo;s a bot.</li>
</ul>
</li>
<li>One I don&rsquo;t think I&rsquo;ve seen before is 95.217.58.146, which is making requests to XMLUI with a Drupal user agent
<ul>
<li>According to <a href="https://viewdns.info/reverseip/?host=95.217.58.146&amp;t=1">viewdns.info</a> it belongs to <a href="https://landvoc.org/">landvoc.org</a></li>
<li>I should add Drupal to the list of bots&hellip;</li>
</ul>
</li>
</ul>
<!-- raw HTML omitted -->