mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2020-05-29
This commit is contained in:
@ -18,7 +18,7 @@ I see that CGSpace (linode18) is still using PostgreSQL JDBC driver version 42.2
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2020-05/" />
|
||||
<meta property="article:published_time" content="2020-05-02T09:52:04+03:00" />
|
||||
<meta property="article:modified_time" content="2020-05-20T09:44:36+03:00" />
|
||||
<meta property="article:modified_time" content="2020-05-25T11:52:28+03:00" />
|
||||
|
||||
<meta name="twitter:card" content="summary"/>
|
||||
<meta name="twitter:title" content="May, 2020"/>
|
||||
@ -31,7 +31,7 @@ I see that CGSpace (linode18) is still using PostgreSQL JDBC driver version 42.2
|
||||
|
||||
|
||||
"/>
|
||||
<meta name="generator" content="Hugo 0.71.0" />
|
||||
<meta name="generator" content="Hugo 0.71.1" />
|
||||
|
||||
|
||||
|
||||
@ -41,9 +41,9 @@ I see that CGSpace (linode18) is still using PostgreSQL JDBC driver version 42.2
|
||||
"@type": "BlogPosting",
|
||||
"headline": "May, 2020",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2020-05/",
|
||||
"wordCount": "1288",
|
||||
"wordCount": "1473",
|
||||
"datePublished": "2020-05-02T09:52:04+03:00",
|
||||
"dateModified": "2020-05-20T09:44:36+03:00",
|
||||
"dateModified": "2020-05-25T11:52:28+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -316,6 +316,30 @@ $ ./add-orcid-identifiers-csv.py -i 2020-05-25-add-orcids.csv -db dspace -u dspa
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="2020-05-29">2020-05-29</h2>
|
||||
<ul>
|
||||
<li>Linode alerted to say that the CPU load on CGSpace (linode18) was high for a few hours this morning
|
||||
<ul>
|
||||
<li>Looking at the nginx logs for this morning with goaccess:</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<pre><code># cat /var/log/nginx/*.log.1 | grep -E "29/May/2020:(02|03|04|05)" | goaccess --log-format=COMBINED -
|
||||
</code></pre><ul>
|
||||
<li>The top is 172.104.229.92, which is the AReS harvester (still not using a user agent, but it’s tagged as a bot in the nginx mapping)</li>
|
||||
<li>Second is 188.134.31.88, which is a Russian host that we also saw in the last few weeks, using a browser user agent and hitting the XMLUI (but it is tagged as a bot in nginx as well)</li>
|
||||
<li>Another one is 51.158.106.4, which is some Scaleway IP making requests to XMLUI with different browser user agents that I am pretty sure I have seen before but never blocked
|
||||
<ul>
|
||||
<li>According to Solr it has made about 800 requests this year, but still… it’s a bot.</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>One I don’t think I’ve seen before is 95.217.58.146, which is making requests to XMLUI with a Drupal user agent
|
||||
<ul>
|
||||
<li>According to <a href="https://viewdns.info/reverseip/?host=95.217.58.146&t=1">viewdns.info</a> it belongs to <a href="https://landvoc.org/">landvoc.org</a></li>
|
||||
<li>I should add Drupal to the list of bots…</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<!-- raw HTML omitted -->
|
||||
|
||||
|
||||
|
Reference in New Issue
Block a user