Add notes for 2017-12-30

This commit is contained in:
2017-12-31 06:47:41 -08:00
parent 164748b5d2
commit b51bbdb91c
3 changed files with 74 additions and 8 deletions

View File

@ -23,7 +23,7 @@ The list of connections to XMLUI and REST API for today:
<meta property="article:published_time" content="2017-12-01T13:53:54&#43;03:00"/>
<meta property="article:modified_time" content="2017-12-25T22:26:01&#43;02:00"/>
<meta property="article:modified_time" content="2017-12-29T18:45:09&#43;02:00"/>
@ -56,9 +56,9 @@ The list of connections to XMLUI and REST API for today:
"@type": "BlogPosting",
"headline": "December, 2017",
"url": "https://alanorth.github.io/cgspace-notes/2017-12/",
"wordCount": "3897",
"wordCount": "4050",
"datePublished": "2017-12-01T13:53:54&#43;03:00",
"dateModified": "2017-12-25T22:26:01&#43;02:00",
"dateModified": "2017-12-29T18:45:09&#43;02:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -904,6 +904,41 @@ DELETE 20
<li>I need to figure out why we have records with language <code>in</code> because that&rsquo;s not a language!</li>
</ul>
<h2 id="2017-12-30">2017-12-30</h2>
<ul>
<li>Linode alerted that CGSpace was using 259% CPU from 4 to 6 AM</li>
<li>Uptime Robot noticed that the server went down for 1 minute a few hours later, around 9AM</li>
<li>Here&rsquo;s the XMLUI logs:</li>
</ul>
<pre><code># cat /var/log/nginx/access.log /var/log/nginx/access.log.1 /var/log/nginx/library-access.log /var/log/nginx/library-access.log.1 | grep -E &quot;30/Dec/2017&quot; | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
637 207.46.13.106
641 157.55.39.186
715 68.180.229.254
924 104.196.152.243
1012 66.249.64.95
1060 216.244.66.245
1120 54.175.208.220
1287 66.249.64.93
1586 66.249.64.78
3653 66.249.64.91
</code></pre>
<ul>
<li>Looks pretty normal actually, but I don&rsquo;t know who 54.175.208.220 is</li>
<li>They identify as &ldquo;com.plumanalytics&rdquo;, which Google says is associated with Elsevier</li>
<li>They only seem to have used one Tomcat session so that&rsquo;s good, I guess I don&rsquo;t need to add them to the Tomcat Crawler Session Manager valve:</li>
</ul>
<pre><code>$ grep 54.175.208.220 dspace.log.2017-12-30 | grep -o -E 'session_id=[A-Z0-9]{32}' | sort -n | uniq | wc -l
1
</code></pre>
<ul>
<li>216.244.66.245 seems to be moz.com&rsquo;s DotBot</li>
</ul>