Add notes for 2017-12-30

This commit is contained in:
Alan Orth 2017-12-31 06:47:41 -08:00
parent 164748b5d2
commit b51bbdb91c
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
3 changed files with 74 additions and 8 deletions

View File

@ -699,3 +699,34 @@ DELETE 20
``` ```
- I need to figure out why we have records with language `in` because that's not a language! - I need to figure out why we have records with language `in` because that's not a language!
## 2017-12-30
- Linode alerted that CGSpace was using 259% CPU from 4 to 6 AM
- Uptime Robot noticed that the server went down for 1 minute a few hours later, around 9AM
- Here's the XMLUI logs:
```
# cat /var/log/nginx/access.log /var/log/nginx/access.log.1 /var/log/nginx/library-access.log /var/log/nginx/library-access.log.1 | grep -E "30/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
637 207.46.13.106
641 157.55.39.186
715 68.180.229.254
924 104.196.152.243
1012 66.249.64.95
1060 216.244.66.245
1120 54.175.208.220
1287 66.249.64.93
1586 66.249.64.78
3653 66.249.64.91
```
- Looks pretty normal actually, but I don't know who 54.175.208.220 is
- They identify as "com.plumanalytics", which Google says is associated with Elsevier
- They only seem to have used one Tomcat session so that's good, I guess I don't need to add them to the Tomcat Crawler Session Manager valve:
```
$ grep 54.175.208.220 dspace.log.2017-12-30 | grep -o -E 'session_id=[A-Z0-9]{32}' | sort -n | uniq | wc -l
1
```
- 216.244.66.245 seems to be moz.com's DotBot

View File

@ -23,7 +23,7 @@ The list of connections to XMLUI and REST API for today:
<meta property="article:published_time" content="2017-12-01T13:53:54&#43;03:00"/> <meta property="article:published_time" content="2017-12-01T13:53:54&#43;03:00"/>
<meta property="article:modified_time" content="2017-12-25T22:26:01&#43;02:00"/> <meta property="article:modified_time" content="2017-12-29T18:45:09&#43;02:00"/>
@ -56,9 +56,9 @@ The list of connections to XMLUI and REST API for today:
"@type": "BlogPosting", "@type": "BlogPosting",
"headline": "December, 2017", "headline": "December, 2017",
"url": "https://alanorth.github.io/cgspace-notes/2017-12/", "url": "https://alanorth.github.io/cgspace-notes/2017-12/",
"wordCount": "3897", "wordCount": "4050",
"datePublished": "2017-12-01T13:53:54&#43;03:00", "datePublished": "2017-12-01T13:53:54&#43;03:00",
"dateModified": "2017-12-25T22:26:01&#43;02:00", "dateModified": "2017-12-29T18:45:09&#43;02:00",
"author": { "author": {
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
@ -904,6 +904,41 @@ DELETE 20
<li>I need to figure out why we have records with language <code>in</code> because that&rsquo;s not a language!</li> <li>I need to figure out why we have records with language <code>in</code> because that&rsquo;s not a language!</li>
</ul> </ul>
<h2 id="2017-12-30">2017-12-30</h2>
<ul>
<li>Linode alerted that CGSpace was using 259% CPU from 4 to 6 AM</li>
<li>Uptime Robot noticed that the server went down for 1 minute a few hours later, around 9AM</li>
<li>Here&rsquo;s the XMLUI logs:</li>
</ul>
<pre><code># cat /var/log/nginx/access.log /var/log/nginx/access.log.1 /var/log/nginx/library-access.log /var/log/nginx/library-access.log.1 | grep -E &quot;30/Dec/2017&quot; | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
637 207.46.13.106
641 157.55.39.186
715 68.180.229.254
924 104.196.152.243
1012 66.249.64.95
1060 216.244.66.245
1120 54.175.208.220
1287 66.249.64.93
1586 66.249.64.78
3653 66.249.64.91
</code></pre>
<ul>
<li>Looks pretty normal actually, but I don&rsquo;t know who 54.175.208.220 is</li>
<li>They identify as &ldquo;com.plumanalytics&rdquo;, which Google says is associated with Elsevier</li>
<li>They only seem to have used one Tomcat session so that&rsquo;s good, I guess I don&rsquo;t need to add them to the Tomcat Crawler Session Manager valve:</li>
</ul>
<pre><code>$ grep 54.175.208.220 dspace.log.2017-12-30 | grep -o -E 'session_id=[A-Z0-9]{32}' | sort -n | uniq | wc -l
1
</code></pre>
<ul>
<li>216.244.66.245 seems to be moz.com&rsquo;s DotBot</li>
</ul>

View File

@ -4,7 +4,7 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/2017-12/</loc> <loc>https://alanorth.github.io/cgspace-notes/2017-12/</loc>
<lastmod>2017-12-25T22:26:01+02:00</lastmod> <lastmod>2017-12-29T18:45:09+02:00</lastmod>
</url> </url>
<url> <url>
@ -139,7 +139,7 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/</loc> <loc>https://alanorth.github.io/cgspace-notes/</loc>
<lastmod>2017-12-25T22:26:01+02:00</lastmod> <lastmod>2017-12-29T18:45:09+02:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
@ -150,7 +150,7 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc> <loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
<lastmod>2017-12-25T22:26:01+02:00</lastmod> <lastmod>2017-12-29T18:45:09+02:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
@ -162,13 +162,13 @@
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/post/</loc> <loc>https://alanorth.github.io/cgspace-notes/post/</loc>
<lastmod>2017-12-25T22:26:01+02:00</lastmod> <lastmod>2017-12-29T18:45:09+02:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc> <loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
<lastmod>2017-12-25T22:26:01+02:00</lastmod> <lastmod>2017-12-29T18:45:09+02:00</lastmod>
<priority>0</priority> <priority>0</priority>
</url> </url>