mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-29 18:08:20 +01:00
Add notes for 2017-12-30
This commit is contained in:
parent
164748b5d2
commit
b51bbdb91c
@ -699,3 +699,34 @@ DELETE 20
|
|||||||
```
|
```
|
||||||
|
|
||||||
- I need to figure out why we have records with language `in` because that's not a language!
|
- I need to figure out why we have records with language `in` because that's not a language!
|
||||||
|
|
||||||
|
## 2017-12-30
|
||||||
|
|
||||||
|
- Linode alerted that CGSpace was using 259% CPU from 4 to 6 AM
|
||||||
|
- Uptime Robot noticed that the server went down for 1 minute a few hours later, around 9AM
|
||||||
|
- Here's the XMLUI logs:
|
||||||
|
|
||||||
|
```
|
||||||
|
# cat /var/log/nginx/access.log /var/log/nginx/access.log.1 /var/log/nginx/library-access.log /var/log/nginx/library-access.log.1 | grep -E "30/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||||
|
637 207.46.13.106
|
||||||
|
641 157.55.39.186
|
||||||
|
715 68.180.229.254
|
||||||
|
924 104.196.152.243
|
||||||
|
1012 66.249.64.95
|
||||||
|
1060 216.244.66.245
|
||||||
|
1120 54.175.208.220
|
||||||
|
1287 66.249.64.93
|
||||||
|
1586 66.249.64.78
|
||||||
|
3653 66.249.64.91
|
||||||
|
```
|
||||||
|
|
||||||
|
- Looks pretty normal actually, but I don't know who 54.175.208.220 is
|
||||||
|
- They identify as "com.plumanalytics", which Google says is associated with Elsevier
|
||||||
|
- They only seem to have used one Tomcat session so that's good, I guess I don't need to add them to the Tomcat Crawler Session Manager valve:
|
||||||
|
|
||||||
|
```
|
||||||
|
$ grep 54.175.208.220 dspace.log.2017-12-30 | grep -o -E 'session_id=[A-Z0-9]{32}' | sort -n | uniq | wc -l
|
||||||
|
1
|
||||||
|
```
|
||||||
|
|
||||||
|
- 216.244.66.245 seems to be moz.com's DotBot
|
||||||
|
@ -23,7 +23,7 @@ The list of connections to XMLUI and REST API for today:
|
|||||||
|
|
||||||
<meta property="article:published_time" content="2017-12-01T13:53:54+03:00"/>
|
<meta property="article:published_time" content="2017-12-01T13:53:54+03:00"/>
|
||||||
|
|
||||||
<meta property="article:modified_time" content="2017-12-25T22:26:01+02:00"/>
|
<meta property="article:modified_time" content="2017-12-29T18:45:09+02:00"/>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
@ -56,9 +56,9 @@ The list of connections to XMLUI and REST API for today:
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "December, 2017",
|
"headline": "December, 2017",
|
||||||
"url": "https://alanorth.github.io/cgspace-notes/2017-12/",
|
"url": "https://alanorth.github.io/cgspace-notes/2017-12/",
|
||||||
"wordCount": "3897",
|
"wordCount": "4050",
|
||||||
"datePublished": "2017-12-01T13:53:54+03:00",
|
"datePublished": "2017-12-01T13:53:54+03:00",
|
||||||
"dateModified": "2017-12-25T22:26:01+02:00",
|
"dateModified": "2017-12-29T18:45:09+02:00",
|
||||||
"author": {
|
"author": {
|
||||||
"@type": "Person",
|
"@type": "Person",
|
||||||
"name": "Alan Orth"
|
"name": "Alan Orth"
|
||||||
@ -904,6 +904,41 @@ DELETE 20
|
|||||||
<li>I need to figure out why we have records with language <code>in</code> because that’s not a language!</li>
|
<li>I need to figure out why we have records with language <code>in</code> because that’s not a language!</li>
|
||||||
</ul>
|
</ul>
|
||||||
|
|
||||||
|
<h2 id="2017-12-30">2017-12-30</h2>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>Linode alerted that CGSpace was using 259% CPU from 4 to 6 AM</li>
|
||||||
|
<li>Uptime Robot noticed that the server went down for 1 minute a few hours later, around 9AM</li>
|
||||||
|
<li>Here’s the XMLUI logs:</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
<pre><code># cat /var/log/nginx/access.log /var/log/nginx/access.log.1 /var/log/nginx/library-access.log /var/log/nginx/library-access.log.1 | grep -E "30/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||||
|
637 207.46.13.106
|
||||||
|
641 157.55.39.186
|
||||||
|
715 68.180.229.254
|
||||||
|
924 104.196.152.243
|
||||||
|
1012 66.249.64.95
|
||||||
|
1060 216.244.66.245
|
||||||
|
1120 54.175.208.220
|
||||||
|
1287 66.249.64.93
|
||||||
|
1586 66.249.64.78
|
||||||
|
3653 66.249.64.91
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>Looks pretty normal actually, but I don’t know who 54.175.208.220 is</li>
|
||||||
|
<li>They identify as “com.plumanalytics”, which Google says is associated with Elsevier</li>
|
||||||
|
<li>They only seem to have used one Tomcat session so that’s good, I guess I don’t need to add them to the Tomcat Crawler Session Manager valve:</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
<pre><code>$ grep 54.175.208.220 dspace.log.2017-12-30 | grep -o -E 'session_id=[A-Z0-9]{32}' | sort -n | uniq | wc -l
|
||||||
|
1
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>216.244.66.245 seems to be moz.com’s DotBot</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
@ -4,7 +4,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/2017-12/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/2017-12/</loc>
|
||||||
<lastmod>2017-12-25T22:26:01+02:00</lastmod>
|
<lastmod>2017-12-29T18:45:09+02:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
@ -139,7 +139,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||||
<lastmod>2017-12-25T22:26:01+02:00</lastmod>
|
<lastmod>2017-12-29T18:45:09+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -150,7 +150,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||||
<lastmod>2017-12-25T22:26:01+02:00</lastmod>
|
<lastmod>2017-12-29T18:45:09+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -162,13 +162,13 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
||||||
<lastmod>2017-12-25T22:26:01+02:00</lastmod>
|
<lastmod>2017-12-29T18:45:09+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||||
<lastmod>2017-12-25T22:26:01+02:00</lastmod>
|
<lastmod>2017-12-29T18:45:09+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user