mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-26 08:28:18 +01:00
Add notes for 2017-12-18
This commit is contained in:
parent
7ac8d252d0
commit
dad1df6263
@ -238,4 +238,87 @@ Ended: 1513521858573
|
|||||||
Elapsed time: 2 secs (2559 msecs)
|
Elapsed time: 2 secs (2559 msecs)
|
||||||
```
|
```
|
||||||
|
|
||||||
|
- I even tried to debug it by adding verbose logging to the `JAVA_OPTS`:
|
||||||
|
|
||||||
|
```
|
||||||
|
-Dlog4j.configuration=file:/Users/aorth/dspace/config/log4j-console.properties -Ddspace.log.init.disable=true
|
||||||
|
```
|
||||||
|
|
||||||
|
- ... but the error message was the same, just with more INFO noise around it
|
||||||
- For now I'll import into a collection in DSpace Test but I'm really not sure what's up with this!
|
- For now I'll import into a collection in DSpace Test but I'm really not sure what's up with this!
|
||||||
|
- Linode alerted that CGSpace was using high CPU from 4 to 6 PM
|
||||||
|
- The logs for today show the CORE bot (137.108.70.7) being active in XMLUI:
|
||||||
|
|
||||||
|
```
|
||||||
|
# cat /var/log/nginx/access.log /var/log/nginx/access.log.1 /var/log/nginx/library-access.log /var/log/nginx/library-access.log.1 | grep -E "17/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||||
|
671 66.249.66.70
|
||||||
|
885 95.108.181.88
|
||||||
|
904 157.55.39.96
|
||||||
|
923 157.55.39.179
|
||||||
|
1159 207.46.13.107
|
||||||
|
1184 104.196.152.243
|
||||||
|
1230 66.249.66.91
|
||||||
|
1414 68.180.229.254
|
||||||
|
4137 66.249.66.90
|
||||||
|
46401 137.108.70.7
|
||||||
|
```
|
||||||
|
|
||||||
|
- And then some CIAT bot (45.5.184.196) is actively hitting API endpoints:
|
||||||
|
|
||||||
|
```
|
||||||
|
# cat /var/log/nginx/rest.log /var/log/nginx/rest.log.1 /var/log/nginx/oai.log /var/log/nginx/oai.log.1 | grep -E "17/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||||
|
33 68.180.229.254
|
||||||
|
48 157.55.39.96
|
||||||
|
51 157.55.39.179
|
||||||
|
56 207.46.13.107
|
||||||
|
102 104.196.152.243
|
||||||
|
102 66.249.66.90
|
||||||
|
691 137.108.70.7
|
||||||
|
1531 50.116.102.77
|
||||||
|
4014 70.32.83.92
|
||||||
|
11030 45.5.184.196
|
||||||
|
```
|
||||||
|
|
||||||
|
- That's probably ok, as I don't think the REST API connections use up a Tomcat session...
|
||||||
|
- CIP emailed a few days ago to ask about unique IDs for authors and organizations, and if we can provide them via an API
|
||||||
|
- Regarding the import issue above it seems to be a known issue that has a patch in DSpace 5.7:
|
||||||
|
- https://jira.duraspace.org/browse/DS-2633
|
||||||
|
- https://jira.duraspace.org/browse/DS-3583
|
||||||
|
- We're on DSpace 5.5 but there is a one-word fix to the addItem() function here: https://github.com/DSpace/DSpace/pull/1731
|
||||||
|
- I will apply it on our branch but I need to make a note to NOT cherry-pick it when I rebase on to the latest 5.x upstream later
|
||||||
|
- Pull request: [#351](https://github.com/ilri/DSpace/pull/351)
|
||||||
|
|
||||||
|
## 2017-12-18
|
||||||
|
|
||||||
|
- Linode alerted this morning that there was high outbound traffic from 6 to 8 AM
|
||||||
|
- The XMLUI logs show that the CORE bot from last night (137.108.70.7) is very active still:
|
||||||
|
|
||||||
|
```
|
||||||
|
# cat /var/log/nginx/access.log /var/log/nginx/access.log.1 /var/log/nginx/library-access.log /var/log/nginx/library-access.log.1 | grep -E "18/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||||
|
190 207.46.13.146
|
||||||
|
191 197.210.168.174
|
||||||
|
202 86.101.203.216
|
||||||
|
268 157.55.39.134
|
||||||
|
297 66.249.66.91
|
||||||
|
314 213.55.99.121
|
||||||
|
402 66.249.66.90
|
||||||
|
532 68.180.229.254
|
||||||
|
644 104.196.152.243
|
||||||
|
32220 137.108.70.7
|
||||||
|
```
|
||||||
|
|
||||||
|
- On the API side (REST and OAI) there is still the same CIAT bot (45.5.184.196) from last night making quite a number of requests this morning:
|
||||||
|
|
||||||
|
```
|
||||||
|
# cat /var/log/nginx/rest.log /var/log/nginx/rest.log.1 /var/log/nginx/oai.log /var/log/nginx/oai.log.1 | grep -E "18/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||||
|
7 104.198.9.108
|
||||||
|
8 185.29.8.111
|
||||||
|
8 40.77.167.176
|
||||||
|
9 66.249.66.91
|
||||||
|
9 68.180.229.254
|
||||||
|
10 157.55.39.134
|
||||||
|
15 66.249.66.90
|
||||||
|
59 104.196.152.243
|
||||||
|
4014 70.32.83.92
|
||||||
|
8619 45.5.184.196
|
||||||
|
```
|
||||||
|
@ -23,7 +23,7 @@ The list of connections to XMLUI and REST API for today:
|
|||||||
|
|
||||||
<meta property="article:published_time" content="2017-12-01T13:53:54+03:00"/>
|
<meta property="article:published_time" content="2017-12-01T13:53:54+03:00"/>
|
||||||
|
|
||||||
<meta property="article:modified_time" content="2017-12-17T11:22:21+02:00"/>
|
<meta property="article:modified_time" content="2017-12-17T17:18:06+02:00"/>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
@ -56,9 +56,9 @@ The list of connections to XMLUI and REST API for today:
|
|||||||
"@type": "BlogPosting",
|
"@type": "BlogPosting",
|
||||||
"headline": "December, 2017",
|
"headline": "December, 2017",
|
||||||
"url": "https://alanorth.github.io/cgspace-notes/2017-12/",
|
"url": "https://alanorth.github.io/cgspace-notes/2017-12/",
|
||||||
"wordCount": "1330",
|
"wordCount": "1743",
|
||||||
"datePublished": "2017-12-01T13:53:54+03:00",
|
"datePublished": "2017-12-01T13:53:54+03:00",
|
||||||
"dateModified": "2017-12-17T11:22:21+02:00",
|
"dateModified": "2017-12-17T17:18:06+02:00",
|
||||||
"author": {
|
"author": {
|
||||||
"@type": "Person",
|
"@type": "Person",
|
||||||
"name": "Alan Orth"
|
"name": "Alan Orth"
|
||||||
@ -386,9 +386,100 @@ Elapsed time: 2 secs (2559 msecs)
|
|||||||
</code></pre>
|
</code></pre>
|
||||||
|
|
||||||
<ul>
|
<ul>
|
||||||
<li>For now I’ll import into a collection in DSpace Test but I’m really not sure what’s up with this!</li>
|
<li>I even tried to debug it by adding verbose logging to the <code>JAVA_OPTS</code>:</li>
|
||||||
</ul>
|
</ul>
|
||||||
|
|
||||||
|
<pre><code>-Dlog4j.configuration=file:/Users/aorth/dspace/config/log4j-console.properties -Ddspace.log.init.disable=true
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>… but the error message was the same, just with more INFO noise around it</li>
|
||||||
|
<li>For now I’ll import into a collection in DSpace Test but I’m really not sure what’s up with this!</li>
|
||||||
|
<li>Linode alerted that CGSpace was using high CPU from 4 to 6 PM</li>
|
||||||
|
<li>The logs for today show the CORE bot (137.108.70.7) being active in XMLUI:</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
<pre><code># cat /var/log/nginx/access.log /var/log/nginx/access.log.1 /var/log/nginx/library-access.log /var/log/nginx/library-access.log.1 | grep -E "17/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||||
|
671 66.249.66.70
|
||||||
|
885 95.108.181.88
|
||||||
|
904 157.55.39.96
|
||||||
|
923 157.55.39.179
|
||||||
|
1159 207.46.13.107
|
||||||
|
1184 104.196.152.243
|
||||||
|
1230 66.249.66.91
|
||||||
|
1414 68.180.229.254
|
||||||
|
4137 66.249.66.90
|
||||||
|
46401 137.108.70.7
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>And then some CIAT bot (45.5.184.196) is actively hitting API endpoints:</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
<pre><code># cat /var/log/nginx/rest.log /var/log/nginx/rest.log.1 /var/log/nginx/oai.log /var/log/nginx/oai.log.1 | grep -E "17/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||||
|
33 68.180.229.254
|
||||||
|
48 157.55.39.96
|
||||||
|
51 157.55.39.179
|
||||||
|
56 207.46.13.107
|
||||||
|
102 104.196.152.243
|
||||||
|
102 66.249.66.90
|
||||||
|
691 137.108.70.7
|
||||||
|
1531 50.116.102.77
|
||||||
|
4014 70.32.83.92
|
||||||
|
11030 45.5.184.196
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>That’s probably ok, as I don’t think the REST API connections use up a Tomcat session…</li>
|
||||||
|
<li>CIP emailed a few days ago to ask about unique IDs for authors and organizations, and if we can provide them via an API</li>
|
||||||
|
<li>Regarding the import issue above it seems to be a known issue that has a patch in DSpace 5.7:
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li><a href="https://jira.duraspace.org/browse/DS-2633">https://jira.duraspace.org/browse/DS-2633</a></li>
|
||||||
|
<li><a href="https://jira.duraspace.org/browse/DS-3583">https://jira.duraspace.org/browse/DS-3583</a></li>
|
||||||
|
</ul></li>
|
||||||
|
<li>We’re on DSpace 5.5 but there is a one-word fix to the addItem() function here: <a href="https://github.com/DSpace/DSpace/pull/1731">https://github.com/DSpace/DSpace/pull/1731</a></li>
|
||||||
|
<li>I will apply it on our branch but I need to make a note to NOT cherry-pick it when I rebase on to the latest 5.x upstream later</li>
|
||||||
|
<li>Pull request: <a href="https://github.com/ilri/DSpace/pull/351">#351</a></li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
<h2 id="2017-12-18">2017-12-18</h2>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>Linode alerted this morning that there was high outbound traffic from 6 to 8 AM</li>
|
||||||
|
<li>The XMLUI logs show that the CORE bot from last night (137.108.70.7) is very active still:</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
<pre><code># cat /var/log/nginx/access.log /var/log/nginx/access.log.1 /var/log/nginx/library-access.log /var/log/nginx/library-access.log.1 | grep -E "18/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||||
|
190 207.46.13.146
|
||||||
|
191 197.210.168.174
|
||||||
|
202 86.101.203.216
|
||||||
|
268 157.55.39.134
|
||||||
|
297 66.249.66.91
|
||||||
|
314 213.55.99.121
|
||||||
|
402 66.249.66.90
|
||||||
|
532 68.180.229.254
|
||||||
|
644 104.196.152.243
|
||||||
|
32220 137.108.70.7
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
<ul>
|
||||||
|
<li>On the API side (REST and OAI) there is still the same CIAT bot (45.5.184.196) from last night making quite a number of requests this morning:</li>
|
||||||
|
</ul>
|
||||||
|
|
||||||
|
<pre><code># cat /var/log/nginx/rest.log /var/log/nginx/rest.log.1 /var/log/nginx/oai.log /var/log/nginx/oai.log.1 | grep -E "18/Dec/2017" | awk '{print $1}' | sort -n | uniq -c | sort -h | tail
|
||||||
|
7 104.198.9.108
|
||||||
|
8 185.29.8.111
|
||||||
|
8 40.77.167.176
|
||||||
|
9 66.249.66.91
|
||||||
|
9 68.180.229.254
|
||||||
|
10 157.55.39.134
|
||||||
|
15 66.249.66.90
|
||||||
|
59 104.196.152.243
|
||||||
|
4014 70.32.83.92
|
||||||
|
8619 45.5.184.196
|
||||||
|
</code></pre>
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
@ -4,7 +4,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/2017-12/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/2017-12/</loc>
|
||||||
<lastmod>2017-12-17T11:22:21+02:00</lastmod>
|
<lastmod>2017-12-17T17:18:06+02:00</lastmod>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
@ -139,7 +139,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||||
<lastmod>2017-12-17T11:22:21+02:00</lastmod>
|
<lastmod>2017-12-17T17:18:06+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -150,7 +150,7 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||||
<lastmod>2017-12-17T11:22:21+02:00</lastmod>
|
<lastmod>2017-12-17T17:18:06+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
@ -162,13 +162,13 @@
|
|||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/post/</loc>
|
||||||
<lastmod>2017-12-17T11:22:21+02:00</lastmod>
|
<lastmod>2017-12-17T17:18:06+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
<url>
|
<url>
|
||||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||||
<lastmod>2017-12-17T11:22:21+02:00</lastmod>
|
<lastmod>2017-12-17T17:18:06+02:00</lastmod>
|
||||||
<priority>0</priority>
|
<priority>0</priority>
|
||||||
</url>
|
</url>
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user