mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-18 04:37:04 +01:00
70 lines
2.6 KiB
Markdown
70 lines
2.6 KiB
Markdown
---
|
|
title: "February, 2019"
|
|
date: 2019-02-01T21:37:30+02:00
|
|
author: "Alan Orth"
|
|
tags: ["Notes"]
|
|
---
|
|
|
|
## 2019-02-01
|
|
|
|
- Linode has alerted a few times since last night that the CPU usage on CGSpace (linode18) was high despite me increasing the alert threshold last week from 250% to 275%—I might need to increase it again!
|
|
- The top IPs before, during, and after this latest alert tonight were:
|
|
|
|
```
|
|
# zcat --force /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep -E "01/Feb/2019:(17|18|19|20|21)" | awk '{print $1}' | sort | uniq -c | sort -n | tail -n 10
|
|
245 207.46.13.5
|
|
332 54.70.40.11
|
|
385 5.143.231.38
|
|
405 207.46.13.173
|
|
405 207.46.13.75
|
|
1117 66.249.66.219
|
|
1121 35.237.175.180
|
|
1546 5.9.6.51
|
|
2474 45.5.186.2
|
|
5490 85.25.237.71
|
|
```
|
|
|
|
- `85.25.237.71` is the "Linguee Bot" that I first saw last month
|
|
- The Solr statistics the past few months have been very high and I was wondering if the web server logs also showed an increase
|
|
- There were just over 3 million accesses in the nginx logs last month:
|
|
|
|
```
|
|
# time zcat --force /var/log/nginx/* | grep -cE "[0-9]{1,2}/Jan/2019"
|
|
3018243
|
|
|
|
real 0m19.873s
|
|
user 0m22.203s
|
|
sys 0m1.979s
|
|
```
|
|
|
|
<!--more-->
|
|
|
|
- Normally I'd say this was very high, but [about this time last year]({{< relref "2018-02.md" >}}) I remember thinking the same thing when we had 3.1 million...
|
|
- I will have to keep an eye on this to see if there is some error in Solr...
|
|
- Atmire sent their [pull request to re-enable the Metadata Quality Module (MQM) on our `5_x-dev` branch](https://github.com/ilri/DSpace/pull/407) today
|
|
- I will test it next week and send them feedback
|
|
|
|
## 2019-02-02
|
|
|
|
- Another alert from Linode about CGSpace (linode18) this morning, here are the top IPs in the web server logs before, during, and after that time:
|
|
|
|
```
|
|
# zcat --force /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep -E "02/Feb/2019:0(1|2|3|4|5)" | awk '{print $1}' | sort | uniq -c | sort -n | tail -n 10
|
|
284 18.195.78.144
|
|
329 207.46.13.32
|
|
417 35.237.175.180
|
|
448 34.218.226.147
|
|
694 2a01:4f8:13b:1296::2
|
|
718 2a01:4f8:140:3192::2
|
|
786 137.108.70.14
|
|
1002 5.9.6.51
|
|
6077 85.25.237.71
|
|
8726 45.5.184.2
|
|
```
|
|
|
|
- `45.5.184.2` is CIAT and `85.25.237.71` is the new Linguee bot that I first noticed a few days ago
|
|
- I will increase the Linode alert threshold from 275 to 300% because this is becoming too much!
|
|
- I tested the Atmire Metadata Quality Module (MQM)'s duplicate checked on the some [WLE items](https://dspacetest.cgiar.org/handle/10568/81268) that I helped Udana with a few months ago on DSpace Test (linode19) and indeed it found many duplicates!
|
|
|
|
<!-- vim: set sw=2 ts=2: -->
|