Add notes for 2020-08-13

This commit is contained in:
2020-08-13 17:56:39 +03:00
parent ccecd63eb0
commit eafe422984
20 changed files with 54 additions and 25 deletions

View File

@ -385,4 +385,17 @@ dspace=# SELECT count(text_value) FROM metadatavalue WHERE metadata_field_id = 2
- I noticed a bunch of user agents with "Crawl" in the Solr stats, which is strange because the DSpace spider agents file has had "crawl" for a long time (and it is case insensitive)
- In any case I will purge them and add them to the Tomcat Crawler Session Manager Valve so that at least their sessions get re-used
## 2020-08-13
- Linode keeps sending mails that the load and outgoing bandwidth is above the threshold
- I took a look briefly and found two IPs with the "Delphi 2009" user agent
- Then there is 88.99.115.53 which made 82,000 requests in 2020 so far with no user agent
- 64.62.202.73 has made 7,000 requests with this user agent `Mozilla/5.0 (compatible; +centuryb.o.t9[at]gmail.com)`
- I had added it to the Tomcat Crawler Session Manager Valve last week but never purged the hits from Solr
- 195.54.160.163 is making thousands of requests with user agents liket this:
`(CASE WHEN 2850=9474 THEN 2850 ELSE NULL END)`
- I purged 150,000 hits from 2020 and 2020 from these user agents and hosts
<!-- vim: set sw=2 ts=2: -->