diff --git a/content/posts/2020-07.md b/content/posts/2020-07.md new file mode 100644 index 000000000..42bd23417 --- /dev/null +++ b/content/posts/2020-07.md @@ -0,0 +1,186 @@ +--- +title: "July, 2020" +date: 2020-07-01T10:53:54+03:00 +author: "Alan Orth" +categories: ["Notes"] +--- + +## 2020-07-01 + +- A few users noticed that CGSpace wasn't loading items today, item pages seem blank + - I looked at the PostgreSQL locks but they don't seem unusual + - I guess this is the same "blank item page" issue that we had a few times in 2019 that we never solved + - I restarted Tomcat and PostgreSQL and the issue was gone +- Since I was restarting Tomcat anyways I decided to redeploy the latest changes from the `5_x-prod` branch and I added a note about COVID-19 items to the CGSpace frontpage at Peter's request + + + +- Also, Linode is alerting that we had high outbound traffic rate early this morning around midnight AND high CPU load later in the morning +- First looking at the traffic in the morning: + +``` +# cat /var/log/nginx/*.log.1 /var/log/nginx/*.log | grep -E "01/Jul/2020:(00|01|02|03|04)" | goaccess --log-format=COMBINED - +... +9659 33.56% 1 0.08% 340.94 MiB 64.39.99.13 +3317 11.53% 1 0.08% 871.71 MiB 199.47.87.140 +2986 10.38% 1 0.08% 17.39 MiB 199.47.87.144 +2286 7.94% 1 0.08% 13.04 MiB 199.47.87.142 +``` + +- 64.39.99.13 belongs to Qualys, but I see they are using a normal desktop user agent: + +``` +Mozilla/5.0 (Macintosh; Intel Mac OS X 10_14_5) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/12.1.1 Safari/605.1.15 +``` + +- I will purge hits from that IP from Solr +- The 199.47.87.x IPs belong to Turnitin, and apparently they are NOT marked as bots and we have 40,000 hits from them in 2020 statistics alone: + +``` +$ curl -s "http://localhost:8081/solr/statistics/select" -d "q=userAgent:/Turnitin.*/&rows=0" | grep -oE 'numFound="[0-9]+"' +numFound="41694" +``` + +- They used to be "TurnitinBot"... hhmmmm, seems they use both: https://turnitin.com/robot/crawlerinfo.html +- I will add Turnitin to the DSpace bot user agent list, but I see they are reqesting `robots.txt` and only requesting item pages, so that's impressive! I don't need to add them to the "bad bot" rate limit list in nginx +- While looking at the logs I noticed eighty-one IPs in the range 185.152.250.x making little requests this user agent: + +``` +Mozilla/5.0 (Macintosh; Intel Mac OS X 10.14; rv:76.0) Gecko/20100101 Firefox/76.0 +``` + +- The IPs all belong to HostRoyale: + +``` +# cat /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep '01/Jul/2020' | awk '{print $1}' | grep 185.152.250. | sort | uniq | wc -l +81 +# cat /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep '01/Jul/2020' | awk '{print $1}' | grep 185.152.250. | sort | uniq | sort -h +185.152.250.1 +185.152.250.101 +185.152.250.103 +185.152.250.105 +185.152.250.107 +185.152.250.111 +185.152.250.115 +185.152.250.119 +185.152.250.121 +185.152.250.123 +185.152.250.125 +185.152.250.129 +185.152.250.13 +185.152.250.131 +185.152.250.133 +185.152.250.135 +185.152.250.137 +185.152.250.141 +185.152.250.145 +185.152.250.149 +185.152.250.153 +185.152.250.155 +185.152.250.157 +185.152.250.159 +185.152.250.161 +185.152.250.163 +185.152.250.165 +185.152.250.167 +185.152.250.17 +185.152.250.171 +185.152.250.183 +185.152.250.189 +185.152.250.191 +185.152.250.197 +185.152.250.201 +185.152.250.205 +185.152.250.209 +185.152.250.21 +185.152.250.213 +185.152.250.217 +185.152.250.219 +185.152.250.221 +185.152.250.223 +185.152.250.225 +185.152.250.227 +185.152.250.229 +185.152.250.231 +185.152.250.233 +185.152.250.235 +185.152.250.239 +185.152.250.243 +185.152.250.247 +185.152.250.249 +185.152.250.25 +185.152.250.251 +185.152.250.253 +185.152.250.255 +185.152.250.27 +185.152.250.29 +185.152.250.3 +185.152.250.31 +185.152.250.39 +185.152.250.41 +185.152.250.47 +185.152.250.5 +185.152.250.59 +185.152.250.63 +185.152.250.65 +185.152.250.67 +185.152.250.7 +185.152.250.71 +185.152.250.73 +185.152.250.77 +185.152.250.81 +185.152.250.85 +185.152.250.89 +185.152.250.9 +185.152.250.93 +185.152.250.95 +185.152.250.97 +185.152.250.99 +``` + +- It's only a few hundred requests each, but I am very suspicious so I will record it here and purge their IPs from Solr +- Then I see 185.187.30.14 and 185.187.30.13 making requests also, with several different "normal" user agents + - They are both apparently in France, belonging to Scalair FR hosting + - I will purge their requests from Solr too +- Now I see some other new bots I hadn't noticed before: + - `Mozilla/5.0 (compatible; MSIE 10.0; Windows NT 6.1; Trident/6.0) LinkCheck by Siteimprove.com` + - `Consilio (WebHare Platform 4.28.2-dev); LinkChecker)`, which appears to be a [university CMS](https://www.utwente.nl/en/websites/webhare/) + - I will add `LinkCheck`, `Consilio`, and `WebHare` to the list of DSpace bot agents and purge them from Solr stats + - COUNTER-Robots list already has `link.?check` but for some reason DSpace didn't match that and I see hits for some of these... + - Maybe I should add `[Ll]ink.?[Cc]heck.?` to a custom list for now? + - For now I added `Turnitin` to the [new bots pull request on COUNTER-Robots](https://github.com/atmire/COUNTER-Robots/pull/34) +- I purged 20,000 hits from IPs and 45,000 hits from user agents +- I will revert the default "example" agents file back to the upstream master branch of COUNTER-Robots, and then add all my custom ones that are pending in pull requests they haven't merged yet: + +``` +$ diff --unchanged-line-format= --old-line-format= --new-line-format='%L' dspace/config/spiders/agents/example ~/src/git/COUNTER-Robots/COUNTER_Robots_list.txt +Citoid +ecointernet +GigablastOpenSource +Jersey\/\d +MarcEdit +OgScrper +okhttp +^Pattern\/\d +ReactorNetty\/\d +sqlmap +Typhoeus +7siters +``` + +- Just a note that I *still* can't deploy the `6_x-dev-atmire-modules` branch as it fails at ant update: + +``` + [java] java.lang.RuntimeException: Failed to startup the DSpace Service Manager: failure starting up spring service manager: Error crea +ting bean with name 'DefaultStorageUpdateConfig': Injection of autowired dependencies failed; nested exception is org.springframework.beans.factory.BeanCreationException: Could not autowire method: public void com.atmire.statistics.util.StorageReportsUpdater.setStorageReportServi +ces(java.util.List); nested exception is org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'cuaEPersonStorageReportService': Injection of autowired dependencies failed; nested exception is org.springframework.beans.factory.BeanCreationExceptio +n: Could not autowire field: private com.atmire.dspace.cua.dao.storage.CUAEPersonStorageReportDAO com.atmire.dspace.cua.CUAStorageReportServ +iceImpl$CUAEPersonStorageReportServiceImpl.CUAEPersonStorageReportDAO; nested exception is org.springframework.beans.factory.NoUniqueBeanDef +initionException: No qualifying bean of type [com.atmire.dspace.cua.dao.storage.CUAEPersonStorageReportDAO] is defined: expected single matc +hing bean but found 2: com.atmire.dspace.cua.dao.impl.CUAStorageReportDAOImpl$CUAEPersonStorageReportDAOImpl#0,com.atmire.dspace.cua.dao.imp +l.CUAStorageReportDAOImpl$CUAEPersonStorageReportDAOImpl#1 +``` + +- I had told Atmire about this several weeks ago... but I reminded them again in the ticket + + diff --git a/docs/2015-11/index.html b/docs/2015-11/index.html index 27f805324..41dc7b4fc 100644 --- a/docs/2015-11/index.html +++ b/docs/2015-11/index.html @@ -239,6 +239,8 @@ db.statementpool = true
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -247,8 +249,6 @@ db.statementpool = true
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2015-12/index.html b/docs/2015-12/index.html index 008ac611c..c868203ed 100644 --- a/docs/2015-12/index.html +++ b/docs/2015-12/index.html @@ -261,6 +261,8 @@ $ curl -o /dev/null -s -w %{time_total}\\n https://cgspace.cgiar.org/rest/handle
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -269,8 +271,6 @@ $ curl -o /dev/null -s -w %{time_total}\\n https://cgspace.cgiar.org/rest/handle
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-01/index.html b/docs/2016-01/index.html index d7f1d5255..212831ab1 100644 --- a/docs/2016-01/index.html +++ b/docs/2016-01/index.html @@ -197,6 +197,8 @@ $ find SimpleArchiveForBio/ -iname “*.pdf” -exec basename {} ; | sor
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -205,8 +207,6 @@ $ find SimpleArchiveForBio/ -iname “*.pdf” -exec basename {} ; | sor
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-02/index.html b/docs/2016-02/index.html index dff24eb78..bd7e22c16 100644 --- a/docs/2016-02/index.html +++ b/docs/2016-02/index.html @@ -375,6 +375,8 @@ Bitstream: tést señora alimentación.pdf
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -383,8 +385,6 @@ Bitstream: tést señora alimentación.pdf
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-03/index.html b/docs/2016-03/index.html index 2f711102a..dfcd5dc01 100644 --- a/docs/2016-03/index.html +++ b/docs/2016-03/index.html @@ -313,6 +313,8 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -321,8 +323,6 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-04/index.html b/docs/2016-04/index.html index 191dccb98..bab077700 100644 --- a/docs/2016-04/index.html +++ b/docs/2016-04/index.html @@ -492,6 +492,8 @@ dspace.log.2016-04-27:7271
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -500,8 +502,6 @@ dspace.log.2016-04-27:7271
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-05/index.html b/docs/2016-05/index.html index 42851cc9d..63759c61b 100644 --- a/docs/2016-05/index.html +++ b/docs/2016-05/index.html @@ -368,6 +368,8 @@ sys 0m20.540s
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -376,8 +378,6 @@ sys 0m20.540s
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-06/index.html b/docs/2016-06/index.html index 9d5510d83..e7aa02f86 100644 --- a/docs/2016-06/index.html +++ b/docs/2016-06/index.html @@ -406,6 +406,8 @@ $ ./delete-metadata-values.py -f dc.contributor.corporate -i Corporate-Authors-D
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -414,8 +416,6 @@ $ ./delete-metadata-values.py -f dc.contributor.corporate -i Corporate-Authors-D
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-07/index.html b/docs/2016-07/index.html index 9efe989c9..bdd390d2b 100644 --- a/docs/2016-07/index.html +++ b/docs/2016-07/index.html @@ -322,6 +322,8 @@ discovery.index.authority.ignore-variants=true
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -330,8 +332,6 @@ discovery.index.authority.ignore-variants=true
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-08/index.html b/docs/2016-08/index.html index a5ba742ec..b018b7ce9 100644 --- a/docs/2016-08/index.html +++ b/docs/2016-08/index.html @@ -386,6 +386,8 @@ $ JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx512m" /home/cgspace.cgiar.org/b
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -394,8 +396,6 @@ $ JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx512m" /home/cgspace.cgiar.org/b
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-09/index.html b/docs/2016-09/index.html index b83d6e94d..cdfef1438 100644 --- a/docs/2016-09/index.html +++ b/docs/2016-09/index.html @@ -603,6 +603,8 @@ $ ./delete-metadata-values.py -i ilrisubjects-delete-13.csv -f cg.subject.ilri -
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -611,8 +613,6 @@ $ ./delete-metadata-values.py -i ilrisubjects-delete-13.csv -f cg.subject.ilri -
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-10/index.html b/docs/2016-10/index.html index 930753f24..c0d572fe8 100644 --- a/docs/2016-10/index.html +++ b/docs/2016-10/index.html @@ -369,6 +369,8 @@ dspace=# update metadatavalue set text_value = regexp_replace(text_value, 'http:
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -377,8 +379,6 @@ dspace=# update metadatavalue set text_value = regexp_replace(text_value, 'http:
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-11/index.html b/docs/2016-11/index.html index 8588e95d3..3584a611b 100644 --- a/docs/2016-11/index.html +++ b/docs/2016-11/index.html @@ -545,6 +545,8 @@ org.dspace.discovery.SearchServiceException: Error executing query
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -553,8 +555,6 @@ org.dspace.discovery.SearchServiceException: Error executing query
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2016-12/index.html b/docs/2016-12/index.html index b20fa03f4..17d265736 100644 --- a/docs/2016-12/index.html +++ b/docs/2016-12/index.html @@ -781,6 +781,8 @@ $ exit
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -789,8 +791,6 @@ $ exit
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-01/index.html b/docs/2017-01/index.html index f568e906b..78f5025a4 100644 --- a/docs/2017-01/index.html +++ b/docs/2017-01/index.html @@ -366,6 +366,8 @@ $ gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -374,8 +376,6 @@ $ gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-02/index.html b/docs/2017-02/index.html index e5d5a6ca8..6a47a31c6 100644 --- a/docs/2017-02/index.html +++ b/docs/2017-02/index.html @@ -421,6 +421,8 @@ COPY 1968
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -429,8 +431,6 @@ COPY 1968
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-03/index.html b/docs/2017-03/index.html index f5409fe49..e810e87bb 100644 --- a/docs/2017-03/index.html +++ b/docs/2017-03/index.html @@ -352,6 +352,8 @@ $ ./delete-metadata-values.py -i Investors-Delete-121.csv -f dc.description.spon
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -360,8 +362,6 @@ $ ./delete-metadata-values.py -i Investors-Delete-121.csv -f dc.description.spon
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-04/index.html b/docs/2017-04/index.html index b026a67cf..37b59ff4e 100644 --- a/docs/2017-04/index.html +++ b/docs/2017-04/index.html @@ -582,6 +582,8 @@ $ gem install compass -v 1.0.3
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -590,8 +592,6 @@ $ gem install compass -v 1.0.3
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-05/index.html b/docs/2017-05/index.html index b58e80a03..9f9358abb 100644 --- a/docs/2017-05/index.html +++ b/docs/2017-05/index.html @@ -388,6 +388,8 @@ UPDATE 187
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -396,8 +398,6 @@ UPDATE 187
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-06/index.html b/docs/2017-06/index.html index 5f3a51712..1a2de44f4 100644 --- a/docs/2017-06/index.html +++ b/docs/2017-06/index.html @@ -267,6 +267,8 @@ $ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace impo
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -275,8 +277,6 @@ $ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace impo
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-07/index.html b/docs/2017-07/index.html index 663bef437..ab8f4349c 100644 --- a/docs/2017-07/index.html +++ b/docs/2017-07/index.html @@ -272,6 +272,8 @@ delete from metadatavalue where resource_type_id=2 and metadata_field_id=235 and
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -280,8 +282,6 @@ delete from metadatavalue where resource_type_id=2 and metadata_field_id=235 and
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-08/index.html b/docs/2017-08/index.html index b2ea6c6a8..d44acf4e9 100644 --- a/docs/2017-08/index.html +++ b/docs/2017-08/index.html @@ -514,6 +514,8 @@ org.apache.commons.dbcp.SQLNestedException: Cannot get a connection, pool error
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -522,8 +524,6 @@ org.apache.commons.dbcp.SQLNestedException: Cannot get a connection, pool error
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-09/index.html b/docs/2017-09/index.html index 4c23c0ba7..f8730e3e9 100644 --- a/docs/2017-09/index.html +++ b/docs/2017-09/index.html @@ -656,6 +656,8 @@ Cert Status: good
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -664,8 +666,6 @@ Cert Status: good
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-10/index.html b/docs/2017-10/index.html index 239d30f8a..fabc06053 100644 --- a/docs/2017-10/index.html +++ b/docs/2017-10/index.html @@ -440,6 +440,8 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -448,8 +450,6 @@ session_id=6C30F10B4351A4ED83EC6ED50AFD6B6A
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-11/index.html b/docs/2017-11/index.html index 9fe942a2d..1b27e04c9 100644 --- a/docs/2017-11/index.html +++ b/docs/2017-11/index.html @@ -941,6 +941,8 @@ $ cat dspace.log.2017-11-28 | grep -o -E 'session_id=[A-Z0-9]{32}' | sort -n | u
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -949,8 +951,6 @@ $ cat dspace.log.2017-11-28 | grep -o -E 'session_id=[A-Z0-9]{32}' | sort -n | u
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2017-12/index.html b/docs/2017-12/index.html index 5f84d32f0..f6442a801 100644 --- a/docs/2017-12/index.html +++ b/docs/2017-12/index.html @@ -780,6 +780,8 @@ DELETE 20
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -788,8 +790,6 @@ DELETE 20
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-01/index.html b/docs/2018-01/index.html index c526f704a..1e7666109 100644 --- a/docs/2018-01/index.html +++ b/docs/2018-01/index.html @@ -1449,6 +1449,8 @@ Catalina:type=Manager,context=/,host=localhost activeSessions 8
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -1457,8 +1459,6 @@ Catalina:type=Manager,context=/,host=localhost activeSessions 8
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-02/index.html b/docs/2018-02/index.html index 69d902cee..e5995248e 100644 --- a/docs/2018-02/index.html +++ b/docs/2018-02/index.html @@ -1036,6 +1036,8 @@ UPDATE 3
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -1044,8 +1046,6 @@ UPDATE 3
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-03/index.html b/docs/2018-03/index.html index eaf5a901f..7585a2d56 100644 --- a/docs/2018-03/index.html +++ b/docs/2018-03/index.html @@ -582,6 +582,8 @@ Fixed 5 occurences of: GENEBANKS
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -590,8 +592,6 @@ Fixed 5 occurences of: GENEBANKS
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-04/index.html b/docs/2018-04/index.html index 6031adbac..50ec083f4 100644 --- a/docs/2018-04/index.html +++ b/docs/2018-04/index.html @@ -591,6 +591,8 @@ $ pg_restore -O -U dspacetest -d dspacetest -W -h localhost /tmp/dspace_2018-04-
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -599,8 +601,6 @@ $ pg_restore -O -U dspacetest -d dspacetest -W -h localhost /tmp/dspace_2018-04-
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-05/index.html b/docs/2018-05/index.html index 330ee64a0..7740196a4 100644 --- a/docs/2018-05/index.html +++ b/docs/2018-05/index.html @@ -520,6 +520,8 @@ $ psql -h localhost -U postgres dspacetest
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -528,8 +530,6 @@ $ psql -h localhost -U postgres dspacetest
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-06/index.html b/docs/2018-06/index.html index 8e29315f6..99af9aa76 100644 --- a/docs/2018-06/index.html +++ b/docs/2018-06/index.html @@ -514,6 +514,8 @@ $ sed '/^id/d' 10568-*.csv | csvcut -c 1,2 > map-to-cifor-archive.csv
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -522,8 +524,6 @@ $ sed '/^id/d' 10568-*.csv | csvcut -c 1,2 > map-to-cifor-archive.csv
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-07/index.html b/docs/2018-07/index.html index 855622b5f..c0ef5955c 100644 --- a/docs/2018-07/index.html +++ b/docs/2018-07/index.html @@ -566,6 +566,8 @@ dspace=# select count(text_value) from metadatavalue where resource_type_id=2 an
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -574,8 +576,6 @@ dspace=# select count(text_value) from metadatavalue where resource_type_id=2 an
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-08/index.html b/docs/2018-08/index.html index 5f673505f..43700c7af 100644 --- a/docs/2018-08/index.html +++ b/docs/2018-08/index.html @@ -439,6 +439,8 @@ $ dspace database migrate ignored
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -447,8 +449,6 @@ $ dspace database migrate ignored
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-09/index.html b/docs/2018-09/index.html index 33ed34e7c..457628968 100644 --- a/docs/2018-09/index.html +++ b/docs/2018-09/index.html @@ -745,6 +745,8 @@ UPDATE metadatavalue SET text_value='ja' WHERE resource_type_id=2 AND metadata_f
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -753,8 +755,6 @@ UPDATE metadatavalue SET text_value='ja' WHERE resource_type_id=2 AND metadata_f
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-10/index.html b/docs/2018-10/index.html index 058612a6d..c51e1a9bd 100644 --- a/docs/2018-10/index.html +++ b/docs/2018-10/index.html @@ -653,6 +653,8 @@ $ curl -X GET -H "Content-Type: application/json" -H "Accept: app
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -661,8 +663,6 @@ $ curl -X GET -H "Content-Type: application/json" -H "Accept: app
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-11/index.html b/docs/2018-11/index.html index b3e9778eb..385f838e9 100644 --- a/docs/2018-11/index.html +++ b/docs/2018-11/index.html @@ -550,6 +550,8 @@ $ dspace dsrun org.dspace.eperson.Groomer -a -b 11/27/2016 -d
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -558,8 +560,6 @@ $ dspace dsrun org.dspace.eperson.Groomer -a -b 11/27/2016 -d
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2018-12/index.html b/docs/2018-12/index.html index 2bb878ce6..f22a5cd62 100644 --- a/docs/2018-12/index.html +++ b/docs/2018-12/index.html @@ -591,6 +591,8 @@ UPDATE 1
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -599,8 +601,6 @@ UPDATE 1
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-01/index.html b/docs/2019-01/index.html index 2a6eb93c2..fa7a75ced 100644 --- a/docs/2019-01/index.html +++ b/docs/2019-01/index.html @@ -1261,6 +1261,8 @@ identify: CorruptImageProfile `xmp' @ warning/profile.c/SetImageProfileInternal/
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -1269,8 +1271,6 @@ identify: CorruptImageProfile `xmp' @ warning/profile.c/SetImageProfileInternal/
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-02/index.html b/docs/2019-02/index.html index 2e340d993..551cb833d 100644 --- a/docs/2019-02/index.html +++ b/docs/2019-02/index.html @@ -1341,6 +1341,8 @@ Please see the DSpace documentation for assistance.
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -1349,8 +1351,6 @@ Please see the DSpace documentation for assistance.
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-03/index.html b/docs/2019-03/index.html index a86d3684b..43410728c 100644 --- a/docs/2019-03/index.html +++ b/docs/2019-03/index.html @@ -1205,6 +1205,8 @@ sys 0m2.551s
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -1213,8 +1215,6 @@ sys 0m2.551s
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-04/index.html b/docs/2019-04/index.html index 7faa5cf5f..eacf90f3a 100644 --- a/docs/2019-04/index.html +++ b/docs/2019-04/index.html @@ -1296,6 +1296,8 @@ UPDATE 14
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -1304,8 +1306,6 @@ UPDATE 14
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-05/index.html b/docs/2019-05/index.html index 4b62886e6..f7800bbc4 100644 --- a/docs/2019-05/index.html +++ b/docs/2019-05/index.html @@ -628,6 +628,8 @@ COPY 64871
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -636,8 +638,6 @@ COPY 64871
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-06/index.html b/docs/2019-06/index.html index 9b5aa570d..c947eba8d 100644 --- a/docs/2019-06/index.html +++ b/docs/2019-06/index.html @@ -314,6 +314,8 @@ UPDATE 2
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -322,8 +324,6 @@ UPDATE 2
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-07/index.html b/docs/2019-07/index.html index 7bbac007e..12e4c1b45 100644 --- a/docs/2019-07/index.html +++ b/docs/2019-07/index.html @@ -551,6 +551,8 @@ issn.validate('1020-3362')
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -559,8 +561,6 @@ issn.validate('1020-3362')
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-08/index.html b/docs/2019-08/index.html index 6df28d9a8..d6a0c5e00 100644 --- a/docs/2019-08/index.html +++ b/docs/2019-08/index.html @@ -570,6 +570,8 @@ sys 2m27.496s
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -578,8 +580,6 @@ sys 2m27.496s
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-09/index.html b/docs/2019-09/index.html index 086f36371..79bd0966c 100644 --- a/docs/2019-09/index.html +++ b/docs/2019-09/index.html @@ -578,6 +578,8 @@ $ csv-metadata-quality -i /tmp/clarisa-institutions.csv -o /tmp/clarisa-institut
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -586,8 +588,6 @@ $ csv-metadata-quality -i /tmp/clarisa-institutions.csv -o /tmp/clarisa-institut
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-10/index.html b/docs/2019-10/index.html index 312de5af4..e92511f01 100644 --- a/docs/2019-10/index.html +++ b/docs/2019-10/index.html @@ -382,6 +382,8 @@ $ dspace import -a -c 10568/104057 -e fuu@cgiar.org -m 2019-10-15-Bioversity.map
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -390,8 +392,6 @@ $ dspace import -a -c 10568/104057 -e fuu@cgiar.org -m 2019-10-15-Bioversity.map
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-11/index.html b/docs/2019-11/index.html index a23f4eeec..55827ea30 100644 --- a/docs/2019-11/index.html +++ b/docs/2019-11/index.html @@ -689,6 +689,8 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -697,8 +699,6 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2019-12/index.html b/docs/2019-12/index.html index a0785ca17..47beb4140 100644 --- a/docs/2019-12/index.html +++ b/docs/2019-12/index.html @@ -401,6 +401,8 @@ UPDATE 1
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -409,8 +411,6 @@ UPDATE 1
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2020-01/index.html b/docs/2020-01/index.html index e25f83a8f..8da9fa3cd 100644 --- a/docs/2020-01/index.html +++ b/docs/2020-01/index.html @@ -601,6 +601,8 @@ COPY 2900
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -609,8 +611,6 @@ COPY 2900
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2020-02/index.html b/docs/2020-02/index.html index 42c9aff3b..060bcdef4 100644 --- a/docs/2020-02/index.html +++ b/docs/2020-02/index.html @@ -1272,6 +1272,8 @@ Moving: 21993 into core statistics-2019
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -1280,8 +1282,6 @@ Moving: 21993 into core statistics-2019
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2020-03/index.html b/docs/2020-03/index.html index d47879cb6..139c329d5 100644 --- a/docs/2020-03/index.html +++ b/docs/2020-03/index.html @@ -481,6 +481,8 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -489,8 +491,6 @@ $ tidy -xml -utf8 -iq -m -w 0 dspace/config/controlled-vocabularies/cg-creator-i
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2020-04/index.html b/docs/2020-04/index.html index b89ed4bb6..e553f078a 100644 --- a/docs/2020-04/index.html +++ b/docs/2020-04/index.html @@ -655,6 +655,8 @@ $ psql -c 'select * from pg_stat_activity' | wc -l
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -663,8 +665,6 @@ $ psql -c 'select * from pg_stat_activity' | wc -l
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2020-05/index.html b/docs/2020-05/index.html index b127c1d70..1c262c5e4 100644 --- a/docs/2020-05/index.html +++ b/docs/2020-05/index.html @@ -474,6 +474,8 @@ Caused by: java.lang.NullPointerException
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -482,8 +484,6 @@ Caused by: java.lang.NullPointerException
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2020-06/index.html b/docs/2020-06/index.html index bf921c3b8..cffdb4b5f 100644 --- a/docs/2020-06/index.html +++ b/docs/2020-06/index.html @@ -19,7 +19,7 @@ I tried to build the OAI registry on the freshly migrated DSpace 6 on DSpace Tes - + @@ -45,7 +45,7 @@ I tried to build the OAI registry on the freshly migrated DSpace 6 on DSpace Tes "url": "https://alanorth.github.io/cgspace-notes/2020-06/", "wordCount": "4788", "datePublished": "2020-06-01T13:55:39+03:00", - "dateModified": "2020-06-30T15:47:18+03:00", + "dateModified": "2020-06-30T19:21:50+03:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -808,6 +808,8 @@ $ csvcut -c 'id,cg.subject.ilri[],cg.subject.ilri[en_US],dc.subject[en_US]' /tmp
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -816,8 +818,6 @@ $ csvcut -c 'id,cg.subject.ilri[],cg.subject.ilri[en_US],dc.subject[en_US]' /tmp
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/2020-07/index.html b/docs/2020-07/index.html new file mode 100644 index 000000000..34ac33588 --- /dev/null +++ b/docs/2020-07/index.html @@ -0,0 +1,368 @@ + + + + + + + + + + + + + + + + + + + + + + + + + + + + July, 2020 | CGSpace Notes + + + + + + + + + + + + + + + + + + + + + + + +
+
+ +
+
+ + + + +
+
+

CGSpace Notes

+

Documenting day-to-day work on the CGSpace repository.

+
+
+ + + + +
+
+
+ + + + +
+
+

July, 2020

+ +
+

2020-07-01

+
    +
  • A few users noticed that CGSpace wasn’t loading items today, item pages seem blank +
      +
    • I looked at the PostgreSQL locks but they don’t seem unusual
    • +
    • I guess this is the same “blank item page” issue that we had a few times in 2019 that we never solved
    • +
    • I restarted Tomcat and PostgreSQL and the issue was gone
    • +
    +
  • +
  • Since I was restarting Tomcat anyways I decided to redeploy the latest changes from the 5_x-prod branch and I added a note about COVID-19 items to the CGSpace frontpage at Peter’s request
  • +
+
    +
  • Also, Linode is alerting that we had high outbound traffic rate early this morning around midnight AND high CPU load later in the morning
  • +
  • First looking at the traffic in the morning:
  • +
+
# cat /var/log/nginx/*.log.1 /var/log/nginx/*.log | grep -E "01/Jul/2020:(00|01|02|03|04)" | goaccess --log-format=COMBINED -
+...
+9659 33.56%    1  0.08% 340.94 MiB 64.39.99.13
+3317 11.53%    1  0.08% 871.71 MiB 199.47.87.140
+2986 10.38%    1  0.08%  17.39 MiB 199.47.87.144
+2286  7.94%    1  0.08%  13.04 MiB 199.47.87.142
+
    +
  • 64.39.99.13 belongs to Qualys, but I see they are using a normal desktop user agent:
  • +
+
Mozilla/5.0 (Macintosh; Intel Mac OS X 10_14_5) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/12.1.1 Safari/605.1.15
+
    +
  • I will purge hits from that IP from Solr
  • +
  • The 199.47.87.x IPs belong to Turnitin, and apparently they are NOT marked as bots and we have 40,000 hits from them in 2020 statistics alone:
  • +
+
$ curl -s "http://localhost:8081/solr/statistics/select" -d "q=userAgent:/Turnitin.*/&rows=0" | grep -oE 'numFound="[0-9]+"'
+numFound="41694"
+
    +
  • They used to be “TurnitinBot”… hhmmmm, seems they use both: https://turnitin.com/robot/crawlerinfo.html
  • +
  • I will add Turnitin to the DSpace bot user agent list, but I see they are reqesting robots.txt and only requesting item pages, so that’s impressive! I don’t need to add them to the “bad bot” rate limit list in nginx
  • +
  • While looking at the logs I noticed eighty-one IPs in the range 185.152.250.x making little requests this user agent:
  • +
+
Mozilla/5.0 (Macintosh; Intel Mac OS X 10.14; rv:76.0) Gecko/20100101 Firefox/76.0
+
    +
  • The IPs all belong to HostRoyale:
  • +
+
# cat /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep '01/Jul/2020' | awk '{print $1}' | grep 185.152.250. | sort | uniq | wc -l
+81
+# cat /var/log/nginx/*.log /var/log/nginx/*.log.1 | grep '01/Jul/2020' | awk '{print $1}' | grep 185.152.250. | sort | uniq | sort -h
+185.152.250.1
+185.152.250.101
+185.152.250.103
+185.152.250.105
+185.152.250.107
+185.152.250.111
+185.152.250.115
+185.152.250.119
+185.152.250.121
+185.152.250.123
+185.152.250.125
+185.152.250.129
+185.152.250.13
+185.152.250.131
+185.152.250.133
+185.152.250.135
+185.152.250.137
+185.152.250.141
+185.152.250.145
+185.152.250.149
+185.152.250.153
+185.152.250.155
+185.152.250.157
+185.152.250.159
+185.152.250.161
+185.152.250.163
+185.152.250.165
+185.152.250.167
+185.152.250.17
+185.152.250.171
+185.152.250.183
+185.152.250.189
+185.152.250.191
+185.152.250.197
+185.152.250.201
+185.152.250.205
+185.152.250.209
+185.152.250.21
+185.152.250.213
+185.152.250.217
+185.152.250.219
+185.152.250.221
+185.152.250.223
+185.152.250.225
+185.152.250.227
+185.152.250.229
+185.152.250.231
+185.152.250.233
+185.152.250.235
+185.152.250.239
+185.152.250.243
+185.152.250.247
+185.152.250.249
+185.152.250.25
+185.152.250.251
+185.152.250.253
+185.152.250.255
+185.152.250.27
+185.152.250.29
+185.152.250.3
+185.152.250.31
+185.152.250.39
+185.152.250.41
+185.152.250.47
+185.152.250.5
+185.152.250.59
+185.152.250.63
+185.152.250.65
+185.152.250.67
+185.152.250.7
+185.152.250.71
+185.152.250.73
+185.152.250.77
+185.152.250.81
+185.152.250.85
+185.152.250.89
+185.152.250.9
+185.152.250.93
+185.152.250.95
+185.152.250.97
+185.152.250.99
+
    +
  • It’s only a few hundred requests each, but I am very suspicious so I will record it here and purge their IPs from Solr
  • +
  • Then I see 185.187.30.14 and 185.187.30.13 making requests also, with several different “normal” user agents +
      +
    • They are both apparently in France, belonging to Scalair FR hosting
    • +
    • I will purge their requests from Solr too
    • +
    +
  • +
  • Now I see some other new bots I hadn’t noticed before: +
      +
    • Mozilla/5.0 (compatible; MSIE 10.0; Windows NT 6.1; Trident/6.0) LinkCheck by Siteimprove.com
    • +
    • Consilio (WebHare Platform 4.28.2-dev); LinkChecker), which appears to be a university CMS
    • +
    • I will add LinkCheck, Consilio, and WebHare to the list of DSpace bot agents and purge them from Solr stats
    • +
    • COUNTER-Robots list already has link.?check but for some reason DSpace didn’t match that and I see hits for some of these…
    • +
    • Maybe I should add [Ll]ink.?[Cc]heck.? to a custom list for now?
    • +
    • For now I added Turnitin to the new bots pull request on COUNTER-Robots
    • +
    +
  • +
  • I purged 20,000 hits from IPs and 45,000 hits from user agents
  • +
  • I will revert the default “example” agents file back to the upstream master branch of COUNTER-Robots, and then add all my custom ones that are pending in pull requests they haven’t merged yet:
  • +
+
$ diff --unchanged-line-format= --old-line-format= --new-line-format='%L' dspace/config/spiders/agents/example ~/src/git/COUNTER-Robots/COUNTER_Robots_list.txt
+Citoid
+ecointernet
+GigablastOpenSource
+Jersey\/\d
+MarcEdit
+OgScrper
+okhttp
+^Pattern\/\d
+ReactorNetty\/\d
+sqlmap
+Typhoeus
+7siters
+
    +
  • Just a note that I still can’t deploy the 6_x-dev-atmire-modules branch as it fails at ant update:
  • +
+
     [java] java.lang.RuntimeException: Failed to startup the DSpace Service Manager: failure starting up spring service manager: Error crea
+ting bean with name 'DefaultStorageUpdateConfig': Injection of autowired dependencies failed; nested exception is org.springframework.beans.factory.BeanCreationException: Could not autowire method: public void com.atmire.statistics.util.StorageReportsUpdater.setStorageReportServi
+ces(java.util.List); nested exception is org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'cuaEPersonStorageReportService': Injection of autowired dependencies failed; nested exception is org.springframework.beans.factory.BeanCreationExceptio
+n: Could not autowire field: private com.atmire.dspace.cua.dao.storage.CUAEPersonStorageReportDAO com.atmire.dspace.cua.CUAStorageReportServ
+iceImpl$CUAEPersonStorageReportServiceImpl.CUAEPersonStorageReportDAO; nested exception is org.springframework.beans.factory.NoUniqueBeanDef
+initionException: No qualifying bean of type [com.atmire.dspace.cua.dao.storage.CUAEPersonStorageReportDAO] is defined: expected single matc
+hing bean but found 2: com.atmire.dspace.cua.dao.impl.CUAStorageReportDAOImpl$CUAEPersonStorageReportDAOImpl#0,com.atmire.dspace.cua.dao.imp
+l.CUAStorageReportDAOImpl$CUAEPersonStorageReportDAOImpl#1
+
    +
  • I had told Atmire about this several weeks ago… but I reminded them again in the ticket
  • +
+ + + + + + +
+ + + +
+ + + + +
+
+ + + + + + + + + diff --git a/docs/404.html b/docs/404.html index b75b1ade1..3f953015d 100644 --- a/docs/404.html +++ b/docs/404.html @@ -94,6 +94,8 @@
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -102,8 +104,6 @@
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/categories/index.html b/docs/categories/index.html index ebbe60a8a..9fa2d6c35 100644 --- a/docs/categories/index.html +++ b/docs/categories/index.html @@ -9,7 +9,7 @@ - + @@ -83,7 +83,7 @@

Notes

- +
Read more → @@ -107,6 +107,8 @@
    +
  1. July, 2020
  2. +
  3. June, 2020
  4. May, 2020
  5. @@ -115,8 +117,6 @@
  6. March, 2020
  7. -
  8. February, 2020
  9. -
diff --git a/docs/categories/index.xml b/docs/categories/index.xml index 3f32ae948..bba300c3d 100644 --- a/docs/categories/index.xml +++ b/docs/categories/index.xml @@ -6,7 +6,7 @@ Recent content in Categories on CGSpace Notes Hugo -- gohugo.io en-us - Mon, 01 Jun 2020 13:55:39 +0300 + Wed, 01 Jul 2020 10:53:54 +0300 @@ -14,7 +14,7 @@ Notes https://alanorth.github.io/cgspace-notes/categories/notes/ - Mon, 01 Jun 2020 13:55:39 +0300 + Wed, 01 Jul 2020 10:53:54 +0300 https://alanorth.github.io/cgspace-notes/categories/notes/ diff --git a/docs/categories/notes/index.html b/docs/categories/notes/index.html index 447e2bcf7..763cd30ca 100644 --- a/docs/categories/notes/index.html +++ b/docs/categories/notes/index.html @@ -9,7 +9,7 @@ - + @@ -80,6 +80,34 @@ +
+
+

July, 2020

+ +
+

2020-07-01

+
    +
  • A few users noticed that CGSpace wasn’t loading items today, item pages seem blank +
      +
    • I looked at the PostgreSQL locks but they don’t seem unusual
    • +
    • I guess this is the same “blank item page” issue that we had a few times in 2019 that we never solved
    • +
    • I restarted Tomcat and PostgreSQL and the issue was gone
    • +
    +
  • +
  • Since I was restarting Tomcat anyways I decided to redeploy the latest changes from the 5_x-prod branch and I added a note about COVID-19 items to the CGSpace frontpage at Peter’s request
  • +
+ Read more → +
+ + + + + +

June, 2020

@@ -351,24 +379,6 @@ - -
-
-

October, 2019

- -
- 2019-10-01 Udana from IWMI asked me for a CSV export of their community on CGSpace I exported it, but a quick run through the csv-metadata-quality tool shows that there are some low-hanging fruits we can fix before I send him the data I will limit the scope to the titles, regions, subregions, and river basins for now to manually fix some non-breaking spaces (U+00A0) there that would otherwise be removed by the csv-metadata-quality script’s “unneccesary Unicode” fix: $ csvcut -c 'id,dc. - Read more → -
- - - - -