diff --git a/content/posts/2021-03.md b/content/posts/2021-03.md index 924db2c47..790107ee6 100644 --- a/content/posts/2021-03.md +++ b/content/posts/2021-03.md @@ -500,4 +500,131 @@ $ curl -s 'http://localhost:9200/_nodes/jvm?human' | python -m json.tool $ curl -XPUT 'http://localhost:9200/openrxv-items-temp' ``` +## 2021-03-24 + +- Atmire responded to the [ticket about the Duplicate Checker](https://tracker.atmire.com/tickets-cgiar-ilri/view-ticket?id=934) + - He says it works for him in Firefox, so I checked and it seems to have been an issue with my LocalCDN addon +- I re-deployed DSpace Test (linode26) from the latest CGSpace (linode18) data + - I want to try to finish up processing the duplicates in Solr that [Atmire advised on last month](https://tracker.atmire.com/tickets-cgiar-ilri/view-ticket?id=839) + - The current statistics core is 57861236 kilobytes: + +```console +# du -s /home/dspacetest.cgiar.org/solr/statistics +57861236 /home/dspacetest.cgiar.org/solr/statistics +``` + +- I applied their changes to `config/spring/api/atmire-cua-update.xml` and started the duplicate processor: + +```console +$ export JAVA_OPTS='-Dfile.encoding=UTF-8 -Xmx4096m' +$ chrt -b 0 dspace dsrun com.atmire.statistics.util.update.atomic.AtomicStatisticsUpdateCLI -r 1000 -c statistics -t 12 +``` + +- The default number of records per query is 10,000, which caused memory issues, so I will try with 1000 (Atmire used 100, but that seems too low!) +- Hah, I still got a memory error after only a few minutes: + +```console +... +Run 1 — 80% — 5,000/6,263 docs — 25s — 6m 31s +Exception: GC overhead limit exceeded +java.lang.OutOfMemoryError: GC overhead limit exceeded +``` + +- I guess we really do have to use `-r 100` +- Now the thing runs for a few minutes and "finishes": + +```console +$ chrt -b 0 dspace dsrun com.atmire.statistics.util.update.atomic.AtomicStatisticsUpdateCLI -r 100 -c statistics -t 12 +Loading @mire database changes for module MQM +Changes have been processed + + +************************* +* Update Script Started * +************************* + +Run 1 +Start updating Solr Storage Reports | Wed Mar 24 14:42:17 CET 2021 +Deleting old storage docs from Solr... | Wed Mar 24 14:42:17 CET 2021 +Done. | Wed Mar 24 14:42:17 CET 2021 +Processing storage reports for type: eperson | Wed Mar 24 14:42:17 CET 2021 +Done. | Wed Mar 24 14:42:41 CET 2021 +Processing storage reports for type: group | Wed Mar 24 14:42:41 CET 2021 +Done. | Wed Mar 24 14:45:46 CET 2021 +Processing storage reports for type: collection | Wed Mar 24 14:45:46 CET 2021 +Done. | Wed Mar 24 14:45:54 CET 2021 +Processing storage reports for type: community | Wed Mar 24 14:45:54 CET 2021 +Done. | Wed Mar 24 14:45:58 CET 2021 +Committing to Solr... | Wed Mar 24 14:45:58 CET 2021 +Done. | Wed Mar 24 14:45:59 CET 2021 +Successfully finished updating Solr Storage Reports | Wed Mar 24 14:45:59 CET 2021 +Run 1 — 2% — 100/4,824 docs — 3m 47s — 3m 47s +Run 1 — 4% — 200/4,824 docs — 2s — 3m 50s +Run 1 — 6% — 300/4,824 docs — 2s — 3m 53s +Run 1 — 8% — 400/4,824 docs — 2s — 3m 55s +Run 1 — 10% — 500/4,824 docs — 2s — 3m 58s +Run 1 — 12% — 600/4,824 docs — 2s — 4m 1s +Run 1 — 15% — 700/4,824 docs — 2s — 4m 3s +Run 1 — 17% — 800/4,824 docs — 2s — 4m 6s +Run 1 — 19% — 900/4,824 docs — 2s — 4m 9s +Run 1 — 21% — 1,000/4,824 docs — 2s — 4m 11s +Run 1 — 23% — 1,100/4,824 docs — 2s — 4m 14s +Run 1 — 25% — 1,200/4,824 docs — 2s — 4m 16s +Run 1 — 27% — 1,300/4,824 docs — 2s — 4m 19s +Run 1 — 29% — 1,400/4,824 docs — 2s — 4m 22s +Run 1 — 31% — 1,500/4,824 docs — 2s — 4m 24s +Run 1 — 33% — 1,600/4,824 docs — 2s — 4m 27s +Run 1 — 35% — 1,700/4,824 docs — 2s — 4m 29s +Run 1 — 37% — 1,800/4,824 docs — 2s — 4m 32s +Run 1 — 39% — 1,900/4,824 docs — 2s — 4m 35s +Run 1 — 41% — 2,000/4,824 docs — 2s — 4m 37s +Run 1 — 44% — 2,100/4,824 docs — 2s — 4m 40s +Run 1 — 46% — 2,200/4,824 docs — 2s — 4m 42s +Run 1 — 48% — 2,300/4,824 docs — 2s — 4m 45s +Run 1 — 50% — 2,400/4,824 docs — 2s — 4m 48s +Run 1 — 52% — 2,500/4,824 docs — 2s — 4m 50s +Run 1 — 54% — 2,600/4,824 docs — 2s — 4m 53s +Run 1 — 56% — 2,700/4,824 docs — 2s — 4m 55s +Run 1 — 58% — 2,800/4,824 docs — 2s — 4m 58s +Run 1 — 60% — 2,900/4,824 docs — 2s — 5m 1s +Run 1 — 62% — 3,000/4,824 docs — 2s — 5m 3s +Run 1 — 64% — 3,100/4,824 docs — 2s — 5m 6s +Run 1 — 66% — 3,200/4,824 docs — 3s — 5m 9s +Run 1 — 68% — 3,300/4,824 docs — 2s — 5m 12s +Run 1 — 70% — 3,400/4,824 docs — 2s — 5m 14s +Run 1 — 73% — 3,500/4,824 docs — 2s — 5m 17s +Run 1 — 75% — 3,600/4,824 docs — 2s — 5m 20s +Run 1 — 77% — 3,700/4,824 docs — 2s — 5m 22s +Run 1 — 79% — 3,800/4,824 docs — 2s — 5m 25s +Run 1 — 81% — 3,900/4,824 docs — 2s — 5m 27s +Run 1 — 83% — 4,000/4,824 docs — 2s — 5m 30s +Run 1 — 85% — 4,100/4,824 docs — 2s — 5m 33s +Run 1 — 87% — 4,200/4,824 docs — 2s — 5m 35s +Run 1 — 89% — 4,300/4,824 docs — 2s — 5m 38s +Run 1 — 91% — 4,400/4,824 docs — 2s — 5m 41s +Run 1 — 93% — 4,500/4,824 docs — 2s — 5m 43s +Run 1 — 95% — 4,600/4,824 docs — 2s — 5m 46s +Run 1 — 97% — 4,700/4,824 docs — 2s — 5m 49s +Run 1 — 100% — 4,800/4,824 docs — 2s — 5m 51s +Run 1 — 100% — 4,824/4,824 docs — 2s — 5m 53s +Run 1 took 5m 53s + + +************************** +* Update Script Finished * +************************** +``` + +- If I run it again it finds the same 4,824 docs and processes them... + - I asked Atmire for feedback on this: https://tracker.atmire.com/tickets-cgiar-ilri/view-ticket?id=839 + +## 2021-03-25 + +- Niroshini from IWMI is still having problems adding metadata during the edit step of the workflow on CGSpace + - I told her to try to register using a private email account and we'll add her to the WLE group so she can try that way + +## 2021-03-28 + +- Make a backup of the `openrxv-items-final` index on AReS Explorer and start a new harvest + diff --git a/docs/2021-03/index.html b/docs/2021-03/index.html index 2b7084ae1..74814a8e7 100644 --- a/docs/2021-03/index.html +++ b/docs/2021-03/index.html @@ -44,7 +44,7 @@ Also, we found some issues building and running OpenRXV currently due to ecosyst "@type": "BlogPosting", "headline": "March, 2021", "url": "https://alanorth.github.io/cgspace-notes/2021-03/", - "wordCount": "2914", + "wordCount": "3964", "datePublished": "2021-03-01T10:13:54+02:00", "dateModified": "2021-03-23T09:34:40+02:00", "author": { @@ -645,7 +645,138 @@ $ curl -s 'http://localhost:9200/_alias/' | python -m json.tool | less
$ curl -XPUT 'http://localhost:9200/openrxv-items-temp'
-
+# du -s /home/dspacetest.cgiar.org/solr/statistics
+57861236 /home/dspacetest.cgiar.org/solr/statistics
+
config/spring/api/atmire-cua-update.xml
and started the duplicate processor:$ export JAVA_OPTS='-Dfile.encoding=UTF-8 -Xmx4096m'
+$ chrt -b 0 dspace dsrun com.atmire.statistics.util.update.atomic.AtomicStatisticsUpdateCLI -r 1000 -c statistics -t 12
+
...
+Run 1 — 80% — 5,000/6,263 docs — 25s — 6m 31s
+Exception: GC overhead limit exceeded
+java.lang.OutOfMemoryError: GC overhead limit exceeded
+
-r 100
$ chrt -b 0 dspace dsrun com.atmire.statistics.util.update.atomic.AtomicStatisticsUpdateCLI -r 100 -c statistics -t 12
+Loading @mire database changes for module MQM
+Changes have been processed
+
+
+*************************
+* Update Script Started *
+*************************
+
+Run 1
+Start updating Solr Storage Reports | Wed Mar 24 14:42:17 CET 2021
+Deleting old storage docs from Solr... | Wed Mar 24 14:42:17 CET 2021
+Done. | Wed Mar 24 14:42:17 CET 2021
+Processing storage reports for type: eperson | Wed Mar 24 14:42:17 CET 2021
+Done. | Wed Mar 24 14:42:41 CET 2021
+Processing storage reports for type: group | Wed Mar 24 14:42:41 CET 2021
+Done. | Wed Mar 24 14:45:46 CET 2021
+Processing storage reports for type: collection | Wed Mar 24 14:45:46 CET 2021
+Done. | Wed Mar 24 14:45:54 CET 2021
+Processing storage reports for type: community | Wed Mar 24 14:45:54 CET 2021
+Done. | Wed Mar 24 14:45:58 CET 2021
+Committing to Solr... | Wed Mar 24 14:45:58 CET 2021
+Done. | Wed Mar 24 14:45:59 CET 2021
+Successfully finished updating Solr Storage Reports | Wed Mar 24 14:45:59 CET 2021
+Run 1 — 2% — 100/4,824 docs — 3m 47s — 3m 47s
+Run 1 — 4% — 200/4,824 docs — 2s — 3m 50s
+Run 1 — 6% — 300/4,824 docs — 2s — 3m 53s
+Run 1 — 8% — 400/4,824 docs — 2s — 3m 55s
+Run 1 — 10% — 500/4,824 docs — 2s — 3m 58s
+Run 1 — 12% — 600/4,824 docs — 2s — 4m 1s
+Run 1 — 15% — 700/4,824 docs — 2s — 4m 3s
+Run 1 — 17% — 800/4,824 docs — 2s — 4m 6s
+Run 1 — 19% — 900/4,824 docs — 2s — 4m 9s
+Run 1 — 21% — 1,000/4,824 docs — 2s — 4m 11s
+Run 1 — 23% — 1,100/4,824 docs — 2s — 4m 14s
+Run 1 — 25% — 1,200/4,824 docs — 2s — 4m 16s
+Run 1 — 27% — 1,300/4,824 docs — 2s — 4m 19s
+Run 1 — 29% — 1,400/4,824 docs — 2s — 4m 22s
+Run 1 — 31% — 1,500/4,824 docs — 2s — 4m 24s
+Run 1 — 33% — 1,600/4,824 docs — 2s — 4m 27s
+Run 1 — 35% — 1,700/4,824 docs — 2s — 4m 29s
+Run 1 — 37% — 1,800/4,824 docs — 2s — 4m 32s
+Run 1 — 39% — 1,900/4,824 docs — 2s — 4m 35s
+Run 1 — 41% — 2,000/4,824 docs — 2s — 4m 37s
+Run 1 — 44% — 2,100/4,824 docs — 2s — 4m 40s
+Run 1 — 46% — 2,200/4,824 docs — 2s — 4m 42s
+Run 1 — 48% — 2,300/4,824 docs — 2s — 4m 45s
+Run 1 — 50% — 2,400/4,824 docs — 2s — 4m 48s
+Run 1 — 52% — 2,500/4,824 docs — 2s — 4m 50s
+Run 1 — 54% — 2,600/4,824 docs — 2s — 4m 53s
+Run 1 — 56% — 2,700/4,824 docs — 2s — 4m 55s
+Run 1 — 58% — 2,800/4,824 docs — 2s — 4m 58s
+Run 1 — 60% — 2,900/4,824 docs — 2s — 5m 1s
+Run 1 — 62% — 3,000/4,824 docs — 2s — 5m 3s
+Run 1 — 64% — 3,100/4,824 docs — 2s — 5m 6s
+Run 1 — 66% — 3,200/4,824 docs — 3s — 5m 9s
+Run 1 — 68% — 3,300/4,824 docs — 2s — 5m 12s
+Run 1 — 70% — 3,400/4,824 docs — 2s — 5m 14s
+Run 1 — 73% — 3,500/4,824 docs — 2s — 5m 17s
+Run 1 — 75% — 3,600/4,824 docs — 2s — 5m 20s
+Run 1 — 77% — 3,700/4,824 docs — 2s — 5m 22s
+Run 1 — 79% — 3,800/4,824 docs — 2s — 5m 25s
+Run 1 — 81% — 3,900/4,824 docs — 2s — 5m 27s
+Run 1 — 83% — 4,000/4,824 docs — 2s — 5m 30s
+Run 1 — 85% — 4,100/4,824 docs — 2s — 5m 33s
+Run 1 — 87% — 4,200/4,824 docs — 2s — 5m 35s
+Run 1 — 89% — 4,300/4,824 docs — 2s — 5m 38s
+Run 1 — 91% — 4,400/4,824 docs — 2s — 5m 41s
+Run 1 — 93% — 4,500/4,824 docs — 2s — 5m 43s
+Run 1 — 95% — 4,600/4,824 docs — 2s — 5m 46s
+Run 1 — 97% — 4,700/4,824 docs — 2s — 5m 49s
+Run 1 — 100% — 4,800/4,824 docs — 2s — 5m 51s
+Run 1 — 100% — 4,824/4,824 docs — 2s — 5m 53s
+Run 1 took 5m 53s
+
+
+**************************
+* Update Script Finished *
+**************************
+
openrxv-items-final
index on AReS Explorer and start a new harvest