diff --git a/content/posts/2020-02.md b/content/posts/2020-02.md index d16bd1158..273d1547a 100644 --- a/content/posts/2020-02.md +++ b/content/posts/2020-02.md @@ -251,4 +251,28 @@ $ ./check-spider-hits.sh -d -p -f /tmp/jersey -s statistics -u http://localhost: $ for year in 2018 2017 2016 2015; do ./check-spider-hits.sh -d -p -f /tmp/jersey -s "statistics-${year}" -u http://localhost:8081/solr; done ``` +- I noticed another user agen in the logs that we should add to the list: + +``` +ReactorNetty/0.9.2.RELEASE +``` + +- I made [an issue on the COUNTER-Robots repository](https://github.com/atmire/COUNTER-Robots/issues/31) +- I found a [nice tool for exporting and importing Solr records](https://github.com/freedev/solr-import-export-json) and it seems to workfor exporting our 2019 stats from the large statistics core! + +``` +$ ./run.sh -s http://localhost:8081/solr/statistics -a export -o /tmp/statistics-2019-01.json -f 'dateYearMonth:2019-01' -k uid +$ ls -lh /tmp/statistics-2019-01.json +-rw-rw-r-- 1 aorth aorth 3.7G Feb 6 09:26 /tmp/statistics-2019-01.json +``` + +- Then I tested importing this by creating a new core in my development environment: + +``` +$ curl 'http://localhost:8080/solr/admin/cores?action=CREATE&name=statistics-2019&instanceDir=/home/aorth/dspace63/solr/statistics&dataDir=/home/aorth/dspace63/solr/statistics-2019/data' +$ ./run.sh -s http://localhost:8080/solr/statistics-2019 -a import -o ~/Downloads/statistics-2019-01.json -k uid +``` + +- This imports the records into the core, but DSpace can't see them, and when I restart Tomcat the core is not seen by Solr... + diff --git a/docs/2020-02/index.html b/docs/2020-02/index.html index 2ed4a4746..3e4cecffe 100644 --- a/docs/2020-02/index.html +++ b/docs/2020-02/index.html @@ -20,7 +20,7 @@ The code finally builds and runs with a fresh install - + @@ -45,9 +45,9 @@ The code finally builds and runs with a fresh install "@type": "BlogPosting", "headline": "February, 2020", "url": "https:\/\/alanorth.github.io\/cgspace-notes\/2020-02\/", - "wordCount": "1801", + "wordCount": "1926", "datePublished": "2020-02-02T11:56:30+02:00", - "dateModified": "2020-02-05T18:58:04+02:00", + "dateModified": "2020-02-06T10:01:17+02:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -378,7 +378,26 @@ dspace63=# \q
$ ./check-spider-hits.sh -d -p -f /tmp/jersey -s statistics -u http://localhost:8081/solr
 $ for year in 2018 2017 2016 2015; do ./check-spider-hits.sh -d -p -f /tmp/jersey -s "statistics-${year}" -u http://localhost:8081/solr; done
-
+ +
ReactorNetty/0.9.2.RELEASE
+
+
$ ./run.sh -s http://localhost:8081/solr/statistics -a export -o /tmp/statistics-2019-01.json -f 'dateYearMonth:2019-01' -k uid
+$ ls -lh /tmp/statistics-2019-01.json
+-rw-rw-r-- 1 aorth aorth 3.7G Feb  6 09:26 /tmp/statistics-2019-01.json
+
+
$ curl 'http://localhost:8080/solr/admin/cores?action=CREATE&name=statistics-2019&instanceDir=/home/aorth/dspace63/solr/statistics&dataDir=/home/aorth/dspace63/solr/statistics-2019/data'
+$ ./run.sh -s http://localhost:8080/solr/statistics-2019 -a import -o ~/Downloads/statistics-2019-01.json -k uid
+
+ diff --git a/docs/sitemap.xml b/docs/sitemap.xml index 0ed0ccbc2..289e122e7 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -4,27 +4,27 @@ https://alanorth.github.io/cgspace-notes/categories/ - 2020-02-05T18:58:04+02:00 + 2020-02-06T10:01:17+02:00 https://alanorth.github.io/cgspace-notes/ - 2020-02-05T18:58:04+02:00 + 2020-02-06T10:01:17+02:00 https://alanorth.github.io/cgspace-notes/2020-02/ - 2020-02-05T18:58:04+02:00 + 2020-02-06T10:01:17+02:00 https://alanorth.github.io/cgspace-notes/categories/notes/ - 2020-02-05T18:58:04+02:00 + 2020-02-06T10:01:17+02:00 https://alanorth.github.io/cgspace-notes/posts/ - 2020-02-05T18:58:04+02:00 + 2020-02-06T10:01:17+02:00