diff --git a/content/posts/2021-11.md b/content/posts/2021-11.md index 4189b7682..6f31babdd 100644 --- a/content/posts/2021-11.md +++ b/content/posts/2021-11.md @@ -1,12 +1,39 @@ --- title: "November, 2021" -date: 2021-11-01T11:14:07+03:00 +date: 2021-11-02T22:27:07+02:00 author: "Alan Orth" categories: ["Notes"] --- -## 2021-11-01 +## 2021-11-02 + +- I experimented with manually sharding the Solr statistics on DSpace Test +- First I exported all the 2019 stats from CGSpace: + +```console +$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid +$ zstd statistics-2019.json +``` +- Then on DSpace Test I created a `statistics-2019` core with the same instance dir as the main `statistics` core (as [illustrated in the DSpace docs](https://wiki.lyrasis.org/display/DSDOC6x/Testing+Solr+Shards)) + +```console +$ mkdir -p /home/dspacetest.cgiar.org/solr/statistics-2019/data +# create core in Solr admin +$ curl -s "http://localhost:8081/solr/statistics/update?softCommit=true" -H "Content-Type: text/xml" --data-binary "time:2019-*" +$ ./run.sh -s http://localhost:8081/solr/statistics-2019 -a import -o statistics-2019.json -k uid +``` + +- The key thing above is that you create the core in the Solr admin UI, but the data directory must already exist so you have to do that first in the file system +- I restarted the server after the import was done to see if the cores would come back up OK + - I remember last time I tried this the manually created statistics cores didn't come back up after I rebooted, but this time they did + +## 2021-11-03 + +- While inspecting the stats for the new statistics-2019 shard on DSpace Test I noticed that I can't find any stats via the DSpace Statistics API for an item that _should_ have some + - I checked on CGSpace's and I can't find them there either, but I see them in Solr when I query in the admin UI + - I need to debug that, but it doesn't seem to be related to the sharding... + diff --git a/docs/2021-11/index.html b/docs/2021-11/index.html index 3b15f060e..75ac9c104 100644 --- a/docs/2021-11/index.html +++ b/docs/2021-11/index.html @@ -7,17 +7,31 @@ - + - - + + - + @@ -28,9 +42,9 @@ "@type": "BlogPosting", "headline": "November, 2021", "url": "https://alanorth.github.io/cgspace-notes/2021-11/", - "wordCount": "1", - "datePublished": "2021-11-01T11:14:07+03:00", - "dateModified": "2021-11-01T11:14:07+03:00", + "wordCount": "238", + "datePublished": "2021-11-02T22:27:07+02:00", + "dateModified": "2021-11-01T10:49:21+02:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -97,14 +111,44 @@

November, 2021

- + in  Notes

-

2021-11-01

+

2021-11-02

+ +
$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid
+$ zstd statistics-2019.json
+
+
$ mkdir -p /home/dspacetest.cgiar.org/solr/statistics-2019/data
+# create core in Solr admin
+$ curl -s "http://localhost:8081/solr/statistics/update?softCommit=true" -H "Content-Type: text/xml" --data-binary "<delete><query>time:2019-*</query></delete>"
+$ ./run.sh -s http://localhost:8081/solr/statistics-2019 -a import -o statistics-2019.json -k uid
+
+

2021-11-03

+ diff --git a/docs/categories/index.html b/docs/categories/index.html index 5767e1fb5..b172252f8 100644 --- a/docs/categories/index.html +++ b/docs/categories/index.html @@ -10,7 +10,7 @@ - + @@ -84,7 +84,7 @@

Notes

- +
Read more → diff --git a/docs/categories/index.xml b/docs/categories/index.xml index c4f3adb53..a0df5a095 100644 --- a/docs/categories/index.xml +++ b/docs/categories/index.xml @@ -6,11 +6,11 @@ Recent content in Categories on CGSpace Notes Hugo -- gohugo.io en-us - Mon, 01 Nov 2021 11:14:07 +0300 + Tue, 02 Nov 2021 22:27:07 +0200 Notes https://alanorth.github.io/cgspace-notes/categories/notes/ - Mon, 01 Nov 2021 11:14:07 +0300 + Tue, 02 Nov 2021 22:27:07 +0200 https://alanorth.github.io/cgspace-notes/categories/notes/ diff --git a/docs/categories/notes/index.html b/docs/categories/notes/index.html index 39083d511..2068c03fb 100644 --- a/docs/categories/notes/index.html +++ b/docs/categories/notes/index.html @@ -10,7 +10,7 @@ - + @@ -84,13 +84,20 @@

November, 2021

-
-

2021-11-01

+

2021-11-02

+
    +
  • I experimented with manually sharding the Solr statistics on DSpace Test
  • +
  • First I exported all the 2019 stats from CGSpace:
  • +
+
$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid
+$ zstd statistics-2019.json
+
Read more →
diff --git a/docs/categories/notes/index.xml b/docs/categories/notes/index.xml index 46e79e1ca..eba7dec68 100644 --- a/docs/categories/notes/index.xml +++ b/docs/categories/notes/index.xml @@ -6,14 +6,21 @@ Recent content in Notes on CGSpace Notes Hugo -- gohugo.io en-us - Mon, 01 Nov 2021 11:14:07 +0300 + Tue, 02 Nov 2021 22:27:07 +0200 November, 2021 https://alanorth.github.io/cgspace-notes/2021-11/ - Mon, 01 Nov 2021 11:14:07 +0300 + Tue, 02 Nov 2021 22:27:07 +0200 https://alanorth.github.io/cgspace-notes/2021-11/ - <h2 id="2021-11-01">2021-11-01</h2> + <h2 id="2021-11-02">2021-11-02</h2> +<ul> +<li>I experimented with manually sharding the Solr statistics on DSpace Test</li> +<li>First I exported all the 2019 stats from CGSpace:</li> +</ul> +<pre tabindex="0"><code class="language-console" data-lang="console">$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid +$ zstd statistics-2019.json +</code></pre> diff --git a/docs/categories/notes/page/2/index.html b/docs/categories/notes/page/2/index.html index 6cf95c5f5..a55d9c86a 100644 --- a/docs/categories/notes/page/2/index.html +++ b/docs/categories/notes/page/2/index.html @@ -10,7 +10,7 @@ - + diff --git a/docs/categories/notes/page/3/index.html b/docs/categories/notes/page/3/index.html index 7e60e8160..8fd006ff7 100644 --- a/docs/categories/notes/page/3/index.html +++ b/docs/categories/notes/page/3/index.html @@ -10,7 +10,7 @@ - + diff --git a/docs/categories/notes/page/4/index.html b/docs/categories/notes/page/4/index.html index eb969e601..38d341bc2 100644 --- a/docs/categories/notes/page/4/index.html +++ b/docs/categories/notes/page/4/index.html @@ -10,7 +10,7 @@ - + diff --git a/docs/categories/notes/page/5/index.html b/docs/categories/notes/page/5/index.html index e00b5a677..766206779 100644 --- a/docs/categories/notes/page/5/index.html +++ b/docs/categories/notes/page/5/index.html @@ -10,7 +10,7 @@ - + diff --git a/docs/categories/notes/page/6/index.html b/docs/categories/notes/page/6/index.html index 886091a6d..3f594d8ff 100644 --- a/docs/categories/notes/page/6/index.html +++ b/docs/categories/notes/page/6/index.html @@ -10,7 +10,7 @@ - + diff --git a/docs/index.html b/docs/index.html index da15db85a..d6497a8ef 100644 --- a/docs/index.html +++ b/docs/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } @@ -99,13 +99,20 @@

November, 2021

-
-

2021-11-01

+

2021-11-02

+
    +
  • I experimented with manually sharding the Solr statistics on DSpace Test
  • +
  • First I exported all the 2019 stats from CGSpace:
  • +
+
$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid
+$ zstd statistics-2019.json
+
Read more →
diff --git a/docs/index.xml b/docs/index.xml index d87bbc619..fb8ab83ff 100644 --- a/docs/index.xml +++ b/docs/index.xml @@ -6,14 +6,21 @@ Recent content on CGSpace Notes Hugo -- gohugo.io en-us - Mon, 01 Nov 2021 11:14:07 +0300 + Tue, 02 Nov 2021 22:27:07 +0200 November, 2021 https://alanorth.github.io/cgspace-notes/2021-11/ - Mon, 01 Nov 2021 11:14:07 +0300 + Tue, 02 Nov 2021 22:27:07 +0200 https://alanorth.github.io/cgspace-notes/2021-11/ - <h2 id="2021-11-01">2021-11-01</h2> + <h2 id="2021-11-02">2021-11-02</h2> +<ul> +<li>I experimented with manually sharding the Solr statistics on DSpace Test</li> +<li>First I exported all the 2019 stats from CGSpace:</li> +</ul> +<pre tabindex="0"><code class="language-console" data-lang="console">$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid +$ zstd statistics-2019.json +</code></pre> diff --git a/docs/page/2/index.html b/docs/page/2/index.html index 29bca9464..908260e67 100644 --- a/docs/page/2/index.html +++ b/docs/page/2/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/page/3/index.html b/docs/page/3/index.html index c1d448a55..d11c95475 100644 --- a/docs/page/3/index.html +++ b/docs/page/3/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/page/4/index.html b/docs/page/4/index.html index 1b43330d6..38c254a80 100644 --- a/docs/page/4/index.html +++ b/docs/page/4/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/page/5/index.html b/docs/page/5/index.html index 4fdd4f60b..4ca901ee7 100644 --- a/docs/page/5/index.html +++ b/docs/page/5/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/page/6/index.html b/docs/page/6/index.html index e597f9b79..6f7e1fecf 100644 --- a/docs/page/6/index.html +++ b/docs/page/6/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/page/7/index.html b/docs/page/7/index.html index e0f352da9..274a322ec 100644 --- a/docs/page/7/index.html +++ b/docs/page/7/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/page/8/index.html b/docs/page/8/index.html index 89e8ddde8..ca5460e33 100644 --- a/docs/page/8/index.html +++ b/docs/page/8/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/posts/index.html b/docs/posts/index.html index f40aaca3f..06b3378ef 100644 --- a/docs/posts/index.html +++ b/docs/posts/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } @@ -99,13 +99,20 @@

November, 2021

-
-

2021-11-01

+

2021-11-02

+
    +
  • I experimented with manually sharding the Solr statistics on DSpace Test
  • +
  • First I exported all the 2019 stats from CGSpace:
  • +
+
$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid
+$ zstd statistics-2019.json
+
Read more →
diff --git a/docs/posts/index.xml b/docs/posts/index.xml index d46969f48..dcbd0cf6b 100644 --- a/docs/posts/index.xml +++ b/docs/posts/index.xml @@ -6,14 +6,21 @@ Recent content in Posts on CGSpace Notes Hugo -- gohugo.io en-us - Mon, 01 Nov 2021 11:14:07 +0300 + Tue, 02 Nov 2021 22:27:07 +0200 November, 2021 https://alanorth.github.io/cgspace-notes/2021-11/ - Mon, 01 Nov 2021 11:14:07 +0300 + Tue, 02 Nov 2021 22:27:07 +0200 https://alanorth.github.io/cgspace-notes/2021-11/ - <h2 id="2021-11-01">2021-11-01</h2> + <h2 id="2021-11-02">2021-11-02</h2> +<ul> +<li>I experimented with manually sharding the Solr statistics on DSpace Test</li> +<li>First I exported all the 2019 stats from CGSpace:</li> +</ul> +<pre tabindex="0"><code class="language-console" data-lang="console">$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid +$ zstd statistics-2019.json +</code></pre> diff --git a/docs/posts/page/2/index.html b/docs/posts/page/2/index.html index f88a6cc72..5d412973c 100644 --- a/docs/posts/page/2/index.html +++ b/docs/posts/page/2/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/posts/page/3/index.html b/docs/posts/page/3/index.html index 8d69596a3..89369601e 100644 --- a/docs/posts/page/3/index.html +++ b/docs/posts/page/3/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/posts/page/4/index.html b/docs/posts/page/4/index.html index 7dcb2d4f0..b6b54cfb8 100644 --- a/docs/posts/page/4/index.html +++ b/docs/posts/page/4/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/posts/page/5/index.html b/docs/posts/page/5/index.html index 7b8a635a3..360347e8c 100644 --- a/docs/posts/page/5/index.html +++ b/docs/posts/page/5/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/posts/page/6/index.html b/docs/posts/page/6/index.html index 1a464ed92..69fbce8ad 100644 --- a/docs/posts/page/6/index.html +++ b/docs/posts/page/6/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/posts/page/7/index.html b/docs/posts/page/7/index.html index d9b7d57b6..a95b1aa3e 100644 --- a/docs/posts/page/7/index.html +++ b/docs/posts/page/7/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/posts/page/8/index.html b/docs/posts/page/8/index.html index ea80bd25b..0b5cf2c4c 100644 --- a/docs/posts/page/8/index.html +++ b/docs/posts/page/8/index.html @@ -10,7 +10,7 @@ - + @@ -31,7 +31,7 @@ "@type": "Person", "name": "Alan Orth" }, - "dateModified": "2021-11-01T11:14:07+03:00", + "dateModified": "2021-11-02T22:27:07+02:00", "keywords": "notes, migration, notes", "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." } diff --git a/docs/sitemap.xml b/docs/sitemap.xml index 2b84a11a8..2102d49f2 100644 --- a/docs/sitemap.xml +++ b/docs/sitemap.xml @@ -3,19 +3,19 @@ xmlns:xhtml="http://www.w3.org/1999/xhtml"> https://alanorth.github.io/cgspace-notes/categories/ - 2021-11-01T10:48:13+02:00 + 2021-11-01T10:49:21+02:00 https://alanorth.github.io/cgspace-notes/ - 2021-11-01T10:48:13+02:00 + 2021-11-01T10:49:21+02:00 https://alanorth.github.io/cgspace-notes/categories/notes/ - 2021-11-01T10:48:13+02:00 + 2021-11-01T10:49:21+02:00 https://alanorth.github.io/cgspace-notes/2021-11/ - 2021-11-01T11:14:07+03:00 + 2021-11-01T10:49:21+02:00 https://alanorth.github.io/cgspace-notes/posts/ - 2021-11-01T10:48:13+02:00 + 2021-11-01T10:49:21+02:00 https://alanorth.github.io/cgspace-notes/2021-10/ 2021-11-01T10:48:13+02:00