Add notes for 2021-11-02

This commit is contained in:
Alan Orth 2021-11-03 15:56:15 +02:00
parent b04ec94cbe
commit 2ca9096495
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
30 changed files with 186 additions and 73 deletions

View File

@ -1,12 +1,39 @@
--- ---
title: "November, 2021" title: "November, 2021"
date: 2021-11-01T11:14:07+03:00 date: 2021-11-02T22:27:07+02:00
author: "Alan Orth" author: "Alan Orth"
categories: ["Notes"] categories: ["Notes"]
--- ---
## 2021-11-01 ## 2021-11-02
- I experimented with manually sharding the Solr statistics on DSpace Test
- First I exported all the 2019 stats from CGSpace:
```console
$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid
$ zstd statistics-2019.json
```
<!--more--> <!--more-->
- Then on DSpace Test I created a `statistics-2019` core with the same instance dir as the main `statistics` core (as [illustrated in the DSpace docs](https://wiki.lyrasis.org/display/DSDOC6x/Testing+Solr+Shards))
```console
$ mkdir -p /home/dspacetest.cgiar.org/solr/statistics-2019/data
# create core in Solr admin
$ curl -s "http://localhost:8081/solr/statistics/update?softCommit=true" -H "Content-Type: text/xml" --data-binary "<delete><query>time:2019-*</query></delete>"
$ ./run.sh -s http://localhost:8081/solr/statistics-2019 -a import -o statistics-2019.json -k uid
```
- The key thing above is that you create the core in the Solr admin UI, but the data directory must already exist so you have to do that first in the file system
- I restarted the server after the import was done to see if the cores would come back up OK
- I remember last time I tried this the manually created statistics cores didn't come back up after I rebooted, but this time they did
## 2021-11-03
- While inspecting the stats for the new statistics-2019 shard on DSpace Test I noticed that I can't find any stats via the DSpace Statistics API for an item that _should_ have some
- I checked on CGSpace's and I can't find them there either, but I see them in Solr when I query in the admin UI
- I need to debug that, but it doesn't seem to be related to the sharding...
<!-- vim: set sw=2 ts=2: --> <!-- vim: set sw=2 ts=2: -->

View File

@ -7,17 +7,31 @@
<meta property="og:title" content="November, 2021" /> <meta property="og:title" content="November, 2021" />
<meta property="og:description" content="2021-11-01" /> <meta property="og:description" content="2021-11-02
I experimented with manually sharding the Solr statistics on DSpace Test
First I exported all the 2019 stats from CGSpace:
$ ./run.sh -s http://localhost:8081/solr/statistics -f &#39;time:2019-*&#39; -a export -o statistics-2019.json -k uid
$ zstd statistics-2019.json
" />
<meta property="og:type" content="article" /> <meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2021-11/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2021-11/" />
<meta property="article:published_time" content="2021-11-01T11:14:07+03:00" /> <meta property="article:published_time" content="2021-11-02T22:27:07+02:00" />
<meta property="article:modified_time" content="2021-11-01T11:14:07+03:00" /> <meta property="article:modified_time" content="2021-11-01T10:49:21+02:00" />
<meta name="twitter:card" content="summary"/> <meta name="twitter:card" content="summary"/>
<meta name="twitter:title" content="November, 2021"/> <meta name="twitter:title" content="November, 2021"/>
<meta name="twitter:description" content="2021-11-01"/> <meta name="twitter:description" content="2021-11-02
I experimented with manually sharding the Solr statistics on DSpace Test
First I exported all the 2019 stats from CGSpace:
$ ./run.sh -s http://localhost:8081/solr/statistics -f &#39;time:2019-*&#39; -a export -o statistics-2019.json -k uid
$ zstd statistics-2019.json
"/>
<meta name="generator" content="Hugo 0.88.1" /> <meta name="generator" content="Hugo 0.88.1" />
@ -28,9 +42,9 @@
"@type": "BlogPosting", "@type": "BlogPosting",
"headline": "November, 2021", "headline": "November, 2021",
"url": "https://alanorth.github.io/cgspace-notes/2021-11/", "url": "https://alanorth.github.io/cgspace-notes/2021-11/",
"wordCount": "1", "wordCount": "238",
"datePublished": "2021-11-01T11:14:07+03:00", "datePublished": "2021-11-02T22:27:07+02:00",
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-01T10:49:21+02:00",
"author": { "author": {
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
@ -97,14 +111,44 @@
<header> <header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-11/">November, 2021</a></h2> <h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-11/">November, 2021</a></h2>
<p class="blog-post-meta"> <p class="blog-post-meta">
<time datetime="2021-11-01T11:14:07+03:00">Mon Nov 01, 2021</time> <time datetime="2021-11-02T22:27:07+02:00">Tue Nov 02, 2021</time>
in in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/cgspace-notes/categories/notes/" rel="category tag">Notes</a> <span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/cgspace-notes/categories/notes/" rel="category tag">Notes</a>
</p> </p>
</header> </header>
<h2 id="2021-11-01">2021-11-01</h2> <h2 id="2021-11-02">2021-11-02</h2>
<ul>
<li>I experimented with manually sharding the Solr statistics on DSpace Test</li>
<li>First I exported all the 2019 stats from CGSpace:</li>
</ul>
<pre tabindex="0"><code class="language-console" data-lang="console">$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid
$ zstd statistics-2019.json
</code></pre><ul>
<li>Then on DSpace Test I created a <code>statistics-2019</code> core with the same instance dir as the main <code>statistics</code> core (as <a href="https://wiki.lyrasis.org/display/DSDOC6x/Testing+Solr+Shards">illustrated in the DSpace docs</a>)</li>
</ul>
<pre tabindex="0"><code class="language-console" data-lang="console">$ mkdir -p /home/dspacetest.cgiar.org/solr/statistics-2019/data
# create core in Solr admin
$ curl -s &quot;http://localhost:8081/solr/statistics/update?softCommit=true&quot; -H &quot;Content-Type: text/xml&quot; --data-binary &quot;&lt;delete&gt;&lt;query&gt;time:2019-*&lt;/query&gt;&lt;/delete&gt;&quot;
$ ./run.sh -s http://localhost:8081/solr/statistics-2019 -a import -o statistics-2019.json -k uid
</code></pre><ul>
<li>The key thing above is that you create the core in the Solr admin UI, but the data directory must already exist so you have to do that first in the file system</li>
<li>I restarted the server after the import was done to see if the cores would come back up OK
<ul>
<li>I remember last time I tried this the manually created statistics cores didn&rsquo;t come back up after I rebooted, but this time they did</li>
</ul>
</li>
</ul>
<h2 id="2021-11-03">2021-11-03</h2>
<ul>
<li>While inspecting the stats for the new statistics-2019 shard on DSpace Test I noticed that I can&rsquo;t find any stats via the DSpace Statistics API for an item that <em>should</em> have some
<ul>
<li>I checked on CGSpace&rsquo;s and I can&rsquo;t find them there either, but I see them in Solr when I query in the admin UI</li>
<li>I need to debug that, but it doesn&rsquo;t seem to be related to the sharding&hellip;</li>
</ul>
</li>
</ul>
<!-- raw HTML omitted --> <!-- raw HTML omitted -->

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -84,7 +84,7 @@
<article class="blog-post"> <article class="blog-post">
<header> <header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/categories/notes/">Notes</a></h2> <h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/categories/notes/">Notes</a></h2>
<p class="blog-post-meta"><time datetime="2021-11-01T11:14:07+03:00">Mon Nov 01, 2021</time> by Alan Orth</p> <p class="blog-post-meta"><time datetime="2021-11-02T22:27:07+02:00">Tue Nov 02, 2021</time> by Alan Orth</p>
</header> </header>
<a href='https://alanorth.github.io/cgspace-notes/categories/notes/'>Read more →</a> <a href='https://alanorth.github.io/cgspace-notes/categories/notes/'>Read more →</a>

View File

@ -6,11 +6,11 @@
<description>Recent content in Categories on CGSpace Notes</description> <description>Recent content in Categories on CGSpace Notes</description>
<generator>Hugo -- gohugo.io</generator> <generator>Hugo -- gohugo.io</generator>
<language>en-us</language> <language>en-us</language>
<lastBuildDate>Mon, 01 Nov 2021 11:14:07 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/categories/index.xml" rel="self" type="application/rss+xml" /> <lastBuildDate>Tue, 02 Nov 2021 22:27:07 +0200</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/categories/index.xml" rel="self" type="application/rss+xml" />
<item> <item>
<title>Notes</title> <title>Notes</title>
<link>https://alanorth.github.io/cgspace-notes/categories/notes/</link> <link>https://alanorth.github.io/cgspace-notes/categories/notes/</link>
<pubDate>Mon, 01 Nov 2021 11:14:07 +0300</pubDate> <pubDate>Tue, 02 Nov 2021 22:27:07 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/categories/notes/</guid> <guid>https://alanorth.github.io/cgspace-notes/categories/notes/</guid>
<description></description> <description></description>

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -84,13 +84,20 @@
<article class="blog-post"> <article class="blog-post">
<header> <header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-11/">November, 2021</a></h2> <h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-11/">November, 2021</a></h2>
<p class="blog-post-meta"><time datetime="2021-11-01T11:14:07+03:00">Mon Nov 01, 2021</time> by Alan Orth in <p class="blog-post-meta"><time datetime="2021-11-02T22:27:07+02:00">Tue Nov 02, 2021</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/cgspace-notes/categories/notes/" rel="category tag">Notes</a> <span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/cgspace-notes/categories/notes/" rel="category tag">Notes</a>
</p> </p>
</header> </header>
<h2 id="2021-11-01">2021-11-01</h2> <h2 id="2021-11-02">2021-11-02</h2>
<ul>
<li>I experimented with manually sharding the Solr statistics on DSpace Test</li>
<li>First I exported all the 2019 stats from CGSpace:</li>
</ul>
<pre tabindex="0"><code class="language-console" data-lang="console">$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid
$ zstd statistics-2019.json
</code></pre>
<a href='https://alanorth.github.io/cgspace-notes/2021-11/'>Read more →</a> <a href='https://alanorth.github.io/cgspace-notes/2021-11/'>Read more →</a>
</article> </article>

View File

@ -6,14 +6,21 @@
<description>Recent content in Notes on CGSpace Notes</description> <description>Recent content in Notes on CGSpace Notes</description>
<generator>Hugo -- gohugo.io</generator> <generator>Hugo -- gohugo.io</generator>
<language>en-us</language> <language>en-us</language>
<lastBuildDate>Mon, 01 Nov 2021 11:14:07 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/categories/notes/index.xml" rel="self" type="application/rss+xml" /> <lastBuildDate>Tue, 02 Nov 2021 22:27:07 +0200</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/categories/notes/index.xml" rel="self" type="application/rss+xml" />
<item> <item>
<title>November, 2021</title> <title>November, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-11/</link> <link>https://alanorth.github.io/cgspace-notes/2021-11/</link>
<pubDate>Mon, 01 Nov 2021 11:14:07 +0300</pubDate> <pubDate>Tue, 02 Nov 2021 22:27:07 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-11/</guid> <guid>https://alanorth.github.io/cgspace-notes/2021-11/</guid>
<description>&lt;h2 id=&#34;2021-11-01&#34;&gt;2021-11-01&lt;/h2&gt;</description> <description>&lt;h2 id=&#34;2021-11-02&#34;&gt;2021-11-02&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;I experimented with manually sharding the Solr statistics on DSpace Test&lt;/li&gt;
&lt;li&gt;First I exported all the 2019 stats from CGSpace:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre tabindex=&#34;0&#34;&gt;&lt;code class=&#34;language-console&#34; data-lang=&#34;console&#34;&gt;$ ./run.sh -s http://localhost:8081/solr/statistics -f &#39;time:2019-*&#39; -a export -o statistics-2019.json -k uid
$ zstd statistics-2019.json
&lt;/code&gt;&lt;/pre&gt;</description>
</item> </item>
<item> <item>

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }
@ -99,13 +99,20 @@
<article class="blog-post"> <article class="blog-post">
<header> <header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-11/">November, 2021</a></h2> <h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-11/">November, 2021</a></h2>
<p class="blog-post-meta"><time datetime="2021-11-01T11:14:07+03:00">Mon Nov 01, 2021</time> by Alan Orth in <p class="blog-post-meta"><time datetime="2021-11-02T22:27:07+02:00">Tue Nov 02, 2021</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/cgspace-notes/categories/notes/" rel="category tag">Notes</a> <span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/cgspace-notes/categories/notes/" rel="category tag">Notes</a>
</p> </p>
</header> </header>
<h2 id="2021-11-01">2021-11-01</h2> <h2 id="2021-11-02">2021-11-02</h2>
<ul>
<li>I experimented with manually sharding the Solr statistics on DSpace Test</li>
<li>First I exported all the 2019 stats from CGSpace:</li>
</ul>
<pre tabindex="0"><code class="language-console" data-lang="console">$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid
$ zstd statistics-2019.json
</code></pre>
<a href='https://alanorth.github.io/cgspace-notes/2021-11/'>Read more →</a> <a href='https://alanorth.github.io/cgspace-notes/2021-11/'>Read more →</a>
</article> </article>

View File

@ -6,14 +6,21 @@
<description>Recent content on CGSpace Notes</description> <description>Recent content on CGSpace Notes</description>
<generator>Hugo -- gohugo.io</generator> <generator>Hugo -- gohugo.io</generator>
<language>en-us</language> <language>en-us</language>
<lastBuildDate>Mon, 01 Nov 2021 11:14:07 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/index.xml" rel="self" type="application/rss+xml" /> <lastBuildDate>Tue, 02 Nov 2021 22:27:07 +0200</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/index.xml" rel="self" type="application/rss+xml" />
<item> <item>
<title>November, 2021</title> <title>November, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-11/</link> <link>https://alanorth.github.io/cgspace-notes/2021-11/</link>
<pubDate>Mon, 01 Nov 2021 11:14:07 +0300</pubDate> <pubDate>Tue, 02 Nov 2021 22:27:07 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-11/</guid> <guid>https://alanorth.github.io/cgspace-notes/2021-11/</guid>
<description>&lt;h2 id=&#34;2021-11-01&#34;&gt;2021-11-01&lt;/h2&gt;</description> <description>&lt;h2 id=&#34;2021-11-02&#34;&gt;2021-11-02&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;I experimented with manually sharding the Solr statistics on DSpace Test&lt;/li&gt;
&lt;li&gt;First I exported all the 2019 stats from CGSpace:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre tabindex=&#34;0&#34;&gt;&lt;code class=&#34;language-console&#34; data-lang=&#34;console&#34;&gt;$ ./run.sh -s http://localhost:8081/solr/statistics -f &#39;time:2019-*&#39; -a export -o statistics-2019.json -k uid
$ zstd statistics-2019.json
&lt;/code&gt;&lt;/pre&gt;</description>
</item> </item>
<item> <item>

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }
@ -99,13 +99,20 @@
<article class="blog-post"> <article class="blog-post">
<header> <header>
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-11/">November, 2021</a></h2> <h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2021-11/">November, 2021</a></h2>
<p class="blog-post-meta"><time datetime="2021-11-01T11:14:07+03:00">Mon Nov 01, 2021</time> by Alan Orth in <p class="blog-post-meta"><time datetime="2021-11-02T22:27:07+02:00">Tue Nov 02, 2021</time> by Alan Orth in
<span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/cgspace-notes/categories/notes/" rel="category tag">Notes</a> <span class="fas fa-folder" aria-hidden="true"></span>&nbsp;<a href="/cgspace-notes/categories/notes/" rel="category tag">Notes</a>
</p> </p>
</header> </header>
<h2 id="2021-11-01">2021-11-01</h2> <h2 id="2021-11-02">2021-11-02</h2>
<ul>
<li>I experimented with manually sharding the Solr statistics on DSpace Test</li>
<li>First I exported all the 2019 stats from CGSpace:</li>
</ul>
<pre tabindex="0"><code class="language-console" data-lang="console">$ ./run.sh -s http://localhost:8081/solr/statistics -f 'time:2019-*' -a export -o statistics-2019.json -k uid
$ zstd statistics-2019.json
</code></pre>
<a href='https://alanorth.github.io/cgspace-notes/2021-11/'>Read more →</a> <a href='https://alanorth.github.io/cgspace-notes/2021-11/'>Read more →</a>
</article> </article>

View File

@ -6,14 +6,21 @@
<description>Recent content in Posts on CGSpace Notes</description> <description>Recent content in Posts on CGSpace Notes</description>
<generator>Hugo -- gohugo.io</generator> <generator>Hugo -- gohugo.io</generator>
<language>en-us</language> <language>en-us</language>
<lastBuildDate>Mon, 01 Nov 2021 11:14:07 +0300</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/posts/index.xml" rel="self" type="application/rss+xml" /> <lastBuildDate>Tue, 02 Nov 2021 22:27:07 +0200</lastBuildDate><atom:link href="https://alanorth.github.io/cgspace-notes/posts/index.xml" rel="self" type="application/rss+xml" />
<item> <item>
<title>November, 2021</title> <title>November, 2021</title>
<link>https://alanorth.github.io/cgspace-notes/2021-11/</link> <link>https://alanorth.github.io/cgspace-notes/2021-11/</link>
<pubDate>Mon, 01 Nov 2021 11:14:07 +0300</pubDate> <pubDate>Tue, 02 Nov 2021 22:27:07 +0200</pubDate>
<guid>https://alanorth.github.io/cgspace-notes/2021-11/</guid> <guid>https://alanorth.github.io/cgspace-notes/2021-11/</guid>
<description>&lt;h2 id=&#34;2021-11-01&#34;&gt;2021-11-01&lt;/h2&gt;</description> <description>&lt;h2 id=&#34;2021-11-02&#34;&gt;2021-11-02&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;I experimented with manually sharding the Solr statistics on DSpace Test&lt;/li&gt;
&lt;li&gt;First I exported all the 2019 stats from CGSpace:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre tabindex=&#34;0&#34;&gt;&lt;code class=&#34;language-console&#34; data-lang=&#34;console&#34;&gt;$ ./run.sh -s http://localhost:8081/solr/statistics -f &#39;time:2019-*&#39; -a export -o statistics-2019.json -k uid
$ zstd statistics-2019.json
&lt;/code&gt;&lt;/pre&gt;</description>
</item> </item>
<item> <item>

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -10,7 +10,7 @@
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." /> <meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
<meta property="og:type" content="website" /> <meta property="og:type" content="website" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" /> <meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
<meta property="og:updated_time" content="2021-11-01T10:48:13+02:00" /> <meta property="og:updated_time" content="2021-11-01T10:49:21+02:00" />
@ -31,7 +31,7 @@
"@type": "Person", "@type": "Person",
"name": "Alan Orth" "name": "Alan Orth"
}, },
"dateModified": "2021-11-01T11:14:07+03:00", "dateModified": "2021-11-02T22:27:07+02:00",
"keywords": "notes, migration, notes", "keywords": "notes, migration, notes",
"description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." "description":"Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository."
} }

View File

@ -3,19 +3,19 @@
xmlns:xhtml="http://www.w3.org/1999/xhtml"> xmlns:xhtml="http://www.w3.org/1999/xhtml">
<url> <url>
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc> <loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
<lastmod>2021-11-01T10:48:13+02:00</lastmod> <lastmod>2021-11-01T10:49:21+02:00</lastmod>
</url><url> </url><url>
<loc>https://alanorth.github.io/cgspace-notes/</loc> <loc>https://alanorth.github.io/cgspace-notes/</loc>
<lastmod>2021-11-01T10:48:13+02:00</lastmod> <lastmod>2021-11-01T10:49:21+02:00</lastmod>
</url><url> </url><url>
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc> <loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
<lastmod>2021-11-01T10:48:13+02:00</lastmod> <lastmod>2021-11-01T10:49:21+02:00</lastmod>
</url><url> </url><url>
<loc>https://alanorth.github.io/cgspace-notes/2021-11/</loc> <loc>https://alanorth.github.io/cgspace-notes/2021-11/</loc>
<lastmod>2021-11-01T11:14:07+03:00</lastmod> <lastmod>2021-11-01T10:49:21+02:00</lastmod>
</url><url> </url><url>
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc> <loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
<lastmod>2021-11-01T10:48:13+02:00</lastmod> <lastmod>2021-11-01T10:49:21+02:00</lastmod>
</url><url> </url><url>
<loc>https://alanorth.github.io/cgspace-notes/2021-10/</loc> <loc>https://alanorth.github.io/cgspace-notes/2021-10/</loc>
<lastmod>2021-11-01T10:48:13+02:00</lastmod> <lastmod>2021-11-01T10:48:13+02:00</lastmod>