Add notes for 2021-01-18

This commit is contained in:
2021-01-18 16:21:24 +02:00
parent 87fd47e7d2
commit 5c7d01cbc4
26 changed files with 173 additions and 45 deletions

View File

@ -27,7 +27,7 @@ For example, this item has 51 views on CGSpace, but 0 on AReS
<meta property="og:type" content="article" />
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2021-01/" />
<meta property="article:published_time" content="2021-01-03T10:13:54+02:00" />
<meta property="article:modified_time" content="2021-01-13T15:44:47+02:00" />
<meta property="article:modified_time" content="2021-01-14T16:27:57+02:00" />
@ -60,9 +60,9 @@ For example, this item has 51 views on CGSpace, but 0 on AReS
"@type": "BlogPosting",
"headline": "January, 2021",
"url": "https://alanorth.github.io/cgspace-notes/2021-01/",
"wordCount": "1515",
"wordCount": "1834",
"datePublished": "2021-01-03T10:13:54+02:00",
"dateModified": "2021-01-13T15:44:47+02:00",
"dateModified": "2021-01-14T16:27:57+02:00",
"author": {
"@type": "Person",
"name": "Alan Orth"
@ -385,6 +385,74 @@ $ curl -XDELETE 'http://localhost:9200/openrxv-items-temp'
</ul>
</li>
</ul>
<h2 id="2021-01-17">2021-01-17</h2>
<ul>
<li>Start a re-index on AReS
<ul>
<li>First delete the old Elasticsearch temp index:</li>
</ul>
</li>
</ul>
<pre><code class="language-console" data-lang="console">$ curl -XDELETE 'http://localhost:9200/openrxv-items-temp'
# start indexing in AReS
</code></pre><ul>
<li>Then, the next morning when it&rsquo;s done, check the results of the harvesting, backup the current <code>openrxv-items</code> index, and clone the <code>openrxv-items-temp</code> index to <code>openrxv-items</code>:</li>
</ul>
<pre><code class="language-console" data-lang="console">$ curl -s 'http://localhost:9200/openrxv-items-temp/_count?q=*&amp;pretty'
{
&quot;count&quot; : 100540,
&quot;_shards&quot; : {
&quot;total&quot; : 1,
&quot;successful&quot; : 1,
&quot;skipped&quot; : 0,
&quot;failed&quot; : 0
}
}
$ curl -X PUT &quot;localhost:9200/openrxv-items/_settings&quot; -H 'Content-Type: application/json' -d'{&quot;settings&quot;: {&quot;index.blocks.write&quot;: true}}'
$ curl -s -X POST http://localhost:9200/openrxv-items/_clone/openrxv-items-2021-01-18
$ curl -XDELETE 'http://localhost:9200/openrxv-items'
$ curl -X PUT &quot;localhost:9200/openrxv-items-temp/_settings&quot; -H 'Content-Type: application/json' -d'{&quot;settings&quot;: {&quot;index.blocks.write&quot;: true}}'
$ curl -s -X POST http://localhost:9200/openrxv-items-temp/_clone/openrxv-items
$ curl -XDELETE 'http://localhost:9200/openrxv-items-temp'
$ curl -XDELETE 'http://localhost:9200/openrxv-items-2021-01-18'
</code></pre><h2 id="2021-01-18">2021-01-18</h2>
<ul>
<li>Finish the indexing on AReS that I started yesterday</li>
<li>Udana from IWMI emailed me to ask why the iwmi.csv doesn&rsquo;t include items he approved to CGSpace this morning
<ul>
<li>I told him it is generated every Sunday night</li>
<li>I regenerated the file manually for him</li>
<li>I adjusted the script to run on Monday and Friday</li>
</ul>
</li>
<li>Meeting with Peter and Abenet about CG Core v2
<ul>
<li>We also need to remove CTA and CPWF subjects from the input form since they are both closed now and no longer submitting items</li>
<li>Peter also wants to create new fields on CGSpace for the SDGs and CGIAR Impact Areas
<ul>
<li>I suggested <code>cg.subject.sdg</code> and <code>cg.subject.impactArea</code></li>
</ul>
</li>
<li>We also agreed to remove the following fields:
<ul>
<li>cg.livestock.agegroup</li>
<li>cg.livestock.function</li>
<li>cg.message.sms</li>
<li>cg.message.voice</li>
</ul>
</li>
<li>I removed them from the input form, metadata registry, and deleted all the values in the database:</li>
</ul>
</li>
</ul>
<pre><code>localhost/dspace63= &gt; BEGIN;
localhost/dspace63= &gt; DELETE FROM metadatavalue WHERE metadata_field_id IN (115, 116, 117, 118);
DELETE 27
localhost/dspace63= &gt; COMMIT;
</code></pre><ul>
<li>I submitted <a href="https://github.com/AgriculturalSemantics/cg-core/issues/31">an issue</a> to CG Core v2 to propose standardizing the camel case convention for a few more fields of ours</li>
<li>I submitted <a href="https://github.com/AgriculturalSemantics/cg-core/issues/32">an issue</a> to CG Core v2 to propose removing <code>cg.series</code> and <code>cg.pages</code> in favor of <code>dcterms.isPartOf</code> and <code>dcterms.extent</code>, respectively</li>
</ul>
<!-- raw HTML omitted -->