mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-12-26 06:54:30 +01:00
Add notes for 2021-09-24
This commit is contained in:
parent
992c58601f
commit
bbf478c410
@ -241,4 +241,51 @@ localhost/dspace63= > \COPY (SELECT collection_id,uuid FROM collection WHERE col
|
||||
COPY 1139
|
||||
```
|
||||
|
||||
## 2021-09-24
|
||||
|
||||
- Peter and Abenet agreed that we should consider converting more of our UPPER CASE metadata values to Title Case
|
||||
- It seems that these fields are all still using UPPER CASE:
|
||||
- cg.subject.alliancebiovciat
|
||||
- cg.species.breed
|
||||
- cg.subject.bioversity
|
||||
- cg.subject.ccafs
|
||||
- cg.subject.ciat
|
||||
- cg.subject.cip
|
||||
- cg.identifier.iitatheme
|
||||
- cg.subject.iita
|
||||
- cg.subject.ilri
|
||||
- cg.subject.pabra
|
||||
- cg.river.basin
|
||||
- cg.coverage.subregion (done)
|
||||
- dcterms.audience (done)
|
||||
- cg.subject.wle
|
||||
- We can do some of these without even asking anyone, for example `cg.coverage.subregion`, `cg.river.basin`, and `dcterms.audience`
|
||||
- First, I will look at `cg.coverage.subregion`
|
||||
- These should ideally come from ISO 3166-2 subdivisions
|
||||
- I will sentence case them and then create a controlled vocabulary from those that are matching (and worry about cleaning the rest up later)
|
||||
|
||||
```console
|
||||
localhost/dspace63= > UPDATE metadatavalue SET text_value=INITCAP(text_value) WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=231;
|
||||
UPDATE 2903
|
||||
localhost/dspace63= > \COPY (SELECT DISTINCT text_value as "cg.coverage.subregion" FROM metadatavalue WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id = 231) to /tmp/2021-09-24-subregions.txt;
|
||||
COPY 1200
|
||||
```
|
||||
|
||||
- Then I process the list for matches with my `subdivision-lookup.py` script, and extract only the values that matched:
|
||||
|
||||
```console
|
||||
$ ./ilri/subdivision-lookup.py -i /tmp/2021-09-24-subregions.txt -o /tmp/subregions.csv
|
||||
$ csvgrep -c matched -m 'true' /tmp/subregions.csv | csvcut -c 1 | sed 1d > /tmp/subregions-matched.txt
|
||||
$ wc -l /tmp/subregions-matched.txt
|
||||
81 /tmp/subregions-matched.txt
|
||||
```
|
||||
|
||||
- Then I updated the controlled vocabulary in the submission forms
|
||||
- I did the same for `dcterms.audience`, taking special care to a few all-caps values:
|
||||
|
||||
```console
|
||||
localhost/dspace63= > UPDATE metadatavalue SET text_value=INITCAP(text_value) WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=144 AND text_value != 'NGOS' AND text_value != 'CGIAR';
|
||||
localhost/dspace63= > UPDATE metadatavalue SET text_value='NGOs' WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=144 AND text_value = 'NGOS';
|
||||
```
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
@ -26,7 +26,7 @@ The syntax Moayad showed me last month doesn’t seem to honor the search qu
|
||||
<meta property="og:type" content="article" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/2021-09/" />
|
||||
<meta property="article:published_time" content="2021-09-01T09:14:07+03:00" />
|
||||
<meta property="article:modified_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="article:modified_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
@ -58,9 +58,9 @@ The syntax Moayad showed me last month doesn’t seem to honor the search qu
|
||||
"@type": "BlogPosting",
|
||||
"headline": "September, 2021",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2021-09/",
|
||||
"wordCount": "1775",
|
||||
"wordCount": "2030",
|
||||
"datePublished": "2021-09-01T09:14:07+03:00",
|
||||
"dateModified": "2021-09-23T18:19:11+03:00",
|
||||
"dateModified": "2021-09-23T18:32:47+03:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -403,6 +403,55 @@ $ csvcut -c 1 /tmp/2021-09-23-affiliations.csv | sed 1d > /tmp/affiliations.t
|
||||
</ul>
|
||||
<pre tabindex="0"><code class="language-console" data-lang="console">localhost/dspace63= > \COPY (SELECT collection_id,uuid FROM collection WHERE collection_id IS NOT NULL) TO /tmp/2021-09-23-collection-id2uuid.csv WITH CSV HEADER;
|
||||
COPY 1139
|
||||
</code></pre><h2 id="2021-09-24">2021-09-24</h2>
|
||||
<ul>
|
||||
<li>Peter and Abenet agreed that we should consider converting more of our UPPER CASE metadata values to Title Case
|
||||
<ul>
|
||||
<li>It seems that these fields are all still using UPPER CASE:
|
||||
<ul>
|
||||
<li>cg.subject.alliancebiovciat</li>
|
||||
<li>cg.species.breed</li>
|
||||
<li>cg.subject.bioversity</li>
|
||||
<li>cg.subject.ccafs</li>
|
||||
<li>cg.subject.ciat</li>
|
||||
<li>cg.subject.cip</li>
|
||||
<li>cg.identifier.iitatheme</li>
|
||||
<li>cg.subject.iita</li>
|
||||
<li>cg.subject.ilri</li>
|
||||
<li>cg.subject.pabra</li>
|
||||
<li>cg.river.basin</li>
|
||||
<li>cg.coverage.subregion (done)</li>
|
||||
<li>dcterms.audience (done)</li>
|
||||
<li>cg.subject.wle</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>We can do some of these without even asking anyone, for example <code>cg.coverage.subregion</code>, <code>cg.river.basin</code>, and <code>dcterms.audience</code></li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>First, I will look at <code>cg.coverage.subregion</code>
|
||||
<ul>
|
||||
<li>These should ideally come from ISO 3166-2 subdivisions</li>
|
||||
<li>I will sentence case them and then create a controlled vocabulary from those that are matching (and worry about cleaning the rest up later)</li>
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<pre tabindex="0"><code class="language-console" data-lang="console">localhost/dspace63= > UPDATE metadatavalue SET text_value=INITCAP(text_value) WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=231;
|
||||
UPDATE 2903
|
||||
localhost/dspace63= > \COPY (SELECT DISTINCT text_value as "cg.coverage.subregion" FROM metadatavalue WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id = 231) to /tmp/2021-09-24-subregions.txt;
|
||||
COPY 1200
|
||||
</code></pre><ul>
|
||||
<li>Then I process the list for matches with my <code>subdivision-lookup.py</code> script, and extract only the values that matched:</li>
|
||||
</ul>
|
||||
<pre tabindex="0"><code class="language-console" data-lang="console">$ ./ilri/subdivision-lookup.py -i /tmp/2021-09-24-subregions.txt -o /tmp/subregions.csv
|
||||
$ csvgrep -c matched -m 'true' /tmp/subregions.csv | csvcut -c 1 | sed 1d > /tmp/subregions-matched.txt
|
||||
$ wc -l /tmp/subregions-matched.txt
|
||||
81 /tmp/subregions-matched.txt
|
||||
</code></pre><ul>
|
||||
<li>Then I updated the controlled vocabulary in the submission forms</li>
|
||||
<li>I did the same for <code>dcterms.audience</code>, taking special care to a few all-caps values:</li>
|
||||
</ul>
|
||||
<pre tabindex="0"><code class="language-console" data-lang="console">localhost/dspace63= > UPDATE metadatavalue SET text_value=INITCAP(text_value) WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=144 AND text_value != 'NGOS' AND text_value != 'CGIAR';
|
||||
localhost/dspace63= > UPDATE metadatavalue SET text_value='NGOs' WHERE dspace_object_id IN (SELECT uuid FROM item) AND metadata_field_id=144 AND text_value = 'NGOS';
|
||||
</code></pre><!-- raw HTML omitted -->
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/categories/notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -10,7 +10,7 @@
|
||||
<meta property="og:description" content="Documenting day-to-day work on the [CGSpace](https://cgspace.cgiar.org) repository." />
|
||||
<meta property="og:type" content="website" />
|
||||
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/posts/" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:19:11+03:00" />
|
||||
<meta property="og:updated_time" content="2021-09-23T18:32:47+03:00" />
|
||||
|
||||
|
||||
|
||||
|
@ -3,19 +3,19 @@
|
||||
xmlns:xhtml="http://www.w3.org/1999/xhtml">
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/categories/</loc>
|
||||
<lastmod>2021-09-23T18:19:11+03:00</lastmod>
|
||||
<lastmod>2021-09-23T18:32:47+03:00</lastmod>
|
||||
</url><url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||
<lastmod>2021-09-23T18:19:11+03:00</lastmod>
|
||||
<lastmod>2021-09-23T18:32:47+03:00</lastmod>
|
||||
</url><url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/categories/notes/</loc>
|
||||
<lastmod>2021-09-23T18:19:11+03:00</lastmod>
|
||||
<lastmod>2021-09-23T18:32:47+03:00</lastmod>
|
||||
</url><url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
||||
<lastmod>2021-09-23T18:19:11+03:00</lastmod>
|
||||
<lastmod>2021-09-23T18:32:47+03:00</lastmod>
|
||||
</url><url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2021-09/</loc>
|
||||
<lastmod>2021-09-23T18:19:11+03:00</lastmod>
|
||||
<lastmod>2021-09-23T18:32:47+03:00</lastmod>
|
||||
</url><url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2021-08/</loc>
|
||||
<lastmod>2021-09-02T17:06:28+03:00</lastmod>
|
||||
|
Loading…
Reference in New Issue
Block a user