diff --git a/content/2016-06.md b/content/2016-06.md index 133ec7399..97594dca1 100644 --- a/content/2016-06.md +++ b/content/2016-06.md @@ -153,3 +153,25 @@ $ xml sel -t -m '//value-pairs[@value-pairs-name="ilrisubject"]/pair/displayed-v - Atmire explained that the `atmire.orcid.id` field doesn't exist in the schema, as it actually comes from the authority cache during XMLUI run time - This means we don't see it when harvesting via OAI or REST, for example - They opened a feature ticket on the DSpace tracker to ask for support of this: https://jira.duraspace.org/browse/DS-3239 + +## 2016-06-10 + +- Investigating authority confidences +- It looks like the values are documented in `Choices.java` +- Experiment with setting all 960 CCAFS author values to be 500: + +``` +dspacetest=# SELECT authority, confidence FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security'; + +dspacetest=# UPDATE metadatavalue set confidence = 500 where resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security'; +UPDATE 960 +``` + +- After the database edit, I did a full Discovery re-index +- And now there are exactly 960 items in the authors facet for 'CGIAR Research Program on Climate Change, Agriculture and Food Security' +- Now I ran the same on CGSpace +- Merge controlled vocabulary functionality for animal breeds to `5_x-prod` ([#236](https://github.com/ilri/DSpace/pull/236)) +- Write python script to update metadata values in batch via PostgreSQL: [fix-metadata-values.py](https://gist.github.com/alanorth/df92cbfb54d762ba21b28f7cd83b6897) +- We need to use this to correct some pretty ugly values in fields like `dc.description.sponsorship` +- Merge item display tweaks from earlier this week ([#231](https://github.com/ilri/DSpace/pull/231)) +- Merge controlled vocabulary functionality for subregions ([#238](https://github.com/ilri/DSpace/pull/238)) diff --git a/public/2016-06/index.html b/public/2016-06/index.html index 74db1f6c0..de801195c 100644 --- a/public/2016-06/index.html +++ b/public/2016-06/index.html @@ -252,6 +252,31 @@ UPDATE 960
  • Atmire explained that the atmire.orcid.id field doesn’t exist in the schema, as it actually comes from the authority cache during XMLUI run time
  • This means we don’t see it when harvesting via OAI or REST, for example
  • They opened a feature ticket on the DSpace tracker to ask for support of this: https://jira.duraspace.org/browse/DS-3239
  • + + +

    2016-06-10

    + + + +
    dspacetest=# SELECT authority, confidence FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security';
    +
    +dspacetest=# UPDATE metadatavalue set confidence = 500 where resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security';
    +UPDATE 960
    +
    + + diff --git a/public/index.xml b/public/index.xml index f19cab887..7e7326ecc 100644 --- a/public/index.xml +++ b/public/index.xml @@ -191,6 +191,31 @@ UPDATE 960 <li>This means we don&rsquo;t see it when harvesting via OAI or REST, for example</li> <li>They opened a feature ticket on the DSpace tracker to ask for support of this: <a href="https://jira.duraspace.org/browse/DS-3239">https://jira.duraspace.org/browse/DS-3239</a></li> </ul> + +<h2 id="2016-06-10:6783872e82b68b1517e00f494e6b6504">2016-06-10</h2> + +<ul> +<li>Investigating authority confidences</li> +<li>It looks like the values are documented in <code>Choices.java</code></li> +<li>Experiment with setting all 960 CCAFS author values to be 500:</li> +</ul> + +<pre><code>dspacetest=# SELECT authority, confidence FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security'; + +dspacetest=# UPDATE metadatavalue set confidence = 500 where resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security'; +UPDATE 960 +</code></pre> + +<ul> +<li>After the database edit, I did a full Discovery re-index</li> +<li>And now there are exactly 960 items in the authors facet for &lsquo;CGIAR Research Program on Climate Change, Agriculture and Food Security&rsquo;</li> +<li>Now I ran the same on CGSpace</li> +<li>Merge controlled vocabulary functionality for animal breeds to <code>5_x-prod</code> (<a href="https://github.com/ilri/DSpace/pull/236">#236</a>)</li> +<li>Write python script to update metadata values in batch via PostgreSQL: <a href="https://gist.github.com/alanorth/df92cbfb54d762ba21b28f7cd83b6897">fix-metadata-values.py</a></li> +<li>We need to use this to correct some pretty ugly values in fields like <code>dc.description.sponsorship</code></li> +<li>Merge item display tweaks from earlier this week (<a href="https://github.com/ilri/DSpace/pull/231">#231</a>)</li> +<li>Merge controlled vocabulary functionality for subregions (<a href="https://github.com/ilri/DSpace/pull/238">#238</a>)</li> +</ul> diff --git a/public/tags/notes/index.xml b/public/tags/notes/index.xml index 8182bd447..2db0c7460 100644 --- a/public/tags/notes/index.xml +++ b/public/tags/notes/index.xml @@ -191,6 +191,31 @@ UPDATE 960 <li>This means we don&rsquo;t see it when harvesting via OAI or REST, for example</li> <li>They opened a feature ticket on the DSpace tracker to ask for support of this: <a href="https://jira.duraspace.org/browse/DS-3239">https://jira.duraspace.org/browse/DS-3239</a></li> </ul> + +<h2 id="2016-06-10:6783872e82b68b1517e00f494e6b6504">2016-06-10</h2> + +<ul> +<li>Investigating authority confidences</li> +<li>It looks like the values are documented in <code>Choices.java</code></li> +<li>Experiment with setting all 960 CCAFS author values to be 500:</li> +</ul> + +<pre><code>dspacetest=# SELECT authority, confidence FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security'; + +dspacetest=# UPDATE metadatavalue set confidence = 500 where resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security'; +UPDATE 960 +</code></pre> + +<ul> +<li>After the database edit, I did a full Discovery re-index</li> +<li>And now there are exactly 960 items in the authors facet for &lsquo;CGIAR Research Program on Climate Change, Agriculture and Food Security&rsquo;</li> +<li>Now I ran the same on CGSpace</li> +<li>Merge controlled vocabulary functionality for animal breeds to <code>5_x-prod</code> (<a href="https://github.com/ilri/DSpace/pull/236">#236</a>)</li> +<li>Write python script to update metadata values in batch via PostgreSQL: <a href="https://gist.github.com/alanorth/df92cbfb54d762ba21b28f7cd83b6897">fix-metadata-values.py</a></li> +<li>We need to use this to correct some pretty ugly values in fields like <code>dc.description.sponsorship</code></li> +<li>Merge item display tweaks from earlier this week (<a href="https://github.com/ilri/DSpace/pull/231">#231</a>)</li> +<li>Merge controlled vocabulary functionality for subregions (<a href="https://github.com/ilri/DSpace/pull/238">#238</a>)</li> +</ul>