Add notes for 2016-06-10

This commit is contained in:
Alan Orth 2016-06-10 17:02:55 +03:00
parent 06dfd8da30
commit 216ae75b5b
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
4 changed files with 97 additions and 0 deletions

View File

@ -153,3 +153,25 @@ $ xml sel -t -m '//value-pairs[@value-pairs-name="ilrisubject"]/pair/displayed-v
- Atmire explained that the `atmire.orcid.id` field doesn't exist in the schema, as it actually comes from the authority cache during XMLUI run time
- This means we don't see it when harvesting via OAI or REST, for example
- They opened a feature ticket on the DSpace tracker to ask for support of this: https://jira.duraspace.org/browse/DS-3239
## 2016-06-10
- Investigating authority confidences
- It looks like the values are documented in `Choices.java`
- Experiment with setting all 960 CCAFS author values to be 500:
```
dspacetest=# SELECT authority, confidence FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security';
dspacetest=# UPDATE metadatavalue set confidence = 500 where resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security';
UPDATE 960
```
- After the database edit, I did a full Discovery re-index
- And now there are exactly 960 items in the authors facet for 'CGIAR Research Program on Climate Change, Agriculture and Food Security'
- Now I ran the same on CGSpace
- Merge controlled vocabulary functionality for animal breeds to `5_x-prod` ([#236](https://github.com/ilri/DSpace/pull/236))
- Write python script to update metadata values in batch via PostgreSQL: [fix-metadata-values.py](https://gist.github.com/alanorth/df92cbfb54d762ba21b28f7cd83b6897)
- We need to use this to correct some pretty ugly values in fields like `dc.description.sponsorship`
- Merge item display tweaks from earlier this week ([#231](https://github.com/ilri/DSpace/pull/231))
- Merge controlled vocabulary functionality for subregions ([#238](https://github.com/ilri/DSpace/pull/238))

View File

@ -252,6 +252,31 @@ UPDATE 960
<li>Atmire explained that the <code>atmire.orcid.id</code> field doesn&rsquo;t exist in the schema, as it actually comes from the authority cache during XMLUI run time</li>
<li>This means we don&rsquo;t see it when harvesting via OAI or REST, for example</li>
<li>They opened a feature ticket on the DSpace tracker to ask for support of this: <a href="https://jira.duraspace.org/browse/DS-3239">https://jira.duraspace.org/browse/DS-3239</a></li>
</ul>
<h2 id="2016-06-10:6783872e82b68b1517e00f494e6b6504">2016-06-10</h2>
<ul>
<li>Investigating authority confidences</li>
<li>It looks like the values are documented in <code>Choices.java</code></li>
<li>Experiment with setting all 960 CCAFS author values to be 500:</li>
</ul>
<pre><code>dspacetest=# SELECT authority, confidence FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security';
dspacetest=# UPDATE metadatavalue set confidence = 500 where resource_type_id=2 AND metadata_field_id=3 AND text_value = 'CGIAR Research Program on Climate Change, Agriculture and Food Security';
UPDATE 960
</code></pre>
<ul>
<li>After the database edit, I did a full Discovery re-index</li>
<li>And now there are exactly 960 items in the authors facet for &lsquo;CGIAR Research Program on Climate Change, Agriculture and Food Security&rsquo;</li>
<li>Now I ran the same on CGSpace</li>
<li>Merge controlled vocabulary functionality for animal breeds to <code>5_x-prod</code> (<a href="https://github.com/ilri/DSpace/pull/236">#236</a>)</li>
<li>Write python script to update metadata values in batch via PostgreSQL: <a href="https://gist.github.com/alanorth/df92cbfb54d762ba21b28f7cd83b6897">fix-metadata-values.py</a></li>
<li>We need to use this to correct some pretty ugly values in fields like <code>dc.description.sponsorship</code></li>
<li>Merge item display tweaks from earlier this week (<a href="https://github.com/ilri/DSpace/pull/231">#231</a>)</li>
<li>Merge controlled vocabulary functionality for subregions (<a href="https://github.com/ilri/DSpace/pull/238">#238</a>)</li>
</ul>
</section>

View File

@ -191,6 +191,31 @@ UPDATE 960
&lt;li&gt;This means we don&amp;rsquo;t see it when harvesting via OAI or REST, for example&lt;/li&gt;
&lt;li&gt;They opened a feature ticket on the DSpace tracker to ask for support of this: &lt;a href=&#34;https://jira.duraspace.org/browse/DS-3239&#34;&gt;https://jira.duraspace.org/browse/DS-3239&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2016-06-10:6783872e82b68b1517e00f494e6b6504&#34;&gt;2016-06-10&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Investigating authority confidences&lt;/li&gt;
&lt;li&gt;It looks like the values are documented in &lt;code&gt;Choices.java&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;Experiment with setting all 960 CCAFS author values to be 500:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspacetest=# SELECT authority, confidence FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=3 AND text_value = &#39;CGIAR Research Program on Climate Change, Agriculture and Food Security&#39;;
dspacetest=# UPDATE metadatavalue set confidence = 500 where resource_type_id=2 AND metadata_field_id=3 AND text_value = &#39;CGIAR Research Program on Climate Change, Agriculture and Food Security&#39;;
UPDATE 960
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;After the database edit, I did a full Discovery re-index&lt;/li&gt;
&lt;li&gt;And now there are exactly 960 items in the authors facet for &amp;lsquo;CGIAR Research Program on Climate Change, Agriculture and Food Security&amp;rsquo;&lt;/li&gt;
&lt;li&gt;Now I ran the same on CGSpace&lt;/li&gt;
&lt;li&gt;Merge controlled vocabulary functionality for animal breeds to &lt;code&gt;5_x-prod&lt;/code&gt; (&lt;a href=&#34;https://github.com/ilri/DSpace/pull/236&#34;&gt;#236&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;Write python script to update metadata values in batch via PostgreSQL: &lt;a href=&#34;https://gist.github.com/alanorth/df92cbfb54d762ba21b28f7cd83b6897&#34;&gt;fix-metadata-values.py&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;We need to use this to correct some pretty ugly values in fields like &lt;code&gt;dc.description.sponsorship&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;Merge item display tweaks from earlier this week (&lt;a href=&#34;https://github.com/ilri/DSpace/pull/231&#34;&gt;#231&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;Merge controlled vocabulary functionality for subregions (&lt;a href=&#34;https://github.com/ilri/DSpace/pull/238&#34;&gt;#238&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;
</description>
</item>

View File

@ -191,6 +191,31 @@ UPDATE 960
&lt;li&gt;This means we don&amp;rsquo;t see it when harvesting via OAI or REST, for example&lt;/li&gt;
&lt;li&gt;They opened a feature ticket on the DSpace tracker to ask for support of this: &lt;a href=&#34;https://jira.duraspace.org/browse/DS-3239&#34;&gt;https://jira.duraspace.org/browse/DS-3239&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2016-06-10:6783872e82b68b1517e00f494e6b6504&#34;&gt;2016-06-10&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Investigating authority confidences&lt;/li&gt;
&lt;li&gt;It looks like the values are documented in &lt;code&gt;Choices.java&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;Experiment with setting all 960 CCAFS author values to be 500:&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspacetest=# SELECT authority, confidence FROM metadatavalue WHERE resource_type_id=2 AND metadata_field_id=3 AND text_value = &#39;CGIAR Research Program on Climate Change, Agriculture and Food Security&#39;;
dspacetest=# UPDATE metadatavalue set confidence = 500 where resource_type_id=2 AND metadata_field_id=3 AND text_value = &#39;CGIAR Research Program on Climate Change, Agriculture and Food Security&#39;;
UPDATE 960
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;After the database edit, I did a full Discovery re-index&lt;/li&gt;
&lt;li&gt;And now there are exactly 960 items in the authors facet for &amp;lsquo;CGIAR Research Program on Climate Change, Agriculture and Food Security&amp;rsquo;&lt;/li&gt;
&lt;li&gt;Now I ran the same on CGSpace&lt;/li&gt;
&lt;li&gt;Merge controlled vocabulary functionality for animal breeds to &lt;code&gt;5_x-prod&lt;/code&gt; (&lt;a href=&#34;https://github.com/ilri/DSpace/pull/236&#34;&gt;#236&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;Write python script to update metadata values in batch via PostgreSQL: &lt;a href=&#34;https://gist.github.com/alanorth/df92cbfb54d762ba21b28f7cd83b6897&#34;&gt;fix-metadata-values.py&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;We need to use this to correct some pretty ugly values in fields like &lt;code&gt;dc.description.sponsorship&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;Merge item display tweaks from earlier this week (&lt;a href=&#34;https://github.com/ilri/DSpace/pull/231&#34;&gt;#231&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;Merge controlled vocabulary functionality for subregions (&lt;a href=&#34;https://github.com/ilri/DSpace/pull/238&#34;&gt;#238&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;
</description>
</item>