Add notes for 2017-02-28

This commit is contained in:
Alan Orth 2017-02-28 18:57:31 +02:00
parent ff4dca769e
commit a3f0d88945
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9
5 changed files with 107 additions and 1 deletions

View File

@ -290,6 +290,24 @@ $ grep -c "unable to find valid certification path" [dspace]/log/dspace.log.2017
- Regarding the `filter-media` issue I found earlier, it seems that the ImageMagick PDF plugin will also process JPGs if they are in the "Content Files" (aka `ORIGINAL`) bundle
- The problem likely lies in the logic of `ImageMagickThumbnailFilter.java`, as `ImageMagickPdfThumbnailFilter.java` extends it
- Run CIAT corrections on CGSpace
```
dspace=# update metadatavalue set authority='3026b1de-9302-4f3e-85ab-ef48da024eb2', confidence=600 where resource_type_id=2 and metadata_field_id=3 and text_value = 'International Center for Tropical Agriculture';
```
- CGNET has fixed the certificate chain on their LDAP server
- Redeploy CGSpace and DSpace Test to on latest `5_x-prod` branch with fixes for LDAP bind user
- Run all system updates on CGSpace server and reboot
## 2017-02-28
- After running the CIAT corrections and updating the Discovery and authority indexes, there is still no change in the number of items listed for CIAT in Discovery
- Ah, this is probably because some items have the `International Center for Tropical Agriculture` author twice, which I first noticed in 2016-12 but couldn't figure out how to fix
- I think I can do it by first exporting all metadatavalues that have the author `International Center for Tropical Agriculture`
```
dspace=# \copy (select resource_id, metadata_value_id from metadatavalue where resource_type_id=2 and metadata_field_id=3 and text_value='International Center for Tropical Agriculture') to /tmp/ciat.csv with csv;
COPY 1968
```
- And then using awk or uniq to either remove or print the lines that have a duplicate `resource_id` (meaning they belong to the same item in DSpace and are therefore duplicates), and then using the `metadata_value_id` to delete them

View File

@ -90,7 +90,7 @@ Looks like we’ll be using cg.identifier.ccafsprojectpii as the field name
"headline": "February, 2017",
"url": "https://alanorth.github.io/cgspace-notes/2017-02/",
"wordCount": "1862",
"wordCount": "2019",
"datePublished": "2017-02-07T07:04:52-08:00",
@ -498,11 +498,33 @@ Certificate chain
<li>Regarding the <code>filter-media</code> issue I found earlier, it seems that the ImageMagick PDF plugin will also process JPGs if they are in the &ldquo;Content Files&rdquo; (aka <code>ORIGINAL</code>) bundle</li>
<li>The problem likely lies in the logic of <code>ImageMagickThumbnailFilter.java</code>, as <code>ImageMagickPdfThumbnailFilter.java</code> extends it</li>
<li>Run CIAT corrections on CGSpace</li>
</ul>
<pre><code>dspace=# update metadatavalue set authority='3026b1de-9302-4f3e-85ab-ef48da024eb2', confidence=600 where resource_type_id=2 and metadata_field_id=3 and text_value = 'International Center for Tropical Agriculture';
</code></pre>
<ul>
<li>CGNET has fixed the certificate chain on their LDAP server</li>
<li>Redeploy CGSpace and DSpace Test to on latest <code>5_x-prod</code> branch with fixes for LDAP bind user</li>
<li>Run all system updates on CGSpace server and reboot</li>
</ul>
<h2 id="2017-02-28">2017-02-28</h2>
<ul>
<li>After running the CIAT corrections and updating the Discovery and authority indexes, there is still no change in the number of items listed for CIAT in Discovery</li>
<li>Ah, this is probably because some items have the <code>International Center for Tropical Agriculture</code> author twice, which I first noticed in 2016-12 but couldn&rsquo;t figure out how to fix</li>
<li>I think I can do it by first exporting all metadatavalues that have the author <code>International Center for Tropical Agriculture</code></li>
</ul>
<pre><code>dspace=# \copy (select resource_id, metadata_value_id from metadatavalue where resource_type_id=2 and metadata_field_id=3 and text_value='International Center for Tropical Agriculture') to /tmp/ciat.csv with csv;
COPY 1968
</code></pre>
<ul>
<li>And then using awk or uniq to either remove or print the lines that have a duplicate <code>resource_id</code> (meaning they belong to the same item in DSpace and are therefore duplicates), and then using the <code>metadata_value_id</code> to delete them</li>
</ul>

View File

@ -348,9 +348,31 @@ Certificate chain
&lt;li&gt;Regarding the &lt;code&gt;filter-media&lt;/code&gt; issue I found earlier, it seems that the ImageMagick PDF plugin will also process JPGs if they are in the &amp;ldquo;Content Files&amp;rdquo; (aka &lt;code&gt;ORIGINAL&lt;/code&gt;) bundle&lt;/li&gt;
&lt;li&gt;The problem likely lies in the logic of &lt;code&gt;ImageMagickThumbnailFilter.java&lt;/code&gt;, as &lt;code&gt;ImageMagickPdfThumbnailFilter.java&lt;/code&gt; extends it&lt;/li&gt;
&lt;li&gt;Run CIAT corrections on CGSpace&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# update metadatavalue set authority=&#39;3026b1de-9302-4f3e-85ab-ef48da024eb2&#39;, confidence=600 where resource_type_id=2 and metadata_field_id=3 and text_value = &#39;International Center for Tropical Agriculture&#39;;
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;CGNET has fixed the certificate chain on their LDAP server&lt;/li&gt;
&lt;li&gt;Redeploy CGSpace and DSpace Test to on latest &lt;code&gt;5_x-prod&lt;/code&gt; branch with fixes for LDAP bind user&lt;/li&gt;
&lt;li&gt;Run all system updates on CGSpace server and reboot&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2017-02-28&#34;&gt;2017-02-28&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;After running the CIAT corrections and updating the Discovery and authority indexes, there is still no change in the number of items listed for CIAT in Discovery&lt;/li&gt;
&lt;li&gt;Ah, this is probably because some items have the &lt;code&gt;International Center for Tropical Agriculture&lt;/code&gt; author twice, which I first noticed in 2016-12 but couldn&amp;rsquo;t figure out how to fix&lt;/li&gt;
&lt;li&gt;I think I can do it by first exporting all metadatavalues that have the author &lt;code&gt;International Center for Tropical Agriculture&lt;/code&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# \copy (select resource_id, metadata_value_id from metadatavalue where resource_type_id=2 and metadata_field_id=3 and text_value=&#39;International Center for Tropical Agriculture&#39;) to /tmp/ciat.csv with csv;
COPY 1968
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;And then using awk or uniq to either remove or print the lines that have a duplicate &lt;code&gt;resource_id&lt;/code&gt; (meaning they belong to the same item in DSpace and are therefore duplicates), and then using the &lt;code&gt;metadata_value_id&lt;/code&gt; to delete them&lt;/li&gt;
&lt;/ul&gt;</description>
</item>

View File

@ -348,9 +348,31 @@ Certificate chain
&lt;li&gt;Regarding the &lt;code&gt;filter-media&lt;/code&gt; issue I found earlier, it seems that the ImageMagick PDF plugin will also process JPGs if they are in the &amp;ldquo;Content Files&amp;rdquo; (aka &lt;code&gt;ORIGINAL&lt;/code&gt;) bundle&lt;/li&gt;
&lt;li&gt;The problem likely lies in the logic of &lt;code&gt;ImageMagickThumbnailFilter.java&lt;/code&gt;, as &lt;code&gt;ImageMagickPdfThumbnailFilter.java&lt;/code&gt; extends it&lt;/li&gt;
&lt;li&gt;Run CIAT corrections on CGSpace&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# update metadatavalue set authority=&#39;3026b1de-9302-4f3e-85ab-ef48da024eb2&#39;, confidence=600 where resource_type_id=2 and metadata_field_id=3 and text_value = &#39;International Center for Tropical Agriculture&#39;;
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;CGNET has fixed the certificate chain on their LDAP server&lt;/li&gt;
&lt;li&gt;Redeploy CGSpace and DSpace Test to on latest &lt;code&gt;5_x-prod&lt;/code&gt; branch with fixes for LDAP bind user&lt;/li&gt;
&lt;li&gt;Run all system updates on CGSpace server and reboot&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2017-02-28&#34;&gt;2017-02-28&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;After running the CIAT corrections and updating the Discovery and authority indexes, there is still no change in the number of items listed for CIAT in Discovery&lt;/li&gt;
&lt;li&gt;Ah, this is probably because some items have the &lt;code&gt;International Center for Tropical Agriculture&lt;/code&gt; author twice, which I first noticed in 2016-12 but couldn&amp;rsquo;t figure out how to fix&lt;/li&gt;
&lt;li&gt;I think I can do it by first exporting all metadatavalues that have the author &lt;code&gt;International Center for Tropical Agriculture&lt;/code&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# \copy (select resource_id, metadata_value_id from metadatavalue where resource_type_id=2 and metadata_field_id=3 and text_value=&#39;International Center for Tropical Agriculture&#39;) to /tmp/ciat.csv with csv;
COPY 1968
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;And then using awk or uniq to either remove or print the lines that have a duplicate &lt;code&gt;resource_id&lt;/code&gt; (meaning they belong to the same item in DSpace and are therefore duplicates), and then using the &lt;code&gt;metadata_value_id&lt;/code&gt; to delete them&lt;/li&gt;
&lt;/ul&gt;</description>
</item>

View File

@ -347,9 +347,31 @@ Certificate chain
&lt;li&gt;Regarding the &lt;code&gt;filter-media&lt;/code&gt; issue I found earlier, it seems that the ImageMagick PDF plugin will also process JPGs if they are in the &amp;ldquo;Content Files&amp;rdquo; (aka &lt;code&gt;ORIGINAL&lt;/code&gt;) bundle&lt;/li&gt;
&lt;li&gt;The problem likely lies in the logic of &lt;code&gt;ImageMagickThumbnailFilter.java&lt;/code&gt;, as &lt;code&gt;ImageMagickPdfThumbnailFilter.java&lt;/code&gt; extends it&lt;/li&gt;
&lt;li&gt;Run CIAT corrections on CGSpace&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# update metadatavalue set authority=&#39;3026b1de-9302-4f3e-85ab-ef48da024eb2&#39;, confidence=600 where resource_type_id=2 and metadata_field_id=3 and text_value = &#39;International Center for Tropical Agriculture&#39;;
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;CGNET has fixed the certificate chain on their LDAP server&lt;/li&gt;
&lt;li&gt;Redeploy CGSpace and DSpace Test to on latest &lt;code&gt;5_x-prod&lt;/code&gt; branch with fixes for LDAP bind user&lt;/li&gt;
&lt;li&gt;Run all system updates on CGSpace server and reboot&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;2017-02-28&#34;&gt;2017-02-28&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;After running the CIAT corrections and updating the Discovery and authority indexes, there is still no change in the number of items listed for CIAT in Discovery&lt;/li&gt;
&lt;li&gt;Ah, this is probably because some items have the &lt;code&gt;International Center for Tropical Agriculture&lt;/code&gt; author twice, which I first noticed in 2016-12 but couldn&amp;rsquo;t figure out how to fix&lt;/li&gt;
&lt;li&gt;I think I can do it by first exporting all metadatavalues that have the author &lt;code&gt;International Center for Tropical Agriculture&lt;/code&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;pre&gt;&lt;code&gt;dspace=# \copy (select resource_id, metadata_value_id from metadatavalue where resource_type_id=2 and metadata_field_id=3 and text_value=&#39;International Center for Tropical Agriculture&#39;) to /tmp/ciat.csv with csv;
COPY 1968
&lt;/code&gt;&lt;/pre&gt;
&lt;ul&gt;
&lt;li&gt;And then using awk or uniq to either remove or print the lines that have a duplicate &lt;code&gt;resource_id&lt;/code&gt; (meaning they belong to the same item in DSpace and are therefore duplicates), and then using the &lt;code&gt;metadata_value_id&lt;/code&gt; to delete them&lt;/li&gt;
&lt;/ul&gt;</description>
</item>