mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2019-12-17
This commit is contained in:
@ -25,7 +25,7 @@ I checked to see if the Solr sharding task that is supposed to run on January 1s
|
||||
I tested on DSpace Test as well and it doesn't work there either
|
||||
I asked on the dspace-tech mailing list because it seems to be broken, and actually now I'm not sure if we've ever had the sharding task run successfully over all these years
|
||||
"/>
|
||||
<meta name="generator" content="Hugo 0.60.1" />
|
||||
<meta name="generator" content="Hugo 0.61.0" />
|
||||
|
||||
|
||||
|
||||
@ -106,13 +106,13 @@ I asked on the dspace-tech mailing list because it seems to be broken, and actua
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="20170102">2017-01-02</h2>
|
||||
<h2 id="2017-01-02">2017-01-02</h2>
|
||||
<ul>
|
||||
<li>I checked to see if the Solr sharding task that is supposed to run on January 1st had run and saw there was an error</li>
|
||||
<li>I tested on DSpace Test as well and it doesn't work there either</li>
|
||||
<li>I asked on the dspace-tech mailing list because it seems to be broken, and actually now I'm not sure if we've ever had the sharding task run successfully over all these years</li>
|
||||
</ul>
|
||||
<h2 id="20170104">2017-01-04</h2>
|
||||
<h2 id="2017-01-04">2017-01-04</h2>
|
||||
<ul>
|
||||
<li>I tried to shard my local dev instance and it fails the same way:</li>
|
||||
</ul>
|
||||
@ -183,17 +183,17 @@ Caused by: java.net.SocketException: Broken pipe (Write failed)
|
||||
</code></pre><ul>
|
||||
<li>Very interesting… it creates the core and then fails somehow</li>
|
||||
</ul>
|
||||
<h2 id="20170108">2017-01-08</h2>
|
||||
<h2 id="2017-01-08">2017-01-08</h2>
|
||||
<ul>
|
||||
<li>Put Sisay's <code>item-view.xsl</code> code to show mapped collections on CGSpace (<a href="https://github.com/ilri/DSpace/pull/295">#295</a>)</li>
|
||||
</ul>
|
||||
<h2 id="20170109">2017-01-09</h2>
|
||||
<h2 id="2017-01-09">2017-01-09</h2>
|
||||
<ul>
|
||||
<li>A user wrote to tell me that the new display of an item's mappings had a crazy bug for at least one item: <a href="https://cgspace.cgiar.org/handle/10568/78596">https://cgspace.cgiar.org/handle/10568/78596</a></li>
|
||||
<li>She said she only mapped it once, but it appears to be mapped 184 times</li>
|
||||
</ul>
|
||||
<p><img src="/cgspace-notes/2017/01/mapping-crazy-duplicate.png" alt="Crazy item mapping"></p>
|
||||
<h2 id="20170110">2017-01-10</h2>
|
||||
<h2 id="2017-01-10">2017-01-10</h2>
|
||||
<ul>
|
||||
<li>I tried to clean up the duplicate mappings by exporting the item's metadata to CSV, editing, and re-importing, but DSpace said “no changes were detected”</li>
|
||||
<li>I've asked on the dspace-tech mailing list to see if anyone can help</li>
|
||||
@ -210,7 +210,7 @@ Caused by: java.net.SocketException: Broken pipe (Write failed)
|
||||
<li>I will have to ask the DSpace people if this is a valid approach</li>
|
||||
<li>Finish looking at the Journal Title corrections of the top 500 Journal Titles so we can make a controlled vocabulary from it</li>
|
||||
</ul>
|
||||
<h2 id="20170111">2017-01-11</h2>
|
||||
<h2 id="2017-01-11">2017-01-11</h2>
|
||||
<ul>
|
||||
<li>Maria found another item with duplicate mappings: <a href="https://cgspace.cgiar.org/handle/10568/78658">https://cgspace.cgiar.org/handle/10568/78658</a></li>
|
||||
<li>Error in <code>fix-metadata-values.py</code> when it tries to print the value for Entwicklung & Ländlicher Raum:</li>
|
||||
@ -238,11 +238,11 @@ UnicodeEncodeError: 'ascii' codec can't encode character u'\xe4' in position 15:
|
||||
<li>I will have to go through these and fix some more before making the controlled vocabulary</li>
|
||||
<li>Added 30 more corrections or so, now there are 49 total and I'll have to get the top 500 after applying them</li>
|
||||
</ul>
|
||||
<h2 id="20170113">2017-01-13</h2>
|
||||
<h2 id="2017-01-13">2017-01-13</h2>
|
||||
<ul>
|
||||
<li>Add <code>FOOD SYSTEMS</code> to CIAT subjects, waiting to merge: <a href="https://github.com/ilri/DSpace/pull/296">https://github.com/ilri/DSpace/pull/296</a></li>
|
||||
</ul>
|
||||
<h2 id="20170116">2017-01-16</h2>
|
||||
<h2 id="2017-01-16">2017-01-16</h2>
|
||||
<ul>
|
||||
<li>Fix the two items Maria found with duplicate mappings with this script:</li>
|
||||
</ul>
|
||||
@ -250,7 +250,7 @@ UnicodeEncodeError: 'ascii' codec can't encode character u'\xe4' in position 15:
|
||||
delete from collection2item where item_id = '80596' and id not in (90792, 90806, 90807);
|
||||
/* 1 incorrect mapping: https://cgspace.cgiar.org/handle/10568/78658 */
|
||||
delete from collection2item where id = '91082';
|
||||
</code></pre><h2 id="20170117">2017-01-17</h2>
|
||||
</code></pre><h2 id="2017-01-17">2017-01-17</h2>
|
||||
<ul>
|
||||
<li>Helping clean up some file names in the 232 CIAT records that Sisay worked on last week</li>
|
||||
<li>There are about 30 files with <code>%20</code> (space) and Spanish accents in the file name</li>
|
||||
@ -276,18 +276,18 @@ $ gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -
|
||||
</code></pre><ul>
|
||||
<li>Somewhere on the Internet suggested using a DPI of 144</li>
|
||||
</ul>
|
||||
<h2 id="20170119">2017-01-19</h2>
|
||||
<h2 id="2017-01-19">2017-01-19</h2>
|
||||
<ul>
|
||||
<li>In testing a random sample of CIAT's PDFs for compressability, it looks like all of these methods generally increase the file size so we will just import them as they are</li>
|
||||
<li>Import 232 CIAT records into CGSpace:</li>
|
||||
</ul>
|
||||
<pre><code>$ JAVA_OPTS="-Xmx512m -Dfile.encoding=UTF-8" /home/cgspace.cgiar.org/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/68704 --source /home/aorth/CIAT_232/SimpleArchiveFormat/ --mapfile=/tmp/ciat.map &> /tmp/ciat.log
|
||||
</code></pre><h2 id="20170122">2017-01-22</h2>
|
||||
</code></pre><h2 id="2017-01-22">2017-01-22</h2>
|
||||
<ul>
|
||||
<li>Looking at some records that Sisay is having problems importing into DSpace Test (seems to be because of copious whitespace return characters from Excel's CSV exporter)</li>
|
||||
<li>There were also some issues with an invalid dc.date.issued field, and I trimmed leading / trailing whitespace and cleaned up some URLs with unneeded parameters like ?show=full</li>
|
||||
</ul>
|
||||
<h2 id="20170123">2017-01-23</h2>
|
||||
<h2 id="2017-01-23">2017-01-23</h2>
|
||||
<ul>
|
||||
<li>I merged Atmire's pull request into the development branch so they can deploy it on DSpace Test</li>
|
||||
<li>Move some old ILRI Program communities to a new subcommunity for former programs (10568/79164):</li>
|
||||
@ -298,7 +298,7 @@ $ gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -
|
||||
</ul>
|
||||
<pre><code>10568/42161 10568/171 10568/79341
|
||||
10568/41914 10568/171 10568/79340
|
||||
</code></pre><h2 id="20170124">2017-01-24</h2>
|
||||
</code></pre><h2 id="2017-01-24">2017-01-24</h2>
|
||||
<ul>
|
||||
<li>Run all updates on DSpace Test and reboot the server</li>
|
||||
<li>Run fixes for Journal titles on CGSpace:</li>
|
||||
@ -312,7 +312,7 @@ $ gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -
|
||||
<li>Then sort them in OpenRefine and create a controlled vocabulary by manually adding the XML markup, pull request (<a href="https://github.com/ilri/DSpace/pull/298">#298</a>)</li>
|
||||
<li>This would be the last issue remaining to close the meta issue about switching to controlled vocabularies (<a href="https://github.com/ilri/DSpace/pull/69">#69</a>)</li>
|
||||
</ul>
|
||||
<h2 id="20170125">2017-01-25</h2>
|
||||
<h2 id="2017-01-25">2017-01-25</h2>
|
||||
<ul>
|
||||
<li>Atmire says the <code>com.atmire.statistics.util.UpdateSolrStorageReports</code> and <code>com.atmire.utils.ReportSender</code> are no longer necessary because they are using a Spring scheduler for these tasks now</li>
|
||||
<li>Pull request to remove them from the Ansible templates: <a href="https://github.com/ilri/rmg-ansible-public/pull/80">https://github.com/ilri/rmg-ansible-public/pull/80</a></li>
|
||||
@ -325,18 +325,18 @@ $ gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -
|
||||
</li>
|
||||
<li>But now we have a new issue with the “Types” in Content statistics not being respected—we only get the defaults, despite having custom settings in <code>dspace/config/modules/atmire-cua.cfg</code></li>
|
||||
</ul>
|
||||
<h2 id="20170127">2017-01-27</h2>
|
||||
<h2 id="2017-01-27">2017-01-27</h2>
|
||||
<ul>
|
||||
<li>Magdalena pointed out that somehow the Anonymous group had been added to the Administrators group on CGSpace (!)</li>
|
||||
<li>Discuss plans to update CCAFS metadata and communities for their new flagships and phase II project identifiers</li>
|
||||
<li>The flagships are in <code>cg.subject.ccafs</code>, and we need to probably make a new field for the phase II project identifiers</li>
|
||||
</ul>
|
||||
<h2 id="20170128">2017-01-28</h2>
|
||||
<h2 id="2017-01-28">2017-01-28</h2>
|
||||
<ul>
|
||||
<li>Merge controlled vocabulary for journal titles (<code>dc.source</code>) into CGSpace (<a href="https://github.com/ilri/DSpace/pull/298">#298</a>)</li>
|
||||
<li>Merge new CIAT subject into CGSpace (<a href="https://github.com/ilri/DSpace/pull/296">#296</a>)</li>
|
||||
</ul>
|
||||
<h2 id="20170129">2017-01-29</h2>
|
||||
<h2 id="2017-01-29">2017-01-29</h2>
|
||||
<ul>
|
||||
<li>Run all system updates on DSpace Test, redeploy DSpace code, and reboot the server</li>
|
||||
<li>Run all system updates on CGSpace, redeploy DSpace code, and reboot the server</li>
|
||||
|
Reference in New Issue
Block a user