mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2019-12-17
This commit is contained in:
@ -43,7 +43,7 @@ After rebooting, all statistics cores were loaded… wow, that's lucky.
|
||||
|
||||
Run system updates on DSpace Test (linode19) and reboot it
|
||||
"/>
|
||||
<meta name="generator" content="Hugo 0.60.1" />
|
||||
<meta name="generator" content="Hugo 0.61.0" />
|
||||
|
||||
|
||||
|
||||
@ -124,11 +124,11 @@ Run system updates on DSpace Test (linode19) and reboot it
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="20190803">2019-08-03</h2>
|
||||
<h2 id="2019-08-03">2019-08-03</h2>
|
||||
<ul>
|
||||
<li>Look at Bioversity's latest migration CSV and now I see that Francesco has cleaned up the extra columns and the newline at the end of the file, but many of the column headers have an extra space in the name…</li>
|
||||
</ul>
|
||||
<h2 id="20190804">2019-08-04</h2>
|
||||
<h2 id="2019-08-04">2019-08-04</h2>
|
||||
<ul>
|
||||
<li>Deploy ORCID identifier updates requested by Bioversity to CGSpace</li>
|
||||
<li>Run system updates on CGSpace (linode18) and reboot it
|
||||
@ -139,7 +139,7 @@ Run system updates on DSpace Test (linode19) and reboot it
|
||||
</li>
|
||||
<li>Run system updates on DSpace Test (linode19) and reboot it</li>
|
||||
</ul>
|
||||
<h2 id="20190805">2019-08-05</h2>
|
||||
<h2 id="2019-08-05">2019-08-05</h2>
|
||||
<ul>
|
||||
<li>Update Tomcat to 7.0.96 in the <a href="https://github.com/ilri/rmg-ansible-public">Ansible infrastructure playbooks</a></li>
|
||||
<li>Update PostgreSQL JDBC driver to 42.2.6 in the <a href="https://github.com/ilri/rmg-ansible-public">Ansible infrastrucutre playbooks</a></li>
|
||||
@ -201,7 +201,7 @@ Run system updates on DSpace Test (linode19) and reboot it
|
||||
<li>I tried to extract the filenames and construct a URL to download the PDFs with my <code>generate-thumbnails.py</code> script, but there seem to be several paths for PDFs so I can't guess it properly</li>
|
||||
<li>I will have to wait for Francesco to respond about the PDFs, or perhaps proceed with a metadata-only upload so we can do other checks on DSpace Test</li>
|
||||
</ul>
|
||||
<h2 id="20190806">2019-08-06</h2>
|
||||
<h2 id="2019-08-06">2019-08-06</h2>
|
||||
<ul>
|
||||
<li>Francesca responded to address my feedback yesterday
|
||||
<ul>
|
||||
@ -213,11 +213,11 @@ Run system updates on DSpace Test (linode19) and reboot it
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="20190807">2019-08-07</h2>
|
||||
<h2 id="2019-08-07">2019-08-07</h2>
|
||||
<ul>
|
||||
<li>Daniel Haile-Michael asked about using a logical OR with the DSpace OpenSearch, but I looked in the DSpace manual and it does not seem to be possible</li>
|
||||
</ul>
|
||||
<h2 id="20190808">2019-08-08</h2>
|
||||
<h2 id="2019-08-08">2019-08-08</h2>
|
||||
<ul>
|
||||
<li>Moayad noticed that the HTTPS certificate expired on the AReS dev server (linode20)
|
||||
<ul>
|
||||
@ -274,7 +274,7 @@ $ ./generate-thumbnails.py -i /tmp/user-upload2.csv -w --url-field-name url -d |
|
||||
<li>Though I am really wondering why this happened now, because the configuration has been working for months…</li>
|
||||
<li>Improve the output of the suspicious characters check in <a href="https://github.com/alanorth/csv-metadata-quality">csv-metadata-quality</a> script and tag version 0.2.0</li>
|
||||
</ul>
|
||||
<h2 id="20190809">2019-08-09</h2>
|
||||
<h2 id="2019-08-09">2019-08-09</h2>
|
||||
<ul>
|
||||
<li>Looking at the 128 IITA records (20195TH.xls) that Sisay uploadd to DSpace Test last month: <a href="https://dspacetest.cgiar.org/handle/10568/102361">IITA_July_29</a>
|
||||
<ul>
|
||||
@ -294,11 +294,11 @@ $ ./generate-thumbnails.py -i /tmp/user-upload2.csv -w --url-field-name url -d |
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="20190810">2019-08-10</h2>
|
||||
<h2 id="2019-08-10">2019-08-10</h2>
|
||||
<ul>
|
||||
<li>Add checks for uncommon filename extensions and replacements for unneccesary Unicode to the csv-metadata-quality script</li>
|
||||
</ul>
|
||||
<h2 id="20190812">2019-08-12</h2>
|
||||
<h2 id="2019-08-12">2019-08-12</h2>
|
||||
<ul>
|
||||
<li>Looking at the 128 IITA records again:
|
||||
<ul>
|
||||
@ -317,7 +317,7 @@ $ ./generate-thumbnails.py -i /tmp/user-upload2.csv -w --url-field-name url -d |
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="20190813">2019-08-13</h2>
|
||||
<h2 id="2019-08-13">2019-08-13</h2>
|
||||
<ul>
|
||||
<li>Create a test user on DSpace Test for Mohammad Salem to attempt depositing:</li>
|
||||
</ul>
|
||||
@ -343,7 +343,7 @@ $ ~/dspace/bin/dspace metadata-import -f /tmp/bioversity.csv -e blah@blah.com
|
||||
<li>This time it succeeded, and using VisualVM I noticed that the import process used a maximum of 620MB of RAM</li>
|
||||
<li>(oops, I realize that actually I forgot to delete items I had flagged as duplicates, so the total should be 1,427 items)</li>
|
||||
</ul>
|
||||
<h2 id="20190814">2019-08-14</h2>
|
||||
<h2 id="2019-08-14">2019-08-14</h2>
|
||||
<ul>
|
||||
<li>I imported the 1,427 Bioversity records into DSpace Test
|
||||
<ul>
|
||||
@ -359,11 +359,11 @@ $ dspace metadata-import -f /tmp/bioversity2.csv -e blah@blah.com
|
||||
</code></pre><ul>
|
||||
<li>The next step is to check these items for duplicates</li>
|
||||
</ul>
|
||||
<h2 id="20190816">2019-08-16</h2>
|
||||
<h2 id="2019-08-16">2019-08-16</h2>
|
||||
<ul>
|
||||
<li>Email Bioversity to let them know that the 1,427 records are on DSpace Test and that Abenet should look over them</li>
|
||||
</ul>
|
||||
<h2 id="20190818">2019-08-18</h2>
|
||||
<h2 id="2019-08-18">2019-08-18</h2>
|
||||
<ul>
|
||||
<li>Deploy latest <code>5_x-prod</code> branch on CGSpace (linode18), including the <a href="https://github.com/ilri/DSpace/pull/429">new CCAFS project tags</a></li>
|
||||
<li>Deploy Tomcat 7.0.96 and PostgreSQL JDBC 42.2.6 driver on CGSpace (linde18)</li>
|
||||
@ -375,7 +375,7 @@ $ dspace metadata-import -f /tmp/bioversity2.csv -e blah@blah.com
|
||||
<li>After reboot the statistics-2018 core failed to load so I restarted <code>tomcat7</code> again</li>
|
||||
<li>After this last restart all Solr cores seem to be up and running</li>
|
||||
</ul>
|
||||
<h2 id="20190820">2019-08-20</h2>
|
||||
<h2 id="2019-08-20">2019-08-20</h2>
|
||||
<ul>
|
||||
<li>Francesco sent me a new CSV with the raw filenames and paths for the Bioversity migration
|
||||
<ul>
|
||||
@ -392,7 +392,7 @@ return os.path.basename(value)
|
||||
<li>Then I can try to download all the files again with the script</li>
|
||||
<li>I also asked Francesco about the strange filenames (.LCK, .zip, and .7z)</li>
|
||||
</ul>
|
||||
<h2 id="20190821">2019-08-21</h2>
|
||||
<h2 id="2019-08-21">2019-08-21</h2>
|
||||
<ul>
|
||||
<li>Upload <a href="https://github.com/ilri/csv-metadata-quality">csv-metadata-quality repository to ILRI's GitHub organization</a></li>
|
||||
<li>Fix a few invalid countries in IITA's <a href="https://dspacetest.cgiar.org/handle/10568/102361">July 29</a> records (aka “20195TH.xls”)
|
||||
@ -402,16 +402,16 @@ return os.path.basename(value)
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="20190822">2019-08-22</h2>
|
||||
<h2 id="2019-08-22">2019-08-22</h2>
|
||||
<ul>
|
||||
<li>Transfer original <a href="https://github.com/ilri/csv-metadata-quality">csv-metadata-quality</a> repository to ILRI organization on GitHub</li>
|
||||
</ul>
|
||||
<h2 id="20190823">2019-08-23</h2>
|
||||
<h2 id="2019-08-23">2019-08-23</h2>
|
||||
<ul>
|
||||
<li>Run system updates on AReS / OpenRXV dev server (linode20) and reboot it</li>
|
||||
<li>Fix AReS exports on DSpace Test by adding a new nginx proxy pass</li>
|
||||
</ul>
|
||||
<h2 id="20190826">2019-08-26</h2>
|
||||
<h2 id="2019-08-26">2019-08-26</h2>
|
||||
<ul>
|
||||
<li>Peter sent 2,943 corrections to the author dump I had originally sent him on 2019-05-27
|
||||
<ul>
|
||||
@ -448,7 +448,7 @@ sys 2m24.715s
|
||||
</ul>
|
||||
</li>
|
||||
</ul>
|
||||
<h2 id="20190827">2019-08-27</h2>
|
||||
<h2 id="2019-08-27">2019-08-27</h2>
|
||||
<ul>
|
||||
<li>File <a href="https://github.com/ilri/OpenRXV/issues/11">an issue on OpenRXV</a> for the bug when selecting communities</li>
|
||||
<li>Peter approved the related citation changes so I merged the <a href="https://github.com/ilri/DSpace/pull/430">pull request on GitHub</a> and will deploy it to CGSpace this weekend</li>
|
||||
@ -461,7 +461,7 @@ sys 2m24.715s
|
||||
</li>
|
||||
<li>Add a fix for missing space after commas to my <a href="https://github.com/ilri/csv-metadata-quality">csv-metadata-quality</a> script and tag version 0.2.2</li>
|
||||
</ul>
|
||||
<h2 id="20190828">2019-08-28</h2>
|
||||
<h2 id="2019-08-28">2019-08-28</h2>
|
||||
<ul>
|
||||
<li>Skype with Jane about AReS Phase III priorities</li>
|
||||
<li>I did a test to automatically fix some authors in the database using my csv-metadata-quality script
|
||||
@ -488,7 +488,7 @@ COPY 65597
|
||||
</code></pre><ul>
|
||||
<li>I very well might run these on CGSpace soon…</li>
|
||||
</ul>
|
||||
<h2 id="20190829">2019-08-29</h2>
|
||||
<h2 id="2019-08-29">2019-08-29</h2>
|
||||
<ul>
|
||||
<li>Resume working on the CG Core v2 changes in the <code>5_x-cgcorev2</code> branch again
|
||||
<ul>
|
||||
@ -522,7 +522,7 @@ COPY 65597
|
||||
</code></pre><ul>
|
||||
<li>So this is the same issue we had before, where Altmetric <em>knows</em> this Handle is associated with a DOI that has a score, but the client-side JavaScript code doesn't show it because it seems to a secondary handle or something</li>
|
||||
</ul>
|
||||
<h2 id="20190831">2019-08-31</h2>
|
||||
<h2 id="2019-08-31">2019-08-31</h2>
|
||||
<ul>
|
||||
<li>Run system updates on DSpace Test (linode19) and reboot the server</li>
|
||||
<li>Run the author fixes on DSpace Test and CGSpace and start a full Discovery re-index:</li>
|
||||
|
Reference in New Issue
Block a user