mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2020-01-27
This commit is contained in:
@ -9,7 +9,7 @@
|
||||
<meta property="og:description" content="2016-03-02
|
||||
|
||||
Looking at issues with author authorities on CGSpace
|
||||
For some reason we still have the index-lucene-update cron job active on CGSpace, but I'm pretty sure we don't need it as of the latest few versions of Atmire's Listings and Reports module
|
||||
For some reason we still have the index-lucene-update cron job active on CGSpace, but I’m pretty sure we don’t need it as of the latest few versions of Atmire’s Listings and Reports module
|
||||
Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Java JDK 1.7 to match environment on CGSpace server
|
||||
" />
|
||||
<meta property="og:type" content="article" />
|
||||
@ -22,10 +22,10 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
<meta name="twitter:description" content="2016-03-02
|
||||
|
||||
Looking at issues with author authorities on CGSpace
|
||||
For some reason we still have the index-lucene-update cron job active on CGSpace, but I'm pretty sure we don't need it as of the latest few versions of Atmire's Listings and Reports module
|
||||
For some reason we still have the index-lucene-update cron job active on CGSpace, but I’m pretty sure we don’t need it as of the latest few versions of Atmire’s Listings and Reports module
|
||||
Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Java JDK 1.7 to match environment on CGSpace server
|
||||
"/>
|
||||
<meta name="generator" content="Hugo 0.62.2" />
|
||||
<meta name="generator" content="Hugo 0.63.1" />
|
||||
|
||||
|
||||
|
||||
@ -55,7 +55,7 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
|
||||
<!-- combined, minified CSS -->
|
||||
|
||||
<link href="https://alanorth.github.io/cgspace-notes/css/style.a20c1a4367639632cdb341d23c27ca44fedcc75b0f8b3cbea6203010da153d3c.css" rel="stylesheet" integrity="sha256-ogwaQ2djljLNs0HSPCfKRP7cx1sPizy+piAwENoVPTw=" crossorigin="anonymous">
|
||||
<link href="https://alanorth.github.io/cgspace-notes/css/style.23e2c3298bcc8c1136c19aba330c211ec94c36f7c4454ea15cf4d3548370042a.css" rel="stylesheet" integrity="sha256-I+LDKYvMjBE2wZq6MwwhHslMNvfERU6hXPTTVINwBCo=" crossorigin="anonymous">
|
||||
|
||||
|
||||
<!-- RSS 2.0 feed -->
|
||||
@ -103,14 +103,14 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/2016-03/">March, 2016</a></h2>
|
||||
<p class="blog-post-meta"><time datetime="2016-03-02T16:50:00+03:00">Wed Mar 02, 2016</time> by Alan Orth in
|
||||
|
||||
<i class="fa fa-tag" aria-hidden="true"></i> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
<span class="fas fa-tag" aria-hidden="true"></span> <a href="/cgspace-notes/tags/notes" rel="tag">Notes</a>
|
||||
|
||||
</p>
|
||||
</header>
|
||||
<h2 id="2016-03-02">2016-03-02</h2>
|
||||
<ul>
|
||||
<li>Looking at issues with author authorities on CGSpace</li>
|
||||
<li>For some reason we still have the <code>index-lucene-update</code> cron job active on CGSpace, but I'm pretty sure we don't need it as of the latest few versions of Atmire's Listings and Reports module</li>
|
||||
<li>For some reason we still have the <code>index-lucene-update</code> cron job active on CGSpace, but I’m pretty sure we don’t need it as of the latest few versions of Atmire’s Listings and Reports module</li>
|
||||
<li>Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Java JDK 1.7 to match environment on CGSpace server</li>
|
||||
</ul>
|
||||
<h2 id="2016-03-07">2016-03-07</h2>
|
||||
@ -124,16 +124,16 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
<pre><code>Exception in thread "Lucene Merge Thread #19" org.apache.lucene.index.MergePolicy$MergeException: java.io.IOException: No space left on device
|
||||
</code></pre><h2 id="2016-03-08">2016-03-08</h2>
|
||||
<ul>
|
||||
<li>Add a few new filters to Atmire's Listings and Reports module (<a href="https://github.com/ilri/DSpace/issues/180">#180</a>)</li>
|
||||
<li>Add a few new filters to Atmire’s Listings and Reports module (<a href="https://github.com/ilri/DSpace/issues/180">#180</a>)</li>
|
||||
<li>We had also wanted to add a few to the Content and Usage module but I have to ask the editors which ones they were</li>
|
||||
</ul>
|
||||
<h2 id="2016-03-10">2016-03-10</h2>
|
||||
<ul>
|
||||
<li>Disable the lucene cron job on CGSpace as it shouldn't be needed anymore</li>
|
||||
<li>Disable the lucene cron job on CGSpace as it shouldn’t be needed anymore</li>
|
||||
<li>Discuss ORCiD and duplicate authors on Yammer</li>
|
||||
<li>Request new documentation for Atmire CUA and L&R modules, as ours are from 2013</li>
|
||||
<li>Walk Sisay through some data cleaning workflows in OpenRefine</li>
|
||||
<li>Start cleaning up the configuration for Atmire's CUA module (<a href="https://github.com/ilri/DSpace/issues/185">#184</a>)</li>
|
||||
<li>Start cleaning up the configuration for Atmire’s CUA module (<a href="https://github.com/ilri/DSpace/issues/185">#184</a>)</li>
|
||||
<li>It is very messed up because some labels are incorrect, fields are missing, etc</li>
|
||||
</ul>
|
||||
<p><img src="/cgspace-notes/2016/03/cua-label-mixup.png" alt="Mixed up label in Atmire CUA"></p>
|
||||
@ -143,7 +143,7 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
<h2 id="2016-03-11">2016-03-11</h2>
|
||||
<ul>
|
||||
<li>As I was looking at the CUA config I realized our Discovery config is all messed up and confusing</li>
|
||||
<li>I've opened an issue to track some of that work (<a href="https://github.com/ilri/DSpace/issues/186">#186</a>)</li>
|
||||
<li>I’ve opened an issue to track some of that work (<a href="https://github.com/ilri/DSpace/issues/186">#186</a>)</li>
|
||||
<li>I did some major cleanup work on Discovery and XMLUI stuff related to the <code>dc.type</code> indexes (<a href="https://github.com/ilri/DSpace/pull/187">#187</a>)</li>
|
||||
<li>We had been confusing <code>dc.type</code> (a Dublin Core value) with <code>dc.type.output</code> (a value we invented) for a few years and it had permeated all aspects of our data, indexes, item displays, etc.</li>
|
||||
<li>There is still some more work to be done to remove references to old <code>outputtype</code> and <code>output</code></li>
|
||||
@ -162,7 +162,7 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
</ul>
|
||||
<h2 id="2016-03-16">2016-03-16</h2>
|
||||
<ul>
|
||||
<li>Still having problems deploying Atmire's CUA updates and fixes from January!</li>
|
||||
<li>Still having problems deploying Atmire’s CUA updates and fixes from January!</li>
|
||||
<li>More discussion on the GitHub issue here: <a href="https://github.com/ilri/DSpace/pull/182">https://github.com/ilri/DSpace/pull/182</a></li>
|
||||
<li>Clean up Atmire CUA config (<a href="https://github.com/ilri/DSpace/pull/193">#193</a>)</li>
|
||||
<li>Help Sisay with some PostgreSQL queries to clean up the incorrect <code>dc.contributor.corporateauthor</code> field</li>
|
||||
@ -183,7 +183,7 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
1942624 | 35347 | 37 | hi | | 1 | | -1 | 2
|
||||
1942639 | 35339 | 37 | hi | | 1 | | -1 | 2
|
||||
</code></pre><ul>
|
||||
<li>It seems this <code>dc.language</code> field isn't really used, but we should delete these values</li>
|
||||
<li>It seems this <code>dc.language</code> field isn’t really used, but we should delete these values</li>
|
||||
<li>Also, <code>dc.language.iso</code> has some weird values, like “En” and “English”</li>
|
||||
</ul>
|
||||
<h2 id="2016-03-17">2016-03-17</h2>
|
||||
@ -214,13 +214,13 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
</ul>
|
||||
<h2 id="2016-03-21">2016-03-21</h2>
|
||||
<ul>
|
||||
<li>Fix 66 site errors in Google's webmaster tools</li>
|
||||
<li>Fix 66 site errors in Google’s webmaster tools</li>
|
||||
<li>I looked at a bunch of them and they were old URLs, weird things linked from non-existent items, etc, so I just marked them all as fixed</li>
|
||||
<li>We also have 1,300 “soft 404” errors for URLs like: <a href="https://cgspace.cgiar.org/handle/10568/440/browse?type=bioversity">https://cgspace.cgiar.org/handle/10568/440/browse?type=bioversity</a></li>
|
||||
<li>I've marked them as fixed as well since the ones I tested were working fine</li>
|
||||
<li>I’ve marked them as fixed as well since the ones I tested were working fine</li>
|
||||
<li>This raises another question, as many of these pages are linked from Discovery search results and might create a duplicate content problem…</li>
|
||||
<li>Results pages like this give items that Google already knows from the sitemap: <a href="https://cgspace.cgiar.org/discover?filtertype=author&filter_relational_operator=equals&filter=Orth%2C+A">https://cgspace.cgiar.org/discover?filtertype=author&filter_relational_operator=equals&filter=Orth%2C+A</a>.</li>
|
||||
<li>There are some access denied errors on JSPUI links (of course! we forbid them!), but I'm not sure why Google is trying to index them…</li>
|
||||
<li>There are some access denied errors on JSPUI links (of course! we forbid them!), but I’m not sure why Google is trying to index them…</li>
|
||||
<li>For example:
|
||||
<ul>
|
||||
<li>This: <a href="https://cgspace.cgiar.org/jspui/bitstream/10568/809/1/main-page.pdf">https://cgspace.cgiar.org/jspui/bitstream/10568/809/1/main-page.pdf</a></li>
|
||||
@ -233,9 +233,9 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
</ul>
|
||||
<p><img src="/cgspace-notes/2016/03/google-index.png" alt="CGSpace pages in Google index"></p>
|
||||
<ul>
|
||||
<li>Turns out this is a problem with DSpace's <code>robots.txt</code>, and there's a Jira ticket since December, 2015: <a href="https://jira.duraspace.org/browse/DS-2962">https://jira.duraspace.org/browse/DS-2962</a></li>
|
||||
<li>Turns out this is a problem with DSpace’s <code>robots.txt</code>, and there’s a Jira ticket since December, 2015: <a href="https://jira.duraspace.org/browse/DS-2962">https://jira.duraspace.org/browse/DS-2962</a></li>
|
||||
<li>I am not sure if I want to apply it yet</li>
|
||||
<li>For now I've just set a bunch of these dynamic pages to not appear in search results by using the URL Parameters tool in Webmaster Tools</li>
|
||||
<li>For now I’ve just set a bunch of these dynamic pages to not appear in search results by using the URL Parameters tool in Webmaster Tools</li>
|
||||
</ul>
|
||||
<p><img src="/cgspace-notes/2016/03/url-parameters.png" alt="URL parameters cause millions of dynamic pages">
|
||||
<img src="/cgspace-notes/2016/03/url-parameters2.png" alt="Setting pages with the filter_0 param not to show in search results"></p>
|
||||
@ -244,7 +244,7 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
<li>It seems Feedburner can do HTTPS now, so we might be able to update our feeds and simplify the nginx configs</li>
|
||||
<li>De-deploy CGSpace with latest <code>5_x-prod</code> branch</li>
|
||||
<li>Run updates on CGSpace and reboot server (new kernel, <code>4.5.0</code>)</li>
|
||||
<li>Deploy Let's Encrypt certificate for cgspace.cgiar.org, but still need to work it into the ansible playbooks</li>
|
||||
<li>Deploy Let’s Encrypt certificate for cgspace.cgiar.org, but still need to work it into the ansible playbooks</li>
|
||||
</ul>
|
||||
<h2 id="2016-03-22">2016-03-22</h2>
|
||||
<ul>
|
||||
@ -257,7 +257,7 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
<pre><code>Can't find method org.dspace.app.xmlui.aspect.administrative.FlowGroupUtils.processSaveGroup(org.dspace.core.Context,number,string,[Ljava.lang.String;,[Ljava.lang.String;,org.apache.cocoon.environment.wrapper.RequestWrapper). (resource://aspects/Administrative/administrative.js#967)
|
||||
</code></pre><ul>
|
||||
<li>I can reproduce the same error on DSpace Test and on my Mac</li>
|
||||
<li>Looks to be an issue with the Atmire modules, I've submitted a ticket to their tracker.</li>
|
||||
<li>Looks to be an issue with the Atmire modules, I’ve submitted a ticket to their tracker.</li>
|
||||
</ul>
|
||||
<h2 id="2016-03-24">2016-03-24</h2>
|
||||
<ul>
|
||||
@ -268,7 +268,7 @@ Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Ja
|
||||
<ul>
|
||||
<li>Having problems with Listings and Reports, seems to be caused by a rogue reference to <code>dc.type.output</code></li>
|
||||
<li>This is the error we get when we proceed to the second page of Listings and Reports: <a href="https://gist.github.com/alanorth/b2d7fb5b82f94898caaf">https://gist.github.com/alanorth/b2d7fb5b82f94898caaf</a></li>
|
||||
<li>Commenting out the line works, but I haven't figured out the proper syntax for referring to <code>dc.type.*</code></li>
|
||||
<li>Commenting out the line works, but I haven’t figured out the proper syntax for referring to <code>dc.type.*</code></li>
|
||||
</ul>
|
||||
<h2 id="2016-03-28">2016-03-28</h2>
|
||||
<ul>
|
||||
|
Reference in New Issue
Block a user