mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-11-22 14:45:03 +01:00
Update notes for 2018-03-21
This commit is contained in:
parent
261a32d353
commit
81d80a1adc
@ -414,4 +414,24 @@ java.lang.OutOfMemoryError: Java heap space
|
||||
- Update [Ansible playbooks](https://github.com/ilri/rmg-ansible-public) to use [PostgreSQL JBDC driver](https://jdbc.postgresql.org/) 42.2.2
|
||||
- Deploy the new JDBC driver on DSpace Test
|
||||
- I'm also curious to see how long the `dspace index-discovery -b` takes on DSpace Test where the DSpace installation directory is on one of Linode's new block storage volumes
|
||||
|
||||
```
|
||||
$ time schedtool -D -e ionice -c2 -n7 nice -n19 dspace index-discovery -b
|
||||
|
||||
real 208m19.155s
|
||||
user 8m39.138s
|
||||
sys 2m45.135s
|
||||
```
|
||||
|
||||
- So that's about three times as long as it took on CGSpace this morning
|
||||
- I should also check the raw read speed with `hdparm -tT /dev/sdc`
|
||||
- Looking at Peter's author corrections there are some mistakes due to Windows 1252 encoding
|
||||
- I need to find a way to filter these easily with OpenRefine
|
||||
- For example, Peter has inadvertantly introduced Unicode character 0xfffd into several fields
|
||||
- I can search for Unicode values by their hex code in OpenRefine using the following GREL expression:
|
||||
|
||||
```
|
||||
isNotNull(value.match(/.*\ufffd.*/))
|
||||
```
|
||||
|
||||
- I need to be able to add many common characters though so that it is useful to copy and paste into a new project to find issues
|
||||
|
@ -20,7 +20,7 @@ Export a CSV of the IITA community metadata for Martin Mueller
|
||||
|
||||
<meta property="article:published_time" content="2018-03-02T16:07:54+02:00"/>
|
||||
|
||||
<meta property="article:modified_time" content="2018-03-21T11:44:06+02:00"/>
|
||||
<meta property="article:modified_time" content="2018-03-21T18:11:22+02:00"/>
|
||||
|
||||
|
||||
|
||||
@ -51,9 +51,9 @@ Export a CSV of the IITA community metadata for Martin Mueller
|
||||
"@type": "BlogPosting",
|
||||
"headline": "March, 2018",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2018-03/",
|
||||
"wordCount": "2343",
|
||||
"wordCount": "2459",
|
||||
"datePublished": "2018-03-02T16:07:54+02:00",
|
||||
"dateModified": "2018-03-21T11:44:06+02:00",
|
||||
"dateModified": "2018-03-21T18:11:22+02:00",
|
||||
"author": {
|
||||
"@type": "Person",
|
||||
"name": "Alan Orth"
|
||||
@ -587,7 +587,29 @@ java.lang.OutOfMemoryError: Java heap space
|
||||
<li>Update <a href="https://github.com/ilri/rmg-ansible-public">Ansible playbooks</a> to use <a href="https://jdbc.postgresql.org/">PostgreSQL JBDC driver</a> 42.2.2</li>
|
||||
<li>Deploy the new JDBC driver on DSpace Test</li>
|
||||
<li>I’m also curious to see how long the <code>dspace index-discovery -b</code> takes on DSpace Test where the DSpace installation directory is on one of Linode’s new block storage volumes</li>
|
||||
</ul>
|
||||
|
||||
<pre><code>$ time schedtool -D -e ionice -c2 -n7 nice -n19 dspace index-discovery -b
|
||||
|
||||
real 208m19.155s
|
||||
user 8m39.138s
|
||||
sys 2m45.135s
|
||||
</code></pre>
|
||||
|
||||
<ul>
|
||||
<li>So that’s about three times as long as it took on CGSpace this morning</li>
|
||||
<li>I should also check the raw read speed with <code>hdparm -tT /dev/sdc</code></li>
|
||||
<li>Looking at Peter’s author corrections there are some mistakes due to Windows 1252 encoding</li>
|
||||
<li>I need to find a way to filter these easily with OpenRefine</li>
|
||||
<li>For example, Peter has inadvertantly introduced Unicode character 0xfffd into several fields</li>
|
||||
<li>I can search for Unicode values by their hex code in OpenRefine using the following GREL expression:</li>
|
||||
</ul>
|
||||
|
||||
<pre><code>isNotNull(value.match(/.*\ufffd.*/))
|
||||
</code></pre>
|
||||
|
||||
<ul>
|
||||
<li>I need to be able to add many common characters though so that it is useful to copy and paste into a new project to find issues</li>
|
||||
</ul>
|
||||
|
||||
|
||||
|
@ -4,7 +4,7 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/2018-03/</loc>
|
||||
<lastmod>2018-03-21T11:44:06+02:00</lastmod>
|
||||
<lastmod>2018-03-21T18:11:22+02:00</lastmod>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
@ -154,7 +154,7 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/</loc>
|
||||
<lastmod>2018-03-21T11:44:06+02:00</lastmod>
|
||||
<lastmod>2018-03-21T18:11:22+02:00</lastmod>
|
||||
<priority>0</priority>
|
||||
</url>
|
||||
|
||||
@ -165,7 +165,7 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/tags/notes/</loc>
|
||||
<lastmod>2018-03-21T11:44:06+02:00</lastmod>
|
||||
<lastmod>2018-03-21T18:11:22+02:00</lastmod>
|
||||
<priority>0</priority>
|
||||
</url>
|
||||
|
||||
@ -177,13 +177,13 @@
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/posts/</loc>
|
||||
<lastmod>2018-03-21T11:44:06+02:00</lastmod>
|
||||
<lastmod>2018-03-21T18:11:22+02:00</lastmod>
|
||||
<priority>0</priority>
|
||||
</url>
|
||||
|
||||
<url>
|
||||
<loc>https://alanorth.github.io/cgspace-notes/tags/</loc>
|
||||
<lastmod>2018-03-21T11:44:06+02:00</lastmod>
|
||||
<lastmod>2018-03-21T18:11:22+02:00</lastmod>
|
||||
<priority>0</priority>
|
||||
</url>
|
||||
|
||||
|
Loading…
Reference in New Issue
Block a user