mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Update notes for 2018-03-21
This commit is contained in:
@ -414,4 +414,24 @@ java.lang.OutOfMemoryError: Java heap space
|
||||
- Update [Ansible playbooks](https://github.com/ilri/rmg-ansible-public) to use [PostgreSQL JBDC driver](https://jdbc.postgresql.org/) 42.2.2
|
||||
- Deploy the new JDBC driver on DSpace Test
|
||||
- I'm also curious to see how long the `dspace index-discovery -b` takes on DSpace Test where the DSpace installation directory is on one of Linode's new block storage volumes
|
||||
|
||||
```
|
||||
$ time schedtool -D -e ionice -c2 -n7 nice -n19 dspace index-discovery -b
|
||||
|
||||
real 208m19.155s
|
||||
user 8m39.138s
|
||||
sys 2m45.135s
|
||||
```
|
||||
|
||||
- So that's about three times as long as it took on CGSpace this morning
|
||||
- I should also check the raw read speed with `hdparm -tT /dev/sdc`
|
||||
- Looking at Peter's author corrections there are some mistakes due to Windows 1252 encoding
|
||||
- I need to find a way to filter these easily with OpenRefine
|
||||
- For example, Peter has inadvertantly introduced Unicode character 0xfffd into several fields
|
||||
- I can search for Unicode values by their hex code in OpenRefine using the following GREL expression:
|
||||
|
||||
```
|
||||
isNotNull(value.match(/.*\ufffd.*/))
|
||||
```
|
||||
|
||||
- I need to be able to add many common characters though so that it is useful to copy and paste into a new project to find issues
|
||||
|
Reference in New Issue
Block a user