mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Update notes for 2017-12-17
This commit is contained in:
@ -202,4 +202,13 @@ UPDATE 1
|
||||
- The dates are in super long ISO8601 format (from Excel?) like `2016-02-07T00:00:00Z` so I converted them to simpler forms in GREL: `value.toString("yyyy-MM-dd")`
|
||||
- I trimmed the whitespaces in a few fields but it wasn't many
|
||||
- Rename her thumbnail column to filename, and format it so SAFBuilder adds the files to the thumbnail bundle with this GREL in OpenRefine: `value + "__bundle:THUMBNAIL"`
|
||||
- Rename dc.identifier.status and dc.identifier.url columns to cg.identifier.status and cg.identifier.url
|
||||
- Item 4 has weird characters in citation, ie: Nagoya et de Trait
|
||||
- Some author names need normalization, ie: `Aggarwal, Pramod` and `Aggarwal, Pramod K.`
|
||||
- Something weird going on with duplicate authors that have the same text value, like `Berto, Jayson C.` and `Balmeo, Katherine P.`
|
||||
- I will send her feedback on some author names like UNEP and ICRISAT and ask her for the missing thumbnail11.jpg
|
||||
- I did a test import of the data locally after building with SAFBuilder but for some reason I had to specify the collection (even though the collections were specified in the `collection` field)
|
||||
|
||||
```
|
||||
$ JAVA_OPTS="-Xmx512m -Dfile.encoding=UTF-8" ~/dspace/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/89338 --source /Users/aorth/Downloads/2016\ bulk\ upload\ thumbnails/SimpleArchiveFormat --mapfile=/tmp/ccafs.map &> /tmp/ccafs.log
|
||||
```
|
||||
|
Reference in New Issue
Block a user