mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2017-06-20
This commit is contained in:
@ -91,3 +91,19 @@ $ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add -
|
||||
|
||||
- Redeploy CGSpace with latest changes from `5_x-prod`, run system updates, and reboot the server
|
||||
- Continue working on ansible infrastructure changes for CGIAR Library
|
||||
|
||||
## 2017-06-20
|
||||
|
||||
- Import Abenet and Peter's changes to the CGIAR Library CRP community
|
||||
- Due to them using Windows and renaming some columns there were formatting, encoding, and duplicate metadata value issues
|
||||
- I had to remove some fields from the CSV and rename some back to, ie, `dc.subject[en_US]` just so DSpace would detect changes properly
|
||||
- Now it looks much better: https://dspacetest.cgiar.org/handle/10947/2517
|
||||
- Removing the HTML tags and HTML/XML entities using the following GREL:
|
||||
- `replace(value,/<\/?\w+((\s+\w+(\s*=\s*(?:".*?"|'.*?'|[^'">\s]+))?)+\s*|\s*)\/?>/,'')`
|
||||
- `value.unescape("html").unescape("xml")`
|
||||
- Finally import 914 CIAT Book Chapters to CGSpace in two batches:
|
||||
|
||||
```
|
||||
$ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/35701 --source /home/aorth/CIAT-Books/SimpleArchiveFormat/ --mapfile=/tmp/ciat-books.map &> /tmp/ciat-books.log
|
||||
$ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/35701 --source /home/aorth/CIAT-Books/SimpleArchiveFormat/ --mapfile=/tmp/ciat-books2.map &> /tmp/ciat-books2.log
|
||||
```
|
||||
|
Reference in New Issue
Block a user