Add notes for 2017-06-20

This commit is contained in:
2017-06-20 12:00:40 +03:00
parent 4756e9025b
commit 41ba0acca9
3 changed files with 44 additions and 8 deletions

View File

@ -91,3 +91,19 @@ $ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add -
- Redeploy CGSpace with latest changes from `5_x-prod`, run system updates, and reboot the server
- Continue working on ansible infrastructure changes for CGIAR Library
## 2017-06-20
- Import Abenet and Peter's changes to the CGIAR Library CRP community
- Due to them using Windows and renaming some columns there were formatting, encoding, and duplicate metadata value issues
- I had to remove some fields from the CSV and rename some back to, ie, `dc.subject[en_US]` just so DSpace would detect changes properly
- Now it looks much better: https://dspacetest.cgiar.org/handle/10947/2517
- Removing the HTML tags and HTML/XML entities using the following GREL:
- `replace(value,/<\/?\w+((\s+\w+(\s*=\s*(?:".*?"|'.*?'|[^'">\s]+))?)+\s*|\s*)\/?>/,'')`
- `value.unescape("html").unescape("xml")`
- Finally import 914 CIAT Book Chapters to CGSpace in two batches:
```
$ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/35701 --source /home/aorth/CIAT-Books/SimpleArchiveFormat/ --mapfile=/tmp/ciat-books.map &> /tmp/ciat-books.log
$ JAVA_OPTS="-Xmx1024m -Dfile.encoding=UTF-8" [dspace]/bin/dspace import --add --eperson=aorth@mjanja.ch --collection=10568/35701 --source /home/aorth/CIAT-Books/SimpleArchiveFormat/ --mapfile=/tmp/ciat-books2.map &> /tmp/ciat-books2.log
```