Add notes for 2019-05-17

This commit is contained in:
2019-05-17 18:46:13 +03:00
parent 856084655e
commit 32830944c2
3 changed files with 50 additions and 8 deletions

View File

@ -364,4 +364,24 @@ COPY 995
- Fork the [ICARDA AReS v1 repository](https://github.com/icarda-git/AReS) to [ILRI's GitHub](https://github.com/ilri/AReS) and give access to CodeObia guys
- The plan is that we develop the v2 code here
## 2019-05-17
- Peter sent me a bunch of fixes for investors from yesterday
- I did a quick check in Open Refine (trim and collapse whitespace, clean smart quotes, etc) and then applied them on CGSpace:
```
$ ./fix-metadata-values.py -i /tmp/2019-05-16-fix-306-Investors.csv -db dspace-u dspace-p 'fuuu' -f dc.description.sponsorship -m 29 -t correct -d
$ ./delete-metadata-values.py -i /tmp/2019-05-16-delete-297-Investors.csv -db dspace -u dspace -p 'fuuu' -m 29 -f dc.description.sponsorship -d
```
- Then I started a full Discovery re-indexing:
```
$ export JAVA_OPTS="-Dfile.encoding=UTF-8 -Xmx1024m"
$ time schedtool -B -e ionice -c2 -n7 nice -n19 dspace index-discovery -b
```
- I was going to make a new controlled vocabulary of the top 100 terms after these corrections, but I noticed a bunch of duplicates and variations when I sorted them alphabetically
- Instead, I exported a new list and asked Peter to look at it again
<!-- vim: set sw=2 ts=2: -->