Update notes for 2019-10-12

This commit is contained in:
2019-10-12 23:28:50 +03:00
parent 05433a338d
commit e6d361c7fe
3 changed files with 18 additions and 10 deletions

View File

@ -125,7 +125,11 @@ International Maize and Wheat Improvement Centre,International Maize and Wheat I
$ ./fix-metadata-values.py -i /tmp/affiliations.csv -db dspace -u dspace -p 'fuuu' -f from -m 211 -t to
```
- I did some manual curation of ~227 authors in preparation for telling Peter and Abenet that the migration is almost ready
- I did some manual curation of about 300 authors in OpenRefine in preparation for telling Peter and Abenet that the migration is almost ready
- I would still like to perhaps (re)move institutional authors from `dc.contributor.author` to `cg.contributor.affiliation`, but I will have to run that by Francesca, Carol, and Abenet
- I could use a custom text facet like this in OpenRefine to find authors that likely match the "Last, F." pattern: `isNotNull(value.match(/^.*, \p{Lu}\.?.*$/))`
- The `\p{Lu}` is a cool [regex character class](https://www.regular-expressions.info/unicode.html) to make sure this works for letters with accents
- As cool as that is, it's actually more effective to just search for authors that have "." in them!
- I've decided to add a `cg.contributor.affiliation` column to 1,025 items based on the logic above where the author name is not an actual person
<!-- vim: set sw=2 ts=2: -->