Update notes for 2019-09-21

This commit is contained in:
2019-09-22 01:41:25 +03:00
parent ddf3b1346b
commit 01348c687d
3 changed files with 10 additions and 10 deletions

View File

@ -303,6 +303,6 @@ $ dspace import -a me@cgiar.org -m 2019-09-20-bioversity2.map -s /home/aorth/Bio
- langid seems to be the best considering the above experiences
- I added very experimental language detection to the [csv-metadata-quality](https://github.com/ilri/csv-metadata-quality) module
- It works by checking the predicted language of the `dc.title` field against the item's `dc.language.iso` field
- I tested it on the Bioversity migration data set and actually managed to correct about eight incorrect language fields in their records!
- I tested it on the Bioversity migration data set and it actually helped me correct eleven language fields in their records!
<!-- vim: set sw=2 ts=2: -->