Add notes for 2017-05-15

This commit is contained in:
2017-05-15 07:52:09 +03:00
parent 516e5ecd1d
commit ad891890c7
3 changed files with 20 additions and 8 deletions

View File

@ -132,3 +132,8 @@ dspace=# delete from metadatavalue where resource_type_id=2 and text_value='';
- After quite a bit of troubleshooting with importing cleaned up data as CSV, it seems that there are actually [NUL](https://en.wikipedia.org/wiki/Null_character) characters in the `dc.description.abstract` field (at least) on the lines where CSV importing was failing
- I tried to find a way to remove the characters in vim or Open Refine, but decided it was quicker to just remove the column temporarily and import it
- The import was successful and detected 2022 changes, which should likely be the rest that were failing to import before
## 2017-05-15
- To delete the blank lines that cause isses during import we need to use a regex in vim `g/^$/d`
- After that I started looking in the `dc.subject` field to try to pull countries and regions out, but there are too many values in there