1
0
mirror of https://github.com/ilri/csv-metadata-quality.git synced 2024-11-16 11:07:03 +01:00
csv-metadata-quality/data/test.csv
Alan Orth fa4fa3491b
Add check for "suspicious" characters
These standalone characters often indicate issues with encoding or
copy/paste in languages with accents like French and Spanish. For
example: foreˆt should be forêt.

It is not possible to fix these issues automatically, but this will
print a warning so you can notify the owner of the data.
2019-07-29 17:08:49 +03:00

425 B
Raw Blame History

1dc.contributor.authorbirthdatedc.identifier.issndc.identifier.isbn
2Alan|| Alan||Alan Orth||Alan ||Alan Orth |Alan19840378-5955978-0-306-40615-6||99921-58-10-7
3Stella|| Stella ||Stella Orth||Stella 1984-11-272321-230299921-58-10-7
4Sophia2019-06-15
5Test2019-06-150
6Doe, J.2019-06-15||2019-01-10
7Someone0378-5955|0378-5955
8Unnecessary Unicode2019-07-29
9Suspicious Character||foreˆt2019-07-29