1
0
mirror of https://github.com/ilri/csv-metadata-quality.git synced 2024-11-16 11:07:03 +01:00
csv-metadata-quality/csv_metadata_quality
Alan Orth 8435ee242d
Experimental language detection using langid
Works decenty well assuming the title, abstract, and citation fields
are an accurate representation of the language as identified by the
language field. Handles ISO 639-1 (alpha 2) and ISO 639-3 (alpha 3)
values seamlessly.

This includes updated pipenv environment, test data, pytest tests
for both correct and incorrect ISO 639-1 and ISO 639-3 languages,
and a new command line option "-e".
2019-09-26 13:46:32 +03:00
..
__init__.py Refactor as package with subpackages 2019-07-26 22:11:10 +03:00
__main__.py Sort imports with isort 2019-08-29 01:15:04 +03:00
app.py Experimental language detection using langid 2019-09-26 13:46:32 +03:00
check.py More ISO 639-1 and ISO 639-3 fixes 2019-09-26 07:44:39 +03:00
experimental.py Experimental language detection using langid 2019-09-26 13:46:32 +03:00
fix.py Sort imports with isort 2019-08-29 01:15:04 +03:00
version.py Format with black 2019-08-29 01:10:39 +03:00