1
0
mirror of https://github.com/ilri/csv-metadata-quality.git synced 2024-12-22 20:22:18 +01:00

README.md: Add note about Pandas

This commit is contained in:
Alan Orth 2019-07-29 10:56:02 +03:00
parent cf6c01caaf
commit a8a41d60b6
Signed by: alanorth
GPG Key ID: 0FB860CC9C45B1B9

View File

@ -1,14 +1,14 @@
# CSV Metadata Quality [![builds.sr.ht status](https://builds.sr.ht/~alanorth/csv-metadata-quality.svg)](https://builds.sr.ht/~alanorth/csv-metadata-quality?) # CSV Metadata Quality [![builds.sr.ht status](https://builds.sr.ht/~alanorth/csv-metadata-quality.svg)](https://builds.sr.ht/~alanorth/csv-metadata-quality?)
A simple but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpace ecosystem. Supports multi-value fields using the standard DSpace value separator ("||"). Despite the name it does support reading Excel files. A simple but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpace ecosystem. Supports multi-value fields using the standard DSpace value separator ("||"). Despite the name it does support reading Excel files.
Written and tested using Python 3.7. Written and tested using Python 3.7. CSV and Excel support comes from the [Pandas](https://pandas.pydata.org/) library.
## Checks ## Checks
Supports checking the validity of the following metadata elements: Supports checking the validity of the following metadata elements:
- ISSN - ISSN
- ISBN - ISBN
- Multi-value separators - Multi-value separators ("||")
- Dates - Dates
## Fixes ## Fixes