From ad33195ba39262608f95bbfaf67ea2218ee86432 Mon Sep 17 00:00:00 2001 From: Alan Orth Date: Wed, 8 Dec 2021 11:36:34 +0200 Subject: [PATCH] README.md: adjust intro Makes the badges not wrap and looks better in my opinion. --- README.md | 9 ++++++++- 1 file changed, 8 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 028969a..69adc2c 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,11 @@ -# DSpace CSV Metadata Quality Checker ![GitHub Actions](https://github.com/ilri/csv-metadata-quality/workflows/Build%20and%20Test/badge.svg) [![Build Status](https://ci.mjanja.ch/api/badges/alanorth/csv-metadata-quality/status.svg)](https://ci.mjanja.ch/alanorth/csv-metadata-quality) +

DSpace CSV Metadata Quality Checker

+ +

+ Build Status + Build and Test + Code style: black +

+ A simple, but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpace ecosystem (though it could theoretically work on any CSV that uses Dublin Core fields as columns). The implementation is essentially a pipeline of checks and fixes that begins with splitting multi-value fields on the standard DSpace "||" separator, trimming leading/trailing whitespace, and then proceeding to more specialized cases like ISSNs, ISBNs, languages, unnecessary Unicode, AGROVOC terms, etc. Requires Python 3.7.1 or greater (3.8+ recommended). CSV and Excel support comes from the [Pandas](https://pandas.pydata.org/) library, though your mileage may vary with Excel because this is much less tested.