mirror of
https://github.com/ilri/csv-metadata-quality.git
synced 2024-11-24 14:50:17 +01:00
Strip filename descriptions before checking
Some checks failed
continuous-integration/drone/push Build is failing
Some checks failed
continuous-integration/drone/push Build is failing
When checking for uncommon file extensions in the filename field we should strip descriptions that are meant for SAF Bundler, for example: Annual_Report_2020.pdf__description:Report. This ends up as a false positive that spams the output with warnings.
This commit is contained in:
parent
bde38e9ed4
commit
8bc4cd419c
@ -15,6 +15,8 @@ because it is deprecated and outdated
|
|||||||
- Don't run `fix.separators()` on title or abstract fields
|
- Don't run `fix.separators()` on title or abstract fields
|
||||||
- Don't run whitespace or newline fixes on abstract fields
|
- Don't run whitespace or newline fixes on abstract fields
|
||||||
- Ignore some common non-SPDX licenses
|
- Ignore some common non-SPDX licenses
|
||||||
|
- Ignore `__description` suffix in filenames meant for SAFBuilder when checking
|
||||||
|
for uncommon file extensions
|
||||||
|
|
||||||
### Updated
|
### Updated
|
||||||
- Python dependencies
|
- Python dependencies
|
||||||
|
@ -286,6 +286,11 @@ def filename_extension(field):
|
|||||||
|
|
||||||
# Iterate over all values
|
# Iterate over all values
|
||||||
for value in values:
|
for value in values:
|
||||||
|
# Strip filename descriptions that are meant for SAF Bundler, for
|
||||||
|
# example: Annual_Report_2020.pdf__description:Report
|
||||||
|
if "__description" in value:
|
||||||
|
value = value.split("__")[0]
|
||||||
|
|
||||||
# Assume filename extension does not match
|
# Assume filename extension does not match
|
||||||
filename_extension_match = False
|
filename_extension_match = False
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user