mirror of
https://github.com/ilri/cgspace-java-helpers.git
synced 2024-11-17 04:17:06 +01:00
Curation tasks and helper scripts for the CGSpace institutional repository.
Alan Orth
6b648c2c85
This adds another script to detect and remove more low-quality thu- mbnails. For example: - If an item has an "IM Thumbnail" and a "Generated Thumbnail" in the THUMBNAIL bundle, remove the "Generated Thumbnail" - If an item has a PDF bitstream and a JPEG bitstream with a name or description "thumbnail" in the ORIGINAL bundle, remove the "thumbnail" bitstream in the ORIGINAL bundle and try to remove the "thumbnail.jpg" bitstream in the THUMBNAIL bundle The idea is that we should *always* prefer thumbnails generated by ImageMagick from PDFs in the ORIGINAL bundle and should remove any other manually uploaded thumbnails. |
||
---|---|---|
.github/workflows | ||
.idea | ||
src/main | ||
.gitignore | ||
COPYING | ||
pom.xml | ||
README.md |
CGSpace Java Helpers
DSpace curation tasks and other Java-based helpers used on the CGSpace institutional repository:
- CountryCodeTagger: add ISO 3166-1 Alpha2 country codes to items based on their existing country metadata
- FixJpgJpgThumbnails: Fix low-quality ".jpg.jpg" thumbnails by replacing them with their originals
Tested on DSpace 6.3. Read more about the DSpace curation system.
Build and Install
Integrate into DSpace Build
To use these curation tasks in a DSpace project add the following dependency to dspace/modules/additions/pom.xml
:
<dependency>
<groupId>io.github.ilri.cgspace</groupId>
<artifactId>cgspace-java-helpers</artifactId>
<version>6.1-SNAPSHOT</version>
</dependency>
The jar will be copied to all DSpace applications.
Manual Build and Install
To build the standalone jar:
$ mvn package
Copy the resulting jar to the DSpace lib
directory:
$ cp target/cgspace-java-helpers-6.1-SNAPSHOT.jar ~/dspace/lib/
Configuration
Please refer to the appropriate README.md file:
- Curation Tasks: src/main/java/io/github/ilri/cgspace/ctasks/README.md
- Scripts: src/main/java/io/github/ilri/cgspace/scripts/README.md
Todo
- Add a curation task to normalize DOIs to "https://doi.org" format
Notes
This project was initially created according to the Maven Getting Started Guide:
$ mvn -B archetype:generate -DgroupId=io.github.ilri.cgspace -DartifactId=cgspace-java-helpers -DarchetypeArtifactId=maven-archetype-quickstart -DarchetypeVersion=1.4
License
This work is licensed under the GPLv3.
This repository contains data from the Debian iso-codes project project, which is licensed under the GNU Lesser General Public License v2.1.