2019-10-28 12:42:02 +01:00
+++
title = "CGSpace CG Core v2 Migration"
2021-03-30 08:56:38 +02:00
date = 2021-02-21T13:27:35+02:00
2019-10-28 12:42:02 +01:00
description = "Possible changes to CGSpace metadata fields to align more with DC, QDC, and DCTERMS as well as CG Core v2."
categories = ["Notes"]
tags = ["Migration"]
url = "cgspace-cgcorev2-migration"
+++
2021-03-30 08:52:52 +02:00
Changes to CGSpace metadata fields to align more with DC, QDC, and DCTERMS as well as CG Core v2. Implemented on 2021-02-21.
2019-10-28 12:42:02 +01:00
With reference to [CG Core v2 draft standard ](https://agriculturalsemantics.github.io/cg-core/cgcore.html ) by Marie-Angélique as well as [DCMI DCTERMS ](http://www.dublincore.org/specifications/dublin-core/dcmi-terms/ ).
<!-- more -->
- [Proposed Changes ](#proposed-changes )
2021-09-21 11:46:34 +02:00
- [Out of Scope ](#out-of-scope )
2019-10-28 12:42:02 +01:00
- [Fields to Create ](#fields-to-create )
- [Fields to Delete ](#fields-to-delete )
- [Implementation Progress ](#implementation-progress )
## Proposed Changes
2021-01-18 15:08:40 +01:00
As of 2021-01-18 the scope of the changes includes the following fields:
2019-10-28 12:42:02 +01:00
- cg.creator.id→cg.creator.identifier
- ORCID identifiers
- dc.format.extent→dcterms.extent
- dc.date.issued→dcterms.issued
- dc.description.abstract→dcterms.abstract
- dc.description→dcterms.description
- dc.description.sponsorship→cg.contributor.donor
- values from CrossRef or Grid.ac if possible
2021-01-18 15:08:40 +01:00
- dc.description.version→cg.reviewStatus
- cg.fulltextstatus→cg.howPublished
2019-10-28 12:42:02 +01:00
- CGSpace uses values like "Formally Published" or "Grey Literature"
- dc.identifier.citation→dcterms.bibliographicCitation
- cg.identifier.status→dcterms.accessRights
- current values are "Open Access" and "Limited Access"
- future values are possibly "Open" and "Restricted"?
- dc.language.iso→dcterms.language
- current values are ISO 639-1 (aka Alpha 2)
- future values are possibly ISO 639-3 (aka Alpha 3)?
- cg.link.reference→dcterms.relation
- dc.publisher→dcterms.publisher
2021-01-24 16:40:08 +01:00
- dc.relation.ispartofseries will be split into:
- series name: dcterms.isPartOf
- series number: cg.number
2019-10-28 12:42:02 +01:00
- dc.rights→dcterms.license
- Using [SPDX license identifiers ](https://spdx.org/licenses/ ) if possible
- dc.source→cg.journal
- dc.subject→dcterms.subject
- dc.type→dcterms.type
- dc.identifier.isbn→cg.isbn
- dc.identifier.issn→cg.issn
2019-12-22 10:56:45 +01:00
- cg.targetaudience→dcterms.audience
2019-10-28 12:42:02 +01:00
2021-09-21 11:46:34 +02:00
### Out of Scope
2019-10-28 12:42:02 +01:00
The following fields are currently out of the scope of this migration because they are used internally by DSpace 5.x/6.x and would be difficult to change without significant modifications to the core of the code:
2019-10-29 15:23:43 +01:00
- dc.title (`IncludePageMeta.java` only considers DC when building pageMeta, which we rely on in XMLUI because of XSLT from DRI)
2019-10-28 15:54:05 +01:00
- dc.title.alternative
2019-10-28 12:42:02 +01:00
- dc.date.available
- dc.date.accessioned
2019-10-29 15:23:43 +01:00
- dc.identifier.uri (hard coded for Handle assignment upon item submission)
2019-10-28 12:42:02 +01:00
- dc.description.provenance
2019-10-29 15:23:43 +01:00
- dc.contributor.author (`IncludePageMeta.java` only considers DC when building pageMeta, which we rely on in XMLUI because of XSLT from DRI)
2019-10-28 12:42:02 +01:00
## Fields to Create
Make sure the following fields exist:
2021-01-28 09:48:04 +01:00
- [x] cg.creator.identifier (247)
- [x] cg.contributor.donor (248)
- [x] cg.reviewStatus (249)
- [x] cg.howPublished (250)
- [x] cg.journal (251)
- [x] cg.isbn (252)
- [x] cg.issn (253)
- [x] cg.volume (254)
- [x] cg.number (255)
2021-01-28 15:28:21 +01:00
- [x] cg.issue (256)
2019-10-28 12:42:02 +01:00
## Fields to delete
Fields to delete after migration:
2021-03-30 08:52:52 +02:00
- [x] cg.creator.id
- [x] cg.fulltextstatus
- [x] cg.identifier.status
- [x] cg.link.reference
- [x] cg.targetaudience
2019-10-28 12:42:02 +01:00
## Implementation Progress
2021-01-28 09:48:04 +01:00
Tally of the status of the implementation of the new fields in the CGSpace `6_x-cgcorev2` branch.
2019-10-28 12:42:02 +01:00
| Field Name | migrate-fields.sh | Input Forms | XMLUI Themes¹ | dspace.cfg | Discovery | Atmire Modules | Crosswalks |
| ---------- | :---------------: | :---------: | :-----------: | :--------: | :-------: | :------------: | :--------: |
cg.creator.identifier | ✓ | ✓ | ✓ | - | ✓ | ✓ | |
dcterms.extent | ✓ | ✓ | - | - | - | - | |
dcterms.issued | ✓ | ✓ | ? | ✓ | ✓ | ✓ | |
dcterms.abstract | ✓ | ✓ | ✓ | ✓ | ✓ | - | |
dcterms.description | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
cg.contributor.donor | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
2021-01-18 17:22:19 +01:00
cg.reviewStatus | ✓ | ✓ | ✓ | - | ✓ | - | |
cg.howPublished | ✓ | ✓ | - | - | - | - | |
2019-10-28 12:42:02 +01:00
dcterms.bibliographicCitation | ✓ | ✓ | ✓ | - | - | ✓ | |
dcterms.accessRights | ✓ | ✓ | ✓ | - | ✓ | ✓ | |
dcterms.language | ✓ | ✓ | ✓ | - | ✓ | ✓ | |
dcterms.relation | ✓ | ✓ | ✓ | - | - | - | |
dcterms.publisher | ✓ | ✓ | - | - | ✓ | ✓ | |
dcterms.isPartOf | ✓ | ✓ | - | ✓ | ✓ | ✓ | |
dcterms.license | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
cg.journal | ✓ | ✓ | - | - | ✓ | ✓ | |
dcterms.subject | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
dcterms.type | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
cg.isbn | ✓ | ✓ | - | - | - | ✓ | |
cg.issn | ✓ | ✓ | - | - | - | ✓ | |
2019-12-22 10:56:45 +01:00
dcterms.audience | ✓ | ✓ | - | - | - | ✓ | |
2019-10-28 12:42:02 +01:00
There are a few things that I need to check once I get a deployment of this code up and running:
- Assess the XSL changes to see if things like `not(@qualifier)]` still make sense after we move fields from DC to DCTERMS, as some fields will no longer have qualifiers
2020-04-13 14:30:24 +02:00
- Do I need to edit crosswalks that we are not using, like [MODS ](https://wiki.lyrasis.org/display/DSDOC5x/DSpace+AIP+Format#DSpaceAIPFormat-MODSSchema )?
2019-10-28 12:42:02 +01:00
- There is potentially a lot of work in the OAI metadata formats like DIM, METS, and QDC (see `dspace/config/crosswalks/oai/*.xsl` )
------
¹ Not committed yet because I don't want to have to make minor adjustments in multiple commits. Re-apply the gauntlet of fixes with the sed script:
```
2019-11-17 14:39:10 +01:00
$ find dspace/modules/xmlui-mirage2/src/main/webapp/themes -iname "*.xsl" -exec sed -i -f ./cgcore-xsl-replacements.sed {} \;
2019-10-28 12:42:02 +01:00
```