mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2024-12-22 21:22:19 +01:00
559 lines
15 KiB
HTML
559 lines
15 KiB
HTML
<!DOCTYPE html>
|
|
<html lang="en" >
|
|
|
|
<head>
|
|
<meta charset="utf-8">
|
|
<meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no">
|
|
|
|
<meta property="og:title" content="CGSpace CG Core v2 Migration" />
|
|
<meta property="og:description" content="Possible changes to CGSpace metadata fields to align more with DC, QDC, and DCTERMS as well as CG Core v2." />
|
|
<meta property="og:type" content="article" />
|
|
<meta property="og:url" content="https://alanorth.github.io/cgspace-notes/cgspace-cgcorev2-migration/" />
|
|
<meta property="article:published_time" content="2019-10-28T13:27:35+02:00" />
|
|
<meta property="article:modified_time" content="2019-10-28T16:54:05+02:00" />
|
|
|
|
<meta name="twitter:card" content="summary"/>
|
|
<meta name="twitter:title" content="CGSpace CG Core v2 Migration"/>
|
|
<meta name="twitter:description" content="Possible changes to CGSpace metadata fields to align more with DC, QDC, and DCTERMS as well as CG Core v2."/>
|
|
<meta name="generator" content="Hugo 0.59.0" />
|
|
|
|
|
|
|
|
<script type="application/ld+json">
|
|
{
|
|
"@context": "http://schema.org",
|
|
"@type": "BlogPosting",
|
|
"headline": "CGSpace CG Core v2 Migration",
|
|
"url": "https:\/\/alanorth.github.io\/cgspace-notes\/cgspace-cgcorev2-migration\/",
|
|
"wordCount": "522",
|
|
"datePublished": "2019-10-28T13:27:35+02:00",
|
|
"dateModified": "2019-10-28T16:54:05+02:00",
|
|
"author": {
|
|
"@type": "Person",
|
|
"name": "Alan Orth"
|
|
},
|
|
"keywords": "Notes, Migration",
|
|
"description": "Possible changes to CGSpace metadata fields to align more with DC, QDC, and DCTERMS as well as CG Core v2."
|
|
}
|
|
</script>
|
|
|
|
|
|
|
|
<link rel="canonical" href="https://alanorth.github.io/cgspace-notes/cgspace-cgcorev2-migration/">
|
|
|
|
<title>CGSpace CG Core v2 Migration | CGSpace Notes</title>
|
|
|
|
|
|
<!-- combined, minified CSS -->
|
|
<link href="https://alanorth.github.io/cgspace-notes/css/style.css" rel="stylesheet" integrity="sha384-G5B34w7DFTumWTswxYzTX7NWfbvQEg1HbFFEg6ItN03uTAAoS2qkPS/fu3LhuuSA" crossorigin="anonymous">
|
|
|
|
|
|
<!-- RSS 2.0 feed -->
|
|
|
|
|
|
|
|
|
|
|
|
|
|
</head>
|
|
|
|
<body>
|
|
|
|
|
|
<div class="blog-masthead">
|
|
<div class="container">
|
|
<nav class="nav blog-nav">
|
|
<a class="nav-link " href="https://alanorth.github.io/cgspace-notes/">Home</a>
|
|
</nav>
|
|
</div>
|
|
</div>
|
|
|
|
|
|
|
|
|
|
<header class="blog-header">
|
|
<div class="container">
|
|
<h1 class="blog-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/" rel="home">CGSpace Notes</a></h1>
|
|
<p class="lead blog-description" dir="auto">Documenting day-to-day work on the <a href="https://cgspace.cgiar.org">CGSpace</a> repository.</p>
|
|
</div>
|
|
</header>
|
|
|
|
|
|
|
|
|
|
<div class="container">
|
|
<div class="row">
|
|
<div class="col-sm-8 blog-main">
|
|
|
|
|
|
|
|
|
|
<article class="blog-post">
|
|
<header>
|
|
<h2 class="blog-post-title" dir="auto"><a href="https://alanorth.github.io/cgspace-notes/cgspace-cgcorev2-migration/">CGSpace CG Core v2 Migration</a></h2>
|
|
<p class="blog-post-meta"><time datetime="2019-10-28T13:27:35+02:00">Mon Oct 28, 2019</time> by Alan Orth in
|
|
<i class="fa fa-folder" aria-hidden="true"></i> <a href="/cgspace-notes/categories/notes" rel="category tag">Notes</a>
|
|
|
|
|
|
<i class="fa fa-tag" aria-hidden="true"></i> <a href="/cgspace-notes/tags/migration" rel="tag">Migration</a>
|
|
|
|
</p>
|
|
</header>
|
|
<p>Possible changes to CGSpace metadata fields to align more with DC, QDC, and DCTERMS as well as CG Core v2.</p>
|
|
|
|
<p>With reference to <a href="https://agriculturalsemantics.github.io/cg-core/cgcore.html">CG Core v2 draft standard</a> by Marie-Angélique as well as <a href="http://www.dublincore.org/specifications/dublin-core/dcmi-terms/">DCMI DCTERMS</a>.</p>
|
|
|
|
<ul>
|
|
<li><a href="#proposed-changes">Proposed Changes</a></li>
|
|
<li><a href="#fields-to-create">Fields to Create</a></li>
|
|
<li><a href="#fields-to-delete">Fields to Delete</a></li>
|
|
<li><a href="#implementation-progress">Implementation Progress</a></li>
|
|
</ul>
|
|
|
|
<h2 id="proposed-changes">Proposed Changes</h2>
|
|
|
|
<p>As of 2019-09-11 the scope of the changes includes the following fields:</p>
|
|
|
|
<ul>
|
|
<li>dc.contributor.author→dcterms.creator
|
|
|
|
<ul>
|
|
<li>for people and institutional authors</li>
|
|
</ul></li>
|
|
<li>cg.creator.id→cg.creator.identifier
|
|
|
|
<ul>
|
|
<li>ORCID identifiers</li>
|
|
</ul></li>
|
|
<li>dc.format.extent→dcterms.extent</li>
|
|
<li>dc.date.issued→dcterms.issued</li>
|
|
<li>dc.description.abstract→dcterms.abstract</li>
|
|
<li>dc.description→dcterms.description</li>
|
|
<li>dc.description.sponsorship→cg.contributor.donor
|
|
|
|
<ul>
|
|
<li>values from CrossRef or Grid.ac if possible</li>
|
|
</ul></li>
|
|
<li>dc.description.version→cg.peer-reviewed</li>
|
|
<li>cg.fulltextstatus→cg.howpublished
|
|
|
|
<ul>
|
|
<li>CGSpace uses values like “Formally Published” or “Grey Literature”</li>
|
|
</ul></li>
|
|
<li>dc.identifier.citation→dcterms.bibliographicCitation</li>
|
|
<li>cg.identifier.status→dcterms.accessRights
|
|
|
|
<ul>
|
|
<li>current values are “Open Access” and “Limited Access”</li>
|
|
<li>future values are possibly “Open” and “Restricted”?</li>
|
|
</ul></li>
|
|
<li>dc.language.iso→dcterms.language
|
|
|
|
<ul>
|
|
<li>current values are ISO 639-1 (aka Alpha 2)</li>
|
|
<li>future values are possibly ISO 639-3 (aka Alpha 3)?</li>
|
|
</ul></li>
|
|
<li>cg.link.reference→dcterms.relation</li>
|
|
<li>dc.publisher→dcterms.publisher</li>
|
|
<li>dc.relation.ispartofseries→dcterms.isPartOf</li>
|
|
<li>dc.rights→dcterms.license
|
|
|
|
<ul>
|
|
<li>Using <a href="https://spdx.org/licenses/">SPDX license identifiers</a> if possible</li>
|
|
</ul></li>
|
|
<li>dc.source→cg.journal</li>
|
|
<li>dc.subject→dcterms.subject</li>
|
|
<li>dc.type→dcterms.type</li>
|
|
<li>dc.identifier.isbn→cg.isbn</li>
|
|
<li>dc.identifier.issn→cg.issn</li>
|
|
<li>cg.identifier.dataurl→cg.hasMetadata</li>
|
|
</ul>
|
|
|
|
<p>The following fields are currently out of the scope of this migration because they are used internally by DSpace 5.x/6.x and would be difficult to change without significant modifications to the core of the code:</p>
|
|
|
|
<ul>
|
|
<li>dc.title</li>
|
|
<li>dc.title.alternative</li>
|
|
<li>dc.date.available</li>
|
|
<li>dc.date.accessioned</li>
|
|
<li>dc.identifier.uri</li>
|
|
<li>dc.description.provenance</li>
|
|
</ul>
|
|
|
|
<h2 id="fields-to-create">Fields to Create</h2>
|
|
|
|
<p>Make sure the following fields exist:</p>
|
|
|
|
<ul class="task-list">
|
|
<li><label><input type="checkbox" checked disabled class="task-list-item"> cg.creator.identifier (242)</label></li>
|
|
<li><label><input type="checkbox" checked disabled class="task-list-item"> cg.contributor.donor (243)</label></li>
|
|
<li><label><input type="checkbox" checked disabled class="task-list-item"> cg.peer-reviewed (244)</label></li>
|
|
<li><label><input type="checkbox" checked disabled class="task-list-item"> cg.howpublished (245)</label></li>
|
|
<li><label><input type="checkbox" checked disabled class="task-list-item"> cg.journal (246)</label></li>
|
|
<li><label><input type="checkbox" checked disabled class="task-list-item"> cg.isbn (247)</label></li>
|
|
<li><label><input type="checkbox" checked disabled class="task-list-item"> cg.issn (248)</label></li>
|
|
<li><label><input type="checkbox" checked disabled class="task-list-item"> cg.hasMetadata (249)</label></li>
|
|
</ul>
|
|
|
|
<h2 id="fields-to-delete">Fields to delete</h2>
|
|
|
|
<p>Fields to delete after migration:</p>
|
|
|
|
<ul class="task-list">
|
|
<li><label><input type="checkbox" disabled class="task-list-item"> cg.creator.id</label></li>
|
|
<li><label><input type="checkbox" disabled class="task-list-item"> cg.fulltextstatus</label></li>
|
|
<li><label><input type="checkbox" disabled class="task-list-item"> cg.identifier.status</label></li>
|
|
<li><label><input type="checkbox" disabled class="task-list-item"> cg.link.reference</label></li>
|
|
<li><label><input type="checkbox" disabled class="task-list-item"> cg.identifier.dataurl</label></li>
|
|
</ul>
|
|
|
|
<h2 id="implementation-progress">Implementation Progress</h2>
|
|
|
|
<p>Tally of the status of the implementation of the new fields in the CGSpace <code>5_x-cgcorev2</code> branch.</p>
|
|
|
|
<table>
|
|
<thead>
|
|
<tr>
|
|
<th>Field Name</th>
|
|
<th align="center">migrate-fields.sh</th>
|
|
<th align="center">Input Forms</th>
|
|
<th align="center">XMLUI Themes¹</th>
|
|
<th align="center">dspace.cfg</th>
|
|
<th align="center">Discovery</th>
|
|
<th align="center">Atmire Modules</th>
|
|
<th align="center">Crosswalks</th>
|
|
</tr>
|
|
</thead>
|
|
|
|
<tbody>
|
|
<tr>
|
|
<td>dcterms.creator</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">?</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>cg.creator.identifier</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.extent</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.issued</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">?</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.abstract</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.description</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>cg.contributor.donor</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>cg.peer-reviewed</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>cg.howpublished</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.bibliographicCitation</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.accessRights</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.language</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.relation</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.publisher</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.isPartOf</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.license</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>cg.journal</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.subject</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>dcterms.type</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>cg.isbn</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>cg.issn</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
|
|
<tr>
|
|
<td>cg.hasMetadata</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">-</td>
|
|
<td align="center">✓</td>
|
|
<td align="center">✓</td>
|
|
<td align="center"></td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
|
|
<p>There are a few things that I need to check once I get a deployment of this code up and running:</p>
|
|
|
|
<ul>
|
|
<li>Assess the XSL changes to see if things like <code>not(@qualifier)]</code> still make sense after we move fields from DC to DCTERMS, as some fields will no longer have qualifiers</li>
|
|
<li>Do I need to edit crosswalks that we are not using, like <a href="https://wiki.duraspace.org/display/DSDOC5x/DSpace+AIP+Format#DSpaceAIPFormat-MODSSchema">MODS</a>?</li>
|
|
<li>There is potentially a lot of work in the OAI metadata formats like DIM, METS, and QDC (see <code>dspace/config/crosswalks/oai/*.xsl</code>)</li>
|
|
</ul>
|
|
|
|
<hr />
|
|
|
|
<p>¹ Not committed yet because I don’t want to have to make minor adjustments in multiple commits. Re-apply the gauntlet of fixes with the sed script:</p>
|
|
|
|
<pre><code>$ find dspace/modules/xmlui-mirage2/src/main/webapp/themes -iname "*.xsl" -exec ./cgcore-xsl-replacements.sed {} \;
|
|
</code></pre>
|
|
|
|
|
|
|
|
|
|
|
|
</article>
|
|
|
|
|
|
|
|
</div> <!-- /.blog-main -->
|
|
|
|
<aside class="col-sm-3 ml-auto blog-sidebar">
|
|
|
|
|
|
|
|
<section class="sidebar-module">
|
|
<h4>Recent Posts</h4>
|
|
<ol class="list-unstyled">
|
|
|
|
|
|
<li><a href="/cgspace-notes/cgspace-cgcorev2-migration/">CGSpace CG Core v2 Migration</a></li>
|
|
|
|
<li><a href="/cgspace-notes/2019-10/">October, 2019</a></li>
|
|
|
|
<li><a href="/cgspace-notes/2019-09/">September, 2019</a></li>
|
|
|
|
<li><a href="/cgspace-notes/2019-08/">August, 2019</a></li>
|
|
|
|
<li><a href="/cgspace-notes/2019-07/">July, 2019</a></li>
|
|
|
|
</ol>
|
|
</section>
|
|
|
|
|
|
|
|
|
|
<section class="sidebar-module">
|
|
<h4>Links</h4>
|
|
<ol class="list-unstyled">
|
|
|
|
<li><a href="https://cgspace.cgiar.org">CGSpace</a></li>
|
|
|
|
<li><a href="https://dspacetest.cgiar.org">DSpace Test</a></li>
|
|
|
|
<li><a href="https://github.com/ilri/DSpace">CGSpace @ GitHub</a></li>
|
|
|
|
</ol>
|
|
</section>
|
|
|
|
</aside>
|
|
|
|
|
|
</div> <!-- /.row -->
|
|
</div> <!-- /.container -->
|
|
|
|
|
|
|
|
<footer class="blog-footer">
|
|
<p dir="auto">
|
|
|
|
Blog template created by <a href="https://twitter.com/mdo">@mdo</a>, ported to Hugo by <a href='https://twitter.com/mralanorth'>@mralanorth</a>.
|
|
|
|
</p>
|
|
<p>
|
|
<a href="#">Back to top</a>
|
|
</p>
|
|
</footer>
|
|
|
|
|
|
</body>
|
|
|
|
</html>
|