2016-10-03 17:28:33 +02:00
<!DOCTYPE html>
< html lang = "en" >
< head >
< meta charset = "utf-8" >
< meta http-equiv = "X-UA-Compatible" content = "IE=edge" >
< meta name = "viewport" content = "width=device-width, initial-scale=1, shrink-to-fit=no" >
<!-- The above 3 meta tags *must* come first in the head; any other head content must come *after* these tags -->
< meta name = "description" content = "" >
< meta name = "author" content = "Alan Orth" >
<!-- OpenGraph Metadata: http://ogp.me/ -->
< meta property = "og:title" content = "October, 2016" >
< meta property = "og:description" content = "" >
< meta property = "og:type" content = "article" >
< meta property = "article:published_time" content = "2016-10-03T15:53:00+03:00" >
< meta property = "article:author" content = "Alan Orth" >
< meta property = "og:url" content = "https://alanorth.github.io/cgspace-notes/2016-10/" >
<!-- Metadata for Twitter: https://dev.twitter.com/cards/markup -->
< meta property = "twitter:card" content = "summary" >
< meta property = "twitter:title" content = "October, 2016" >
< meta property = "twitter:description" content = "" >
< meta name = "generator" content = "Hugo 0.17-DEV" / >
< base href = "https://alanorth.github.io/cgspace-notes/" >
< link rel = "canonical" href = "https://alanorth.github.io/cgspace-notes/2016-10/" >
< title > October, 2016 | CGSpace Notes< / title >
<!-- combined, minified CSS -->
< link href = "https://alanorth.github.io/cgspace-notes/css/style.css" rel = "stylesheet" >
<!-- RSS 2.0 feed of posts -->
< link href = "https://alanorth.github.io/cgspace-notes/post/index.xml" type = "application/rss+xml" rel = "alternate" >
< / head >
< body >
< div class = "blog-masthead" >
< div class = "container" >
< nav class = "nav blog-nav" >
< a class = "nav-link " href = "https://alanorth.github.io/cgspace-notes/" > Home< / a >
< / nav >
< / div >
< / div >
< header class = "blog-header" >
< div class = "container" >
< h1 class = "blog-title" > < a href = "https://alanorth.github.io/cgspace-notes/" rel = "home" > CGSpace Notes< / a > < / h1 >
< / div >
< / header >
< div class = "container" >
< div class = "row" >
< div class = "col-sm-8 blog-main" >
< article class = "blog-post" >
< header >
< h2 class = "blog-post-title" > < a href = "https://alanorth.github.io/cgspace-notes/2016-10/" title = "October, 2016" > October, 2016< / a > < / h2 >
< p class = "blog-post-meta" > < time datetime = "2016-10-03T15:53:00+03:00" > Mon Oct 03, 2016< / time > by Alan Orth in
< i class = "fa fa-tag" aria-hidden = "true" > < / i > < a href = "/cgspace-notes/tags/notes" rel = "tag" > Notes< / a >
< / p >
< / header >
< h2 id = "2016-10-03" > 2016-10-03< / h2 >
< ul >
< li > Testing adding < a href = "https://wiki.duraspace.org/display/DSDOC5x/ORCID+Integration#ORCIDIntegration-EditingexistingitemsusingBatchCSVEditing" > ORCIDs to a CSV< / a > file for a single item to see if the author orders get messed up< / li >
< li > Need to test the following scenarios to see how author order is affected:
< ul >
< li > ORCIDs only< / li >
< li > ORCIDs plus normal authors< / li >
< / ul > < / li >
< li > I exported a random item’ s metadata as CSV, deleted < em > all columns< / em > except id and collection, and made a new coloum called < code > ORCID:dc.contributor.author< / code > with the following random ORCIDs from the ORCID registry:< / li >
< / ul >
< pre > < code > 0000-0002-6115-0956||0000-0002-3812-8793||0000-0001-7462-405X
< / code > < / pre >
< ul >
< li > Hmm, with the < code > dc.contributor.author< / code > column removed, DSpace doesn’ t detect any changes< / li >
< li > With a blank < code > dc.contributor.author< / code > column, DSpace wants to remove all non-ORCID authors and add the new ORCID authors< / li >
< li > I added the < a href = "https://github.com/ilri/DSpace/issues/234" > disclaimer text< / a > to the About page, then added a footer link to the disclaimer’ s ID, but there is a Bootstrap issue that causes the page content to disappear when using in-page anchors: < a href = "https://github.com/twbs/bootstrap/issues/1768" > https://github.com/twbs/bootstrap/issues/1768< / a > < / li >
< / ul >
< p > < img src = "2016/10/bootstrap-issue.png" alt = "Bootstrap issue with in-page anchors" / > < / p >
< ul >
< li > Looks like we’ ll just have to add the text to the About page (without a link) or add a separate page< / li >
< / ul >
2016-10-04 10:34:57 +02:00
< h2 id = "2016-10-04" > 2016-10-04< / h2 >
< ul >
< li > Start testing cleanups of authors that Peter sent last week< / li >
< li > Out of 40,000+ rows, Peter had indicated corrections for ~3,200 of them—too many to look through carefully, so I did some basic quality checking:
< ul >
< li > Trim leading/trailing whitespace< / li >
< li > Find invalid characters< / li >
< li > Cluster values to merge obvious authors< / li >
< / ul > < / li >
< li > That left us with 3,180 valid corrections and 3 deletions:< / li >
< / ul >
< pre > < code > $ ./fix-metadata-values.py -i authors-fix-3180.csv -f dc.contributor.author -t correct -m 3 -d dspacetest -u dspacetest -p fuuu
$ ./delete-metadata-values.py -i authors-delete-3.csv -f dc.contributor.author -m 3 -d dspacetest -u dspacetest -p fuuu
< / code > < / pre >
< ul >
< li > Remove old about page (< a href = "https://github.com/ilri/DSpace/pull/284" > #284< / a > )< / li >
< / ul >
2016-10-03 17:28:33 +02:00
< / article >
< / div > <!-- /.blog - main -->
< aside class = "col-sm-3 offset-sm-1 blog-sidebar" >
< section class = "sidebar-module" >
< h4 > Recent Posts< / h4 >
< ol class = "list-unstyled" >
< li > < a href = "/cgspace-notes/2016-10/" > October, 2016< / a > < / li >
< li > < a href = "/cgspace-notes/2016-09/" > September, 2016< / a > < / li >
< li > < a href = "/cgspace-notes/2016-08/" > August, 2016< / a > < / li >
< li > < a href = "/cgspace-notes/2016-07/" > July, 2016< / a > < / li >
< li > < a href = "/cgspace-notes/2016-06/" > June, 2016< / a > < / li >
< / ol >
< / section >
< section class = "sidebar-module" >
< h4 > Links< / h4 >
< ol class = "list-unstyled" >
< li > < a href = "https://cgspace.cgiar.org" > CGSpace< / a > < / li >
< li > < a href = "https://dspacetest.cgiar.org" > DSpace Test< / a > < / li >
< li > < a href = "https://github.com/ilri/DSpace" > CGSpace @ GitHub< / a > < / li >
< / ol >
< / section >
< / aside >
< / div > <!-- /.row -->
< / div > <!-- /.container -->
< footer class = "blog-footer" >
< p >
Blog template built by < a href = 'https://twitter.com/mralanorth' > @mralanorth< / a > .
< / p >
< / footer >
< / body >
< / html >