2023-09-04 08:16:51 +02:00
<!DOCTYPE html>
< html lang = "en" >
< head >
< meta charset = "utf-8" >
< meta name = "viewport" content = "width=device-width, initial-scale=1, shrink-to-fit=no" >
< meta property = "og:title" content = "September, 2023" / >
< meta property = "og:description" content = "2023-09-02
Export CGSpace to check for missing Initiative collection mappings
Start a harvest on AReS
" />
< meta property = "og:type" content = "article" / >
< meta property = "og:url" content = "https://alanorth.github.io/cgspace-notes/2023-09/" / >
< meta property = "article:published_time" content = "2023-09-02T17:29:36+03:00" / >
2023-09-08 23:25:48 +02:00
< meta property = "article:modified_time" content = "2023-09-09T00:25:12+03:00" / >
2023-09-04 08:16:51 +02:00
< meta name = "twitter:card" content = "summary" / >
< meta name = "twitter:title" content = "September, 2023" / >
< meta name = "twitter:description" content = "2023-09-02
Export CGSpace to check for missing Initiative collection mappings
Start a harvest on AReS
"/>
< meta name = "generator" content = "Hugo 0.118.2" >
< script type = "application/ld+json" >
{
"@context": "http://schema.org",
"@type": "BlogPosting",
"headline": "September, 2023",
"url": "https://alanorth.github.io/cgspace-notes/2023-09/",
2023-09-08 23:25:48 +02:00
"wordCount": "368",
2023-09-04 08:16:51 +02:00
"datePublished": "2023-09-02T17:29:36+03:00",
2023-09-08 23:25:48 +02:00
"dateModified": "2023-09-09T00:25:12+03:00",
2023-09-04 08:16:51 +02:00
"author": {
"@type": "Person",
"name": "Alan Orth"
},
"keywords": "Notes"
}
< / script >
< link rel = "canonical" href = "https://alanorth.github.io/cgspace-notes/2023-09/" >
< title > September, 2023 | CGSpace Notes< / title >
<!-- combined, minified CSS -->
< link href = "https://alanorth.github.io/cgspace-notes/css/style.c6ba80bc50669557645abe05f86b73cc5af84408ed20f1551a267bc19ece8228.css" rel = "stylesheet" integrity = "sha256-xrqAvFBmlVdkWr4F+GtzzFr4RAjtIPFVGiZ7wZ7Ogig=" crossorigin = "anonymous" >
<!-- minified Font Awesome for SVG icons -->
< script defer src = "https://alanorth.github.io/cgspace-notes/js/fontawesome.min.f5072c55a0721857184db93a50561d7dc13975b4de2e19db7f81eb5f3fa57270.js" integrity = "sha256-9QcsVaByGFcYTbk6UFYdfcE5dbTeLhnbf4HrXz+lcnA=" crossorigin = "anonymous" > < / script >
<!-- RSS 2.0 feed -->
< / head >
< body >
< div class = "blog-masthead" >
< div class = "container" >
< nav class = "nav blog-nav" >
< a class = "nav-link " href = "https://alanorth.github.io/cgspace-notes/" > Home< / a >
< / nav >
< / div >
< / div >
< header class = "blog-header" >
< div class = "container" >
< h1 class = "blog-title" dir = "auto" > < a href = "https://alanorth.github.io/cgspace-notes/" rel = "home" > CGSpace Notes< / a > < / h1 >
< p class = "lead blog-description" dir = "auto" > Documenting day-to-day work on the < a href = "https://cgspace.cgiar.org" > CGSpace< / a > repository.< / p >
< / div >
< / header >
< div class = "container" >
< div class = "row" >
< div class = "col-sm-8 blog-main" >
< article class = "blog-post" >
< header >
< h2 class = "blog-post-title" dir = "auto" > < a href = "https://alanorth.github.io/cgspace-notes/2023-09/" > September, 2023< / a > < / h2 >
< p class = "blog-post-meta" >
< time datetime = "2023-09-02T17:29:36+03:00" > Sat Sep 02, 2023< / time >
in
< span class = "fas fa-folder" aria-hidden = "true" > < / span > < a href = "/categories/notes/" rel = "category tag" > Notes< / a >
< / p >
< / header >
< h2 id = "2023-09-02" > 2023-09-02< / h2 >
< ul >
< li > Export CGSpace to check for missing Initiative collection mappings< / li >
< li > Start a harvest on AReS< / li >
< / ul >
< h2 id = "2023-09-03" > 2023-09-03< / h2 >
< ul >
< li > I figured out how to use Altmetric and Dimensions badges in the DSpace Angular frontend
< ul >
< li > It still feels hacky, but using < a href = "https://stackoverflow.com/questions/41936631/how-to-trigger-the-function-after-dom-markup-is-loaded-in-angular-style-applicat" > AfterViewInit< / a > , and importing the Altmetric < code > embed.js< / code > in the component works< / li >
< li > The style on mobile also needs work… < / li >
< / ul >
< / li >
< / ul >
2023-09-08 23:25:12 +02:00
< h2 id = "2023-09-06" > 2023-09-06< / h2 >
< ul >
< li > Discussion with Marie about finalizing the output types list on GitHub
< ul >
< li > I did some review and cleanup in preparation for publishing the new list< / li >
< / ul >
< / li >
< / ul >
< h2 id = "2023-09-07" > 2023-09-07< / h2 >
< ul >
< li > Export CGSpace to start doing a review of the metadata< / li >
< li > First I will start by extracting all items with DOIs, along with some fields I can compare against Crossref:< / li >
< / ul >
< div class = "highlight" > < pre tabindex = "0" style = "color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;" > < code class = "language-console" data-lang = "console" > < span style = "display:flex;" > < span > $ csvgrep -c < span style = "color:#e6db74" > ' cg.identifier.doi[en_US]' < / span > -r < span style = "color:#e6db74" > ' doi.org' < / span > ~/Downloads/2023-09-07-cgspace.csv < span style = "color:#ae81ff" > \
< / span > < / span > < / span > < span style = "display:flex;" > < span > < span style = "color:#ae81ff" > < / span > | csvcut -c ' id,dc.title[en_US],dcterms.issued[en_US],dcterms.available[en_US],cg.issn[en_US],cg.isbn[en_US],cg.volume[en_US],cg.issue[en_US],cg.number[en_US],dcterms.extent[en_US],cg.identifier.doi[en_US],cg.reviewStatus[en_US],cg.isijournal[en_US],dcterms.license[en_US],dcterms.accessRights[en_US],dcterms.type[en_US],dc.identifier.uri[en_US]' \
< / span > < / span > < span style = "display:flex;" > < span > > /tmp/2023-09-07-cgspace-dois.csv
< / span > < / span > < span style = "display:flex;" > < span > $ csvgrep -c < span style = "color:#e6db74" > ' cg.identifier.doi[en_US]' < / span > -r < span style = "color:#e6db74" > ' doi.org' < / span > ~/Downloads/2023-09-07-cgspace.csv | csvcut -c < span style = "color:#e6db74" > ' cg.identifier.doi[en_US]' < / span > | sed 1d > /tmp/2023-09-07-cgspace-dois.txt
< / span > < / span > < / code > < / pre > < / div > < ul >
< li > Then I resolved the DOIs from Crossref:< / li >
< / ul >
< div class = "highlight" > < pre tabindex = "0" style = "color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;" > < code class = "language-console" data-lang = "console" > < span style = "display:flex;" > < span > $ ./ilri/crossref_doi_lookup.py -i /tmp/2023-09-07-cgspace-dois.txt -o /tmp/2023-09-07-cgspace-dois-results.csv -e a.orth@cgiar.org
< / span > < / span > < / code > < / pre > < / div > < ul >
< li > A user emailed to ask about uploading a 180MB PDF to CGSpace
< ul >
< li > I used GhostScript to try reducing it using the < code > screen< / code > , < code > ebook< / code > and < code > prepress< / code > presets:< / li >
< / ul >
< / li >
< / ul >
< div class = "highlight" > < pre tabindex = "0" style = "color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;" > < code class = "language-console" data-lang = "console" > < span style = "display:flex;" > < span > $ gs -sDEVICE< span style = "color:#f92672" > =< / span > pdfwrite -dCompatibilityLevel< span style = "color:#f92672" > =< / span > 1.4 -dPDFSETTINGS< span style = "color:#f92672" > =< / span > /screen -dNOPAUSE -dQUIET -dBATCH -sOutputFile< span style = "color:#f92672" > =< / span > primer-screen.pdf Primer< span style = "color:#ae81ff" > \ \(< / span > digital< span style = "color:#ae81ff" > \)< / span > _Climate-< span style = "color:#ae81ff" > \ < / span > smart< span style = "color:#ae81ff" > \ < / span > and< span style = "color:#ae81ff" > \ < / span > regenerative< span style = "color:#ae81ff" > \ < / span > agriculture< span style = "color:#ae81ff" > \ < / span > in< span style = "color:#ae81ff" > \ < / span > climate< span style = "color:#ae81ff" > \ < / span > change< span style = "color:#ae81ff" > \ < / span > adaptation.pdf
< / span > < / span > < span style = "display:flex;" > < span > $ gs -sDEVICE< span style = "color:#f92672" > =< / span > pdfwrite -dCompatibilityLevel< span style = "color:#f92672" > =< / span > 1.4 -dPDFSETTINGS< span style = "color:#f92672" > =< / span > /ebook -dNOPAUSE -dQUIET -dBATCH -sOutputFile< span style = "color:#f92672" > =< / span > primer-ebook.pdf Primer< span style = "color:#ae81ff" > \ \(< / span > digital< span style = "color:#ae81ff" > \)< / span > _Climate-< span style = "color:#ae81ff" > \ < / span > smart< span style = "color:#ae81ff" > \ < / span > and< span style = "color:#ae81ff" > \ < / span > regenerative< span style = "color:#ae81ff" > \ < / span > agriculture< span style = "color:#ae81ff" > \ < / span > in< span style = "color:#ae81ff" > \ < / span > climate< span style = "color:#ae81ff" > \ < / span > change< span style = "color:#ae81ff" > \ < / span > adaptation.pdf
< / span > < / span > < span style = "display:flex;" > < span > $ gs -sDEVICE< span style = "color:#f92672" > =< / span > pdfwrite -dCompatibilityLevel< span style = "color:#f92672" > =< / span > 1.4 -dPDFSETTINGS< span style = "color:#f92672" > =< / span > /prepress -dNOPAUSE -dQUIET -dBATCH -sOutputFile< span style = "color:#f92672" > =< / span > primer-prepress.pdf Primer< span style = "color:#ae81ff" > \ \(< / span > digital< span style = "color:#ae81ff" > \)< / span > _Climate-< span style = "color:#ae81ff" > \ < / span > smart< span style = "color:#ae81ff" > \ < / span > and< span style = "color:#ae81ff" > \ < / span > regenerative< span style = "color:#ae81ff" > \ < / span > agriculture< span style = "color:#ae81ff" > \ < / span > in< span style = "color:#ae81ff" > \ < / span > climate< span style = "color:#ae81ff" > \ < / span > change< span style = "color:#ae81ff" > \ < / span > adaptation.pdf
< / span > < / span > < / code > < / pre > < / div > < ul >
< li > The < code > prepress< / code > one is 300DPI and looks visually identical to the original, so I proposed that we use that one< / li >
< / ul >
< h2 id = "2023-09-08" > 2023-09-08< / h2 >
< ul >
< li > I did a review of the metadata for our items with DOIs, comparing with data from Crossref
< ul >
< li > I spot checked a handful of issue / online dates and licenses, and saw that Crossref’ s dates are always more accurate than ours when they differ< / li >
< li > I also filled in some missing volumes, issues, ISSNs, and extents< / li >
< li > This results in 14,000 changes to existing items, which will take several days to import unfortunately< / li >
< li > After eight hours the first file is only about 2/3 finished… sigh< / li >
< / ul >
< / li >
2023-09-08 23:25:48 +02:00
< li > Meet with Peter to discuss changes to the DSpace 7 test
< ul >
< li > Minor updates to submission forms and some new ideas for the home page and item page< / li >
< / ul >
< / li >
2023-09-08 23:25:12 +02:00
< / ul >
2023-09-04 08:16:51 +02:00
<!-- raw HTML omitted -->
< / article >
< / div > <!-- /.blog - main -->
< aside class = "col-sm-3 ml-auto blog-sidebar" >
< section class = "sidebar-module" >
< h4 > Recent Posts< / h4 >
< ol class = "list-unstyled" >
< li > < a href = "/cgspace-notes/2023-09/" > September, 2023< / a > < / li >
< li > < a href = "/cgspace-notes/2023-08/" > August, 2023< / a > < / li >
< li > < a href = "/cgspace-notes/2023-07/" > July, 2023< / a > < / li >
< li > < a href = "/cgspace-notes/2023-06/" > June, 2023< / a > < / li >
< li > < a href = "/cgspace-notes/2023-05/" > May, 2023< / a > < / li >
< / ol >
< / section >
< section class = "sidebar-module" >
< h4 > Links< / h4 >
< ol class = "list-unstyled" >
< li > < a href = "https://cgspace.cgiar.org" > CGSpace< / a > < / li >
< li > < a href = "https://dspacetest.cgiar.org" > DSpace Test< / a > < / li >
< li > < a href = "https://github.com/ilri/DSpace" > CGSpace @ GitHub< / a > < / li >
< / ol >
< / section >
< / aside >
< / div > <!-- /.row -->
< / div > <!-- /.container -->
< footer class = "blog-footer" >
< p dir = "auto" >
Blog template created by < a href = "https://twitter.com/mdo" > @mdo< / a > , ported to Hugo by < a href = 'https://twitter.com/mralanorth' > @mralanorth< / a > .
< / p >
< p >
< a href = "#" > Back to top< / a >
< / p >
< / footer >
< / body >
< / html >