2016-03-02 15:52:14 +01:00
<!DOCTYPE html>
< html lang = "en-us" >
< head prefix = "og: http://ogp.me/ns#" >
< meta charset = "utf-8" / >
< meta name = "viewport" content = "width=device-width, initial-scale=1.0, maximum-scale=1" / >
< meta property = "og:title" content = " March, 2016 · CGSpace Notes" / >
< meta property = "og:site_name" content = "CGSpace Notes" / >
< meta property = "og:url" content = "/cgspace-notes/2016-03/" / >
< meta property = "og:type" content = "article" / >
< meta property = "og:article:published_time" content = "2016-03-02T16:50:00+03:00" / >
< meta property = "og:article:tag" content = "notes" / >
< title >
March, 2016 · CGSpace Notes
< / title >
< link rel = "stylesheet" href = "/cgspace-notes/css/bootstrap.min.css" / >
< link rel = "stylesheet" href = "/cgspace-notes/css/main.css" / >
< link rel = "stylesheet" href = "/cgspace-notes/css/font-awesome.min.css" / >
< link rel = "stylesheet" href = "/cgspace-notes/css/github.css" / >
< link rel = "stylesheet" href = "//fonts.googleapis.com/css?family=Source+Sans+Pro:200,300,400" type = "text/css" >
< link rel = "shortcut icon" href = "/cgspace-notes/images/favicon.ico" / >
< link rel = "apple-touch-icon" href = "/cgspace-notes/images/apple-touch-icon.png" / >
< / head >
< body >
< header class = "global-header" style = "background-image:url(../images/bg.jpg )" >
< section class = "header-text" >
< h1 > < a href = "/cgspace-notes/" > CGSpace Notes< / a > < / h1 >
< div class = "sns-links hidden-print" >
< / div >
< a href = "/cgspace-notes/" class = "btn-header btn-back hidden-xs" >
< i class = "fa fa-angle-left" aria-hidden = "true" > < / i >
Home
< / a >
< / section >
< / header >
< main class = "container" >
< article >
< header >
< h1 class = "text-primary" > March, 2016< / h1 >
< div class = "post-meta clearfix" >
< div class = "post-date pull-left" >
Posted on
< time datetime = "2016-03-02T16:50:00+03:00" >
Mar 2, 2016
< / time >
< / div >
< div class = "pull-right" >
< span class = "post-tag small" > < a href = "/cgspace-notes//tags/notes" > #notes< / a > < / span >
< / div >
< / div >
< / header >
< section >
< h2 id = "2016-03-02:5a28ddf3ee658c043c064ccddb151717" > 2016-03-02< / h2 >
< ul >
< li > Looking at issues with author authorities on CGSpace< / li >
< li > For some reason we still have the < code > index-lucene-update< / code > cron job active on CGSpace, but I’ m pretty sure we don’ t need it as of the latest few versions of Atmire’ s Listings and Reports module< / li >
2016-03-02 18:31:16 +01:00
< li > Reinstall my local (Mac OS X) DSpace stack with Tomcat 7, PostgreSQL 9.3, and Java JDK 1.7 to match environment on CGSpace server< / li >
2016-03-07 18:23:16 +01:00
< / ul >
< h2 id = "2016-03-07:5a28ddf3ee658c043c064ccddb151717" > 2016-03-07< / h2 >
< ul >
< li > Troubleshooting the issues with the slew of commits for Atmire modules in < a href = "https://github.com/ilri/DSpace/pull/182" > #182< / a > < / li >
< li > Their changes on < code > 5_x-dev< / code > branch work, but it is messy as hell with merge commits and old branch base< / li >
< li > When I rebase their branch on the latest < code > 5_x-prod< / code > I get blank white pages< / li >
< li > I identified one commit that causes the issue and let them know< / li >
2016-03-07 20:22:38 +01:00
< li > Restart DSpace Test, as it seems to have crashed after Sisay tried to import some CSV or zip or something:< / li >
2016-03-02 15:52:14 +01:00
< / ul >
2016-03-07 20:22:38 +01:00
< pre > < code > Exception in thread " Lucene Merge Thread #19" org.apache.lucene.index.MergePolicy$MergeException: java.io.IOException: No space left on device
< / code > < / pre >
2016-03-08 17:57:02 +01:00
< h2 id = "2016-03-08:5a28ddf3ee658c043c064ccddb151717" > 2016-03-08< / h2 >
< ul >
< li > Add a few new filters to Atmire’ s Listings and Reports module (< a href = "https://github.com/ilri/DSpace/issues/180" > #180< / a > )< / li >
< li > We had also wanted to add a few to the Content and Usage module but I have to ask the editors which ones they were< / li >
2016-03-10 17:00:43 +01:00
< / ul >
< h2 id = "2016-03-10:5a28ddf3ee658c043c064ccddb151717" > 2016-03-10< / h2 >
< ul >
< li > Disable the lucene cron job on CGSpace as it shouldn’ t be needed anymore< / li >
< li > Discuss ORCiD and duplicate authors on Yammer< / li >
< li > Request new documentation for Atmire CUA and L& R modules, as ours are from 2013< / li >
< li > Walk Sisay through some data cleaning workflows in OpenRefine< / li >
< li > Start cleaning up the configuration for Atmire’ s CUA module (< a href = "https://github.com/ilri/DSpace/issues/185" > #184< / a > )< / li >
< li > It is very messed up because some labels are incorrect, fields are missing, etc< / li >
2016-03-08 17:57:02 +01:00
< / ul >
2016-03-10 17:04:27 +01:00
< p > < img src = "../images/2016/03/cua-label-mixup.png" alt = "Mixed up label in Atmire CUA" / > < / p >
2016-03-10 17:07:54 +01:00
< ul >
< li > Update documentation for Atmire modules< / li >
2016-03-11 18:16:33 +01:00
< / ul >
< h2 id = "2016-03-11:5a28ddf3ee658c043c064ccddb151717" > 2016-03-11< / h2 >
< ul >
< li > As I was looking at the CUA config I realized our Discovery config is all messed up and confusing< / li >
< li > I’ ve opened an issue to track some of that work (< a href = "https://github.com/ilri/DSpace/issues/186" > #186< / a > )< / li >
< li > I did some major cleanup work on Discovery and XMLUI stuff related to the < code > dc.type< / code > indexes (< a href = "https://github.com/ilri/DSpace/pull/187" > #187< / a > )< / li >
< li > We had been confusing < code > dc.type< / code > (a Dublin Core value) with < code > dc.type.output< / code > (a value we invented) for a few years and it had permeated all aspects of our data, indexes, item displays, etc.< / li >
< li > There is still some more work to be done to remove references to old < code > outputtype< / code > and < code > output< / code > < / li >
2016-03-14 19:34:54 +01:00
< / ul >
< h2 id = "2016-03-14:5a28ddf3ee658c043c064ccddb151717" > 2016-03-14< / h2 >
< ul >
< li > Fix some items that had invalid dates (I noticed them in the log during a re-indexing)< / li >
< li > Reset < code > search.index.*< / code > to the default, as it is only used by Lucene (deprecated by Discovery in DSpace 5.x): < a href = "https://github.com/ilri/DSpace/pull/188" > #188< / a > < / li >
2016-03-16 07:53:44 +01:00
< li > Make titles in Discovery and Browse by more consistent (singular, sentence case, etc) (< a href = "https://github.com/ilri/DSpace/issues/186" > #186< / a > )< / li >
< li > Also four or so center-specific subject strings were missing for Discovery< / li >
< / ul >
< p > < img src = "../images/2016/03/missing-xmlui-string.png" alt = "Missing XMLUI string" / > < / p >
2016-03-16 07:54:23 +01:00
< h2 id = "2016-03-15:5a28ddf3ee658c043c064ccddb151717" > 2016-03-15< / h2 >
2016-03-16 07:53:44 +01:00
< ul >
< li > Create simple theme for new AVCD community just for a unique Google Tracking ID (< a href = "https://github.com/ilri/DSpace/pull/191" > #191< / a > )< / li >
2016-03-17 07:45:51 +01:00
< / ul >
< h2 id = "2016-03-16:5a28ddf3ee658c043c064ccddb151717" > 2016-03-16< / h2 >
< ul >
< li > Still having problems deploying Atmire’ s CUA updates and fixes from January!< / li >
< li > More discussion on the GitHub issue here: < a href = "https://github.com/ilri/DSpace/pull/182" > https://github.com/ilri/DSpace/pull/182< / a > < / li >
< li > Clean up Atmire CUA config (< a href = "https://github.com/ilri/DSpace/pull/193" > #193< / a > )< / li >
< li > Help Sisay with some PostgreSQL queries to clean up the incorrect < code > dc.contributor.corporateauthor< / code > field< / li >
< li > I noticed that we have some weird values in < code > dc.language< / code > :< / li >
< / ul >
< pre > < code > # select * from metadatavalue where metadata_field_id=37;
metadata_value_id | resource_id | metadata_field_id | text_value | text_lang | place | authority | confidence | resource_type_id
-------------------+-------------+-------------------+------------+-----------+-------+-----------+------------+------------------
1942571 | 35342 | 37 | hi | | 1 | | -1 | 2
1942468 | 35345 | 37 | hi | | 1 | | -1 | 2
1942479 | 35337 | 37 | hi | | 1 | | -1 | 2
1942505 | 35336 | 37 | hi | | 1 | | -1 | 2
1942519 | 35338 | 37 | hi | | 1 | | -1 | 2
1942535 | 35340 | 37 | hi | | 1 | | -1 | 2
1942555 | 35341 | 37 | hi | | 1 | | -1 | 2
1942588 | 35343 | 37 | hi | | 1 | | -1 | 2
1942610 | 35346 | 37 | hi | | 1 | | -1 | 2
1942624 | 35347 | 37 | hi | | 1 | | -1 | 2
1942639 | 35339 | 37 | hi | | 1 | | -1 | 2
< / code > < / pre >
< ul >
< li > It seems this < code > dc.language< / code > field isn’ t really used, but we should delete these values< / li >
< li > Also, < code > dc.language.iso< / code > has some weird values, like “ En” and “ English” < / li >
2016-03-17 14:56:24 +01:00
< / ul >
< h2 id = "2016-03-17:5a28ddf3ee658c043c064ccddb151717" > 2016-03-17< / h2 >
< ul >
< li > It turns out < code > hi< / code > is the ISO 639 language code for Hindi, but these should be in < code > dc.language.iso< / code > instead of < code > dc.language< / code > < / li >
< li > I fixed the eleven items with < code > hi< / code > as well as some using the incorrect < code > vn< / code > for Vietnamese< / li >
< li > Start discussing CG core with Abenet and Sisay< / li >
< li > Re-sync CGSpace database to DSpace Test for Atmire to do some tests about the problematic CUA patches< / li >
2016-03-18 15:20:24 +01:00
< li > The patches work fine with a clean database, so the error was caused by some mismatch in CUA versions and the database during my testing< / li >
< / ul >
< h2 id = "2016-03-18:5a28ddf3ee658c043c064ccddb151717" > 2016-03-18< / h2 >
< ul >
< li > Merge Atmire fixes into < code > 5_x-prod< / code > < / li >
< li > Discuss thumbnails with Francesca from Bioversity< / li >
< li > Some of their items end up with thumbnails that have a big white border around them:< / li >
< / ul >
< p > < img src = "../images/2016/03/bioversity-thumbnail-bad.jpg" alt = "Excessive whitespace in thumbnail" / > < / p >
< ul >
< li > Turns out we can add < code > -trim< / code > to the GraphicsMagick options to trim the whitespace< / li >
< / ul >
< p > < img src = "../images/2016/03/bioversity-thumbnail-good.jpg" alt = "Trimmed thumbnail" / > < / p >
< ul >
< li > Command used:< / li >
< / ul >
< pre > < code > $ gm convert -trim -quality 82 -thumbnail x300 -flatten Descriptor\ for\ Butia_EN-2015_2021.pdf\[0\] cover.jpg
< / code > < / pre >
< ul >
< li > Also, it looks like adding < code > -sharpen 0x1.0< / code > really improves the quality of the image for only a few KB< / li >
2016-03-10 17:07:54 +01:00
< / ul >
2016-03-02 15:52:14 +01:00
< / section >
< footer >
< section class = "author-info row" >
< div class = "author-avatar col-md-2" >
< / div >
< div class = "author-meta col-md-6" >
< h1 class = "author-name text-primary" > Alan Orth< / h1 >
< / div >
< / section >
< ul class = "pager" >
< li class = "previous" > < a href = "/cgspace-notes/2016-02/" > < span aria-hidden = "true" > ← < / span > Older< / a > < / li >
< li class = "next disabled" > < a href = "#" > Newer < span aria-hidden = "true" > → < / span > < / a > < / li >
< / ul >
< / footer >
< / article >
< / main >
< footer class = "container global-footer" >
< div class = "copyright-note pull-left" >
< / div >
< div class = "sns-links hidden-print" >
< / div >
< / footer >
< script src = "/cgspace-notes/js/highlight.pack.js" > < / script >
< script >
hljs.initHighlightingOnLoad();
< / script >
< / body >
< / html >