March, 2018
+ +2018-03-02
+ +-
+
- Export a CSV of the IITA community metadata for Martin Mueller +
diff --git a/content/posts/2018-03.md b/content/posts/2018-03.md index b23da319c..7b395cec6 100644 --- a/content/posts/2018-03.md +++ b/content/posts/2018-03.md @@ -139,3 +139,8 @@ dc.contributor.author,cg.creator.id - I didn't integrate the ORCID API lookup for author names in this script for now because I was only interested in "tagging" old items for a few given authors - I added ORCID identifers for 187 items by CIAT's Hernan Ceballos, because that is what Elizabeth was trying to do manually! - Also, I decided to add ORCID identifiers for all records from Peter, Abenet, and Sisay as well + +## 2018-03-09 + +- Give James Stapleton input on Sisay's KRAs +- Create a pull request to disable ORCID authority integration for `dc.contributor.author` in the submission forms and XMLUI display ([#363](https://github.com/ilri/DSpace/pull/363)) diff --git a/docs/2015-11/index.html b/docs/2015-11/index.html index 2b5acd728..4b4498325 100644 --- a/docs/2015-11/index.html +++ b/docs/2015-11/index.html @@ -26,7 +26,7 @@ $ psql -c 'SELECT * from pg_stat_activity;' | grep idle | grep -c cgspac - + @@ -65,7 +65,7 @@ $ psql -c 'SELECT * from pg_stat_activity;' | grep idle | grep -c cgspac "url": "https://alanorth.github.io/cgspace-notes/2015-11/", "wordCount": "798", "datePublished": "2015-11-23T17:00:57+03:00", - "dateModified": "2016-09-28T17:02:30+03:00", + "dateModified": "2018-03-09T22:10:33+02:00", "author": { "@type": "Person", "name": "Alan Orth" @@ -100,8 +100,6 @@ $ psql -c 'SELECT * from pg_stat_activity;' | grep idle | grep -c cgspac
dc.contributor.author
in the submission forms and XMLUI display (#363)dc.rights
proof of concept that I had sent him last weekjmx_tomcat_dbpools
provided by Ubuntu’s munin-plugins-java
package and used the stuff I discovered about JMX in 2018-0102/Jan/2018:11:27:17 +0000
when Uptime Robot got an HTTP 500Caused by: org.apache.tomcat.jdbc.pool.PoolExhaustedException: [http-bio-127.0.0.1-8443-exec-980] Timeout: Pool empty. Unable to fetch a connection in 5 seconds, none available[size:50; busy:50; idle:0; lastwait:5000].
+
+
+2018-01-02 01:21:19,137 ERROR org.dspace.app.xmlui.aspect.discovery.SidebarFacetsTransformer @ Error while searching for sidebar facets
+org.dspace.discovery.SearchServiceException: org.apache.solr.search.SyntaxError: Cannot parse 'dateIssued_keyword:[1976+TO+1979]': Encountered " "]" "] "" at line 1, column 32.
+
+
+$ grep -c "Error while searching for sidebar facets" dspace.log.*
+dspace.log.2017-11-21:4
+dspace.log.2017-11-22:1
+dspace.log.2017-11-23:4
+dspace.log.2017-11-24:11
+dspace.log.2017-11-25:0
+dspace.log.2017-11-26:1
+dspace.log.2017-11-27:7
+dspace.log.2017-11-28:21
+dspace.log.2017-11-29:31
+dspace.log.2017-11-30:15
+dspace.log.2017-12-01:15
+dspace.log.2017-12-02:20
+dspace.log.2017-12-03:38
+dspace.log.2017-12-04:65
+dspace.log.2017-12-05:43
+dspace.log.2017-12-06:72
+dspace.log.2017-12-07:27
+dspace.log.2017-12-08:15
+dspace.log.2017-12-09:29
+dspace.log.2017-12-10:35
+dspace.log.2017-12-11:20
+dspace.log.2017-12-12:44
+dspace.log.2017-12-13:36
+dspace.log.2017-12-14:59
+dspace.log.2017-12-15:104
+dspace.log.2017-12-16:53
+dspace.log.2017-12-17:66
+dspace.log.2017-12-18:83
+dspace.log.2017-12-19:101
+dspace.log.2017-12-20:74
+dspace.log.2017-12-21:55
+dspace.log.2017-12-22:66
+dspace.log.2017-12-23:50
+dspace.log.2017-12-24:85
+dspace.log.2017-12-25:62
+dspace.log.2017-12-26:49
+dspace.log.2017-12-27:30
+dspace.log.2017-12-28:54
+dspace.log.2017-12-29:68
+dspace.log.2017-12-30:89
+dspace.log.2017-12-31:53
+dspace.log.2018-01-01:45
+dspace.log.2018-01-02:34
+
+
+# grep -c "CORE" /var/log/nginx/access.log
+0
+
+
+dspace=# \copy (select distinct text_value, count(*) as count from metadatavalue where metadata_field_id = (select metadata_field_id from metadatafieldregistry where element = 'contributor' and qualifier = 'author') AND resource_type_id = 2 group by text_value order by count desc) to /tmp/authors.csv with csv;
+COPY 54701
+
+
+
+ Read more →
+http://hdl.handle.net/10568/78495||http://hdl.handle.net/10568/79336
+
+
+Rough notes for importing the CGIAR Library content. It was decided that this content would go to a new top-level community called CGIAR System Organization.
+ + + Read more → +dspace.log.2017-08-01
, they are all using the same Tomcat sessionrobots.txt
only blocks the top-level /discover
and /browse
URLs… we will need to find a way to forbid them from accessing these!X-Robots-Tag "none"
HTTP header, but this only forbids the search engine from indexing the page, not crawling it!dc.description.abstract
column, which caused OpenRefine to choke when exporting the CSVg/^$/d
-x
) plus sed
to format the output into quasi XML:dc.rights
proof of concept that I had sent him last weekjmx_tomcat_dbpools
provided by Ubuntu’s munin-plugins-java
package and used the stuff I discovered about JMX in 2018-0102/Jan/2018:11:27:17 +0000
when Uptime Robot got an HTTP 500Caused by: org.apache.tomcat.jdbc.pool.PoolExhaustedException: [http-bio-127.0.0.1-8443-exec-980] Timeout: Pool empty. Unable to fetch a connection in 5 seconds, none available[size:50; busy:50; idle:0; lastwait:5000].
+
+
+2018-01-02 01:21:19,137 ERROR org.dspace.app.xmlui.aspect.discovery.SidebarFacetsTransformer @ Error while searching for sidebar facets
+org.dspace.discovery.SearchServiceException: org.apache.solr.search.SyntaxError: Cannot parse 'dateIssued_keyword:[1976+TO+1979]': Encountered " "]" "] "" at line 1, column 32.
+
+
+$ grep -c "Error while searching for sidebar facets" dspace.log.*
+dspace.log.2017-11-21:4
+dspace.log.2017-11-22:1
+dspace.log.2017-11-23:4
+dspace.log.2017-11-24:11
+dspace.log.2017-11-25:0
+dspace.log.2017-11-26:1
+dspace.log.2017-11-27:7
+dspace.log.2017-11-28:21
+dspace.log.2017-11-29:31
+dspace.log.2017-11-30:15
+dspace.log.2017-12-01:15
+dspace.log.2017-12-02:20
+dspace.log.2017-12-03:38
+dspace.log.2017-12-04:65
+dspace.log.2017-12-05:43
+dspace.log.2017-12-06:72
+dspace.log.2017-12-07:27
+dspace.log.2017-12-08:15
+dspace.log.2017-12-09:29
+dspace.log.2017-12-10:35
+dspace.log.2017-12-11:20
+dspace.log.2017-12-12:44
+dspace.log.2017-12-13:36
+dspace.log.2017-12-14:59
+dspace.log.2017-12-15:104
+dspace.log.2017-12-16:53
+dspace.log.2017-12-17:66
+dspace.log.2017-12-18:83
+dspace.log.2017-12-19:101
+dspace.log.2017-12-20:74
+dspace.log.2017-12-21:55
+dspace.log.2017-12-22:66
+dspace.log.2017-12-23:50
+dspace.log.2017-12-24:85
+dspace.log.2017-12-25:62
+dspace.log.2017-12-26:49
+dspace.log.2017-12-27:30
+dspace.log.2017-12-28:54
+dspace.log.2017-12-29:68
+dspace.log.2017-12-30:89
+dspace.log.2017-12-31:53
+dspace.log.2018-01-01:45
+dspace.log.2018-01-02:34
+
+
+# grep -c "CORE" /var/log/nginx/access.log
+0
+
+
+dspace=# \copy (select distinct text_value, count(*) as count from metadatavalue where metadata_field_id = (select metadata_field_id from metadatafieldregistry where element = 'contributor' and qualifier = 'author') AND resource_type_id = 2 group by text_value order by count desc) to /tmp/authors.csv with csv;
+COPY 54701
+
+
+
+ Read more →
+http://hdl.handle.net/10568/78495||http://hdl.handle.net/10568/79336
+
+
+dspace.log.2017-08-01
, they are all using the same Tomcat sessionrobots.txt
only blocks the top-level /discover
and /browse
URLs… we will need to find a way to forbid them from accessing these!X-Robots-Tag "none"
HTTP header, but this only forbids the search engine from indexing the page, not crawling it!dc.description.abstract
column, which caused OpenRefine to choke when exporting the CSVg/^$/d
-x
) plus sed
to format the output into quasi XML:Documenting day-to-day work on the CGSpace repository.
+dc.rights
to the input form, including some inline instructions/hints:$ [dspace]/bin/dspace filter-media -f -i 10568/16498 -p "ImageMagick PDF Thumbnail" -v >& /tmp/filter-media-cmyk.txt
+
+
+
+ Read more →
+filter-media
bug that causes it to process JPGs even when limiting to the PDF thumbnail plugin: DS-3516filter-media
plugin creates JPG thumbnails with the CMYK colorspace when the source PDF is using CMYK$ identify ~/Desktop/alc_contrastes_desafios.jpg
+/Users/aorth/Desktop/alc_contrastes_desafios.jpg JPEG 464x600 464x600+0+0 8-bit CMYK 168KB 0.000u 0:00.000
+
+
+
+ Read more →
+dspace=# select * from collection2item where item_id = '80278';
+ id | collection_id | item_id
+-------+---------------+---------
+ 92551 | 313 | 80278
+ 92550 | 313 | 80278
+ 90774 | 1051 | 80278
+(3 rows)
+dspace=# delete from collection2item where id = 92551 and item_id = 80278;
+DELETE 1
+
+
+cg.identifier.ccafsprojectpii
as the field name2016-12-02 03:00:32,352 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=CREATE, SubjectType=BUNDLE, SubjectID=70316, ObjectType=(Unknown), ObjectID=-1, TimeStamp=1480647632305, dispatcher=1544803905, detail=[null], transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=MODIFY_METADATA, SubjectType=BUNDLE, SubjectID =70316, ObjectType=(Unknown), ObjectID=-1, TimeStamp=1480647632309, dispatcher=1544803905, detail="dc.title", transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=ADD, SubjectType=ITEM, SubjectID=80044, Object Type=BUNDLE, ObjectID=70316, TimeStamp=1480647632311, dispatcher=1544803905, detail="THUMBNAIL", transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=ADD, SubjectType=BUNDLE, SubjectID=70316, Obje ctType=BITSTREAM, ObjectID=86715, TimeStamp=1480647632318, dispatcher=1544803905, detail="-1", transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=MODIFY, SubjectType=ITEM, SubjectID=80044, ObjectType=(Unknown), ObjectID=-1, TimeStamp=1480647632351, dispatcher=1544803905, detail=[null], transactionID="TX157907838689377964651674089851855413607")
+
+
+dc.type
to the output options for Atmire’s Listings and Reports module (#286)ORCID:dc.contributor.author
with the following random ORCIDs from the ORCID registry:0000-0002-6115-0956||0000-0002-3812-8793||0000-0001-7462-405X
+
+
+
+ Read more →
+DC=ILRI
to determine whether a user was ILRI or not$ ldapsearch -x -H ldaps://svcgroot2.cgiarad.org:3269/ -b "dc=cgiarad,dc=org" -D "admigration1@cgiarad.org" -W "(sAMAccountName=admigration1)"
+
+
+
+ Read more →
+Documenting day-to-day work on the CGSpace repository.
+bower.json
because most are several versions of out datefonts
)$ git checkout -b 55new 5_x-prod
+$ git reset --hard ilri/5_x-prod
+$ git rebase -i dspace-5.5
+
+
+
+ Read more →
+dc.description.sponsorship
to Discovery sidebar facets and make investors clickable in item view (#232)dspacetest=# update metadatavalue set text_value = regexp_replace(text_value, '(^.+?),$', '\1') where metadata_field_id=3 and resource_type_id=2 and text_value ~ '^.+?,$';
+UPDATE 95
+dspacetest=# select text_value from metadatavalue where metadata_field_id=3 and resource_type_id=2 and text_value ~ '^.+?,$';
+ text_value
+------------
+(0 rows)
+
+
+ListSets
verb: http://ebrary.ifpri.org/oai/oai.php?verb=ListSetsdc.identifier.fund
to cg.identifier.cpwfproject
and then the rest to dc.description.sponsorship
# awk '{print $1}' /var/log/nginx/rest.log | uniq | wc -l
+3168
+
+
+
+ Read more →
+checker
log has some errors we should pay attention to:index-lucene-update
cron job active on CGSpace, but I’m pretty sure we don’t need it as of the latest few versions of Atmire’s Listings and Reports module10568/12503
from 10568/27869
to 10568/27629
using the move_collections.sh script I wrote last year.lzop
with xz
in log compression cron jobs on DSpace Test—it uses less space:# cd /home/dspacetest.cgiar.org/log
+# ls -lh dspace.log.2015-11-18*
+-rw-rw-r-- 1 tomcat7 tomcat7 2.0M Nov 18 23:59 dspace.log.2015-11-18
+-rw-rw-r-- 1 tomcat7 tomcat7 387K Nov 18 23:59 dspace.log.2015-11-18.lzo
+-rw-rw-r-- 1 tomcat7 tomcat7 169K Nov 18 23:59 dspace.log.2015-11-18.xz
+
+
+
+ Read more →
+$ psql -c 'SELECT * from pg_stat_activity;' | grep idle | grep -c cgspace
+78
+
+
+
+ Read more →
+Documenting day-to-day work on the CGSpace repository.
+Documenting day-to-day work on the CGSpace repository.
+dc.rights
to the input form, including some inline instructions/hints:$ [dspace]/bin/dspace filter-media -f -i 10568/16498 -p "ImageMagick PDF Thumbnail" -v >& /tmp/filter-media-cmyk.txt
+
+
+
+ Read more →
+filter-media
bug that causes it to process JPGs even when limiting to the PDF thumbnail plugin: DS-3516filter-media
plugin creates JPG thumbnails with the CMYK colorspace when the source PDF is using CMYK$ identify ~/Desktop/alc_contrastes_desafios.jpg
+/Users/aorth/Desktop/alc_contrastes_desafios.jpg JPEG 464x600 464x600+0+0 8-bit CMYK 168KB 0.000u 0:00.000
+
+
+
+ Read more →
+dspace=# select * from collection2item where item_id = '80278';
+ id | collection_id | item_id
+-------+---------------+---------
+ 92551 | 313 | 80278
+ 92550 | 313 | 80278
+ 90774 | 1051 | 80278
+(3 rows)
+dspace=# delete from collection2item where id = 92551 and item_id = 80278;
+DELETE 1
+
+
+cg.identifier.ccafsprojectpii
as the field name2016-12-02 03:00:32,352 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=CREATE, SubjectType=BUNDLE, SubjectID=70316, ObjectType=(Unknown), ObjectID=-1, TimeStamp=1480647632305, dispatcher=1544803905, detail=[null], transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=MODIFY_METADATA, SubjectType=BUNDLE, SubjectID =70316, ObjectType=(Unknown), ObjectID=-1, TimeStamp=1480647632309, dispatcher=1544803905, detail="dc.title", transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=ADD, SubjectType=ITEM, SubjectID=80044, Object Type=BUNDLE, ObjectID=70316, TimeStamp=1480647632311, dispatcher=1544803905, detail="THUMBNAIL", transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=ADD, SubjectType=BUNDLE, SubjectID=70316, Obje ctType=BITSTREAM, ObjectID=86715, TimeStamp=1480647632318, dispatcher=1544803905, detail="-1", transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=MODIFY, SubjectType=ITEM, SubjectID=80044, ObjectType=(Unknown), ObjectID=-1, TimeStamp=1480647632351, dispatcher=1544803905, detail=[null], transactionID="TX157907838689377964651674089851855413607")
+
+
+dc.type
to the output options for Atmire’s Listings and Reports module (#286)ORCID:dc.contributor.author
with the following random ORCIDs from the ORCID registry:0000-0002-6115-0956||0000-0002-3812-8793||0000-0001-7462-405X
+
+
+
+ Read more →
+DC=ILRI
to determine whether a user was ILRI or not$ ldapsearch -x -H ldaps://svcgroot2.cgiarad.org:3269/ -b "dc=cgiarad,dc=org" -D "admigration1@cgiarad.org" -W "(sAMAccountName=admigration1)"
+
+
+
+ Read more →
+Documenting day-to-day work on the CGSpace repository.
+bower.json
because most are several versions of out datefonts
)$ git checkout -b 55new 5_x-prod
+$ git reset --hard ilri/5_x-prod
+$ git rebase -i dspace-5.5
+
+
+
+ Read more →
+dc.description.sponsorship
to Discovery sidebar facets and make investors clickable in item view (#232)dspacetest=# update metadatavalue set text_value = regexp_replace(text_value, '(^.+?),$', '\1') where metadata_field_id=3 and resource_type_id=2 and text_value ~ '^.+?,$';
+UPDATE 95
+dspacetest=# select text_value from metadatavalue where metadata_field_id=3 and resource_type_id=2 and text_value ~ '^.+?,$';
+ text_value
+------------
+(0 rows)
+
+
+ListSets
verb: http://ebrary.ifpri.org/oai/oai.php?verb=ListSetsdc.identifier.fund
to cg.identifier.cpwfproject
and then the rest to dc.description.sponsorship
# awk '{print $1}' /var/log/nginx/rest.log | uniq | wc -l
+3168
+
+
+
+ Read more →
+checker
log has some errors we should pay attention to:index-lucene-update
cron job active on CGSpace, but I’m pretty sure we don’t need it as of the latest few versions of Atmire’s Listings and Reports module10568/12503
from 10568/27869
to 10568/27629
using the move_collections.sh script I wrote last year.lzop
with xz
in log compression cron jobs on DSpace Test—it uses less space:# cd /home/dspacetest.cgiar.org/log
+# ls -lh dspace.log.2015-11-18*
+-rw-rw-r-- 1 tomcat7 tomcat7 2.0M Nov 18 23:59 dspace.log.2015-11-18
+-rw-rw-r-- 1 tomcat7 tomcat7 387K Nov 18 23:59 dspace.log.2015-11-18.lzo
+-rw-rw-r-- 1 tomcat7 tomcat7 169K Nov 18 23:59 dspace.log.2015-11-18.xz
+
+
+
+ Read more →
+$ psql -c 'SELECT * from pg_stat_activity;' | grep idle | grep -c cgspace
+78
+
+
+
+ Read more →
+Documenting day-to-day work on the CGSpace repository.
+Documenting day-to-day work on the CGSpace repository.
+Documenting day-to-day work on the CGSpace repository.
+dc.rights
to the input form, including some inline instructions/hints:$ [dspace]/bin/dspace filter-media -f -i 10568/16498 -p "ImageMagick PDF Thumbnail" -v >& /tmp/filter-media-cmyk.txt
+
+
+
+ Read more →
+filter-media
bug that causes it to process JPGs even when limiting to the PDF thumbnail plugin: DS-3516filter-media
plugin creates JPG thumbnails with the CMYK colorspace when the source PDF is using CMYK$ identify ~/Desktop/alc_contrastes_desafios.jpg
+/Users/aorth/Desktop/alc_contrastes_desafios.jpg JPEG 464x600 464x600+0+0 8-bit CMYK 168KB 0.000u 0:00.000
+
+
+
+ Read more →
+dspace=# select * from collection2item where item_id = '80278';
+ id | collection_id | item_id
+-------+---------------+---------
+ 92551 | 313 | 80278
+ 92550 | 313 | 80278
+ 90774 | 1051 | 80278
+(3 rows)
+dspace=# delete from collection2item where id = 92551 and item_id = 80278;
+DELETE 1
+
+
+cg.identifier.ccafsprojectpii
as the field name2016-12-02 03:00:32,352 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=CREATE, SubjectType=BUNDLE, SubjectID=70316, ObjectType=(Unknown), ObjectID=-1, TimeStamp=1480647632305, dispatcher=1544803905, detail=[null], transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=MODIFY_METADATA, SubjectType=BUNDLE, SubjectID =70316, ObjectType=(Unknown), ObjectID=-1, TimeStamp=1480647632309, dispatcher=1544803905, detail="dc.title", transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=ADD, SubjectType=ITEM, SubjectID=80044, Object Type=BUNDLE, ObjectID=70316, TimeStamp=1480647632311, dispatcher=1544803905, detail="THUMBNAIL", transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=ADD, SubjectType=BUNDLE, SubjectID=70316, Obje ctType=BITSTREAM, ObjectID=86715, TimeStamp=1480647632318, dispatcher=1544803905, detail="-1", transactionID="TX157907838689377964651674089851855413607")
+2016-12-02 03:00:32,353 WARN com.atmire.metadataquality.batchedit.BatchEditConsumer @ BatchEditConsumer should not have been given this kind of Subject in an event, skipping: org.dspace.event.Event(eventType=MODIFY, SubjectType=ITEM, SubjectID=80044, ObjectType=(Unknown), ObjectID=-1, TimeStamp=1480647632351, dispatcher=1544803905, detail=[null], transactionID="TX157907838689377964651674089851855413607")
+
+
+dc.type
to the output options for Atmire’s Listings and Reports module (#286)ORCID:dc.contributor.author
with the following random ORCIDs from the ORCID registry:0000-0002-6115-0956||0000-0002-3812-8793||0000-0001-7462-405X
+
+
+
+ Read more →
+DC=ILRI
to determine whether a user was ILRI or not$ ldapsearch -x -H ldaps://svcgroot2.cgiarad.org:3269/ -b "dc=cgiarad,dc=org" -D "admigration1@cgiarad.org" -W "(sAMAccountName=admigration1)"
+
+
+
+ Read more →
+Documenting day-to-day work on the CGSpace repository.
+bower.json
because most are several versions of out datefonts
)$ git checkout -b 55new 5_x-prod
+$ git reset --hard ilri/5_x-prod
+$ git rebase -i dspace-5.5
+
+
+
+ Read more →
+dc.description.sponsorship
to Discovery sidebar facets and make investors clickable in item view (#232)dspacetest=# update metadatavalue set text_value = regexp_replace(text_value, '(^.+?),$', '\1') where metadata_field_id=3 and resource_type_id=2 and text_value ~ '^.+?,$';
+UPDATE 95
+dspacetest=# select text_value from metadatavalue where metadata_field_id=3 and resource_type_id=2 and text_value ~ '^.+?,$';
+ text_value
+------------
+(0 rows)
+
+
+ListSets
verb: http://ebrary.ifpri.org/oai/oai.php?verb=ListSetsdc.identifier.fund
to cg.identifier.cpwfproject
and then the rest to dc.description.sponsorship
# awk '{print $1}' /var/log/nginx/rest.log | uniq | wc -l
+3168
+
+
+
+ Read more →
+checker
log has some errors we should pay attention to:index-lucene-update
cron job active on CGSpace, but I’m pretty sure we don’t need it as of the latest few versions of Atmire’s Listings and Reports module10568/12503
from 10568/27869
to 10568/27629
using the move_collections.sh script I wrote last year.lzop
with xz
in log compression cron jobs on DSpace Test—it uses less space:# cd /home/dspacetest.cgiar.org/log
+# ls -lh dspace.log.2015-11-18*
+-rw-rw-r-- 1 tomcat7 tomcat7 2.0M Nov 18 23:59 dspace.log.2015-11-18
+-rw-rw-r-- 1 tomcat7 tomcat7 387K Nov 18 23:59 dspace.log.2015-11-18.lzo
+-rw-rw-r-- 1 tomcat7 tomcat7 169K Nov 18 23:59 dspace.log.2015-11-18.xz
+
+
+
+ Read more →
+$ psql -c 'SELECT * from pg_stat_activity;' | grep idle | grep -c cgspace
+78
+
+
+
+ Read more →
+Documenting day-to-day work on the CGSpace repository.
+dc.rights
proof of concept that I had sent him last weekjmx_tomcat_dbpools
provided by Ubuntu’s munin-plugins-java
package and used the stuff I discovered about JMX in 2018-0102/Jan/2018:11:27:17 +0000
when Uptime Robot got an HTTP 500Caused by: org.apache.tomcat.jdbc.pool.PoolExhaustedException: [http-bio-127.0.0.1-8443-exec-980] Timeout: Pool empty. Unable to fetch a connection in 5 seconds, none available[size:50; busy:50; idle:0; lastwait:5000].
+
+
+2018-01-02 01:21:19,137 ERROR org.dspace.app.xmlui.aspect.discovery.SidebarFacetsTransformer @ Error while searching for sidebar facets
+org.dspace.discovery.SearchServiceException: org.apache.solr.search.SyntaxError: Cannot parse 'dateIssued_keyword:[1976+TO+1979]': Encountered " "]" "] "" at line 1, column 32.
+
+
+$ grep -c "Error while searching for sidebar facets" dspace.log.*
+dspace.log.2017-11-21:4
+dspace.log.2017-11-22:1
+dspace.log.2017-11-23:4
+dspace.log.2017-11-24:11
+dspace.log.2017-11-25:0
+dspace.log.2017-11-26:1
+dspace.log.2017-11-27:7
+dspace.log.2017-11-28:21
+dspace.log.2017-11-29:31
+dspace.log.2017-11-30:15
+dspace.log.2017-12-01:15
+dspace.log.2017-12-02:20
+dspace.log.2017-12-03:38
+dspace.log.2017-12-04:65
+dspace.log.2017-12-05:43
+dspace.log.2017-12-06:72
+dspace.log.2017-12-07:27
+dspace.log.2017-12-08:15
+dspace.log.2017-12-09:29
+dspace.log.2017-12-10:35
+dspace.log.2017-12-11:20
+dspace.log.2017-12-12:44
+dspace.log.2017-12-13:36
+dspace.log.2017-12-14:59
+dspace.log.2017-12-15:104
+dspace.log.2017-12-16:53
+dspace.log.2017-12-17:66
+dspace.log.2017-12-18:83
+dspace.log.2017-12-19:101
+dspace.log.2017-12-20:74
+dspace.log.2017-12-21:55
+dspace.log.2017-12-22:66
+dspace.log.2017-12-23:50
+dspace.log.2017-12-24:85
+dspace.log.2017-12-25:62
+dspace.log.2017-12-26:49
+dspace.log.2017-12-27:30
+dspace.log.2017-12-28:54
+dspace.log.2017-12-29:68
+dspace.log.2017-12-30:89
+dspace.log.2017-12-31:53
+dspace.log.2018-01-01:45
+dspace.log.2018-01-02:34
+
+
+# grep -c "CORE" /var/log/nginx/access.log
+0
+
+
+dspace=# \copy (select distinct text_value, count(*) as count from metadatavalue where metadata_field_id = (select metadata_field_id from metadatafieldregistry where element = 'contributor' and qualifier = 'author') AND resource_type_id = 2 group by text_value order by count desc) to /tmp/authors.csv with csv;
+COPY 54701
+
+
+
+ Read more →
+http://hdl.handle.net/10568/78495||http://hdl.handle.net/10568/79336
+
+
+Rough notes for importing the CGIAR Library content. It was decided that this content would go to a new top-level community called CGIAR System Organization.
+ + + Read more → +dspace.log.2017-08-01
, they are all using the same Tomcat sessionrobots.txt
only blocks the top-level /discover
and /browse
URLs… we will need to find a way to forbid them from accessing these!X-Robots-Tag "none"
HTTP header, but this only forbids the search engine from indexing the page, not crawling it!dc.description.abstract
column, which caused OpenRefine to choke when exporting the CSVg/^$/d
-x
) plus sed
to format the output into quasi XML:Rough notes for importing the CGIAR Library content. It was decided that this content would go to a new top-level community called CGIAR System Organization.
+ + + Read more → +