mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Update notes for 2019-06-10
This commit is contained in:
@ -56,5 +56,23 @@ statistics-2018: org.apache.solr.common.SolrException:org.apache.solr.common.Sol
|
||||
|
||||
- Rename the AReS repository on GitHub to OpenRXV: https://github.com/ilri/OpenRXV
|
||||
- Create a new AReS repository: https://github.com/ilri/AReS
|
||||
- Start looking at the 203 IITA records on DSpace Test from last month ([IITA_May_16](https://dspacetest.cgiar.org/handle/10568/102032) aka "20194th.xls") using OpenRefine
|
||||
- Trim leading, trailing, and consecutive whitespace on all columns, but I didn't notice very many issues
|
||||
- Validate affiliations against latest list of top 1500 terms using reconcile-csv, correcting and standardizing about twenty-seven
|
||||
- Validate countries against latest list of countries using reconcile-csv, correcting three
|
||||
- Convert all DOIs to "https://dx.doi.org" format
|
||||
- Normalize all `cg.identifier.url` Google book fields to "books.google.com"
|
||||
- Correct some inconsistencies in IITA subjects
|
||||
- Correct two incorrect "Peer Review" in `dc.description.version`
|
||||
- About fifteen items have incorrect ISBNs (looks like an Excel error because the values look like scientific numbers)
|
||||
- I managed to get to subjects, so I'll continue from there when I start working next
|
||||
- Generate a new list of countries from the database for use with reconcile-csv
|
||||
- After dumping, use csvcut to add line numbers, then change the csv header to match those you use in reconcile-csv, for example `id` and `name`:
|
||||
|
||||
```
|
||||
dspace=# \COPY (SELECT DISTINCT text_value, count(*) FROM metadatavalue WHERE metadata_field_id = 228 AND resource_type_id = 2 GROUP BY text_value ORDER BY count DESC) to /tmp/countries.csv WITH CSV HEADER
|
||||
COPY 192
|
||||
$ csvcut -l -c 0 /tmp/countries.csv > 2019-06-10-countries.csv
|
||||
```
|
||||
|
||||
<!-- vim: set sw=2 ts=2: -->
|
||||
|
Reference in New Issue
Block a user