Update notes for 2016-11-08

This commit is contained in:
2016-11-08 12:44:29 +02:00
parent 42ca377f12
commit cfe5796b3a
5 changed files with 46 additions and 0 deletions

View File

@ -85,3 +85,13 @@ $ ./fix-metadata-values.py -i /tmp/CRPs.csv -f cg.contributor.crp -t correct -m
![Listings and Reports broken in DSpace 5.5](2016/11/listings-and-reports-55.png)
- I've filed a ticket with Atmire
- Thinking about batch updates for ORCIDs and authors
- Playing with [SolrClient](https://github.com/moonlitesolutions/SolrClient) in Python to query Solr
- All records in the authority core are either `authority_type:orcid` or `authority_type:person`
- There is a `deleted` field and all items seem to be `false`, but might be important sanity check to remember
- The way to go is probably to have a CSV of author names and authority IDs, then to batch update them in PostgreSQL
- Dump of the top ~200 authors in CGSpace:
```
dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=3 group by text_value order by count desc limit 210) to /tmp/210-authors.csv with csv;
```