mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-24 04:23:22 +01:00
Update notes for 2016-11-08
This commit is contained in:
parent
42ca377f12
commit
cfe5796b3a
@ -85,3 +85,13 @@ $ ./fix-metadata-values.py -i /tmp/CRPs.csv -f cg.contributor.crp -t correct -m
|
||||
![Listings and Reports broken in DSpace 5.5](2016/11/listings-and-reports-55.png)
|
||||
|
||||
- I've filed a ticket with Atmire
|
||||
- Thinking about batch updates for ORCIDs and authors
|
||||
- Playing with [SolrClient](https://github.com/moonlitesolutions/SolrClient) in Python to query Solr
|
||||
- All records in the authority core are either `authority_type:orcid` or `authority_type:person`
|
||||
- There is a `deleted` field and all items seem to be `false`, but might be important sanity check to remember
|
||||
- The way to go is probably to have a CSV of author names and authority IDs, then to batch update them in PostgreSQL
|
||||
- Dump of the top ~200 authors in CGSpace:
|
||||
|
||||
```
|
||||
dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=3 group by text_value order by count desc limit 210) to /tmp/210-authors.csv with csv;
|
||||
```
|
||||
|
@ -186,8 +186,17 @@ COPY 22
|
||||
|
||||
<ul>
|
||||
<li>I’ve filed a ticket with Atmire</li>
|
||||
<li>Thinking about batch updates for ORCIDs and authors</li>
|
||||
<li>Playing with <a href="https://github.com/moonlitesolutions/SolrClient">SolrClient</a> in Python to query Solr</li>
|
||||
<li>All records in the authority core are either <code>authority_type:orcid</code> or <code>authority_type:person</code></li>
|
||||
<li>There is a <code>deleted</code> field and all items seem to be <code>false</code>, but might be important sanity check to remember</li>
|
||||
<li>The way to go is probably to have a CSV of author names and authority IDs, then to batch update them in PostgreSQL</li>
|
||||
<li>Dump of the top ~200 authors in CGSpace:</li>
|
||||
</ul>
|
||||
|
||||
<pre><code>dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=3 group by text_value order by count desc limit 210) to /tmp/210-authors.csv with csv;
|
||||
</code></pre>
|
||||
|
||||
|
||||
|
||||
|
||||
|
@ -113,7 +113,16 @@ COPY 22
|
||||
|
||||
<ul>
|
||||
<li>I&rsquo;ve filed a ticket with Atmire</li>
|
||||
<li>Thinking about batch updates for ORCIDs and authors</li>
|
||||
<li>Playing with <a href="https://github.com/moonlitesolutions/SolrClient">SolrClient</a> in Python to query Solr</li>
|
||||
<li>All records in the authority core are either <code>authority_type:orcid</code> or <code>authority_type:person</code></li>
|
||||
<li>There is a <code>deleted</code> field and all items seem to be <code>false</code>, but might be important sanity check to remember</li>
|
||||
<li>The way to go is probably to have a CSV of author names and authority IDs, then to batch update them in PostgreSQL</li>
|
||||
<li>Dump of the top ~200 authors in CGSpace:</li>
|
||||
</ul>
|
||||
|
||||
<pre><code>dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=3 group by text_value order by count desc limit 210) to /tmp/210-authors.csv with csv;
|
||||
</code></pre>
|
||||
</description>
|
||||
</item>
|
||||
|
||||
|
@ -113,7 +113,16 @@ COPY 22
|
||||
|
||||
<ul>
|
||||
<li>I&rsquo;ve filed a ticket with Atmire</li>
|
||||
<li>Thinking about batch updates for ORCIDs and authors</li>
|
||||
<li>Playing with <a href="https://github.com/moonlitesolutions/SolrClient">SolrClient</a> in Python to query Solr</li>
|
||||
<li>All records in the authority core are either <code>authority_type:orcid</code> or <code>authority_type:person</code></li>
|
||||
<li>There is a <code>deleted</code> field and all items seem to be <code>false</code>, but might be important sanity check to remember</li>
|
||||
<li>The way to go is probably to have a CSV of author names and authority IDs, then to batch update them in PostgreSQL</li>
|
||||
<li>Dump of the top ~200 authors in CGSpace:</li>
|
||||
</ul>
|
||||
|
||||
<pre><code>dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=3 group by text_value order by count desc limit 210) to /tmp/210-authors.csv with csv;
|
||||
</code></pre>
|
||||
</description>
|
||||
</item>
|
||||
|
||||
|
@ -112,7 +112,16 @@ COPY 22
|
||||
|
||||
<ul>
|
||||
<li>I&rsquo;ve filed a ticket with Atmire</li>
|
||||
<li>Thinking about batch updates for ORCIDs and authors</li>
|
||||
<li>Playing with <a href="https://github.com/moonlitesolutions/SolrClient">SolrClient</a> in Python to query Solr</li>
|
||||
<li>All records in the authority core are either <code>authority_type:orcid</code> or <code>authority_type:person</code></li>
|
||||
<li>There is a <code>deleted</code> field and all items seem to be <code>false</code>, but might be important sanity check to remember</li>
|
||||
<li>The way to go is probably to have a CSV of author names and authority IDs, then to batch update them in PostgreSQL</li>
|
||||
<li>Dump of the top ~200 authors in CGSpace:</li>
|
||||
</ul>
|
||||
|
||||
<pre><code>dspace=# \copy (select distinct text_value, count(*) from metadatavalue where metadata_field_id=3 group by text_value order by count desc limit 210) to /tmp/210-authors.csv with csv;
|
||||
</code></pre>
|
||||
</description>
|
||||
</item>
|
||||
|
||||
|
Loading…
x
Reference in New Issue
Block a user