mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Add notes for 2023-04-27
This commit is contained in:
@ -58,7 +58,7 @@ There seem to be many more of these:
|
||||
"@type": "BlogPosting",
|
||||
"headline": "June, 2022",
|
||||
"url": "https://alanorth.github.io/cgspace-notes/2022-06/",
|
||||
"wordCount": "1786",
|
||||
"wordCount": "1788",
|
||||
"datePublished": "2022-06-06T09:01:36+03:00",
|
||||
"dateModified": "2022-08-03T21:01:39+03:00",
|
||||
"author": {
|
||||
@ -349,7 +349,7 @@ There seem to be many more of these:
|
||||
</ul>
|
||||
<h2 id="2022-06-28">2022-06-28</h2>
|
||||
<ul>
|
||||
<li>Start working on the CGSpace subject export for FAO</li>
|
||||
<li>Start working on the CGSpace subject export for FAO / AGROVOC</li>
|
||||
<li>First I exported a list of all metadata in our <code>dcterms.subject</code> and other center-specific subject fields with their counts:</li>
|
||||
</ul>
|
||||
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>localhost/dspacetest= ☘ \COPY (SELECT DISTINCT text_value AS "subject", count(*) FROM metadatavalue WHERE dspace_object_id in (SELECT dspace_object_id FROM item) AND metadata_field_id IN (187, 120, 210, 122, 215, 127, 208, 124, 128, 123, 125, 135, 203, 236, 238, 119) GROUP BY "subject" ORDER BY count DESC) to /tmp/2022-06-28-cgspace-subjects.csv WITH CSV HEADER;
|
||||
@ -366,7 +366,7 @@ There seem to be many more of these:
|
||||
<li>Using rdflib to open the 1.2GB <code>agrovoc_lod.rdf</code> file takes several minutes and doesn’t seem very efficient</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>I tried using <a href="https://github.com/ozekik/lightrdf">lightrdf</a> and it’s much quicker, but the documentation is limiting and I’m not sure how to search yet
|
||||
<li>I tried using <a href="https://github.com/ozekik/lightrdf">lightrdf</a> and it’s much quicker, but the documentation is limited and I’m not sure how to search yet
|
||||
<ul>
|
||||
<li>I had to try in different Python versions because 3.10.x is apparently too new</li>
|
||||
</ul>
|
||||
|
Reference in New Issue
Block a user