Add notes for 2023-04-27

This commit is contained in:
2023-04-27 13:10:13 -07:00
parent 0ca3cadbef
commit ad8516bbb3
33 changed files with 129 additions and 41 deletions

View File

@ -58,7 +58,7 @@ There seem to be many more of these:
"@type": "BlogPosting",
"headline": "June, 2022",
"url": "https://alanorth.github.io/cgspace-notes/2022-06/",
"wordCount": "1786",
"wordCount": "1788",
"datePublished": "2022-06-06T09:01:36+03:00",
"dateModified": "2022-08-03T21:01:39+03:00",
"author": {
@ -349,7 +349,7 @@ There seem to be many more of these:
</ul>
<h2 id="2022-06-28">2022-06-28</h2>
<ul>
<li>Start working on the CGSpace subject export for FAO</li>
<li>Start working on the CGSpace subject export for FAO / AGROVOC</li>
<li>First I exported a list of all metadata in our <code>dcterms.subject</code> and other center-specific subject fields with their counts:</li>
</ul>
<div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-console" data-lang="console"><span style="display:flex;"><span>localhost/dspacetest= ☘ \COPY (SELECT DISTINCT text_value AS &#34;subject&#34;, count(*) FROM metadatavalue WHERE dspace_object_id in (SELECT dspace_object_id FROM item) AND metadata_field_id IN (187, 120, 210, 122, 215, 127, 208, 124, 128, 123, 125, 135, 203, 236, 238, 119) GROUP BY &#34;subject&#34; ORDER BY count DESC) to /tmp/2022-06-28-cgspace-subjects.csv WITH CSV HEADER;
@ -366,7 +366,7 @@ There seem to be many more of these:
<li>Using rdflib to open the 1.2GB <code>agrovoc_lod.rdf</code> file takes several minutes and doesn&rsquo;t seem very efficient</li>
</ul>
</li>
<li>I tried using <a href="https://github.com/ozekik/lightrdf">lightrdf</a> and it&rsquo;s much quicker, but the documentation is limiting and I&rsquo;m not sure how to search yet
<li>I tried using <a href="https://github.com/ozekik/lightrdf">lightrdf</a> and it&rsquo;s much quicker, but the documentation is limited and I&rsquo;m not sure how to search yet
<ul>
<li>I had to try in different Python versions because 3.10.x is apparently too new</li>
</ul>