Update notes for 2019-02-21

This commit is contained in:
2019-02-21 17:21:37 -08:00
parent bcdf2a1e26
commit 1c64769a65
3 changed files with 12 additions and 10 deletions

View File

@ -1020,11 +1020,12 @@ $ ./agrovoc-lookup.py -l fr -i /tmp/top-1500-subjects.txt -om /tmp/matched-subje
$ cat /tmp/matched-subjects-* | sort | uniq > /tmp/2019-02-21-matched-subjects.txt
```
- And then a list of all the unique *unmatched* terms using some utility I've never heard of before called `comm`:
- And then a list of all the unique *unmatched* terms using some utility I've never heard of before called `comm` or with `diff`:
```
$ sort /tmp/top-1500-subjects.txt > /tmp/subjects-sorted.txt
$ comm -13 /tmp/2019-02-21-matched-subjects.txt /tmp/subjects-sorted.txt > /tmp/2019-02-21-unmatched-subjects.txt
$ diff --new-line-format="" --unchanged-line-format="" /tmp/subjects-sorted.txt /tmp/2019-02-21-matched-subjects.txt > /tmp/2019-02-21-unmatched-subjects.txt
```
- Generate a list of countries and regions from CGSpace for Sisay to look through: