mirror of
https://github.com/alanorth/cgspace-notes.git
synced 2025-01-27 05:49:12 +01:00
Update notes for 2017-02-28
This commit is contained in:
@ -310,4 +310,14 @@ dspace=# \copy (select resource_id, metadata_value_id from metadatavalue where r
|
||||
COPY 1968
|
||||
```
|
||||
|
||||
- And then using awk or uniq to either remove or print the lines that have a duplicate `resource_id` (meaning they belong to the same item in DSpace and are therefore duplicates), and then using the `metadata_value_id` to delete them
|
||||
- And then use awk to print the duplicate lines to a separate file:
|
||||
|
||||
```
|
||||
$ awk -F',' 'seen[$1]++' /tmp/ciat.csv > /tmp/ciat-dupes.csv
|
||||
```
|
||||
|
||||
- From that file I can create a list of 279 deletes and put them in a batch script like:
|
||||
|
||||
```
|
||||
delete from metadatavalue where resource_type_id=2 and metadata_field_id=3 and metadata_value_id=2742061;
|
||||
```
|
||||
|
Reference in New Issue
Block a user