Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

ScholarWorks provides batch metadata editing using CSV (Excel) files. Each campus can download the latest metadata (updated each night) from the URL provided by the Chancellor’s Office. Contact David if you don’t have that URL‘Metadata’ link in the sidebar of your dashboard.

The metadata will come in a .zip file, containing multiple CSV files – one for each data model / form. You can edit the spreadsheets as you like and email the changes to the Chancellor’s Office. At that point, we will create a report showing what changes will be made. Once you approve those, we’ll run the changes for real in ScholarWorks.

...

When editing a CSV export, here's a couple of basic tips to keep in mind:

  1. The "id" column must remain intact. This column also must always have a value in it.

  2. To simplify the CSV, you can simply remove any columns you do NOT wish to edit (except for the "id" column, see #1). Don't worry, removing the entire column won't delete metadata (see #3).

  3. When importing a CSV file, the importer will overlay the metadata onto what is already in the repository to determine the differences. It only acts on the contents of the CSV file, rather than on the complete item metadata. This means that the CSV file that is exported can be manipulated quite substantially before being re-imported. Rows (works) or columns (metadata elements) can be removed and will be ignored.

    For example, if you only want to edit "subject," you can remove all columns EXCEPT for "id" and "subject" so that you can just manipulate the "subject" field. On import, ScholarWorks will see that you've only included the "subject" field in your CSV and therefore will only update the "subject" metadata field for any items listed in that CSV.

  4. Because removing an entire column does NOT delete metadata value(s), if you actually wish to delete a metadata value you should leave the column intact, and simply clear out the appropriate row's value (in that column).