Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Download the .zip file for your campus. Contact David if you don’t have that URL.

  2. There is a separate CSV file within the .zip file for each data model/work form, of which we currently have four: dataset, educational resource, publication, and thesis. If your campus will be cleaning up Title metadata for more than just ETDs, you will need to complete the following clean up steps for each CSV separately.

  3. For each CSV file, download it and save it locally. Then delete all fields (columns) except for id (column A) and title (column I).

  4. There are a couple of ways to automate the clean up process.

    1. Using the Title Case Converter: This tool allows you to convert text based on the capitalization guidelines of various formatting styles. To facilitate batch editing, your campus may want to decide to use one style in all cases, regardless of disciplinary variations. Please consult the Which Title Case Style Should You Use? page for additional information.

      1. Copy all the metadata in the title column and paste it into Notepad or a similar program of your choice to strip out unwanted formatting.

      2. Then paste that text into the text box on the homepage of the Title Case Converter. Note: The converter does not have an explicit character limit, but it seems to work best with approximately 500 or fewer words, so you may need to do this in batches.

      3. Select the title case style your campus has decided to use, and click the “Convert” button.

      4. When taken to the list of converted titles, click the “Copy All” button.

      5. Back in the CSV file, add a new column (you can name it title_revised for now) and paste the converted titles into that column.

      6. You may want to do some spot checking, as issues can arise with acronyms, book/film titles, etc. Depending on the original formatting of your Title metadata, it may be helpful to refer to that column as you spot check.

      7. Once the spot checking is completed, delete the original title column, and rename the “title_revised” column “title”.

      8. That’s it! You Be sure to save the CSV again, and you can send the CSV file back to David, and he . He will take it from there and give you the opportunity for a “sanity check” before finalizing the changes.