Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Download the .zip file for your campus. Contact David if you don’t have that URL.

  2. There is a separate CSV file within the .zip file for each data model/work form, of which we currently have four: dataset, educational resource, publication, and thesis. If your campus will be cleaning up Title metadata for more than just ETDs, you will need to complete the following clean up steps for each CSV separately.

  3. For each CSV file, download it and save it locally. Then delete all fields (columns) except for id (column A) and title (column I).

  4. Add a new column C (you can name it “title_revised” for now), and in cell C2, type =PROPER(B2) and press enter/return on your keyboard. This will convert the all caps text in B2 to title case.

  5. Then fill the formula down the entire column. Please consult Excel’s documentation for additional guidance with that.

  6. Then copy the output of that formula and paste it as values in a new column (you can name that one “title_revised2” for now).

  7. As noted above, this formula will capitalize the first letter of all words and lower case the rest, so spot checking will likely be required to identify those words that should be all caps (e.g., acronyms) and identify words that should not be capitalized (e.g., a, an, of, at).

    Screenshot of Replace example in Excel, with Match case selected
    1. You may want to consider using find and replace to automate this process a bit, working from the title-revised2 column. For example, you could replace all instances of “Of “ with “of “ as shown in the screenshot to the right. Just be sure to include the subsequent a space before and after the word in order to avoid replacing longer words that simply begin with the same letters, and also be sure to check the “Match case” option.

  8. Once the spot checking is completed, delete the original title column , and the “title_revised” column. Then rename the “title_revised” revised2” column “title”.

  9. That’s it! Be sure to save the CSV again, and you can send the file back to David. He will take it from there and give you the opportunity for a “sanity check” before finalizing the changes.

...