Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Note
titleIf you are editing with Microsoft Excel, be sure to open the CSV in Unicode/UTF-8 encoding

By default, Microsoft Excel may not correctly open the CSV in Unicode/UTF-8 encoding. This means that special characters may be improperly displayed and also can be "corrupted" during re-import of the CSV.

You need to tell Excel this CSV is Unicode, by importing it as follows. (Please note these instructions are valid for MS Office 2007 and 2013. Other Office versions may vary)

  • First, open Excel (with an empty sheet/workbook open)
  • Select "Data" tab
  • Click "From Text" button (in the "External Data" section)
  • Select your CSV file
  • Wizard Step 1
    • Choose "Delimited" option
    • Start import at row: 1
    • In the "File origin" selectbox, select "65001 : Unicode (UTF-8)"
      • NOTE: these encoding options are sorted alphabetically, so "Unicode (UTF-8)" appears near the bottom of the list.
    • Click Next
  • Wizard Step 2
    • Select "Comma" as the only delimiter
    • Click Next
  • Wizard Step 3
    • Select "Text" as the "Column data format" (Unfortunately, this must be done for each column individually in Excel)
      • At a minimum, you MUST ensure all date columns (e.g. dc.date.issued) are treated as "Text" so that Excel doesn't autoconvert DSpace's YYYY-MM-DD format into MM/DD/YYYY
      • To avoid such autoconversion, it is safest to ensure each column is treated as "Text".  Unfortunately, this means selecting each column one-by-one and choosing "Text" as the "Column data format".
    • Click Finish
  • Choose whether to open CSV in the existing sheet or a new one

...