Attendees

  • Alan Stanley
  • Amy Blau
  • Dana Bronson
  • Dara Virks
  • David Wilcox
  • Kun Lin
  • Paige Morfitt 

Agenda

Topic

Stand-up

  1. What has everyone been working on?
  2. What will you be working on over the next couple weeks?
  3. Are there any blockers that are preventing you from getting work done?
Sample Data issues and questions

Priorities and areas of focus

  1. Timeline for remaining work
  2. Possible cut-over date
Wrap-up and next steps

Notes

  1. Dara
    1. Some issues with configs being overwritten on staging
      1. BD doing an update so this won't happen anymore
      2. Planning to freeze staging this afternoon for the update
    2. Don will be working on the Access Control work next week
  2. Alan
    1. Ingesting newspapers from laptop is very slow (about an hour per issue vs 3-4 minutes on staging)
      1. Ingesting on the server fails to create OCR. Alan is investigating.
        1. Hypercube grabs the image and sends to Tesseract, seems to be timing out here
          1. Nginx may be timing out at 64 seconds, Alan will try doubling or tripling this number
          2. Could be due to server load 
  3. Amy
    1. Created a spreadsheet of sample data issues
      1. Title length is an issue - there is a 255 character limit
        1. Whitman team will discuss
    2. Checklist for spreadsheets
    3. parent_id vs member_of
      1. parent_id is used for ingest, member_of is used for relationships in the repository
      2. one of the fields has a space in the spreadsheet, which workbench fails on
  4. Timeline
    1. Dara to investigate when prod site will be ready for production ingests
    2. Should be able to start production ingest within the next 2 weeks
  5. To discuss
    1. VTT transcripts

Actions