Attendees

  • Alan Stanley
  • Amy Blau
  • Dana Bronson
  • Dara Virks
  • David Wilcox
  • Kun Lin
  • Paige Morfitt 

Agenda

Topic

Stand-up

  1. What has everyone been working on?
  2. What will you be working on over the next couple weeks?
  3. Are there any blockers that are preventing you from getting work done?
Production Migration Schedule

Priorities and areas of focus

  1. Timeline for remaining work
  2. Possible cut-over date
  3. Documentation tasks
Wrap-up and next steps

Notes

  1. PDF ingest
    1. Some PDFs are very large (600MB+)
      1. May need to downsample PDFs so they will load faster
    2. Thumbnail generation service pulls the entire PDF down to generate the thumbnail
      1. Crayfish has a hardwired timeout of 1 minute
      2. Danny Lamb has mostly resolved this issue
      3. We can pull the thumbnails over rather than generating them, but that doesn't help future ingests
      4. We can setup the server for VBOs; this will allow us to fix any missing derivatives
    3. Difficult to tell which resources are missing data streams 
      1. Alan can write a small utility to check for missing data
  2. Newspapers
    1. Showing up as individual PDFs but not in a newspaper issue browse view
    2. Need to have "Newspaper" model selected in order to get browse view
    3. Amy will verify search works
  3. Staging limited access files
    1. There is a good Python3 library that can read files from S3 with the right credentials
      1. Alan can extend this to work for this use case
    2. Easier option - put the limited access files on a drive on the server where the script can access them
  4. VTT files
    1. This should be currently supported
    2. Alan will look into this

Actions

  • Amy: Develop requirements for license files and share with Alan
  • Whitman team: Start putting limited access files on the server for Workbench access