The Capstone Experience for the BSU Computer Science BS First (solo) Project





Project 1 Sprint 3


Summary: In this sprint3 of your first project, You will read data out of an excel file to add to your database


Tools:

We will continue to use the same version control, continuous integration tools and programming language requirements that we put together for parts one & two.


Sprint 3: Getting some additional data: 

Due: Wednesday  Feb 24th  at 11:59pm However as per Slack submissions will be accepted as on time until 11:59pm Sun Feb 28



  1. Use the same project that you used last time. I will just do a git pull on the project and get the updates.

  2. Get the excel file for wage and job data by state. Either get the local copy that I downloaded here. Or get the original online, you want the state by state data. The most recent data is from May 2019, it seems to come out each May and I'm guessing they didn't post it last year because the Covid effects would be too much of an issue - either that or it take a really long time to get the data out.
  3. Create at least one more table in your database with state by state employment data (be able to store the data from step 4 below)
  4. Get the following data from the excel sheet and store it in your database
    1. for each state (I don't care if you include territories or not)  get every major employment category and record (as a DB row)
      1. the state
      2. the occupation major title
      3. the total employment in that field in that state
      4. the 25th percentile salary (lets assume that most college grads earn in the lower 25%) for that field both hourly and annual
      5. the total employment in that state for that field oops typo, It should be the Occupation code instead
  5. Write automated tests
    1. write tests to assure your method to read from the xlsx file works properly
      1. eg, make sure you get data from all 50 states from the original data
      2. maybe create a test xlsx with a limited number of major occupational groups along with some other stuff. make sure you get the right number of major occupational groups
    2. write a test to make sure the new table is there
    3. write a test to make sure the old table is still there if test is still not there from previous sprint
    4. write a test to make sure that your new write to table works (unless you found a way to make the old write to table work and the old test covers it all.
  6. make sure your DevOps stuff all works (tests are run on github and formatting checks still work)
  7. update your readme and requirements (or equivalent go.mod or gradle or whatever you need for your language.)
  8. commit and push often so you lose nothing if your computer dies.





Some hints for working with xcel files:

)