Today we are going to try and automate pandas (https://github.com/bmaillou/BigDataPython/tree/master/11-morePandas). Lets see how we can do.
VIDEO
Homework Due: Due Next Class
(Monday October 12, 2025)
- use the well_data.xlsx file.
- Make a pdf file that contains a plot of As versus every other parameter. Make sure it is labeled nicely and in one pdf file. Arsenic should be on the x-axis and the other parameters should be on the y-axis.
- Each parameter should be on one page.
- make sure your axes go in the correct direction.
- Don’t leave in leftover code that messes you up.
- You will need to hand in your pdf file of results.
- It can be all one color!! Choose a your own symbol and color.
- The column “Drink” is an object and not a float. It plots okay when on an x-axis but not a y-axis. If your code crashes you need to skip this column.
- Make a plot with subplots that is As versus at least 3 other parameters each on their own graph but all on one page. You can decide if it is better to share an x or share a y axis. But Arsenic is the independent variable.
- Save this graph to a jpeg and turn that in .
- For this your graphs should be square. Adjust the figure size to make them square.
- They should have letters denoting them like in the packet.
- Final 10%
- Redo #3 the plot with 3 boxes but color the points by your parameter from the last homework (pump, platform, aquifer, shared). Then choose the color scheme from your favorite team, company etc and state which one you are using. Make sure to add a legend.
HINTS
The Drink column is going to cause scatter to crash. So you will need to avoid it. See notes from class.