Using a program called "Processing", I will be using a Java environment to create data plots and explore the visualization. Here is how the IDE looks like below.
This is a nice working interface that allows visualizations to be drawn using setup() and draw(). Everything else I will be implementing outside of these functions to create the desired graphs.
Following the tutorial in Ben Fry's book "Visualizing Data", I recreated Chapter 4 to create the following graph.
There are some very cool things in this graph to visualize. To improve the visualization, I followed the requirements the class curriculum I'm in desired with numerical labels to Georgia font, and all the text titles to the Verdana font. I also made the "Year" title larger and the "Milk" title larger. Finally I rotated the "Gallons consumed per capita" 90 degrees and took out the extra lines.
To add to some changes, I made the ticks on the vertical bar go all the way across and I made it's weight lighter so it didn't take away from the graph. This will allow better visualization in where these data points close to the line actually lie. Finally, I took Edward Tufte's principle of less ink by taking the background color away. Below are the changes I implemented.
I continued the tutorial in the chapter and came up with the following graphs. The graph that is most appropriate would be the filled chart because these are the number of gallons consumed. The semantics would be much more understood with meaning if the area was filled beneath. Below are the images produced.
Continuing the tutorial in the book, I was able to create tabs and also an integrator method which is super cool because it animates the changes between the different graphs as you select one or another by click or by your keyboard. Below is the shown results of the tabs getting changed for one of the datatypes.
Showing the tabs is super useful so you know what datasets exist and know that there is an option to view different datasets. The integrator is even more useful since it animates the data moving within 1 second. My favorite part of the integrator is that the speed of the data changes more drastically if it is farther apart and you see small movements if the data is super close in a given year. The thing that is missing is hovering your mouse over one of the tabs. I would have made it known to the user that you can click on something by changing the background color if you hover over a section that is clickable. I would change the tool if showing tons of data points (more than 1000 points) by allowing you to zoom into certain regions and filter the data accordingly to what you want to look at.
To choose a view to display all datasets, I have combined all of the data into one plot. I also placed a key as the other tabs that shows the color coordination between the data points. The reason I chose this method is so that the user doesn't have to learn a new graph. They are already used to looking at this scale in data so it would make it easier to compare all the data points together. I wasn't a fan of connecting points so I used curved vertex lines to portray my data. I also allowed the functionality of scrolling over a piece of data and finding the actual value. Below is the output of my graph.
Since I'm a graduate student, I went a step further than the assignment and calculated the linear regression of each plot I was able to find the effectiveness of least squares estimator. I really liked the interpolator so I made the linear regression animate when changing between data values. The linear regression is more effective when the data is spread out on the entire sheet. For the example of "Tea", it is hard to really tell the linear regression and it looks as though the least squares remains constant though that may not be the case. There are cases where linear regression would be ineffective. The cases would be if the data followed a polynomial pattern. This would make the straight line inconsistent with the noise in the data. In those cases, you would use a curved regression to find the least squares. Below I have an image of the result.

















