On three datasets related to diabetes, obesity, and inactivity, I conducted a correlation study. The analysis showed a significant link among all three datasets, with the FIPS code acting as the tying element. I combined these three datasets into a single Excel spreadsheet in order to conduct a more thorough analysis after realizing the necessity for one.
I created a piece of code to integrate the three datasets, and it revealed 356 shared data points. The Excel file was then cleaned up, which entailed dealing with duplicate columns including data on the county, state, and year. I eliminated these columns to make the data more readable. I also increased the readability of the dataset by renaming particular columns and changing column widths to aid with data visualization.
Additionally, I am eager to defend the facts and ensure that the information is accurate. I am examining many tests, like the T-test and Bruesch-Pagan Test, to achieve this. Even though I have a preliminary T-test code, I have not yet tested it on the dataset.