Datasets
Some datasets may be found in the datsets folder of the Google Drive folder for this course. Links to other possibilities are below. If you find other good resources, please consider adding them to this list. You can choose any dataset you find for your group project, although your final choice must be approved by the instructor.
- Other datasets are available from the Vanderbilt data website or UCI Machine Learning Repository.
- The Add Health website has some public use datasets from the National Longitudinal study of Adolescent to Adult Health.
- Public CDC data on influenza via the FluView site or (more usefully in my experience) the cdcfluview R package.
Following links taken from the ASA Public Health Data Challenge
- The CDC WONDER Multiple Cause of Death (Detailed Mortality) dataset
- Youth Risk Behaviors Surveillance System (CDC)
- Medicare Part D Opioid Prescribing Mapping Tool (CMS)
- Opioid-Related Hospital Use (HCUP)
- Uniform Crime Reporting Program Data Series (ICPSR)
- National Survey on Drug Use and Health (SAMHDA)
- Treatment Episode Data Set-Admissions (SAMHDA)
- Treatment Episode Data Set-Discharges (SAMHDA)
- Drug Abuse Warning Network (SAMHDA)
Let me know if you have other suggestions for datasets or repositories to add to this list.