
Datasets
High-quality, engaging, relatable datasets are essential for data science education, and early research shows that a student’s selection of dataset has a substantial impact on their engagement. Below you will find an array of datasets that are well suited for K-12 education, as well as in the college and university setting. We encourage you to implement these ready-to-use datasets into your data science lessons
Datasets Specification
Coalition members Emmanuel Schanzer (Bootstrap) and Dan Schneider (Code.org) have collaborated with DS4E staff to develop an in depth specification guide that offers a pathway for individuals to find, clean, document, and upload datasets that can be used in K-12 data science tools.
If you would like to submit datasets, please complete the google form linked below and ensure your datasets align with the datasets specification guide. Download PDF here.
(Have a dataset to add?)
Filter by Grade Level
14 resources found
Tuva
Tuva creates a K-12 data science ecosystem for math and science that includes its own proprietary data science software, lesson plans, and datasets. The Tuva team also offers both live and recorded workshops to enable teacher training in data literacy education.
Grade(s)
K-5, 6-8, 9-12
Software and Tools
None required
K-12 Datasheet Verified?
Not Yet
What's Going On in This Graph? (NYTimes)
"What's Going on in This Graph?" is a compilation of visual data representations previously published in the New York Times. Each graph is linked to an article with more context on the data origins and collection methods.
Grade(s)
All
Software and Tools
None required