NEW State Assessment Data Released from the Education Data Center- Version 2.1
2024 state assessment data released for all states, D.C., Puerto Rico + updated files
Hello everyone!
To everyone who subscribes here for updates on state assessment data, you may know that the dataset we use to produce our test score briefs comes from our State Assessment Data Repository at the Education Data Center. Today, we are excited to announce the release of the newest version of the data repository, Version 2.1!
New Version 2.1 Data Highlights
A few highlights of the newly-released data are included below. Please visit our Data Changelog for a full list of state-specific updates.
NEW to Version 2.1
New 2024 data. The repository now includes 2024 assessment data for ALL states. Newly-added 2024 datasets include Hawaii, Maine, Montana, New Mexico, Vermont, Virginia.
New subgroup data. New subgroup data has been added across several states, including Georgia, Iowa, Missouri, and Pennsylvania.
Additional data disaggregation by grade level. New data added for New Mexico disaggregated by grade x subgroup for the "regular" assessment only for the first time (2017-2024). New data disaggregated by grade level has been incorporated at the state level for West Virginia (2018-2024).
Greater scope. Puerto Rico is now included in the repository. New data has been added for the Colorado Grades 3-4 Spanish Language Arts assessment from 2021 onward.
And many more updates! The repository includes many other improvements, such as updated data for Virginia, Illinois, Kentucky, Wyoming, and more.
In Pursuit of Data
We continue to pursue data requests for areas where we have identified gaps in state reporting. For example, DC did not disaggregate results by grade level for the 2024 DC Science Assessment. For these results and others, we hope that such data will be available in the future for improved understanding of student outcomes over time.
Access the Data
There are two primary ways to access the data files:
You may access the raw data files here: https://www.zelma.ai/data. This page also documents data availability by state, subject, and year (including what years are missing from state data); new V2.1 technical documentation; and the new V2.1 codebook. You may choose to download the full dataset, or select individual states and years.
You may access the data via our API, using R, Stata, or Python. This allows you to easily load the full V2.1 dataset into your software program.
We value hearing from users about public projects or research using these data – we appreciate seeing the data in use and this also helps us as we seek ongoing grant support and funding.
Use our AI Assistant, Zelma, to Explore the Data
Zelma is the EDC’s AI-powered research assistant that can help answer your assessment-related questions as part of the State Assessment Data Repository. Version 2.1 data are now used for all Zelma queries. Try it out! If there are data you do not see, let us know at info@eddatacenter.org and we will do our best to explain why or try to include the data in future data releases.
Below are some examples of the new data available to query.
Display 1: Zelma can help you explore subject-area trends by state
ELA and Math Proficiency Rates in Tennessee between 2016-17 and 2023-24
Display 2: Zelma can help you explore results by race and subject.
ELA Proficiency Rates in Minnesota by Race in 2023-24
Background: Education Data Center
At the EDC, we are committed to improving timely education data access and transparency. In December 2023, we launched the State Assessment Data Repository in an effort to make state assessment data more widely accessible and engaging for the general public. This project grew from the need across organizations, education leaders, policymakers, researchers, and parents to be able to more easily find and use annual state summative assessment data. This data repository has grown to be the most comprehensive database of state assessment data in the United States. We believe that timely data access and transparency are critical for stakeholders to be able to make evidence-based decisions on how to support our nation's students and address their academic needs.
What is the State Assessment Data Repository?
The State Assessment Data Repository is a comprehensive U.S. state assessment database that includes publicly-available assessment data from all 50 states and D.C. for students in Grades 3-8. This database integrates data across state-, district-, and school- levels, disaggregated by subject, grade level and student subgroups. The SADR includes data received from State Education Agencies (SEAs) via public data request for components that states have not posted publicly, such as data disaggregated by grade or demographic characteristics. The data do not include any student-level data. We also integrate important school and district identifiers from the National Center for Education Statistics (NCES) to support researchers seeking to understand student outcome data across a range of external datasets.
Please help us spread the word - consider emailing this email to a friend, or subscribe for data updates!
As always, you may send your comments, questions or feedback to info@eddatacenter.org. We hope to hear from you!



