CAASPP Test Results

California Assessment of Student Performance and Progress

cde.ca.gov-caaspp-2015e-2.1.1. Modified 2022-08-09T20:25:49

Resources | Packages | Documentation| Contacts| References| Data Dictionary

Resources

Documentation

This dataset is a simple combination of multiple years of California Assessment of Student Performance and Progress (CAASPP) results, for the years 2015 on. The CAASPP was conducted in 2014, but the 2014 file has a different structure from other years ( it’s a “pp” file, see below) so it is excluded from this package.

Download Structure

The Base Download documentation link points to an FTP-like web directory of files, which contain the downloadable files. The other download pages have links to the files in these directories. These directories, as well as the dataset webpages are a bit complex. There seems to be a few different sorts of test result files, but there isn’t a clear distinction between them. These fiels include:

  • “Paper Based” results , for 2014 and 2015
  • Tests in Spanish, marked “STS” for 2016 and 2017
  • Files with code ‘pp’. Some ‘pp’ files have additional codes ‘p2’ or ‘p3’
  • Files with ‘sb’, often also with ‘p2’ or ‘p3’

I think ‘sb’ means ‘Smarter-Balanced’.

The main page for accessing test results, https://caaspp.cde.ca.gov/, has an inconsistent set of links to result pages for each year. For some years, the links are to “STS” results, others are to “Paper-Based” and for 2018, the links are to the “STS” and “Smarter Balanced” results.

Versions

  1. Initial version
  2. total_tested_with_scores datatype changed from ‘unknown’ to ‘integer’ with ‘robust_int’ transform

License

The source data, from the California Department of Education, does not explicitly reference a license, but since the top-level website for California indicates that data published on the State of California website is generally in the public domain, we assume it is in the public domain. Derived data included in this package is also in the public domain. Metadata in this package ( contained in the metadata.csv file ) is derived from metadata for the source package, and is also in the public domain.

Programs used for data transformation (contained in the notebooks directory), are Copyright 2021 Civic Knowledge, and are relased under the Creative Commons Attribution-ShareAlike 4.0 International License.

Contacts

Data Dictionary

subgroup_ids | test_ids | entities | subgroups

subgroup_ids

Column NameData TypeDescription
codeinteger
labelstring
groupstring

test_ids

Column NameData TypeDescription
test_idinteger
test_numinteger
test_namestring

entities

Column NameData TypeDescription
county_codeinteger
district_codeinteger
school_codeinteger
fillerunknown
test_yearinteger
type_idinteger
county_namestring
district_namestring
school_namestring
zip_codestring
type_idinteger

subgroups

Column NameData TypeDescription
county_codeinteger
district_codeinteger
school_codeinteger
fillerunknown
test_yearinteger
subgroup_idinteger
test_typestring
total_caaspp_enrollmentinteger
total_tested_at_entity_levelinteger
total_tested_at_subgroup_levelinteger
gradeinteger
test_idinteger
caaspp_reported_enrollmentinteger
students_testedinteger
mean_scale_scorestring
percentage_standard_exceededstring
percentage_standard_metstring
percentage_standard_met_and_abovestring
percentage_standard_nearly_metstring
percentage_standard_not_metstring
students_with_scoresinteger
area_1_percentage_above_standardstring
area_1_percentage_at_or_near_standardstring
area_1_percentage_below_standardstring
area_2_percentage_above_standardstring
area_2_percentage_at_or_near_standardstring
area_2_percentage_below_standardstring
area_3_percentage_above_standardstring
area_3_percentage_at_or_near_standardstring
area_3_percentage_below_standardstring
area_4_percentage_above_standardstring
area_4_percentage_at_or_near_standardstring
area_4_percentage_below_standardstring
total_tested_with_scoresunknown
area_1_percentage_near_standardunknown
area_2_percentage_near_standardunknown
area_3_percentage_near_standardunknown
area_4_percentage_near_standardunknown
total_tested_at_reporting_levelunknown
total_tested_with_scores_at_reporting_levelunknown
students_enrolledunknown
type_idunknown

References

Urls used in the creation of this data package.

Packages

Accessing Data in Vanilla Pandas

import pandas as pd


subgroup_ids_df =  pd.read_csv('http://library.metatab.org/cde.ca.gov-caaspp-2015e-2.1.1/data/subgroup_ids.csv')
test_ids_df =  pd.read_csv('http://library.metatab.org/cde.ca.gov-caaspp-2015e-2.1.1/data/test_ids.csv')
subgroups_df =  pd.read_csv('http://library.metatab.org/cde.ca.gov-caaspp-2015e-2.1.1/data/subgroups.csv')
entities_df =  pd.read_csv('http://library.metatab.org/cde.ca.gov-caaspp-2015e-2.1.1/data/entities.csv')

Accessing Package in Metapack

import metapack as mp
pkg = mp.open_package('http://library.metatab.org/cde.ca.gov-caaspp-2015e-2.1.1.csv')

# Create Dataframes
subgroup_ids_df = pkg.resource('subgroup_ids').dataframe()
test_ids_df = pkg.resource('test_ids').dataframe()
subgroups_df = pkg.resource('subgroups').dataframe()
entities_df = pkg.resource('entities').dataframe()