I made a few updates to the percent public schools with outbreak cases map I posted earlier today. I added information in the tooltips about the number of public schools and the number of outbreak-associated cases. For notes on the data and the process please see the earlier post.

Continuing Learning How to Code

I still have not learned how to make a timeline map to show outbreaks over time, although I think I am on the way to figuring it out. I watched the beginning of an 11 hour YouTube tutorial with that end in mind. Since I really want to learn the first principles of coding data visualization I want to take my time. I want to really understand how to build a website using HTML, CSS, and javascript rather than just educated guesses and trial and error (although I am pretty good at that type of coding when I need to make something work). I believe that the tutorial uses the D3 library for visualizations, but I have not gotten there yet. I am interested in learning D3 as I have read that it is very customizable. Although I do not think I have gotten close to the edge of the interesting visualizations I can make with Datawrapper and Amcharts.

Percent Public Schools with Active OUtBreak Map

I really like the outbreak maps I have made showing the locations of schools both public and private with a COVID-19 outbreak, as defined by the Maryland Health Department. However, these maps quickly get crowded and are kind of hard to read unless you zoom in closely. It is also hard to compare counties because counties are not all the same density. Also, the outbreak map by school takes a while to set up because I need to find the street address for each school. I have plans to use the vlookup feature in Excel to make match the names in the Maryland Health Department list, but most of the names do not match exactly so I expect that might take a while.

So, I decided to make a map showing the percentage of public schools with an active outbreak. I got the number of schools from the 2021 NCES School Directory list on the Maryland Report Card website. I had previously coded schools from the Maryland Health Department list from September 1, 2021, as public or private. I coded the list based on googling, I should probably check the list against the NCES School Directory, but I have not done that yet. Then I divided the total number of public schools by the number of schools with an outbreak listed. This was fairly quick, the most time-consuming part was coding the schools as public or private.

It should be noted that many public school systems have not opened yet as of September 1, 2021.

I like many things about this map. The size and specific location of outbreaks are lost, but it gives a general idea of the outbreaks in public schools in the State. I think for my own personal use I would want to see both the location-specific map showing cases per school and this map.

Notes about the Data

These notes are copied from the Maryland Department of Health website.

Note: This dataset reflects public and non-public K-12 schools in Maryland that have COVID-19 outbreaks. Data are based on local health department reports to MDH, which may be revised if additional information becomes available. This list does not include child care facilities or institutes of higher education. Schools listed meet 1 or more of the following criteria:

Classroom/cohort outbreak definition:    1) At least two confirmed COVID-19 cases among students/teachers/staff within a 14-day period and who are epidemiologically linked, but not household contacts; or

School-wide outbreak definition:    

2) Three or more classrooms or cohorts with cases from separate households that meet the classroom/cohort outbreak definition that occurs within 14 days; or
 3) Five percent or more unrelated students/teachers/staff have confirmed COVID-19 within a 14 day period (minimum of 10 unrelated students/teachers/staff).

Cases reported reflect the current total number of cases. Schools are removed from the list when health officials determine 14 days have passed with no new cases and no tests pending. Archival data is available through the COVID-19 open data catalogue. These data are updated weekly on Wednesdays during the 10 a.m. hour. MDH is continuously evaluating its data and reporting systems and will make updates as more data becomes available.

I pulled data on September 1, 2021, to make the above map. I provided the information on whether a school is public or private. All errors are my own. This map is not official, it is primarily for the purpose of me exploring the visualization features and keeping track of what I learned.

I have not yet figured out how to make a map with a timeline, so here is a map showing the outbreak-associated cases in schools as reported by the Maryland Department of Health on September 1, 2021. Please refer to my first outbreak map for notes on the data. As with the other map the goal is for me to learn about the mapping software and data visualization using current data. The data and locations have not been checked.

In line with my interest in masking policies in public schools, I decided to map outbreak-associated cases in schools as reported by the Maryland Department of Health to check out the geolocation mapping feature of Datawrapper. As far as using the geolocation, it was super easy. I googled the address and pasted it into the correct box. I assume that the locations are generally accurate. It seems less precise than ArcView, but it seems to work well for when you are just trying to get a sense of the data. It did not that too long to enter the data for this number of locations. It would take a while if there were more locations. It would be much quicker if I had a spreadsheet with all the addresses of schools in the State pre-made and ready to go. I believe that there might be an official list of the public schools available. If I were to do this in the future I would explore using that existing spreadsheet.

I wish that I could add a choropleth map behind the location map to add another level of data details. I know it can be done with Arcview. But what you gain in usability you lose in features.

If you are wondering like I was about the outbreak at Quince Orchard when the school is not even open yet. The answer was found in this news article. Apparently, there was an outbreak in the football team which I assume has started practicing for the fall season.

Notes About the Data

These notes are copied from the Maryland Department of Health website.

Note: This dataset reflects public and non-public K-12 schools in Maryland that have COVID-19 outbreaks. Data are based on local health department reports to MDH, which may be revised if additional information becomes available. This list does not include child care facilities or institutes of higher education. Schools listed meet 1 or more of the following criteria:

Classroom/cohort outbreak definition:    1) At least two confirmed COVID-19 cases among students/teachers/staff within a 14-day period and who are epidemiologically linked, but not household contacts; or

School-wide outbreak definition:    

2) Three or more classrooms or cohorts with cases from separate households that meet the classroom/cohort outbreak definition that occurs within 14 days; or
 3) Five percent or more unrelated students/teachers/staff have confirmed COVID-19 within a 14 day period (minimum of 10 unrelated students/teachers/staff).

Cases reported reflect the current total number of cases. Schools are removed from the list when health officials determine 14 days have passed with no new cases and no tests pending. Archival data is available through the COVID-19 open data catalogue. These data are updated weekly on Wednesdays during the 10 a.m. hour. MDH is continuously evaluating its data and reporting systems and will make updates as more data becomes available.

I pulled data at noon on August 25, 2021, to make the above map. As I discuss locations are approximate. I provided the information on whether a school is public or private. All errors are my own. This map is not official, it is primarily for the purpose of me exploring the visualization features and keeping track of what I learned.

As the masking requirements for public schools changed so does my map. This time I figured out how to link sources in the tooltips. Tooltips are apparently what you call the hover-over information. I like having clear sources for my visualizations it is important that I can retrace my steps and I would like others to be able to retrace my steps. This is especially important because I work with so many sources of data and my products often have a quick turnaround time.

Sometimes I get to use my new skills to build something that is actually useful for real people. I made this map for the Maryland General Assembly to help keep track of the State’s public schools masking requirements. As a nonpartisan analyst my job is to provide clear nonpartisan information. I love how easy this map was to make on the technical side. I also like that I could add tooltips easily.

Update: Here is proof that someone important found the map useful.

Remiation Rates of Recent Maryland Public High School Graduates Enrolled at Maryland Public Institutions

Today I take a look at the remediation rates of recent Maryland public high school graduates enrolled at Maryland public institutions as published by the Maryland Higher Education Commission in their 2021 Data Book. According to data published by the Maryland State Department of Education, 57,622 students graduated from Maryland public high schools during the 2017-2018 academic year. According to MHEC’s remediation data set, 25,575 students enrolled at a Maryland public institution in fall 2018. Thus, less than half, 44%, of the public school graduates are reflected in the map below. So, the data may tell us less about the quality of public schools than first thought. I will have to see if I can drill down further to this data. My first step will be to look at the number of high school graduates from each county; however, since the percentage of students who are low-income is different in each county, the college-going behavior is likely not consistent.

I was a little surprised to see that 25,575 students enrolled in a public institution immediately after high school because according to the college pipeline data published by the Maryland Longitudinal Data System Center, 17,410 students enrolled in a public institution immediately after high school. I wonder if MLDS is better able to separate students enrolled in multiple institutions. Further, the MLDS dataset only is capturing “degree-seeking” students.

Notes About the Data

  • Students may be enrolled at more than one institution. They are included in enrollment figures for each insituiton at which they are enrolled.
  • Data include all degree- and non-degree seeking students enrolled in credit courses.
  • Maryland residents are identified using their place of residence at the time of application to the insituion.
  • Maryland public instiutitions include community colleges and four-year public colleges and universities.
  • Recent Maryland public high school graduates are defined as those graduating from a Maryland public high school, identitifed using the College Board School Code, who graduated in the 2017-2018 academic year and first enrolled in higher education in Maryland in fall 2018. Analysis relies on high school graduation date and reporting of remedial assesment data; missing data for these variable may result in underreporting.
  • Maryland residents whose county of residence is unknown are included in insitutuional-level remedial data, but excluded from reporting by county of residence.
  • Salisbury University and St. Mary's College of Maryland do not offer remedial coursework.

More Than 80% of the Student BODY At Most COmmunity COlleges are Maryland Residents

According to the Maryland Higher Education Commission, 91.6% of Maryland community college students are Maryland residents. This is not surprising at all as community college students tend to attend their local community college. From looking at a data table I was able to see that more than 80% of the student body at most community colleges are Maryland residents. However, only 55.4% of students at Allegany College of Maryland are Maryland residents. With that information and a bit of curiosity about using Datawrapper for mapping, I decided to build a map showing the location of the main campus of each community college and the percent of students that are Maryland residents.

Building the map was relatively simple. I used google to look up the address for the main campus address for each of the 16 community colleges in the State. I then pasted that information into the program and added typed in the Maryland resident information for each college. From my basic sense of the locations of the colleges, the placements look accurate, but I have not checked them.

I struggled with how to best visualize the data since most of the colleges have more than 80% of the student body being Maryland residents. Reducing the number of color categories to three really helped with this issue. It highlights that Allegany College of Maryland is an outliner and Hagerstown Community College is almost an outliner. I probably should have either rounded the numbers to make Hagerstown Community College 80% and reduced the categories to two colors, but I think it is a little fun to see that Hagerstown Community College does not quite meet the 80% requirement. I could also say something such as all but one community college has 79% or more of its student body as Maryland residents. If I was highlighting Allegany College for some reason I would probably do one of those options. But since I am just playing around with the data to see what jumps out at me, I have not done that this time.

After I mapped the data I saw that Allegany College is very close, less than 3 miles, from the West Virginia border. It is also close to the Pennsylvania border. There are other community colleges that are not far from the State borders, but these colleges seem to draw a smaller percentage of non-Maryland residents. Perhaps if I put the color break at a higher percentage and treated Allegany College less like an outliner that story would become clearer, but I have not yet tried that.

WHat I learned from this Mapping Experience

  • Adding location markers in Datawrapper is easy, but can be time consumming
  • Datawrapper is significantly easier to use than other mapping software such as ArcView, but the data crunching functionality is less
  • I have not yet figured out how to show just one of Maryland's counties in a map
  • When most of the data is in the same range, but there are outliners, the data can be hard to visualize
  • I haven't decided how to handle colleges with mulitple campuses when mapping. Representing the "main" campus seems to be the best way for now.
  • Color breaks can change the data story

Ploting the Data

Yesterday I looked at SAT scores in Maryland by county and income and for the incoming freshman classes of the public four-year institutions. Today I take a brief look at SAT scores by county and race/ethnicity from the same dataset published by the Maryland State Department of Education. I made a dot plot using Datawrapper with all the race/ethnicity categories available in the data set as well as “all students”. To be honest, the chart looks very busy and is rather hard to read. I changed the color scheme to reds and oranges to aid with distinguishing the categories, but it only helped a little. I could choose custom colors for each group, and would if I was intending to show this data to a wider audience, but since this is primarily for my own exploration of the data I decided I did not have the energy to make those choices today. I did decide to highlight the “all student” category to help with readability a bit.

An alternative visualization, and the one I have seen used at State Board of Education meetings, is a grouped bar chart. While I think that would work for smaller numbers of counties or race/ethnicities, I think that it is worst than the dot plot for a large amount of data. However, I may explore this visualization in the future.

SAT SCores by county and Race/Ethnicity

It is hard to draw any conclusions from this data. For one thing, I am unsure if this data represents public school students or all students who took the test from that county. For another thing, not all students take the SAT and different local school systems have different policies about pushing students to take the SAT.

Howard County stands out as having very high scores, for students of all races. From the data, I do not know if Howard County encouraged only high-performing students to take the SAT. I would be surprised if that was the case, but it is a possibility. Since there is information about the number of students that took the test, I might be able to infer the policy from that data or I might look at their website to see if they have a SAT policy.

Note: I likely will not be posting for a while as I will be on vacation.

Context

I have been reading through the Maryland Higher Education Commission’s Annual Data Book 2021. I am interested to see if I can learn any new insights by exploring different visualizations of the data. I am also teaching myself how to use data visualization/graphing software. Today I am taking a look at SAT scores statistics published in the databook as well as additional SAT scores published by the Maryland State Department of Education. The populations for all of these datasets are slightly different.

SAT Scores of Entering for High School Seniors 2020

According to the databook, the mean combined evidence-based reading and writing score and math for Maryland high school seniors in 2020 was 1029, slightly less than the nationwide mean of 1051. I have read in the past this is because a higher percentage of students in Maryland take the SAT than nationally, but I will have to find a citation that is true.

SAT Scores By COunty

On the Maryland Report Card, the Maryland State Department of Education publishes data about students. As part of the "college readiness data", average SAT scores are published. In addition to an average for all students from the local school system, the scores are disaggregated by a number of subpopulations including by low-income students, that is those eligible for free or reduced price meals. Low-income students scored lower on the SAT than all students in every county. However, the average scores were nearly identical for students from Dorchester County. At the SEED school, where almost all students are low-income, low-income students scored higher than all students.

There is a large range of scores between counties and between low-income students and all students in most counties. Howard County has the highest average score for all students and the second-highest average score., after Carroll County, for low-income students. In fact, low-income students from Carroll and Howard counties had a higher average score than the average score for all students from 14 counties, Baltimore City, and the SEED school.

I plan to examine the data for more subpopulations in the future. Including the best way to visualize the data.

SAT SCores of Entering Freshmen

The MHEC databook publishes the average, 25th percentile, and 75th percentile SAT scores of entering freshmen at the public four-year institutions. Institutions submit aggregated data on average SAT scores for all incoming freshmen. Some institutions do not require SAT scores for admission. Institutional score ranges are based upon those scores that were used as a basis for admitting students to the institution.

This chart shows the SAT scores of entering freshmen for the University of Maryland, College Park from the 25th to the 75th percentile