I have made more treemaps. This time showing transfer students to the University of Maryland, College Park in 2020.

In the first treemap, I show the level 1 names, and the level 2 names are hidden. Clicking on the level 1 names will reveal level 2.

Source: University System of Maryland, IRIS

In the second treemap, I show the level 2 names at first open. Clicking on the level 1 names will focus just on that level.

Source: University System of Maryland, IRIS

I am not sure which treemap style I like the best. I think it will depend on what I want the treemap to show. It might depend on the story I want to tell with the data.

I have found it is a little difficult to read the smaller categories of the treemaps, which might make them less useful. I expanded the height from 300 px to 500 px, but I am thinking of expanding them further to 700 px or 800 px. I also find the smaller categories sort of difficult to click on.

I have not yet figured out how to make a map with a timeline, so here is a map showing the outbreak-associated cases in schools as reported by the Maryland Department of Health on September 1, 2021. Please refer to my first outbreak map for notes on the data. As with the other map the goal is for me to learn about the mapping software and data visualization using current data. The data and locations have not been checked.

Another way to show the information in a divergent bar chart, like the one I posted a few days ago, is in a split bar chart. Datawrapper staff wrote a blog post not recommending divergent bar charts, so they only offer a split bar chart.

I put together this split bar chart to get a better idea of what the Maryland Comprehensive Assessment Program (MCAP) results would look like displayed in this manner. In addition to the five score levels, I added a percent proficient column which is level 4 and level 5 added together. Percent proficient is reported in the official data chart and shown in the official column charts developed by the Maryland State Department of Education for the Maryland Report Card. The percent proficient is shown in green on the right of the centerline in my divergent bar chart.

I’m not sold that a split bar chart is always superior to a divergent bar chart. Especially when the data has categories that are distinctly good or bad and no awkward middle neutral category. However, I like the split bar chart. I especially like that it can be easily sorted by percent proficient, which includes levels 4 and 5. For this data, the percent proficient is more important than the percentage of students at each level.

I find it interesting that the percent proficient ranges from nearly 70% to less than 20%. With such a large gap, it is likely that there are methods that can be learned from the higher-performing counties to increase scores (and hopefully ultimately learning) in the lower-performing counties.

Notes About the Data

  • Values listed as 5.0% are acutally less than or equal to 5.0%. I am not sure if there is a way to show uncertain values in Datawrapper bar charts. In education data with small populations is often repressed. I could probably back into some of these numbers but I have not for this chart as the goal is to look at the data visualization.

In line with my interest in masking policies in public schools, I decided to map outbreak-associated cases in schools as reported by the Maryland Department of Health to check out the geolocation mapping feature of Datawrapper. As far as using the geolocation, it was super easy. I googled the address and pasted it into the correct box. I assume that the locations are generally accurate. It seems less precise than ArcView, but it seems to work well for when you are just trying to get a sense of the data. It did not that too long to enter the data for this number of locations. It would take a while if there were more locations. It would be much quicker if I had a spreadsheet with all the addresses of schools in the State pre-made and ready to go. I believe that there might be an official list of the public schools available. If I were to do this in the future I would explore using that existing spreadsheet.

I wish that I could add a choropleth map behind the location map to add another level of data details. I know it can be done with Arcview. But what you gain in usability you lose in features.

If you are wondering like I was about the outbreak at Quince Orchard when the school is not even open yet. The answer was found in this news article. Apparently, there was an outbreak in the football team which I assume has started practicing for the fall season.

Notes About the Data

These notes are copied from the Maryland Department of Health website.

Note: This dataset reflects public and non-public K-12 schools in Maryland that have COVID-19 outbreaks. Data are based on local health department reports to MDH, which may be revised if additional information becomes available. This list does not include child care facilities or institutes of higher education. Schools listed meet 1 or more of the following criteria:

Classroom/cohort outbreak definition:    1) At least two confirmed COVID-19 cases among students/teachers/staff within a 14-day period and who are epidemiologically linked, but not household contacts; or

School-wide outbreak definition:    

2) Three or more classrooms or cohorts with cases from separate households that meet the classroom/cohort outbreak definition that occurs within 14 days; or
 3) Five percent or more unrelated students/teachers/staff have confirmed COVID-19 within a 14 day period (minimum of 10 unrelated students/teachers/staff).

Cases reported reflect the current total number of cases. Schools are removed from the list when health officials determine 14 days have passed with no new cases and no tests pending. Archival data is available through the COVID-19 open data catalogue. These data are updated weekly on Wednesdays during the 10 a.m. hour. MDH is continuously evaluating its data and reporting systems and will make updates as more data becomes available.

I pulled data at noon on August 25, 2021, to make the above map. As I discuss locations are approximate. I provided the information on whether a school is public or private. All errors are my own. This map is not official, it is primarily for the purpose of me exploring the visualization features and keeping track of what I learned.

The Maryland Comprehensive Assessment Program (MCAP) are the assessments that are used to meet federal Early Student Succeeds Act (ESSA) assessment requirements. MCAP has five score levels, a student who scores at level 4 or 5 is considered proficient on the material. Because of the pandemic, the MCAP was last administered in 2019.

I used data from the Maryland Report Card website to look at the percentage of students who are certified eligible for free and reduced-price meals compared to all students who took the assessment. I choose Math 5 because I wanted a group of students that had been in the school system for some time and to only capture first-time test takers. Also generally in middle school, a subsection of students start taking the Algebra I assessment (because they take Algebra) and thus are not included in the grade-level assessment. The data reports percentages of students, not the number of students. Thus, I can not easily calculate the non-FARMS subgroup, which might be a better comparison group since FARMS students are included in the all students category.

To bring some type of order to the data I reverse sorted it by all students. From a data visualization point of view I think this is an effective way of looking at the data. There is a lot of data, but it is easy to read and all of the counties are on the same chart. I wish I could better highlight the State average data with vertical lines or bolded dots, but I do not think that is an option in this software.

As the masking requirements for public schools changed so does my map. This time I figured out how to link sources in the tooltips. Tooltips are apparently what you call the hover-over information. I like having clear sources for my visualizations it is important that I can retrace my steps and I would like others to be able to retrace my steps. This is especially important because I work with so many sources of data and my products often have a quick turnaround time.

Short answer. Yes, free-and-reduced price status likely impacts college segment of initial enrollment. See the pretty Sankey diagram below that shows the college segment of initial enrollment for high school graduates by free- and reduced-priced meal (FARMS) eligibility status. However, read my data notes below before making any conclusions.

Most notably 47% FARMS eligible high school graduates failed to enroll in any college segment compared to 24% of non-FARMS eligible high school graduates. Also, 10% of FARMS eligible high school graduates initially enrolled in an out-of-state college compared to 24% of non-FARMS eligible high school graduates.

Click here if you can not see the diagram.

Long answer. As an analyst, I can think of a long list of reasons why this data does not answer this question. The first being is that the data source does not list actual numbers, just percentages. So I had to back into the numbers I used in the diagram. However, due to rounding, the numbers do not add up to the proper totals. If this was for real analysis I would try to get the actual numbers. I need to use numbers, even if they are not quite correct to get the diagram to render properly.

But for this project, I am just attempting to see if I can code this type of diagram and getting a general sense of what the diagram would show. On that end, it is a success. I can do the coding, I am sincerely hoping that you can see the diagram above right now. I also think I really like the visual effect. I think it works well when you have a few categories for each node.

Notes about the Data

  • The number of students are estimates from the percentages published by the Maryland Longitdinal Data System Center. I have not fully checked my calculations, this is primarily for me to learn how to create the diagrams using the software. I am also attempting to learn which types of data visulizations I find useful and worth pursuing. I also am trying to better understand the data.
  • Due to rounding in the data source the numbers do not add up to the correct totals. This is just for a general idea of the data, not for specifics.
  • Some notes adapted from those provided by the Maryland Longitudinal Data System Center, all errors are my own.
  • The number number of years following high school graduation impacts the initial postsecondary enrollment numbers. Graduates who enrolled in Private Career Schools or Continuing Education and Training Certificate sequences are not included.
  • This dashboard uses “initial enrollment,” which counts only the first enrollment of a student after graduation. For example, a student graduates high school, enrolls in a summer community college course, and then enrolls in an out-of-state college in the fall. The initial enrollment count for that student is one in-state enrollment. Other methods of counting enrollment may count that student as one in-state and one out-of-state enrollment. Accordingly, when reviewing, and especially when comparing post-secondary enrollment reports, it is important to understand how the enrollments are being counted.
  • The number of high school graduates reported on this dashboard includes: (a) only the counts of 12th grade graduates; and (b) eliminates any duplicate graduation recordsSome numbers are rounded or estimated due to data suppresssion. For general illistrative purposes only.

College of Initial Enrollment

During the 2013-2014 academic year approximately 58,300 students graduated from a Maryland public high school. Of those graduates, approximately 39,900 students had enrolled in college by the following year. The data set captures the higher education segment (community college, public four-year, State-aid independent institution, or out-of-state institution) that a graduate initially enrolls.

As shown in the Sankey diagram, high school graduates enrolled initially as follows: 30% at community college; 17% at a public four-year institution; 3% at a State-aided independent institution; and 19% at an out-of-state institution. An additional 32% of high school graduates did not (yet) enroll in a postsecondary education captured in this dataset.

This data set only captures the college of initial enrollment. For example, if a student enrolls in a community college for a summer class and then enrolls in a public four-year institution in the fall, the student will be recorded as enrolling in community college.

I wonder how the college of initial enrollment for public high school graduates compares to the total enrollment of the various sectors.

Click here if you can not see the diagram.

Source: Maryland Longitudinal Data System Center, Maryland Public High School Graduates with Initial Postsecondary Enrollments, 2013-2014 data

Notes about the Data

  • Some notes adapted from those provided by the Maryland Longitudinal Data System Center, all errors are my own.
  • Due to rounding the numbers do not add up correctly, so this is just for proof of concept if I had the actual numbers.
  • The number number of years following high school graduation impacts the initial postsecondary enrollment numbers.
  • Graduates who enrolled in Private Career Schools or Continuing Education and Training Certificate sequences are not included.
  • This dashboard uses “initial enrollment,” which counts only the first enrollment of a student after graduation. For example, a student graduates high school, enrolls in a summer community college course, and then enrolls in an out-of-state college in the fall. The initial enrollment count for that student is one in-state enrollment. Other methods of counting enrollment may count that student as one in-state and one out-of-state enrollment. Accordingly, when reviewing, and especially when comparing post-secondary enrollment reports, it is important to understand how the enrollments are being counted.
  • The number of high school graduates reported on this dashboard includes: (a) only the counts of 12th grade graduates; and (b) eliminates any duplicate graduation records
  • Some numbers are rounded or estimated due to data suppresssion. For general illistrative purposes only.

AMCHARTS Venn Diagrams

Earlier I wrote about trying to make a Venn diagram to show that high school graduates that meet University System of Maryland (USM) requirements and Career Technology Education (CTE) requirements were a subset of all high school graduates. I could not get the Venn diagram to render correctly. Today I figured out the correct syntax. I had to define areas in the Venn diagram that I did not actually want to show. That is, I had to define “USM HS Grads” and “CTE HS Grads” and “Both Requirements Grads” even though I do not want them to render separately in the chart. I know that is not the best explanation, but I want to just quickly make this diagram live to see if how it works on a live webpage. I will revisit this Venn diagram again soon with further refinements.

Click here if you can not see the diagram.

Source: Maryland Department of Education, High School Completion Data, 2019

Remiation Rates of Recent Maryland Public High School Graduates Enrolled at Maryland Public Institutions

Today I take a look at the remediation rates of recent Maryland public high school graduates enrolled at Maryland public institutions as published by the Maryland Higher Education Commission in their 2021 Data Book. According to data published by the Maryland State Department of Education, 57,622 students graduated from Maryland public high schools during the 2017-2018 academic year. According to MHEC’s remediation data set, 25,575 students enrolled at a Maryland public institution in fall 2018. Thus, less than half, 44%, of the public school graduates are reflected in the map below. So, the data may tell us less about the quality of public schools than first thought. I will have to see if I can drill down further to this data. My first step will be to look at the number of high school graduates from each county; however, since the percentage of students who are low-income is different in each county, the college-going behavior is likely not consistent.

I was a little surprised to see that 25,575 students enrolled in a public institution immediately after high school because according to the college pipeline data published by the Maryland Longitudinal Data System Center, 17,410 students enrolled in a public institution immediately after high school. I wonder if MLDS is better able to separate students enrolled in multiple institutions. Further, the MLDS dataset only is capturing “degree-seeking” students.

Notes About the Data

  • Students may be enrolled at more than one institution. They are included in enrollment figures for each insituiton at which they are enrolled.
  • Data include all degree- and non-degree seeking students enrolled in credit courses.
  • Maryland residents are identified using their place of residence at the time of application to the insituion.
  • Maryland public instiutitions include community colleges and four-year public colleges and universities.
  • Recent Maryland public high school graduates are defined as those graduating from a Maryland public high school, identitifed using the College Board School Code, who graduated in the 2017-2018 academic year and first enrolled in higher education in Maryland in fall 2018. Analysis relies on high school graduation date and reporting of remedial assesment data; missing data for these variable may result in underreporting.
  • Maryland residents whose county of residence is unknown are included in insitutuional-level remedial data, but excluded from reporting by county of residence.
  • Salisbury University and St. Mary's College of Maryland do not offer remedial coursework.