As I noted in a previous post, APDA is in the middle of finalizing data for a new report. This will be a follow up to the report released in August 2015. We hope to include data on graduates with no listed placements and Carnegie Classifications, among other improvements. It is our aim to release the new report by April 15th, so that it can be useful to those who have applied to graduate programs this year. (Until that time, editing on the site has been turned off so that we can verify and analyze the data. We will turn back on editing in May when we turn on a new feature to allow for individual editing.)
In preparation for that report, I have been trying to determine the best way of displaying our data. I am attaching four DRAFT images that present data for 104 universities using pie charts (on gender, AOS, job type, and graduation year: gender and AOS use data from APDA alone, whereas job type and graduation year also uses graduation information from outside APDA, discussed in this post). I used pie charts because they are visually intuitive and I want the data to be as accessible as possible. I used suggestions from this post to help avoid some common criticisms of pie charts. (Note: I tend to analyze data in R, using ggplot2 for graphs, which is the language I provide below for anyone with expertise in this area.) At the top left of each image are the data for the full set of 104 universities. (Universities are included only if we have both an external source of graduation data and placement records for that university with recorded graduation years in this time period.)
I am looking for feedback on these charts. Are these easy to understand? Are there alterations that would be beneficial? Two other options, with images below: 1) Replace pie charts with bar graphs (one sample version below). 2) Make university-specific sets of charts. (This is more time-intensive than 1.)
Note also: We aim to release tables and regression analyses, as we did last time, and any images we release will be in addition to that work. Your input is welcome!
(Click the images for full size version.)
Gender:
AOS:
Job Type:
> p = ggplot(data=df4, aes(x=factor(1), y=Percentage, fill =Year)) + facet_grid(facets=. ~ University) + geom_bar(stat="identity", width = 1) + facet_wrap(~ University) + coord_polar(theta="y") + xlab("") + theme(axis.ticks = element_blank(), axis.text.y = element_blank(), axis.text.x = element_blank()) + ylab("") + ggtitle("APDA Records and External Graduation Data: Graduation Year as Proportion of Graduates 2012-2015")
> png(file="year.png",width=2200,height=2200,res=125)
> p
> dev.off()
> df3 <- read.csv("test.csv", header = TRUE)
> p3 = ggplot(data=df3, aes(x=factor(1), y=Percentage, fill = Category)) + facet_grid(facets=. ~ Data.Type) + geom_bar(stat="identity", width = 1) + facet_wrap(~ Data.Type) + coord_polar(theta="y") + xlab("") + theme(axis.ticks = element_blank(), axis.text.y = element_blank(), axis.text.x = element_blank()) + ylab("") + ggtitle("Baylor, n=12 (AOS, Gender), n=23 (Type, Year)") + scale_fill_manual(values=c("#556B2F", "#006400", "#32CD32", "#DCDCDC", "#ADFF2F","#000080", "#DCDCDC", "#0000FF", "#8B0000", "#FF0000", "#DCDCDC", "#4B0082", "#8B008B", "#BA55D3", "#FF00FF"))
> df2 <- read.csv("test.csv", header = TRUE)
> p2 = ggplot(data=df2, aes(x=factor(1), y=Percentage, fill = Category)) + facet_grid(facets=. ~ Data.Type) + geom_bar(stat="identity", width = 1) + facet_wrap(~ Data.Type) + coord_polar(theta="y") + xlab("") + theme(axis.ticks = element_blank(), axis.text.y = element_blank(), axis.text.x = element_blank()) + ylab("") + ggtitle("Arizona, n=20 (AOS, Gender), n=26 (Type, Year)") + scale_fill_manual(values=c("#556B2F", "#006400", "#32CD32", "#DCDCDC", "#ADFF2F","#000080", "#DCDCDC", "#0000FF", "#8B0000", "#FF0000", "#DCDCDC", "#4B0082", "#8B008B", "#BA55D3", "#FF00FF"))
> df1 <- read.csv("test.csv", header = TRUE)
> p1 = ggplot(data=df1, aes(x=factor(1), y=Percentage, fill = Category)) + facet_grid(facets=. ~ Data.Type) + geom_bar(stat="identity", width = 1) + facet_wrap(~ Data.Type) + coord_polar(theta="y") + xlab("") + theme(axis.ticks = element_blank(), axis.text.y = element_blank(), axis.text.x = element_blank()) + ylab("") + ggtitle("Arizona State, n=9 (AOS, Gender), n=10 (Type, Year)") + scale_fill_manual(values=c("#556B2F", "#006400", "#32CD32", "#DCDCDC", "#ADFF2F","#000080", "#DCDCDC", "#0000FF", "#8B0000", "#FF0000", "#DCDCDC", "#4B0082", "#8B008B", "#BA55D3", "#FF00FF"))
> png(file="test.png",width=1000,height=2000,res=100)
> multiplot(p1,p2, p3, p4)
> dev.off()
Recent Comments