Due by 06:00 PM on Monday, March 25, 2019
Choose from one of four datasets.
babynames: US baby name data for each year from 1880 to 2013, the number of children of each sex given each name. All names used 5 or more times are included. 1,792,091 rows, 5 columns (year, sex, name, n, prop). (Source: Social security administration).
fueleconomy: Fuel economy data for all cars sold in the US from 1984 to 2015. 33,442 rows, 12 variables. (Source: Environmental protection agency)
nasaweather: Data from the 2006 ASA data expo. Contains monthly atmospheric measurements from Jan 1995 to Dec 2000 on 24 x 24 grid over Central America. 41,472 observations, 11 variables. (Source: ASA data expo)
nycflights13: This package contains information about all flights that departed from NYC (i.e., EWR, JFK and LGA) in 2013: 336,776 flights with 16 variables. To help understand what causes delays, it also includes a number of other useful datasets: weather, planes, airports, airlines. (Source: Bureau of transportation statistics)
Create a scenario: Start by forming a primary research question. Then develop a scenario to answer that research question.
Read the paper titled “A Nested Model for Visualization Design and Validation”. It explains the design process in detail.
Design a visual analytics system using
shiny. You need to design a visual analytics system to help yourself conduct the tasks you proposed.
Answer the following questions in your design process:
What are the data and tasks?
What can/must be done using data analysis?
What needs to be done interactively?
How to best support the user in conducting the tasks?
What’s the analysis and interaction flow?
What needs to be visualized and how to best visualize it?
The resulting design will be a visualization dashboard. If you can’t figure out how to implement certain recommendations (e.g., certain types of interactions), then it’s okay to make notes. For this project, the goal is on design rather than implementation (though you’ll get a higher grade the more you can implement).
Presentation: Prepare a 7-8 minutes presentation to introduce your dataset, scenario, design process (e.g. your answers to the problems), and your dashboard.
The presentations will be given on March 25 in class. Upload your dashboard to your shinyapps.io account for deployment.
For your presentation, you will need to create slides for your presentation. You may use PowerPoint, Keynote, or try an R-based presentation tool (e.g., ioslides, slidy, or
xaringan). I would strongly recommend
xaringan. In the first page of your slides, write down the names of the team members.
Grading: The instructor grades the design after your presentation according to the following criteria:
- Novelty of your scenario (2 point)
- How well you follow the design process (2 points)
- The novelty of your design (2 points)
- Quality of your dashboard (2 points)
- Quality of the presentation (2 points)
Higher scores will include using a broader range of packages and adding in interactive or animated tools (e.g., enable
htmlwidgets like plotly) when appropriate. However, please note, don’t just add interaction to add more complexity. You will need to justify why interaction or animation was necessary if you use these items.