Contents (hide)
Assigned: Thursday, Oct 25, 2012
Due: Tuesday, Nov 20, 2012 (3 1/2 weeks)
You may work in groups (2 persons max) on this assignment. You may work with your final project partner or form a different group for this assignment.
The purpose of this assignment is to introduce you to some real-world problems and allow you to come up with an interactive visualization to address one of them.
Dataset: 2000 voter registration records, different dates, different amounts of added error (see me for the datafiles)
Our colleagues at UNC have gathered voter registration data (~2000 records) from the same zip code. Voter registration data is updated weekly, and we have datasets from two different dates (July 9, 2012 and Aug 29, 2012).
Develop an interactive visualization that would assist an analyst in working with this data. Specifically, your visualization should allow an analyst to complete the following tasks:
Your visualization should be able to help the analyst with these tasks for all versions of the datasets:
Note about the error: New (10/26 -MCW)
For 10% error (30% duplicate error), if there are 100 total records with 20 duplicates, then 10 errors introduced in any row (duplicate or not). Then 6 additional errors (30% of 20) were generated in the duplicate records.
Here's a bit more information about the voter data - HW5-voterdata
It is suggested that you use a tool such as the CDC's LinkPlus (Windows) to perform the record linkage and focus your efforts on visualizing the results of the linkage.
Dataset: 700k+ records in a Microsoft Access database (see me for the datafile)
The goal of the Navy Hearing Conservation Program is to protect hearing and prevent hearing loss.
Develop an interactive visualization that allows the doctors to analyze at least the following factors (add more if you find interesting relationships):
Here's a bit more background information about the program and how the data was collected.
Audiogram
Decibels
Noise Induced Hearing Loss (NIHL)
Develop an interactive visualization targeted towards the Apps4VA competition. You do not have to enter the competition, but you are welcome to do so (deadline is Nov 15).
Competition website - http://www.apps4va.org/apps4va-open-competition.html
Detailed instructions - http://www.weebly.com/uploads/1/1/1/0/11104538/overview_entry_instructions091012.pdf
Data sets - http://www.apps4va.org/data.html
If you choose this option, you must get approval from me (soon!) before starting.
Put an electronic version online at http://www.cs.odu.edu/~username/cs795f12/hw5.html. Include a link to your report on the web page and submit a hard-copy in class on the due date.
You must also write a report (posted on the webpage), detailing how you developed the visualization and how the visualization can be used. Describe what you did for all 7 steps of data visualization: acquire, parse, filter, mine, represent, refine, interact. (See Intro to Info Vis lecture and Visualizing Data text for more info on the steps.) If you chose the Record Linkage project, you must also include answers to the 4 tasks (provide screenshots that indicate the answer) and discuss how you can use your visualization to get the answer.
I will grade the assignment based on the quality of the developed visualization and how well it would help an analyst complete the stated tasks. I will also grade the quality of your report.