Explainers Supplementary Material

Explainers: Example Data

Here I provide some of the data used for the experiments described in the paper. This data is provided so that you can understand what I did (and try to replicate it) - not necessarily for you to do new analysis. Please do not rely on it. If you want to use it for something, please contact me.

City Data

For several examples in the paper (e.g. fig1, fig2, fig3), I used a data set about city livability data taken from the "buzzdata.com" website. This data set was provided for a public contest, and I assumed that the web page that described it would be long-lived, and the provenence of the data would be well explained - bad assumption.

The website seems to be gone. And I did not copy down enough information about where they said the data came from.

I include the data file (as I used it), here. Note: we added columns B,C and D (country, continent, region) ourselves, so we may have made mistakes. Of course, I have no real idea where the data came from - for all I know, they just made these numbers up. I also don't know if they had any copyright restrictions on the data. If they do, I will take it down.

cities: cities.csv

If you know anything about this data set, please let me know!

Docuscope Data

Most of the examples in the paper (all of the literature examples in the paper) use data that comes from Docuscope, a text tagging system from CMU. You can learn about Docuscope from their website. My literature collaborators (Mike Witmore, Jonathan Hope, Robin Valenza) have been using Docuscope in their work. We had already done tools designed to help support their existing workflow (see our EuroVis 2011 paper).

For the purposes of explainers, you can think of Docuscope as something that turns texts into vectors.

Technically, the data files were not created with the Docuscope program. They were created with "DocuscopeJr", an implementation of Docuscope that I created that uses the actual tagging dictionaries that were created by Docuscope's creators. (DocuscopeJr was done with their help and support). We will provide the successor to DocuscopeJr (called Ubiquity) some time in the future.

Here are the DSJ output files for some of the examples in the paper.

Explainer Outputs

Mainly for students who are trying to re-create (or re-design) the pictures in my classes, here are the explainers figures and the numbers used to make them: