Difference between revisions of "Data Visualization"

From California Technology Festival Wiki
Jump to navigation Jump to search
Line 2: Line 2:
  
 
Facilitated by Amanda Hickman - [https://github.com/amandabee github.com/amandabee]
 
Facilitated by Amanda Hickman - [https://github.com/amandabee github.com/amandabee]
 
  
 
Syllabus, Cheat Sheets, etc. [http://amandabee.github.io/CUNY-data-skills/ amandabee.github.io/CUNY-data-skills]
 
Syllabus, Cheat Sheets, etc. [http://amandabee.github.io/CUNY-data-skills/ amandabee.github.io/CUNY-data-skills]
  
 
+
#### Reasons to want data
* Reasons to want data
+
* Focus on a population
** Focus on a population
+
* Evidence for a claim you're making is actually true
** Evidence for a claim you're making is actually true
+
* Reduce harm
** Reduce harm
+
#### When we're talking about data, we're talking about something in a spreadsheet that you can chop up and analyze and such
* When we're talking about data, we're talking about something in a spreadsheet that you can chop up and analyze and such
+
* A PDF is not data yet, because it's basically a picture.
** A PDF is not data yet, because it's basically a picture.
+
** Tabula from Nerd Powerful is a great tool for pulling data from PDF tables
*** Tabula from Nerd Powerful is a great tool for pulling data from PDF tables
+
* Charts and maps are not data because you can't really reverse engineer them
** Charts and maps are not data because you can't really reverse engineer them
+
#### Where to find it
* Where to find it
+
* Librarians. They live for this
** Librarians. They live for this
+
* Open data portals
** Open data portals
+
** Open data is a set of laws that governments are required to put out public data
*** Open data is a set of laws that governments are required to put out public data
+
* Ask for sources on reports and charts that you see
** Ask for sources on reports and charts that you see
+
** If you see a great visualization, then you should call up the authors to try and get their data
*** If you see a great visualization, then you should call up the authors to try and get their data
+
* Academics
** Academics
+
** PhD's have great data that will never be seen because they're academics and thus write unintelligibly
*** PhD's have great data that will never be seen because they're academics and thus write unintelligibly
+
* SF Indicators Project
** SF Indicators Project
+
* Census
** Census
+
* American Community Survey
** American Community Survey
+
* CensusReporter.org
** CensusReporter.org
+
* Bureau of Labor Statistics
** Bureau of Labor Statistics
+
* Community Expenditure Survey
** Community Expenditure Survey
+
* Various public health departments
** Various public health departments
+
* Think tanks
** Think tanks
+
* <span style="font: 14.0px 'Helvetica Neue'">Freedom of Information Act</span> (FOIA) / Freedom of Information Law (FOIL)
** <span style="font: 14.0px 'Helvetica Neue'">Freedom of Information Act</span> (FOIA) / Freedom of Information Law (FOIL)
+
** Muck Rock will help you out on doing a FOIA request.
*** Muck Rock will help you out on doing a FOIA request.
+
*** Check out examples of letters they've sent, and their boiler plate letters
**** Check out examples of letters they've sent, and their boiler plate letters
+
*** They'll alert you about timeline stuff
**** They'll alert you about timeline stuff
+
** There may be costs involved
*** There may be costs involved
+
** This is kinda your last resort
*** This is kinda your last resort
+
** When making a request, ask for the person who's responsible, not just a receptionist. You need someone to hold accountable
*** When making a request, ask for the person who's responsible, not just a receptionist. You need someone to hold accountable
+
* Geo Commons
** Geo Commons
+
** Very open set of geographical data sets
*** Very open set of geographical data sets
+
** Great for polygons like districts
*** Great for polygons like districts
+
* Planning Departments
** Planning Departments
+
* Cicero
** Cicero
+
** Legislation
*** Legislation
+
* Sunlight Foundation
** Sunlight Foundation
+
#### Strategies for getting data and analyzing
* Strategies for getting data and analyzing
+
* If someone tells you they don't have the data, that's not the end. They can help you find it.
** If someone tells you they don't have the data, that's not the end. They can help you find it.
+
* Ask for the name of their database
** Ask for the name of their database
+
* Ask for the specs of the database they're using
** Ask for the specs of the database they're using
+
* Get a lawyer if they claim that giving you data is a threat to homeland security
** Get a lawyer if they claim that giving you data is a threat to homeland security
+
* Ask for possible costs up front
** Ask for possible costs up front
+
* Ask StackExchange
** Ask StackExchange
+
* NICAR
** NICAR
+
#### Working with Data
* Working with Data
+
* Provenance matters
** Provenance matters
+
** Especially if you're using open data from places like Geo Commons. You need to understand where the data came from
*** Especially if you're using open data from places like Geo Commons. You need to understand where the data came from
+
* Not everything should be data
** Not everything should be data
+
#### Tools
* Tools
+
* Hard
** Hard
+
** R + RStudio
*** R + RStudio
+
** QGIS
*** QGIS
+
* Medium
** Medium
+
** [https://cartodb.com/ Carto DB]
*** [https://cartodb.com/ Carto DB]
+
** [http://www.highcharts.com/ High Charts]
*** [http://www.highcharts.com/ High Charts]
+
** Mapbox
*** Mapbox
+
** D3
*** D3
+
* Easy
** Easy
+
** [https://quartz.github.io/Chartbuilder/ Chart Builder]
*** [https://quartz.github.io/Chartbuilder/ Chart Builder]
+
* Geo Coders
** Geo Coders
+
** Texas A&amp;M has a good tool
*** Texas A&amp;M has a good tool
+
** [https://github.com/amandabee/CUNY-data-storytelling/wiki/Tip-Sheet:-Geocoding#promising-rumors Check her site]
*** [https://github.com/amandabee/CUNY-data-storytelling/wiki/Tip-Sheet:-Geocoding#promising-rumors Check her site]
 

Revision as of 20:22, 3 August 2015

Data Visualization

Facilitated by Amanda Hickman - github.com/amandabee

Syllabus, Cheat Sheets, etc. amandabee.github.io/CUNY-data-skills

        1. Reasons to want data
  • Focus on a population
  • Evidence for a claim you're making is actually true
  • Reduce harm
        1. When we're talking about data, we're talking about something in a spreadsheet that you can chop up and analyze and such
  • A PDF is not data yet, because it's basically a picture.
    • Tabula from Nerd Powerful is a great tool for pulling data from PDF tables
  • Charts and maps are not data because you can't really reverse engineer them
        1. Where to find it
  • Librarians. They live for this
  • Open data portals
    • Open data is a set of laws that governments are required to put out public data
  • Ask for sources on reports and charts that you see
    • If you see a great visualization, then you should call up the authors to try and get their data
  • Academics
    • PhD's have great data that will never be seen because they're academics and thus write unintelligibly
  • SF Indicators Project
  • Census
  • American Community Survey
  • CensusReporter.org
  • Bureau of Labor Statistics
  • Community Expenditure Survey
  • Various public health departments
  • Think tanks
  • Freedom of Information Act (FOIA) / Freedom of Information Law (FOIL)
    • Muck Rock will help you out on doing a FOIA request.
      • Check out examples of letters they've sent, and their boiler plate letters
      • They'll alert you about timeline stuff
    • There may be costs involved
    • This is kinda your last resort
    • When making a request, ask for the person who's responsible, not just a receptionist. You need someone to hold accountable
  • Geo Commons
    • Very open set of geographical data sets
    • Great for polygons like districts
  • Planning Departments
  • Cicero
    • Legislation
  • Sunlight Foundation
        1. Strategies for getting data and analyzing
  • If someone tells you they don't have the data, that's not the end. They can help you find it.
  • Ask for the name of their database
  • Ask for the specs of the database they're using
  • Get a lawyer if they claim that giving you data is a threat to homeland security
  • Ask for possible costs up front
  • Ask StackExchange
  • NICAR
        1. Working with Data
  • Provenance matters
    • Especially if you're using open data from places like Geo Commons. You need to understand where the data came from
  • Not everything should be data
        1. Tools