Difference between revisions of "Data Visualization"

From California Technology Festival Wiki
Jump to navigation Jump to search
Line 1: Line 1:
=== Data Visualization ===
 
 
 
Facilitated by Amanda Hickman - [https://github.com/amandabee github.com/amandabee]
 
Facilitated by Amanda Hickman - [https://github.com/amandabee github.com/amandabee]
 
 
Syllabus, Cheat Sheets, etc. [http://amandabee.github.io/CUNY-data-skills/ amandabee.github.io/CUNY-data-skills]
 
Syllabus, Cheat Sheets, etc. [http://amandabee.github.io/CUNY-data-skills/ amandabee.github.io/CUNY-data-skills]
  
#### Reasons to want data
+
* Reasons to want data
* Focus on a population
+
** Focus on a population
* Evidence for a claim you're making is actually true
+
** Evidence for a claim you're making is actually true
* Reduce harm
+
** Reduce harm
#### When we're talking about data, we're talking about something in a spreadsheet that you can chop up and analyze and such
+
* When we're talking about data, we're talking about something in a spreadsheet that you can chop up and analyze and such
* A PDF is not data yet, because it's basically a picture.
+
** A PDF is not data yet, because it's basically a picture.
** Tabula from Nerd Powerful is a great tool for pulling data from PDF tables
+
*** Tabula from Nerd Powerful is a great tool for pulling data from PDF tables
* Charts and maps are not data because you can't really reverse engineer them
+
** Charts and maps are not data because you can't really reverse engineer them
#### Where to find it
+
* Where to find it
* Librarians. They live for this
+
** Librarians. They live for this
* Open data portals
+
** Open data portals
** Open data is a set of laws that governments are required to put out public data
+
*** Open data is a set of laws that governments are required to put out public data
* Ask for sources on reports and charts that you see
+
** Ask for sources on reports and charts that you see
** If you see a great visualization, then you should call up the authors to try and get their data
+
*** If you see a great visualization, then you should call up the authors to try and get their data
* Academics
+
** Academics
** PhD's have great data that will never be seen because they're academics and thus write unintelligibly
+
*** PhD's have great data that will never be seen because they're academics and thus write unintelligibly
* SF Indicators Project
+
** SF Indicators Project
* Census
+
** Census
* American Community Survey
+
** American Community Survey
* CensusReporter.org
+
** CensusReporter.org
* Bureau of Labor Statistics
+
** Bureau of Labor Statistics
* Community Expenditure Survey
+
** Community Expenditure Survey
* Various public health departments
+
** Various public health departments
* Think tanks
+
** Think tanks
* <span style="font: 14.0px 'Helvetica Neue'">Freedom of Information Act</span> (FOIA) / Freedom of Information Law (FOIL)
+
** <span style="font: 14.0px 'Helvetica Neue'">Freedom of Information Act</span> (FOIA) / Freedom of Information Law (FOIL)
** Muck Rock will help you out on doing a FOIA request.
+
*** Muck Rock will help you out on doing a FOIA request.
*** Check out examples of letters they've sent, and their boiler plate letters
+
**** Check out examples of letters they've sent, and their boiler plate letters
*** They'll alert you about timeline stuff
+
**** They'll alert you about timeline stuff
** There may be costs involved
+
*** There may be costs involved
** This is kinda your last resort
+
*** This is kinda your last resort
** When making a request, ask for the person who's responsible, not just a receptionist. You need someone to hold accountable
+
*** When making a request, ask for the person who's responsible, not just a receptionist. You need someone to hold accountable
* Geo Commons
+
** Geo Commons
** Very open set of geographical data sets
+
*** Very open set of geographical data sets
** Great for polygons like districts
+
*** Great for polygons like districts
* Planning Departments
+
** Planning Departments
* Cicero
+
** Cicero
** Legislation
+
*** Legislation
* Sunlight Foundation
+
** Sunlight Foundation
#### Strategies for getting data and analyzing
+
* Strategies for getting data and analyzing
* If someone tells you they don't have the data, that's not the end. They can help you find it.
+
** If someone tells you they don't have the data, that's not the end. They can help you find it.
* Ask for the name of their database
+
** Ask for the name of their database
* Ask for the specs of the database they're using
+
** Ask for the specs of the database they're using
* Get a lawyer if they claim that giving you data is a threat to homeland security
+
** Get a lawyer if they claim that giving you data is a threat to homeland security
* Ask for possible costs up front
+
** Ask for possible costs up front
* Ask StackExchange
+
** Ask StackExchange
* NICAR
+
** NICAR
#### Working with Data
+
* Working with Data
* Provenance matters
+
** Provenance matters
** Especially if you're using open data from places like Geo Commons. You need to understand where the data came from
+
*** Especially if you're using open data from places like Geo Commons. You need to understand where the data came from
* Not everything should be data
+
** Not everything should be data
#### Tools
+
* Tools
* Hard
+
** Hard
** R + RStudio
+
*** R + RStudio
** QGIS
+
*** QGIS
* Medium
+
** Medium
** [https://cartodb.com/ Carto DB]
+
*** [https://cartodb.com/ Carto DB]
** [http://www.highcharts.com/ High Charts]
+
*** [http://www.highcharts.com/ High Charts]
** Mapbox
+
*** Mapbox
** D3
+
*** D3
* Easy
+
** Easy
** [https://quartz.github.io/Chartbuilder/ Chart Builder]
+
*** [https://quartz.github.io/Chartbuilder/ Chart Builder]
* Geo Coders
+
** Geo Coders
** Texas A&amp;M has a good tool
+
*** Texas A&amp;M has a good tool
** [https://github.com/amandabee/CUNY-data-storytelling/wiki/Tip-Sheet:-Geocoding#promising-rumors Check her site]
+
*** [https://github.com/amandabee/CUNY-data-storytelling/wiki/Tip-Sheet:-Geocoding#promising-rumors Check her site]

Revision as of 20:27, 3 August 2015

Facilitated by Amanda Hickman - github.com/amandabee Syllabus, Cheat Sheets, etc. amandabee.github.io/CUNY-data-skills

  • Reasons to want data
    • Focus on a population
    • Evidence for a claim you're making is actually true
    • Reduce harm
  • When we're talking about data, we're talking about something in a spreadsheet that you can chop up and analyze and such
    • A PDF is not data yet, because it's basically a picture.
      • Tabula from Nerd Powerful is a great tool for pulling data from PDF tables
    • Charts and maps are not data because you can't really reverse engineer them
  • Where to find it
    • Librarians. They live for this
    • Open data portals
      • Open data is a set of laws that governments are required to put out public data
    • Ask for sources on reports and charts that you see
      • If you see a great visualization, then you should call up the authors to try and get their data
    • Academics
      • PhD's have great data that will never be seen because they're academics and thus write unintelligibly
    • SF Indicators Project
    • Census
    • American Community Survey
    • CensusReporter.org
    • Bureau of Labor Statistics
    • Community Expenditure Survey
    • Various public health departments
    • Think tanks
    • Freedom of Information Act (FOIA) / Freedom of Information Law (FOIL)
      • Muck Rock will help you out on doing a FOIA request.
        • Check out examples of letters they've sent, and their boiler plate letters
        • They'll alert you about timeline stuff
      • There may be costs involved
      • This is kinda your last resort
      • When making a request, ask for the person who's responsible, not just a receptionist. You need someone to hold accountable
    • Geo Commons
      • Very open set of geographical data sets
      • Great for polygons like districts
    • Planning Departments
    • Cicero
      • Legislation
    • Sunlight Foundation
  • Strategies for getting data and analyzing
    • If someone tells you they don't have the data, that's not the end. They can help you find it.
    • Ask for the name of their database
    • Ask for the specs of the database they're using
    • Get a lawyer if they claim that giving you data is a threat to homeland security
    • Ask for possible costs up front
    • Ask StackExchange
    • NICAR
  • Working with Data
    • Provenance matters
      • Especially if you're using open data from places like Geo Commons. You need to understand where the data came from
    • Not everything should be data
  • Tools