Difference between revisions of "Data Visualization"

From California Technology Festival Wiki
Jump to navigation Jump to search
(Created page with " <h3>Data Visualization</h3> <div>Facilitated by Amanda Hickman - [https://github.com/amandabee github.com/amandabee]</div> <div>Syllabus, Cheat Sheets, etc. [http://amandabe...")
 
Line 1: Line 1:
 +
=== Data Visualization ===
  
<h3>Data Visualization</h3>
+
Facilitated by Amanda Hickman - [https://github.com/amandabee github.com/amandabee]
<div>Facilitated by Amanda Hickman - [https://github.com/amandabee github.com/amandabee]</div>
+
 
<div>Syllabus, Cheat Sheets, etc. [http://amandabee.github.io/CUNY-data-skills/ amandabee.github.io/CUNY-data-skills]</div>
+
 
<div><br/></div>
+
Syllabus, Cheat Sheets, etc. [http://amandabee.github.io/CUNY-data-skills/ amandabee.github.io/CUNY-data-skills]
<ul>
+
 
<li>Reasons to want data
+
 
<ul>
+
* Reasons to want data
<li>Focus on a population</li>
+
** Focus on a population
<li>Evidence for a claim you're making is actually true</li>
+
** Evidence for a claim you're making is actually true
<li>Reduce harm</li>
+
** Reduce harm
</ul>
+
* When we're talking about data, we're talking about something in a spreadsheet that you can chop up and analyze and such
</li>
+
** A PDF is not data yet, because it's basically a picture.
<li>When we're talking about data, we're talking about something in a spreadsheet that you can chop up and analyze and such
+
*** Tabula from Nerd Powerful is a great tool for pulling data from PDF tables
<ul>
+
** Charts and maps are not data because you can't really reverse engineer them
<li>A PDF is not data yet, because it's basically a picture.
+
* Where to find it
<ul>
+
** Librarians. They live for this
<li>Tabula from Nerd Powerful is a great tool for pulling data from PDF tables</li>
+
** Open data portals
</ul>
+
*** Open data is a set of laws that governments are required to put out public data
</li>
+
** Ask for sources on reports and charts that you see
<li>Charts and maps are not data because you can't really reverse engineer them</li>
+
*** If you see a great visualization, then you should call up the authors to try and get their data
</ul>
+
** Academics
</li>
+
*** PhD's have great data that will never be seen because they're academics and thus write unintelligibly
<li>Where to find it
+
** SF Indicators Project
<ul>
+
** Census
<li>Librarians. They live for this</li>
+
** American Community Survey
<li>Open data portals
+
** CensusReporter.org
<ul>
+
** Bureau of Labor Statistics
<li>Open data is a set of laws that governments are required to put out public data</li>
+
** Community Expenditure Survey
</ul>
+
** Various public health departments
</li>
+
** Think tanks
<li>Ask for sources on reports and charts that you see
+
** <span style="font: 14.0px 'Helvetica Neue'">Freedom of Information Act</span> (FOIA) / Freedom of Information Law (FOIL)
<ul>
+
*** Muck Rock will help you out on doing a FOIA request.
<li>If you see a great visualization, then you should call up the authors to try and get their data</li>
+
**** Check out examples of letters they've sent, and their boiler plate letters
</ul>
+
**** They'll alert you about timeline stuff
</li>
+
*** There may be costs involved
<li>Academics
+
*** This is kinda your last resort
<ul>
+
*** When making a request, ask for the person who's responsible, not just a receptionist. You need someone to hold accountable
<li>PhD's have great data that will never be seen because they're academics and thus write unintelligibly</li>
+
** Geo Commons
</ul>
+
*** Very open set of geographical data sets
</li>
+
*** Great for polygons like districts
<li>SF Indicators Project</li>
+
** Planning Departments
<li>Census</li>
+
** Cicero
<li>American Community Survey</li>
+
*** Legislation
<li>CensusReporter.org</li>
+
** Sunlight Foundation
<li>Bureau of Labor Statistics</li>
+
* Strategies for getting data and analyzing
<li>Community Expenditure Survey</li>
+
** If someone tells you they don't have the data, that's not the end. They can help you find it.
<li>Various public health departments</li>
+
** Ask for the name of their database
<li>Think tanks</li>
+
** Ask for the specs of the database they're using
<li><span style="font: 14.0px 'Helvetica Neue'">Freedom of Information Act</span> (FOIA) / Freedom of Information Law (FOIL)
+
** Get a lawyer if they claim that giving you data is a threat to homeland security
<ul>
+
** Ask for possible costs up front
<li>Muck Rock will help you out on doing a FOIA request.
+
** Ask StackExchange
<ul>
+
** NICAR
<li>Check out examples of letters they've sent, and their boiler plate letters</li>
+
* Working with Data
<li>They'll alert you about timeline stuff</li>
+
** Provenance matters
</ul>
+
*** Especially if you're using open data from places like Geo Commons. You need to understand where the data came from
</li>
+
** Not everything should be data
<li>There may be costs involved</li>
+
* Tools
<li>This is kinda your last resort</li>
+
** Hard
<li>When making a request, ask for the person who's responsible, not just a receptionist. You need someone to hold accountable</li>
+
*** R + RStudio
</ul>
+
*** QGIS
</li>
+
** Medium
<li>Geo Commons
+
*** [https://cartodb.com/ Carto DB]
<ul>
+
*** [http://www.highcharts.com/ High Charts]
<li>Very open set of geographical data sets</li>
+
*** Mapbox
<li>Great for polygons like districts</li>
+
*** D3
</ul>
+
** Easy
</li>
+
*** [https://quartz.github.io/Chartbuilder/ Chart Builder]
<li>Planning Departments</li>
+
** Geo Coders
<li>Cicero
+
*** Texas A&amp;M has a good tool
<ul>
+
*** [https://github.com/amandabee/CUNY-data-storytelling/wiki/Tip-Sheet:-Geocoding#promising-rumors Check her site]
<li>Legislation</li>
 
</ul>
 
</li>
 
<li>Sunlight Foundation</li>
 
</ul>
 
</li>
 
<li>Strategies for getting data and analyzing
 
<ul>
 
<li>If someone tells you they don't have the data, that's not the end. They can help you find it.</li>
 
<li>Ask for the name of their database</li>
 
<li>Ask for the specs of the database they're using</li>
 
<li>Get a lawyer if they claim that giving you data is a threat to homeland security</li>
 
<li>Ask for possible costs up front</li>
 
<li>Ask StackExchange</li>
 
<li>NICAR</li>
 
</ul>
 
</li>
 
<li>Working with Data
 
<ul>
 
<li>Provenance matters
 
<ul>
 
<li>Especially if you're using open data from places like Geo Commons. You need to understand where the data came from</li>
 
</ul>
 
</li>
 
<li>Not everything should be data</li>
 
</ul>
 
</li>
 
<li>Tools
 
<ul>
 
<li>Hard
 
<ul>
 
<li>R + RStudio</li>
 
<li>QGIS</li>
 
</ul>
 
</li>
 
<li>Medium
 
<ul>
 
<li>[https://cartodb.com/ Carto DB]</li>
 
<li>[http://www.highcharts.com/ High Charts]</li>
 
<li>Mapbox</li>
 
<li>D3</li>
 
</ul>
 
</li>
 
<li>Easy
 
<ul>
 
<li>[https://quartz.github.io/Chartbuilder/ Chart Builder]</li>
 
</ul>
 
</li>
 
<li>Geo Coders
 
<ul>
 
<li>Texas A&amp;M has a good tool</li>
 
<li>[https://github.com/amandabee/CUNY-data-storytelling/wiki/Tip-Sheet:-Geocoding#promising-rumors Check her site]</li>
 
</ul>
 
</li>
 
</ul>
 
</li>
 
</ul>
 

Revision as of 20:20, 3 August 2015

Data Visualization

Facilitated by Amanda Hickman - github.com/amandabee


Syllabus, Cheat Sheets, etc. amandabee.github.io/CUNY-data-skills


  • Reasons to want data
    • Focus on a population
    • Evidence for a claim you're making is actually true
    • Reduce harm
  • When we're talking about data, we're talking about something in a spreadsheet that you can chop up and analyze and such
    • A PDF is not data yet, because it's basically a picture.
      • Tabula from Nerd Powerful is a great tool for pulling data from PDF tables
    • Charts and maps are not data because you can't really reverse engineer them
  • Where to find it
    • Librarians. They live for this
    • Open data portals
      • Open data is a set of laws that governments are required to put out public data
    • Ask for sources on reports and charts that you see
      • If you see a great visualization, then you should call up the authors to try and get their data
    • Academics
      • PhD's have great data that will never be seen because they're academics and thus write unintelligibly
    • SF Indicators Project
    • Census
    • American Community Survey
    • CensusReporter.org
    • Bureau of Labor Statistics
    • Community Expenditure Survey
    • Various public health departments
    • Think tanks
    • Freedom of Information Act (FOIA) / Freedom of Information Law (FOIL)
      • Muck Rock will help you out on doing a FOIA request.
        • Check out examples of letters they've sent, and their boiler plate letters
        • They'll alert you about timeline stuff
      • There may be costs involved
      • This is kinda your last resort
      • When making a request, ask for the person who's responsible, not just a receptionist. You need someone to hold accountable
    • Geo Commons
      • Very open set of geographical data sets
      • Great for polygons like districts
    • Planning Departments
    • Cicero
      • Legislation
    • Sunlight Foundation
  • Strategies for getting data and analyzing
    • If someone tells you they don't have the data, that's not the end. They can help you find it.
    • Ask for the name of their database
    • Ask for the specs of the database they're using
    • Get a lawyer if they claim that giving you data is a threat to homeland security
    • Ask for possible costs up front
    • Ask StackExchange
    • NICAR
  • Working with Data
    • Provenance matters
      • Especially if you're using open data from places like Geo Commons. You need to understand where the data came from
    • Not everything should be data
  • Tools