How to Work With Data

The other day, Bitly’s data chief Hilary Mason explained how to get started with data.

Today, she discusses how to work with data, from getting it, to exploring it to interpreting it.

A while back, Hilary and Columbia mathematician Chris Wiggins wrote about this process, called it a taxonomy of data science, and gave a roughly chronological account of what one does with data: Obtain, Scrub, Explore, Model and iNterpret.

No, that’s not a typo, it’s part of an acronym: OSEMN, which rhymes with possum, which means you pronounce it “awesome”.

To get more details than Hilary offers here, check their article. It offers code examples and tools and tricks to work through each of the steps above.

  1. christycorrell reblogged this from jeremywaite
  2. jeremywaite reblogged this from futurejournalismproject
  3. purplesource reblogged this from dailydatascience and added:
    Nicely structured thinking. Sadly, I’ve been working 6 months and am still on step 2.
  4. opensandiego reblogged this from dailydatascience
  5. dailydatascience reblogged this from futurejournalismproject
  6. alexainslie reblogged this from emergentfutures and added:
    “Obtain, Scrub, Explore, Model and iNterpret. No, that’s not a typo, it’s part of an acronym: OSEMN, which rhymes with...
  7. strayblossoms reblogged this from emergentfutures
  8. mediagirl reblogged this from futurejournalismproject
  9. jvonneumann reblogged this from emergentfutures
  10. relexed reblogged this from emergentfutures
  11. storiesweshared reblogged this from emergentfutures
  12. uniacid reblogged this from futurejournalismproject
  13. eudamon reblogged this from emergentfutures
  14. emergentfutures reblogged this from futurejournalismproject
  15. livelifealittlelovely reblogged this from futurejournalismproject
  16. mgpettit reblogged this from futurejournalismproject and added:
    Seems very inspired by Ben Fry’s thesis, but...concrete strategies for how
  17. x1alejandro3x reblogged this from logicianmagician
  18. babydatajournalism reblogged this from futurejournalismproject
  19. logicianmagician reblogged this from futurejournalismproject
  20. circlesoffire reblogged this from futurejournalismproject
  21. newsandcode reblogged this from futurejournalismproject and added:
    Awesome.
  22. krochmal reblogged this from futurejournalismproject