How to Work With Data

The other day, Bitly’s data chief Hilary Mason explained how to get started with data.

Today, she discusses how to work with data, from getting it, to exploring it to interpreting it.

A while back, Hilary and Columbia mathematician Chris Wiggins wrote about this process, called it a taxonomy of data science, and gave a roughly chronological account of what one does with data: Obtain, Scrub, Explore, Model and iNterpret.

No, that’s not a typo, it’s part of an acronym: OSEMN, which rhymes with possum, which means you pronounce it “awesome”.

To get more details than Hilary offers here, check their article. It offers code examples and tools and tricks to work through each of the steps above.

58 notes

Show

  1. purplesource reblogged this from dailydatascience and added:
    Nicely structured thinking. Sadly, I’ve been working 6 months and am still on step 2.
  2. opensandiego reblogged this from dailydatascience
  3. dailydatascience reblogged this from futurejournalismproject
  4. alexainslie reblogged this from emergentfutures and added:
    “Obtain, Scrub, Explore, Model and iNterpret. No, that’s not a typo, it’s part of an acronym: OSEMN, which rhymes with...
  5. strayblossoms reblogged this from emergentfutures
  6. mediagirl reblogged this from futurejournalismproject
  7. jvonneumann reblogged this from emergentfutures
  8. relexed reblogged this from emergentfutures
  9. storiesweshared reblogged this from emergentfutures
  10. uniacid reblogged this from futurejournalismproject
  11. eudamon reblogged this from emergentfutures
  12. emergentfutures reblogged this from futurejournalismproject
  13. livelifealittlelovely reblogged this from futurejournalismproject
  14. mgpettit reblogged this from futurejournalismproject and added:
    Seems very inspired by Ben Fry’s thesis, but...concrete strategies for how
  15. x1alejandro3x reblogged this from logicianmagician
  16. babydatajournalism reblogged this from futurejournalismproject
  17. logicianmagician reblogged this from futurejournalismproject
  18. circlesoffire reblogged this from futurejournalismproject
  19. newsandcode reblogged this from futurejournalismproject and added:
    Awesome.
  20. krochmal reblogged this from futurejournalismproject
  21. daniloamfreire reblogged this from futurejournalismproject
  22. This was featured in #Tech

Blog comments powered by Disqus