0%

Book Description

Digging for answers to your pressing business questions probably won’t resemble those tidy case studies that lead you step-by-step from data collection to cool insights. Data science is not so clear-cut in the real world. Instead of high-quality data with the right velocity, variety, and volume, many data scientists have to work with missing or sketchy information extracted from people in the organization.

In this O’Reilly report, Jerry Overton—Distinguished Engineer at global IT leader DXC—introduces practices for making good decisions in a messy and complicated world. What he simply calls “data science that works” is a trial-and-error process of creating and testing hypotheses, gathering evidence, and drawing conclusions. These skills are far more useful for practicing data scientists than, say, mastering the details of a machine-learning algorithm.

Adapted and expanded from a series of articles Overton published on O’Reilly Radar and on the CSC Blog, each chapter is ideal for current and aspiring data scientists who want to go pro, as well as IT execs and managers looking to hire in this field. The report covers:

  • Using the scientific method to gain a competitive advantage
  • The skill set you need to look for when choosing a data scientist
  • Why practical induction is a key part of thinking like a data scientist
  • Best practices for writing solid code in your data science gig
  • How agile experimentation lets you find answers (or dead ends) much faster
  • Advice for surviving (and even thriving) as a data scientist in your organization

Table of Contents

  1. 1. Introduction
    1. Finding Signals in the Noise
    2. Data Science that Works
  2. 2. How to Get a Competitive Advantage Using Data Science
    1. The Standard Story Line for Getting Value from Data Science
    2. An Alternative Story Line for Getting Value from Data Science
    3. The Importance of the Scientific Method
  3. 3. What to Look for in a Data Scientist
    1. A Realistic Skill Set
    2. Realistic Expectations
  4. 4. How to Think Like a Data Scientist
    1. Practical Induction
    2. The Logic of Data Science
    3. Treating Data as Evidence
  5. 5. How to Write Code
    1. The Professional Data Science Programmer
    2. Think Like a Pro
    3. Design Like a Pro
    4. Build Like a Pro
    5. Learn Like a Pro
  6. 6. How to Be Agile
    1. An Example Using the StackOverflow Data Explorer
    2. Putting the Results into Action
    3. Lessons Learned from a Minimum Viable Experiment
    4. Don’t Worry, Be Crappy
  7. 7. How to Survive in Your Organization
    1. You Need a Network
    2. You Need A Patron
    3. You Need Partners
    4. It’s a Jungle Out There
  8. 8. The Road Ahead
    1. Data Science Today
    2. Data Science Tomorrow
  9. Index