Doing Data Science: Straight Talk from the Frontline by Rachel Schutt, Cathy O'Neil

By Rachel Schutt, Cathy O'Neil

Now that individuals are conscious that facts could make the variation in an election or a company version, facts technological know-how as an career is gaining floor. yet how will you start operating in a wide-ranging, interdisciplinary box that’s so clouded in hype? This insightful publication, according to Columbia University’s advent to facts technological know-how type, tells you what you must know.

in lots of of those chapter-long lectures, info scientists from businesses akin to Google, Microsoft, and eBay proportion new algorithms, tools, and versions by way of featuring case reviews and the code they use. If you’re accustomed to linear algebra, likelihood, and statistics, and feature programming event, this publication is a perfect creation to information science.

issues include:
• Statistical inference, exploratory information research, and the information technology method
• Algorithms
• unsolicited mail filters, Naive Bayes, and information wrangling
• Logistic regression
• monetary modeling
• suggestion engines and causality
• info visualization
• Social networks and information journalism
• information engineering, MapReduce, Pregel, and Hadoop

Doing info Science is collaboration among direction teacher Rachel Schutt, Senior vice president of information technological know-how at information Corp, and information technological know-how advisor Cathy O’Neil, a senior info scientist at Johnson study Labs, who attended and blogged concerning the course.

Show description

Read Online or Download Doing Data Science: Straight Talk from the Frontline PDF

Best nonfiction books

Wayne Goddard's $50 Knife Shop, Revised

Don't spend mammoth dollars in your knife shop!

You don't have to lay our a fortune to begin making impressive knives. famous knifemaker Wayne Goddard offers impressive step by step directions for making your individual instruments, discovering the precise metal and forging, grinding and heat-treating knives on a budget.

Wonderfully illustrated with full-color images, Goddard's booklet publications you thru the knifemaking technique from begin to end or even features a finances breakdown exhibiting every little thing you would like is offered to discount costs. Goddard even explains and demonstrates the making of twine Damascus blades with the best of tools.

Wayne Goddard's $50 Knife store is a locate better half quantity to Goddard's e-book the sweetness of Knifemaking and offers the entire info you must begin making knives on the cheap.

How to Become a Straight-A Student: The Unconventional Strategies Real College Students Use to Score High While Studying Less

Seeking to jumpstart your GPA? most school scholars think that directly A’s can be accomplished simply via cramming and painful all-nighters on the library. yet Cal Newport is familiar with that genuine straight-A scholars don’t examine harder—they examine smarter. A step forward method of acing educational assignments, from quizzes and assessments to essays and papers, tips to develop into a Straight-A scholar finds for the 1st time the confirmed research secrets and techniques of actual straight-A scholars around the state and weaves them right into a basic, functional method that any one can grasp.

Mindfulness, Bliss, and Beyond: A Meditator's Handbook

"Better than intercourse " That's how Ajahn Brahm describes meditation, and his enthusiasm is contagious. A self-described meditation junkie, Brahm, the writer of the preferred Who Ordered This Truckload of Dung? , stocks his recipe for bliss during this useful, energizing new publication. The Meditator's instruction manual is an entire, stem-to-stern consultant to the topic, with designated step by step directions for traversing the levels of perform and overcoming hindrances.

In Search of Hobart

Exploring town of Hobart, Tasmania, from a private view, resident Peter Timms builds a full of life, evocative and occasionally hugely severe photograph of Hobart this day. The narrative deftly interweaves the perspectives and critiques of citizens and visitors—from the early convicts to the present mayor—with press experiences and literary references to discover how the town has constructed during the last two hundred years, how its occasionally brutal previous has assisted in shaping its specific personality, and what it truly is wish to dwell in a small urban on an island to this point from the remainder of Australia and the area.

Additional info for Doing Data Science: Straight Talk from the Frontline

Example text

But if you took them and counted how many email messages each person sent, and used that to estimate the underlying distribution of emails sent by all indiviuals at BigCorp, you might get entirely different answers. So if even getting a basic thing down like counting can get distorted when you’re using a reasonable-sounding sampling method, imagine what can happen to more complicated algorithms and models if you haven’t taken into account the process that got the data into your hands. 20 | Chapter 2: Statistical Inference, Exploratory Data Analysis, and the Data Science Process Populations and Samples of Big Data But, wait!

It’s traditionally presented as a bunch of histograms and stem-and-leaf plots. They teach that stuff to kids in fifth grade so it seems trivial, right? No wonder no one thinks much of it. But EDA is a critical part of the data science process, and also repre‐ sents a philosophy or way of doing statistics practiced by a strain of statisticians coming from the Bell Labs tradition. John Tukey, a mathematician at Bell Labs, developed exploratory data analysis in contrast to confirmatory data analysis, which concerns it‐ self with modeling and hypotheses as described in the previous section.

As we mentioned earlier, a data science team works best when different skills (profiles) are represented across different people, because no‐ body is good at everything. It makes us wonder if it might be more worthwhile to define a “data science team”—as shown in Figure 1-3— than to define a data scientist. A Data Science Profile | 11 Figure 1-3. Data science team profiles can be constructed from data scientist profiles; there should be alignment between the data science team profile and the profile of the data problems they try to solve 12 | Chapter 1: Introduction: What Is Data Science?

Download PDF sample

Rated 4.74 of 5 – based on 39 votes