# Data Science: Linear Regression

## Rafael Irizarry, HarvardX

Learn how to use R to implement linear regression, one of the most common statistical modeling approaches in data science.

Linear regression is commonly used to quantify the relationship between two or more variables. It is also used to adjust for confounding. This course, part ofourProfessional Certificate Program in Data Science, covers how to implement linear regression and adjust for confounding in practice using R.

In data science applications, it is very common to be interested in the relationship between two or more variables. The motivating case study we examine in this course relates to the data-driven approach used to construct baseball teams described in Moneyball. We will try to determine which measured outcomes best predict baseball runs by using linear regression.

We will also examine confounding, where extraneous variables affect the relationship between two or more other variables, leading to spurious associations. Linear regression is a powerful technique for removing confounders, but it is not a magical process. It is essential to understand when it is appropriate to use, and this course will teach you when to apply this technique.

### What will you learn

• How linear regression was originally developed by Galton
• What is confounding and how to detect it
• How to examine the relationships between variables by implementing linear regression in R

Dates:
• 15 July 2020
Course properties:
• Free:
• Paid:
• Certificate:
• MOOC:
• Video:
• Audio:
• Email-course:
• Language: English

### Reviews

No reviews yet. Want to be the first?

Register to leave a review

More on this topic:
Fundamentals of Statistics
Develop a deep understanding of the principles that underpin statistical inference...
Introduction to Data Science
Join the data revolution. Companies are searching for data scientists. This...
Model Thinking
In this class, you will learn how to think with models and use them to make...
High Performance Scientific Computing
Programming-oriented course on effectively using modern computers to solve scientific...
PH207x: Health in Numbers: Quantitative Methods in Clinical & Public Health Research
PH207x is the online adaptation of material from the Harvard School of Public...
More from 'Mathematics, Statistics and Data Analysis':
Foundations of Modern Finance II
Learn fundamental principles of modern finance, including valuation models,...
SP21: Introduction to Analytics Modeling
Learn essential analytics models and methods and how to appropriately apply...
SP21: Data Analytics for Business
This course prepares students to understand business analytics and become leaders...
UX Evaluation
Master UX evaluation using a variety of skill sets and methods. Uncover the...
L'évaluation UX
Maîtrisez l'évaluation UX en utilisant une variété de compétences et de méthodes...
More from 'edX':
Instructional Design Course Evaluation & Capstone Project
Develop your Instructional Design &amp; Technology MicroMasters capstone project...
Cloud Computing Security
Learn how to identify security issues in the cloud and industry-standard techniques...
Long-term Financial Management
Learn what it takes to hold a company’s financial future in your hands, as you...
Instructional Design Models
Explore traditional and current instructional design models as you develop your...
Statistical Analysis in Bioinformatics
Learn basic R programming to analyze biological big data to locate genes, perform...