Loading...

What are some strategies you can use to find a linear association inside a non-linear association?

In the video, Shawn recognized that the data on his scatterplot was non-linear. It will be difficult for Shawn to use a linear model to analyze the non-linear data.

Data with a linear association has a constant rate of change throughout the graph. A scatterplot that shows a linear association between the variables is modeled with a straight line. But a scatterplot that shows a non-linear association between the variables cannot be modeled with a straight line because the slope along a curved line changes as you move from left to right.

Although you cannot use a linear model to directly represent non-linear data, there are strategies that allow you to use a linear model to analyze non-linear data.

To do so, you must find a linear association inside the non-linear association by 1) focusing in on a specific part of the data that is linear or 2) ignoring any points that are outliers. Read the tabs to learn about each strategy.

Focusing in on a linear portion of a non-linear association allows you to use a linear model to represent part of the data. You can think of the process as "zooming in" on the linear portion.

You can describe the part of the data you are focusing on using a domain. A domain describes the \( x \)-values. The portion of data being focused on is never described using the \( y \)-values.

When focusing on the linear part of a non-linear set of data, it is important to include as much of the data as possible. Generally, it is acceptable to exclude less than half of the data set.

A detailed description of this image follows in the next paragraph.

A non-linear scatter plot.

This scatterplot shows a non-linear association.

What portion of the data can be modeled with a straight line? State the domain that has a linear association.

The steps for focusing on a portion of the data that can be modeled with a straight line are shown in the table below. Click each step to learn how to identify and state the domain of the data that has a linear association.

The data in the domain from 1.5 to 5 has a linear association, which means that this portion of the data can be modeled by a straight line.

Outliers are data points that do not fit the general data pattern. Sometimes outliers are not clearly separate from the data pattern, which makes the pattern look non-linear.

Sometimes it is possible to make a non-linear association appear more linear by ignoring one or two outliers. The outliers must be identified by their coordinates.

A detailed description of this image follows in the next paragraph.

A non-linear scatter plot.

This scatterplot shows a non-linear association.

Which points could be ignored as outliers so that the rest of the data can be modeled with a straight line?

The steps for identifying outliers that can be ignored so that the data can be modeled with a straight line are shown in the table below. Click each step to see it applied to this example.

The outliers, (3, 1) and (9.5, 7), can be ignored while creating a linear model for the data.

Question

How are the processes of identifying a linear domain or identifying outliers similar? How are they different?