Loading...

How well can you use the strategies of focusing in and ignoring outliers to analyze a non-linear data set using a linear model?

In this lesson, you have learned strategies for analyzing non-linear data using a linear model. Look at this example before doing some practice problems on your own.

In the video at the beginning of this lesson, Shawn was trying to answer the following question.

How does the amount of sleep a person gets each night change as they get older?

Shawn gathered data comparing age with the amount of sleep people get on average.

Age (years)

2

5

8

13

15

18

22

26

30

50

65

Average Sleep Time (hours per night)

14

12

10

9

8.5

8.25

7

6.5

6

6.5

7.5

Then, he created a scatterplot of the data. The data in the scatterplot shows a non-linear association between the variables.

A detailed description of this image follows in the next paragraph.

A non-linear scatter plot with the following attributes:

  • The x-axis is labeled Age (years).
  • The y-axis is labeled Average Sleep Time (hours per night).
  • The following ordered pairs are plotted: (2, 14), (5, 12), (8, 10), (13, 9), (15, 8.5), (18, 8.25), (22, 7), (26, 6.5), (30, 6), (50, 6.5), and (65, 7.5).

How can Shawn use a linear model to analyze this non-linear data and answer his question?

Study the slideshow to see how to first create a linear model for the data by stating a domain or identifying outliers to ignore and then to use the linear model to answer Shawn's question.

To use a linear model to analyze non-linear data, you must find a linear association inside the non-linear association. One strategy is to focus on a specific part of the data that is linear. Another strategy is to ignore any points that are outliers.

Which strategy should Shawn use?

The steps for focusing on a portion of the data that can be modeled with a straight line are shown in the table below. Click each step to see how to identify the domain of Shawn's scatterplot that can be analyzed using a linear model.

Shawn can create a linear model for the data that represents the average sleep times for people between the ages of 8 and 30.

To make a linear model, Shawn will need to find the line of best fit for the linear data and calculate the equation in slope-intercept form. Then, he will be able to interpret the equation to answer his question.

The line of best fit for the data in the domain from age 8 to 30 is shown on the scatterplot below:

A detailed description of this image follows in the next paragraph.

A non-linear scatter plot with the following attributes:

  • The x-axis is labeled Age (years).
  • The y-axis is labeled Average Sleep Time (hours per night).
  • The following ordered pairs are plotted: (2, 14), (5, 12), (8, 10), (13, 9), (15, 8.5), (18, 8.25), (22, 7), (26, 6.5), (30, 6), (50, 6.5), and (65, 7.5).
  • A line of best fit is drawn through the ordered pairs (8, 10), (13, 9), (15, 8.5), (18, 8.25), (22, 7), (26, 6.5), and (30, 6).

The steps for calculating the equation in slope-intercept form of the line of best fit on a scatterplot are shown in the table below. Click each step to see how Shawn used it to create a linear model from the scatterplot.

The linear model for the relationship between age and amount of sleep for people of ages 8 to 30 is:

\( y = - 0.19x + 11.52 \)

Use this linear model to answer the following question:

How does the amount of sleep a person gets each night change as they get older?

Now it's your turn. Practice using a linear model to work with non-linear data by completing the activity below. Answer the question on each tab, then check your answer.

If you need graph paper, click below to download printable graph paper in Word or PDF format.

Identify the domain on the following scatterplot that can be analyzed using a linear model.

A detailed description of this image follows in the next paragraph.

A non-linear scatter plot with the ordered pairs (0.25, 8.25), (1, 8.5), (2, 8), (3.5, 7.25), (4, 6.5), (4, 5.75), (4.5, 4.5), (4.5, 5.5), (5, 4), (5, 5), (5.5, 3.25), (7, 3), and (9, 2).

On the following scatterplot, identify outliers that can be ignored to allow analysis with a linear model.

A detailed description of this image follows in the next paragraph.

A non-linear scatter plot with the ordered pairs (2, 3.5), (3.5, 3), (4, 3.75), (4.5, 3.5), (5, 4), (5.75, 4.75), (6.25, 4.5), (6.5, 5), (7.25, 5.25), (7.75, 6), (8.25, 6.25), and (9.5, 5).

The data in the table shows the percent of a slice of bread that is covered in mold compared to how long the bread had been sitting out.

Time the Bread Sat Out (days)

5

9

15

21

19

17

21

24

28

Percent of Bread Covered in Mold

10

18

20

37

29

21

43

50

85

Create a scatterplot to visualize the data. If the association is non-linear, identify either the domain that can be used or the outliers that can be ignored. Then, find the equation of a linear model for the data.