How do you calculate R-squared in R?

IONOS editorial team16/07/20244 mins

Contents

R-squared (R2) is a statistical error metric used to measure the quality of linear regressions. In R programming, it can be calculated by calling up a simple function.

Why is R-squared in R important?

R-squared is a statistical measure that measures how well a linear regression line approximates the data. It assumes values between 0 and 1 and is a key measure for regression model quality.

An R-squared interpretation provides information about how close the data is to a calculated regression line. The higher the R-squared value, the better the model explains the data. A low R-squared value indicates poor model fitting.

Tip

R lets you program a whole range of different applications. And getting your own webspace lets you host them. Discover different IONOS webspace plans and find one that meets your individual needs.

R-squared in R with linear regression

R-squared in R is often used in the context of linear regression. Since R is a programming language often used in statistics, it’s not surprising that there are various R functions to help you calculate:

x <- c(1, 2, 3, 4, 5)
y <- c(2, 4, 5, 4, 5)
# Linear regression
model <- lm(y ~ x)

In the code example above, two R vectors named x and y were created. These vectors contain the datasets on which the linear regression will be performed. The dependent variable in this case is the variable y. The regression model is then calculated using the R-function lm() and stored in the variable model.

How to calculate R-squared in R

The R2 value in R can be calculated using a function. You don’t need in-depth mathematical knowledge to do this, you just need to know how to use the correct function. It’s a simple function, even if you’re just starting out with coding.

The function to calculate this is called summary(). As the name suggests, it provides a summary of the regression analysis, including the R-squared value. The code example below, which builds on the linear regression that has already been calculated, shows the summary() function in action:

# R-squared-value
summary(model)$r.squared

You can use this code to extract the R-squared value from the linear regression model lm_model. The R-squared value indicates how well the model approximates the variation in the dependent variable y, based on the independent variable x.

In the code example above, the summary() function is applied to the regression model that has already been calculated. At the same time, the R operator $ is used to display the R-squared value from the values returned by the function call. In our example, the value is 0.6.

Tip

Looking to dive deeper into the world of R programming? Our how-to guides will help you get started:

How to interpret R-squared

Once the R-squared value has been determined, you have to interpret the result. Here, it‘s a good idea to look at certain intervals that the value can take. As mentioned earlier, the range of R2 values is between 0 and 1.

0 (no adjustment): an R-squared value of 0 means that the model does not match the data at all. In this case, there is no linear relationship between the variables.
1 (perfect fit): an R-squared value of 1 indicates that all observations lie perfectly on the regression line. This is extremely rare and may indicate overfitting.
0.7 to 0.9 (good fit): an R-squared value in this interval indicates that the model describes the data sufficiently well.
0.5 to 0.7 (acceptable adjustment): an R-squared value in the range of 0.5 to 0.7 is acceptable but indicates that there’s still room for improvement.
Less than 0.5 (poor fit): an R-squared value below 0.5 indicates that the calculated model doesn’t describe the data with sufficient accuracy. In this case, the model should be adapted to obtain meaningful results.

Note

A high R-squared value alone isn’t enough to judge the quality of your model. That’s why you should also consider factors like model validation, analysis of residuals, and adaptation to specific requirements when determining the goodness of fit of a regression model. The summary() function shown earlier provides additional key figures that you can use for the assessment.

Was this article helpful?

ESB ProfessionalShutterstock

What are R operators?

Operators are used in programming languages such as R to assign values, perform arithmetic calculations, and to check logical conditions. Here, we’ll take a closer look at what logical operators in R are. With code examples, you’ll get a clear idea of the different types of R…

Tutorials

REDPIXEL.PLShutterstock

How to create and use strings in R

Strings exist in almost every programming language. R has them as well. In this article, learn about how strings work in R and familiarise yourself with the most useful R string functions. Using simple examples, we’ll show you how to create and manipulate strings in R as well as…

Tutorials

How to use arrays in R

Most programming languages have arrays as a data structure. R is no different. The popular language also provides programmers with an efficient way to store data of the same data type in a single structure. In this article, we demonstrate how to create arrays in R and how to use…

Tutorials

kentohShutterstock

What is predict() in R?

The predict() function in R is a versatile tool that can be used on a variety of models, including linear models and decision trees. You can also customise predictions using different parameters. For example, you can set confidence intervals or add additional data to simulate…

Tutorials

whiteMoccashutterstock

What are R’s gsub() and sub() functions?

R’s gsub() and sub() functions can be used to find patterns or strings within larger strings and replace them with other expressions. The two methods are helpful for deleting unnecessary characters from large data sets or modifying expressions. In this article, we explain the…

Tutorials

ra2 studioShutterstock

What is the substring() function in R?

R’s substring() function is helpful for extracting substrings from larger strings. It can be used to restructure data and retrieve information. In this article, we explain how the function works, what its syntax looks like and how you can use it. Keep reading to find out how to…

Tutorials