A Q-Q plot, short for “quantile-quantile” plot, is often used to assess whether or not a set of data potentially came from some theoretical distribution. In most cases, this type of plot is used to determine whether or not a set of data follows a normal distribution.
This tutorial explains how to create a Q-Q plot for a set of data in Excel.
Example: Q-Q Plot in Excel
Perform the follow steps to create a Q-Q plot for a set of data.
Step 1: Enter and sort the data.
Enter the following data into one column:
Note that this data is already sorted from smallest to largest. If your data is not already sorted, go to the Data tab along the top ribbon in Excel, then go to the Sort & Filter group, then click the Sort A to Z icon.
Step 2: Find the rank of each data value.
Next, use the following formula to calculate the rank of the first value:
=RANK(A2, $A$2:$A$11, 1)
Copy this formula down to all of the other cells in the column:
Step 3: Find the percentile of each data value.
Next, use the following formula to calculate the percentile of the first value:
=(B2-0.5)/COUNT($B$2:$B$11)
Copy this formula down to all of the other cells in the column:
Step 4: Calculate the z-score for each data value.
Use the following formula to calculate the z-score for the first data value:
=NORM.S.INV(C2)
Copy this formula down to all of the other cells in the column:
Step 5: Create the Q-Q plot.
Copy the original data from column A into column E, then highlight the data in columns D and E.
Along the top ribbon, go to Insert. Within the Charts group, choose Insert Scatter (X, Y) and click the option that says Scatter. This will produce the follow Q-Q plot:
Click the plus sign on the top right-hand corner of the graph and check the box next to Trendline. This will add the following line to the chart:
Feel free to add labels for the title and axes of the graph to make it more aesthetically pleasing:
The way to interpret a Q-Q plot is simple: if the data values fall along a roughly straight line at a 45-degree angle, then the data is normally distributed. We can see in our Q-Q plot above that the data values tend to deviate from the 45-degree line quite a bit, especially on the tail ends, which could be an indication that the data set is not normally distributed.
Although a Q-Q plot isn’t a formal statistical test, it offers an easy way to visually check whether or not a data set is normally distributed.