Normal Distribution curve in R?

vishrut-singhal · 21 May 2021 16:05

R Normal Distribution

In random collections of data from independent sources, it is commonly seen that the distribution of data is normal. It means that if we plot a graph with the value of the variable in the horizontal axis and counting the values in the vertical axis, then we get a bell shape curve. The curve center represents the mean of the data set. In the graph, fifty percent of the value is located to the left of the mean. And the other fifty percent to the right of the graph. This is referred to as the normal distribution.

R allows us to generate normal distribution by providing the following functions:

R Normal Distribution

These function can have the following parameters:

S.No	Parameter	Description
1.	x	It is a vector of numbers.
2.	p	It is a vector of probabilities.
3.	n	It is a vector of observations.
4.	mean	It is the mean value of the sample data whose default value is zero.
5.	sd	It is the standard deviation whose default value is 1.

Let’s start understanding how these functions are used with the help of the examples.

dnorm():Density

The dnorm() function of R calculates the height of the probability distribution at each point for a given mean and standard deviation. The probability density of the normal distribution is:

R Normal Distribution

Example

# Creating a sequence of numbers between -1 and 20 incrementing by 0.2.  
x <- seq(-1, 20, by = .2)  
# Choosing the mean as 2.0 and standard deviation as 0.5.  
y <- dnorm(x, mean = 2.0, sd = 0.5)  
# Giving a name to the chart file.  
png(file = "dnorm.png")  
#Plotting the graph  
plot(x,y)  
# Saving the file.  
dev.off()

Output:

pnorm():Direct Look-Up

The dnorm() function is also known as “Cumulative Distribution Function”. This function calculates the probability of a normally distributed random numbers, which is less than the value of a given number. The cumulative distribution is as follows:

f(x)=P(X≤x)

Example

# Creating a sequence of numbers between -1 and 20 incrementing by 0.2.  
x <- seq(-1, 20, by = .1)  
# Choosing the mean as 2.0 and standard deviation as 0.5.  
y <- pnorm(x, mean = 2.0, sd = 0.5)  
# Giving a name to the chart file.  
png(file = "pnorm.png")  
#Plotting the graph  
plot(x,y)  
# Saving the file.  
dev.off()

Output:

qnorm():Inverse Look-Up

The qnorm() function takes the probability value as an input and calculates a number whose cumulative value matches with the probability value. The cumulative distribution function and the inverse cumulative distribution function are related by

p=f(x)
x=f-1 (p)

Example

# Creating a sequence of numbers between -1 and 20 incrementing by 0.2.  
x <- seq(0, 1, by = .01)  
# Choosing the mean as 2.0 and standard deviation as 0.5.  
y <- qnorm(x, mean = 2.0, sd = 0.5)  
# Giving a name to the chart file.  
png(file = "qnorm.png")  
#Plotting the graph  
plot(y,x)  
# Saving the file.  
dev.off()

Output:

rnorm():Random variates

The rnorm() function is used for generating normally distributed random numbers. This function generates random numbers by taking the sample size as an input. Let’s see an example in which we draw a histogram for showing the distribution of the generated numbers.

Example

# Creating a sequence of numbers between -1 and 20 incrementing by 0.2.  
x <- rnorm(1500, mean=80,  sd=15 )  
# Giving a name to the chart file.  
png(file = "rnorm.png")  
#Creating histogram  
hist(x,probability =TRUE,col="red",border="black")  
# Saving the file.  
dev.off()

Output: