Normal Distribution ~ Definition & Formula

Normal Distribution – Definition & Formula

2022-09-12 Distributions Time to read: 5min

How do you like this article?

The normal distribution is a fundamental concept in statistics, defined by a symmetrical, bell-shaped curve that represents data clustering around a central mean. It describes numerous natural phenomena and underpins many statistical methodologies, making it indispensable for inferential statistics. This concept assists in understanding data variability, predicting outcomes, and testing hypotheses in diverse research fields.

Index

Inhaltsverzeichnis

1 Normal Distribution – In a Nutshell
2 Definition: Normal distribution
3 The matter of normal distribution
4 The properties of normal distribution
5 Normal distribution: Empirical rule
6 Normal distribution: Central limit theorem
7 Formula of the normal curve
8 Standard normal distribution
9 FAQs

Normal Distribution – In a Nutshell

Normal distributions are assumed in various fields of statistical research, including social and health sciences.
The mode, median, and mean are identical in a normal distribution.
The main parameters of a normal distribution are the mean and the standard deviation.

Definition: Normal distribution

A normal distribution (Gaussian distribution) refers to a symmetrical probability distribution near the mean. Most observations, group around the peak and the probabilities of other values in the set lessen gradually in both directions.

Normal-distribution - Definition - Bell-curve

Conduct a final format revision for a print of your thesis

Before submitting your thesis for print, check on your formatting with our 3D preview function for a final time. It provides an exact virtual visualization of what the printed version will resemble, making sure the physical version meets your expectations.

The matter of normal distribution

Most variables used in scientific research display normal or near-normal distribution. Such variables include height, income, weight, literacy levels, and exam test scores. Since most of these variables display a normal distribution, many scientists use tests designed for normal distributions.

Knowing the characteristics of this distribution enables students and researchers to make verifiable conclusions and predictions from data samples representing larger populations. Such samples may be selected randomly or picked using the most representative elements of the population in inferential statistics.

The properties of normal distribution

The standard deviation and the mean are the two main parameters in a normally distributed data set:

Mean – The mean is used as one of the main measures of central tendency in quantitative research. It is used in a distribution to define the peak, and most points tend to cluster around the mean.
Standard deviation – It measures the variation of the distribution’s data points from the mean. The standard deviation illustrates how spread the data points are from the mean and is calculated by determining the square root of the variance.

Normal distributions exhibit the following characteristics:

Symmetry – It assumes a symmetrical shape. This implies that the curve can be divided into two equal halves. The symmetrical shape of the distribution is because half of each observation falls on either side of the bell curve.
The mean, mode and median are equal – The mode refers to the most frequently occurring data point in a data distribution. The median is the value that separates the upper from the lower half in an ordered data set. These measures of distribution are equal.

Properties of normal distribution - Standard-normal-distribution

Normal distribution: Empirical rule

The empirical rule, also known as the three-sigma rule or the 68-95-99.7 rule, shows where most of the values autumn in a normal distribution:

Approximately 68% of the values autumn between the mean and 1 standard deviation.
Approximately 95% of the values lie between the mean and 2 standard deviations.
Around 99.7% of the values autumn between the mean and 3 standard deviations.

Example

You collect the ages of a group of students. The data set exhibits the properties of a normal distribution with a mean age of 15 and a standard deviation of 3. Using the empirical rule:

About 68% of the ages autumn between 12 and 18, which is 1 standard deviation over and under the mean.
About 95% of the ages lie between 9 and 21, which is 2 standard deviations over and under the mean.
About 99.7% of the ages autumn between 6 and 24, which is 3 standard deviations over and under the mean.

The empirical rule can be used as a measure of “normality”. If too many data points are outside the three boundaries, then a distribution may not be normal. It can also show the outliers in your data range, i.e. values that are too small or too large, which may affect the shape of the curve.

Normal distribution: Central limit theorem

The central limit theorem postulates that if you have sizeable samples from a given population, the means will be normally distributed even if the population is not necessarily normally distributed. The central limit theorem highlights the following:

The law of large numbers, which states, as the sample size grows larger, the sample mean moves towards the population mean.
For several large samples, the mean of the sampling distribution is distributed normally.

The central limit theorem asserts that the assumption of “normality” is unnecessary when conducting parametric tests if the researcher uses a sizeable sample. Parametric tests can be used in large samples of any distribution type as long as the groups have comparable variance and the data in the set is independent.

Formula of the normal curve

A probability density function is used to plot a normal curve after determining the mean and standard deviation. The area under the curve shows the probability, and the total area covered by the curve is equal to 100% or 1, as provided.

Normal probability density formula:

f(x) = 1 / (σ √(2π)) · e^{-(x – μ)² / (2 σ²)}

f(x) – Probability density at value x
x – Value of the variable
μ – Mean
σ – Standard deviation
σ² – Variance

Standard normal distribution

The standard normal distribution is a normal distribution with a mean of zero and a standard deviation of 1. The standard normal distribution is also known as the z- distribution, as its observations are denoted with z rather than x. Z-scores in a standard normal distribution show where each value falls away from the mean using the number of standard deviations.

Calculating probability in a z-distribution

Every z-score is assigned a probability (p-value) which shows the probability of the occurrence of some values falling below the z-score.

Print Your Thesis Now

BachelorPrint as an online printing service offers
numerous advantages for Canadian students:

✓ 3D live preview of your configuration
✓ Free express delivery for every order
✓ High-quality bindings with individual embossing

to printing services

Category

Normal Distribution – Definition & Formula

Normal Distribution – In a Nutshell

Definition: Normal distribution

The matter of normal distribution

The properties of normal distribution

Normal distribution: Empirical rule

Normal distribution: Central limit theorem

Formula of the normal curve

Standard normal distribution

Calculating probability in a z-distribution

FAQs

What defines normal distribution?

What are the properties of a normal distribution?

What is a z-distribution?

What are parametric tests?