Home > Knowledge Base > Statistical Analysis > Probability Distribution

Probability Distribution

Published by Alaxendra Bets at November 17th, 2025 , Revised On November 17, 2025

A probability distribution explains how the values of a random variable are distributed based on their likelihood of occurring. It shows all possible outcomes of an event and assigns each a probability between 0 and 1.

In statistics, probability distributions are essential for analysing data, predicting outcomes, and understanding uncertainty. They form the foundation of many statistical techniques used in research, such as hypothesis testing, regression, and risk analysis.

Real-life examples of probability distributions include:

Tossing a coin (probability of heads or tails)
Analysing exam results (distribution of scores among students)
Quality control in manufacturing (defective vs. non-defective items)

What Is Probability Distribution

A distribution represents how data points or outcomes are spread out, along with the likelihood of each outcome occurring. In a probability distribution, these likelihoods are expressed as probabilities that always sum up to 1.

The concept is closely tied to random variables, numerical outcomes of random processes. For example, the number of times a coin lands on heads in 10 tosses is a random variable.

Probability distributions can be visualised using graphs, curves, or histograms, where the x-axis shows possible outcomes and the y-axis represents their corresponding probabilities.

Components Of A Probability Distribution

Every probability distribution is built upon key components that define its structure and interpretation:

Random Variable	Represents the *numerical outcome of a random experiment. It can be discrete (countable outcomes like dice rolls) or continuous (uncountable outcomes like height or weight).
Sample Space	The complete set of all possible outcomes.
Events	Specific subsets of the sample space that have a certain probability of occurring. For example, “rolling an even number” on a die.
Probability Values	Each outcome or event is assigned a probability between 0 and 1, where 0 means impossible and 1 means certain. The sum of all probabilities in the sample space must equal 1.
Expected Value and Variance	The expected value shows the average outcome over many trials, while variance measures the dispersion (how much the outcomes deviate) from the average.

Want Custom Dissertation Topic?

Our writers are ready to deliver multiple custom topic suggestions straight to your email that aligns
with your requirements and preferences:

Original Topic Selection Criteria
Ethics Of Sensitive Topics
Manageable Time Frame Topics

Sample Dissertation Whatsapp Us Order Now

Types Of Probability Distributions

Probability distributions are broadly classified into two main types based on the nature of the random variable: discrete and continuous.

Discrete Probability Distributions

A discrete probability distribution deals with outcomes that can be counted and take on specific values. In this case, the probabilities are assigned to individual, distinct events. The sum of all possible probabilities equals 1.

The random variable has finite or countably infinite possible values.
Each value has an associated probability.
Probabilities are represented using a probability mass function (PMF).

Common Examples

Now, let us look at some examples of discrete probability distributions.

Binomial Distribution

Describes the number of successes in a fixed number of independent trials (e.g., flipping a coin 10 times).
Used in quality testing, surveys, and clinical trials.

Poisson Distribution

Represents the number of events occurring within a fixed time or space interval (e.g., the number of customer arrivals in an hour).

Geometric Distribution

Describes the number of trials required for the first success in a series of Bernoulli trials (e.g., rolling a six for the first time) and often applied in reliability studies and risk analysis.

Continuous Probability Distributions

A continuous probability distribution is used when the random variable can take any value within a given range. Unlike discrete data, continuous data are not countable but measurable. The probabilities are represented by a probability density function (PDF) instead of a PMF.

Random variables can take infinitely many values within a continuous range.
The probability of any exact value is 0. Instead, probability is measured over an interval.
The area under the curve of the PDF equals 1.

Common Examples

Normal Distribution (Bell Curve):

The most widely used distribution in statistics. Data are symmetrically distributed around the mean, with most values clustering near the centre. Used in natural and social sciences for variables like IQ scores, height, and measurement errors.

Exponential Distribution:

Models the time between occurrences of events in a Poisson process (e.g., time between system failures).

Uniform Distribution:

Every outcome in the range has an equal probability. Applied in simulations, gaming, and random sampling methods.

Difference Between Discrete & Continuous Distributions

While both types of probability distributions describe how outcomes are spread, they differ in the type of data, formulas used, and applications.

	Discrete Distribution	Continuous Distribution
Definition	Probability of distinct, countable outcomes	Probability of values within a continuous range
Random Variable Type	Discrete (countable)	Continuous (measurable)
Function Used	Probability Mass Function (PMF)	Probability Density Function (PDF)
Probability Assignment	Each specific value has a defined probability, $P(X=x)$	Probability is measured over intervals, $P(a \le X \le b)$
Example	Binomial, Poisson, Geometric	Normal, Exponential, Uniform

In research, the choice between discrete and continuous distributions depends on the nature of the variable:

Use a discrete distribution when dealing with count data (e.g., number of occurrences).
Use a continuous distribution when working with measurement data (e.g., weight, time, or height).

Key Probability Distribution Formulas

To understand how probability distributions work mathematically, it is essential to know the core functions and formulas used to describe them.

Probability Mass Function (PMF)

The Probability Mass Function (PMF) is used for discrete probability distributions. It provides the probability that a discrete random variable takes on a specific value.

Formula: P(X = x) = f(x)

Where:

X = discrete random variable
x = specific value of X
f(x) = probability of X taking the value x

The PMF satisfies two important conditions:

f(x) ≥ 0 for all x
Σ f(x) = 1

Example: In a binomial distribution with n = 3 and p = 0.5, the PMF gives the probability of getting 0, 1, 2, or 3 successes.

Probability Density Function (PDF)

The Probability Density Function (PDF) applies to continuous probability distributions. Instead of assigning a probability to individual values, it defines a curve where the area under the curve within an interval represents the probability.

Formula: P(a ≤ X ≤ b) = ∫ from a to b f(x) dx

Where:

f(x) = PDF of the continuous random variable X
The total area under f(x) from −∞ to +∞ equals 1

Example: For a normal distribution, the PDF produces the well-known bell-shaped curve, showing how data cluster around the mean.

Cumulative Distribution Function (CDF)

The Cumulative Distribution Function (CDF) gives the probability that a random variable takes a value less than or equal to a particular number. It applies to both discrete and continuous distributions.

Formula: F(x) = P(X ≤ x)
For discrete distributions: F(x) = Σ f(t) for all t ≤ x
For continuous distributions: F(x) = ∫ from −∞ to x f(t) dt

The CDF increases monotonically from 0 to 1 as x moves from the smallest to the largest possible value.

Example: In a uniform distribution between 0 and 1, F(0.4) = 0.4, meaning there is a 40% probability that X ≤ 0.4.

Mean & Variance Of Distributions

The mean and variance summarise a probability distribution’s central tendency and spread.

Mean (Expected Value):

E(X) = Σ x·P(x) (for discrete)
E(X) = ∫ x·f(x) dx (for continuous)

The mean shows the long-run average outcome of a random variable.

Variance:

- Var(X) = Σ (x − μ)²·P(x) (for discrete)
- Var(X) = ∫ (x − μ)²·f(x) dx (for continuous)

Variance measures how much the outcomes deviate from the mean.

Probability Distributions In Excel & SPSS

Modern statistical tools like Microsoft Excel and IBM SPSS make it easy to calculate, visualise, and interpret probability distributions without complex manual formulas.

How To Use Excel Functions For Probability Distributions

Excel provides built-in functions for different types of probability distributions. Here are some important functions.

NORM.DIST(x, mean, standard_dev, cumulative)

Used to calculate probabilities in the normal distribution. Setting cumulative = TRUE gives the cumulative probability, while setting it to FALSE returns the probability density.

BINOM.DIST(number_s, trials, probability_s, cumulative)

Calculates probabilities for the binomial distribution, such as the likelihood of a certain number of successes in fixed trials.

POISSON.DIST(x, mean, cumulative)

Computes probabilities for the Poisson distribution, useful for modelling rare events within a fixed time or space.

Probability Distribution Example:

If you want to find the probability of getting exactly 3 successes in 10 trials with a success rate of 0.5, the formula will be:

=BINOM.DIST(3,10,0.5, FALSE)

How To Generate Probability Plots In SPSS

SPSS provides a user-friendly interface for analysing probability distributions through its Descriptive Statistics and Graphs tools. Researchers can compute important statistics and visualise how data align with theoretical distributions.

Open your dataset in SPSS.
Go to Analyse > Descriptive Statistics > Explore.
Move the target variable into the Dependent List box.
Click on Plots and choose Normality plots with tests.
Run the analysis to view histograms, Q-Q plots, and Kolmogorov–Smirnov/Shapiro–Wilk tests.

Example Output Interpretation

A bell-shaped histogram suggests data follow a normal distribution.
In a Q-Q plot, points that closely align with the diagonal line indicate normality.
Significance values (p > 0.05) in normality tests imply the data do not significantly deviate from a normal distribution.

Frequently Asked Questions

What is a probability distribution in simple terms?

A probability distribution shows how the possible outcomes of a random event are spread out and how likely each outcome is to occur. It assigns a probability (between 0 and 1) to every possible value of a random variable.

What are the main types of probability distributions?

There are two main types: discrete and continuous probability distributions. Discrete distributions apply to countable outcomes, such as dice rolls, while continuous distributions apply to measurable outcomes, such as height or time.

What is the difference between discrete and continuous probability distributions?

The key difference is that discrete distributions handle countable outcomes and use the Probability Mass Function (PMF), whereas continuous distributions cover measurable data and use the Probability Density Function (PDF).

What are PMF, PDF, and CDF in probability distributions?

PMF (Probability Mass Function): Defines probabilities for discrete variables.
PDF (Probability Density Function): Represents continuous probabilities as curves.
CDF (Cumulative Distribution Function): Shows the probability that a variable takes a value less than or equal to a specific number.

What is the formula for a probability distribution?

For discrete distributions: P(X = x) = f(x)
For continuous distributions: P(a ≤ X ≤ b) = ∫ from a to b f(x) dx

How is probability distribution used in research?

Probability distributions are used to model uncertainty, predict outcomes, and test hypotheses. They help researchers make informed decisions based on data patterns rather than assumptions.

What is the role of mean and variance in probability distributions?

The mean (expected value) indicates the long-term average outcome of a random variable, while the variance measures how much the data values spread from the mean. Together, they describe the shape and spread of the distribution.

How can I calculate probability distributions in Excel?

You can use Excel functions such as NORM.DIST, BINOM.DIST, and POISSON.DIST to compute probabilities. These functions help perform quick statistical analysis and visualise data distributions using charts or histograms.

How do I generate probability plots in SPSS?

In SPSS, go to Analyse > Descriptive Statistics > Explore, then select Normality plots with tests. The software will produce histograms and Q-Q plots.

Why is the normal distribution important in statistics?

The normal distribution, also known as the bell curve, is essential because many natural and social phenomena follow it. It underpins main statistical methods like hypothesis testing, regression, and confidence intervals.