What is true about the probability distribution function of a discrete random variable?

In this explainer, we will learn how to identify a discrete random variable and define its corresponding probability distribution.

In order to understand what a discrete random variable is, it is helpful to discuss what a random variable is first.

Definition: Random Variables

A random variable is a variable that can take on multiple different values (that are assigned at random), each with an associated probability.

As with probabilities of mutually exclusive events, the probabilities associated with all the values that the random variable can take must all add up to 1. Moreover, each probability must lie within the interval [0,1].

A random variable can be either discrete or continuous. For a discrete random variable, the values of the random value must be discrete. Typically, they will take integer values, but this is not necessarily the case.

Definition: Discrete Random Variable

A discrete random variable is a random variable that can take on multiple different discrete values, each with an associated probability.

One example of a discrete random variable are outcomes of rolling of a fair die. The values the discrete random variable can take are 1, 2, 3, 4, 5, and 6, with 16 being the associated probability for each value.

We can represent a discrete random variable using a probability distribution function. This is a function that maps the values of the discrete random variable to their associated probabilities.

Definition: Probability Distribution Function

A probability distribution function is a function that generates probabilities of value 𝑓(𝑥) given an outcome of value 𝑥 and must hold the following properties:

  • 𝑓 (𝑥)=1 for all values of 𝑥 in the domain of the probability distribution function,
  • each value of 𝑓(𝑥) must lie in the interval [0,1].

In the same way that we can represent inputs and outputs of a function in different ways, can we also represent a probability distribution function in different formats. In the first two examples, we will consider questions where the probability distribution function is represented in a table. The first demonstrates how to check if a function is a probability distribution function by using facts related to probability.

Example 1: Checking if a Function in a Table is a Probability Distribution Function

Can the function in the given table be a probability distribution function?

𝑥0 1 4 5
𝑓(𝑥)0.17 0.43 0.69 0.36

Answer

We know that for any probability distribution function,𝑓(𝑥)=1. Therefore, if the function in the table is a probability function, then all the probabilities in the table will sum to 1: 0.17+0.43+0.69+0.36=1.65.

Since this is not the case, as they summed to 1.65, then 𝑓(𝑥) is not a probability distribution function.

Note that even if all the probabilities summed to 1, we would also need to check that each probability was in the interval [0,1].

The second example also uses a table to represent a probability distribution function, but demonstrates how to find an unknown value in the table using facts associated with probability.

Example 2: Finding an Unknown Value of a Probability Distribution Function in a Table

The function in the given table is a probability distribution function of a discrete random variable 𝑋. Find the value of 𝑎.

𝑥1 2 3 4 5
𝑓(𝑥)15 110 310 110 𝑎

Answer

We know that for any probability distribution function, 𝑓(𝑥)=1, so we can use this in order to determine the unknown value of 𝑎: 15+110+310+1 10+𝑎=1710+𝑎=1𝑎=310.

Since the value of 𝑎 is in the interval [0,1], then we know that 310 is an appropriate value.

In addition to representing a probability distribution function in a table, we can represent each value and its associated probability using the notation 𝑃(𝑋=𝑥 )=𝑝, where 𝑥 represents the value of the discrete random variable 𝑋 and 𝑝 is the associated probability of that value. Using this notation, we can say that 𝑃(𝑋=𝑥)=1 and all values of 𝑝 must lie in the interval [0,1].

In the following example, we will consider how to find an unknown probability when the probability distribution function is presented in the format 𝑃(𝑋=𝑥)=𝑝.

Example 3: Finding an Unknown Value given Other Values of a Probability Distribution Function

Let 𝑋 denote a discrete random variable that can take the values 0, 1, 2, and 3. Given that 𝑃(𝑋=0)=19, 𝑃(𝑋=1)=49, 𝑃(𝑋=2)=𝑎, and 𝑃(𝑋=3)=3𝑎 , find the value of 𝑎.

Answer

As all probabilities in a probability distribution function add up to 1, we know that 𝑃(𝑋=0)+𝑃(𝑋=1 )+𝑃(𝑋=2)+𝑃(𝑋= 3)=1.

By substituting the associated probabilities, we get 19+49+ 𝑎+3𝑎=1.

Then, simplifying, rearranging, and solving for 𝑎 gives us 59+4𝑎=1 4𝑎=49𝑎=19.

As the value of 𝑎 is in the interval [0,1], then 19 is an appropriate answer.

As we saw in example 2, the probability distribution function can be represented as a function of 𝑥, where 𝑥 is the value the discrete random variable can take and 𝑓(𝑥) is its associated probability. We therefore know that 𝑓(𝑥)=1 for all values of 𝑥 the random variable can take. Further, each value of 𝑓( 𝑥) must lie in the interval [0,1].

The following example uses the properties of 𝑓(𝑥)=1 and all values of 𝑓(𝑥) lying in the interval [0,1] in order to determine whether a probability distribution function is valid or not for a given set of values of 𝑥 .

Example 4: Determining Which Probability Distribution Function Could Represent a Discrete Random Variable

Let 𝑋 denote a discrete random variable that can take the values 3, 5, and 6. Which of the following functions could represent the probability function of 𝑋?

  1. 𝑓(𝑥)=𝑥+310
  2. 𝑓(𝑥)=𝑥−28
  3. 𝑓(𝑥)=45𝑥+2
  4. 𝑓(𝑥)= 4𝑥+52

Answer

For a probability distribution function to be able to represent the discrete random variable 𝑋, which can take the values 3, 5, and 6, 𝑓(𝑥)=1 and each value of 𝑓(𝑥) must lie in the interval [0,1]. We will check each function in turn.

For 𝑓(𝑥)=𝑥+310, we will first calculate the associated probabilities for each value of the discrete random variable by substituting 3, 5, and 6 into 𝑓(𝑥 ).

When 𝑥=3, 𝑓(3)=3+310=1210.

As 1210 is outside of the interval [0,1], then we can already deduce that 𝑓(𝑥) =𝑥+310 cannot be a valid probability distribution function for the discrete random variable 𝑋.

Similarly, for 𝑓(𝑥)=𝑥−28, we will first calculate the associated probabilities for each value of the discrete random variable, 3, 5, and 6, by substituting into 𝑓(𝑥).

When 𝑥=3, 𝑓(3)=3−28=18.

When 𝑥=5 , 𝑓(5)=5−28=38.

When 𝑥=6, 𝑓(6)=6−28=48.

As each value lies in the interval [0,1], then so far 𝑓(𝑥)=𝑥−28 is a valid probability distribution function for the discrete random variable 𝑋.

We will now check if 𝑓(𝑥)=1: 𝑓(3)+𝑓(5)+𝑓(6)=18+38+48=1.

Since 𝑓(𝑥)=1, then 𝑓(𝑥)=𝑥−28 is a valid probability distribution function for the discrete random variable 𝑋.

Note:

We can check the other functions, but since we have already determined the correct answer, then this is not necessary.

In the next example, we can use the properties of 𝑓(𝑥)=1 and all values of 𝑓(𝑥) lying in the interval [0,1] to determine an unknown coefficient within the function of 𝑓(𝑥) given the values the random variable can take.

Example 5: Finding an Unknown Coefficient in a Probability Distribution Function given Values the Discrete Random Variable Can Take

Let 𝑋 denote a discrete random variable that can take the values 1, 2, 3, 4, 5, and 6. Given that 𝑋 has probability distribution function 𝑓(𝑥)=𝑎𝑥5, find the value of 𝑎.

Answer

For a probability distribution function to be able to represent the discrete random variable 𝑋, which can take the values 1, 2, 3, 4, 5, and 6, 𝑓(𝑥)=1 and each value of 𝑓(𝑥) must lie in the interval [0,1].

First, we need to substitute 1, 2, 3, 4, 5, and 6 into 𝑓(𝑥)=𝑎𝑥5.

For 𝑥=1, 𝑓(1)=𝑎×15=𝑎5.

For 𝑥=2, 𝑓(2)=𝑎×25=2𝑎5.

For 𝑥=3, 𝑓(3)=𝑎×35 =3𝑎5.

For 𝑥=4, 𝑓(4)=𝑎×45=4𝑎5.

For 𝑥=5, 𝑓(5)=𝑎×55=5𝑎5.

For 𝑥=6, 𝑓(6)=𝑎× 65=6𝑎5.

Next, we use the property 𝑓(𝑥)=1 to create an equation in terms of 𝑎: 𝑓(1)+𝑓(2)+𝑓(3)+𝑓(4)+𝑓(5)+𝑓(6)=1,𝑎5+2𝑎5+3𝑎5+4𝑎5+5𝑎5+6𝑎5=1.

Then, simplifying, rearranging, and solving for 𝑎 gives us 21𝑎5=1𝑎=521.

Although we have found the value of 𝑎, it is good practice to check that for each value of the random variable, the associated probability is in the interval [0,1]. We do this by substituting 𝑎=521 into 𝑓(1), 𝑓(2), and so on.

For 𝑥=1, 𝑓(1)=×15=121.

For 𝑥=2, 𝑓(2)=×25=221.

For 𝑥=3, 𝑓(3)=×35=321.

For 𝑥=4, 𝑓 (4)=×45=421.

For 𝑥=5, 𝑓(5)=×55=521.

For 𝑥=6, 𝑓 (6)=×65=621.

As each value of 𝑓(𝑥) for all values of the random variable lies in the interval [0,1], then we know that 𝑓(𝑥)=𝑎𝑥5, where 𝑎=521, is a valid probability distribution function.

So far, we have considered examples where no context is given. In the final part of the explainer, we will explore examples where a context is given.

In the following example, we will consider how to use information from a probability distribution in order to answer questions within a context.

Example 6: Solving Problems in Context about Discrete Random Variables given a Probability Distribution Function

Let 𝑋 be the random variable that represents the number of patients who visit a dental clinic per hour. The probability distribution of 𝑋 is shown in the table below.

𝑥10 11 12 13 14 15
𝑓(𝑥)0.2 0.1 0.15 0.05 0.3 0.2

Find the probability of the following.

  1. Exactly 13 patients visiting the clinic in a given hour
  2. At least 13 patients visiting the clinic in a given hour
  3. At most 13 patients visiting the clinic in a given hour

Answer

Part 1

To find the probability that exactly 13 patients visit the clinic in a given hour, we use the probability distribution table.

In this case, 𝑋 represents the number of patients who visit a dental clinic per hour, so 𝑥=13. Therefore, the probability of 𝑥=13 is given by the corresponding value for 𝑓(𝑥) in the table, which is 0.05.

Part 2

To find the probability that at least 13 patients visit the clinic in a given hour, we again use the probability distribution table.

In this case, 𝑥 can take multiple values, since we are finding the probability that at least 13 patients visited the clinic in a given hour. So, 𝑥 can be any integer greater than or equal to 13, or 𝑥≥ 13. As seen in the table, 𝑋 can only take integer values from 10 to 15, so 𝑥 can be 13, 14, or 15.

To find the probability of 𝑥 being 13, 14, or 15, we find the corresponding values for 𝑓(𝑥) in the table and sum these.

For 𝑥=13, 𝑓(13 )=0.05.

For 𝑥=14, 𝑓(14)= 0.3.

For 𝑥=15, 𝑓(15)=0.2 .

Therefore, 𝑓(13)+𝑓(14)+𝑓(15)=0.0 5+0.3+0.2=0.55 .

So, the probability that at least 13 patients visit the clinic in a given hour is 0.55.

Part 3

To find the probability that at most 13 patients visit the clinic in a given hour, we again use the probability distribution table.

As in the previous part, 𝑥 can take multiple values. Since we are finding the probability that at most 13 patients visited the clinic in a given hour, then 𝑥 can be any integer less than or equal to 13, or 𝑥≤13. As seen in the table, 𝑋 can only take integer values from 10 to 15, so 𝑥 can be 10, 11, 12, or 13.

To find the probability of 𝑥 being 10, 11, 12, or 13, we find the corresponding values for 𝑓( 𝑥) in the table and sum these.

For 𝑥=10, 𝑓(10)=0.2 .

For 𝑥=11, 𝑓(14)=0.1.

For 𝑥=12, 𝑓(15)=0.15.

For 𝑥=13, 𝑓(13)=0.05.

Therefore, 𝑓(10)+𝑓(11)+𝑓(12)+𝑓(13) =0.2+0.1+0.15+ 0.05=0.5.

So, the probability that at most 13 patients visit the clinic in a given hour is 0.5.

In the last example, we will consider which distribution is most appropriate for a given context.

Example 7: Choosing a Distribution of a Discrete Random Variable given a Context

In an experiment in which a fair coin is tossed five consecutive times, let 𝑋 be the discrete random variable expressing the number of heads minus the number of tails. Find the probability distribution of 𝑋.

  1. 𝑥−5 −3 −1 1 3 5
    𝑓(𝑥)132 532 1032 1032 532 132

  2. 𝑥−5 −3 −1 0 1 3 5
    𝑓(𝑥)132 532 932 232 932 532 132

  3. 𝑥−3 −1 0 1 3
    𝑓(𝑥)532 1032 232 1032 532

  4. 𝑥−4 −3 −1 1 3 4
    𝑓(𝑥)132 532 1032 1032 532 132

  5. 𝑥−5 −3 −1 1 3 5
    𝑓(𝑥)132 1032 532 532 1032 132

Answer

In order to determine the probability distribution function 𝑓(𝑥) for the discrete random variable 𝑋, we first determine all the possible values that 𝑋 can take, then find the probability, 𝑓(𝑥), for each value of 𝑋.

The questions states that a fair coin is tossed 5 times and 𝑋 is the discrete random variable that represents the number of heads minus the number of tails. Therefore, the possible values of 𝑋 are as follows:

  • 5 heads and 0 tails: 5−0=5,
  • 4 heads and 1 tails: 4−1=3,
  • 3 heads and 2 tails: 3−2=1,
  • 2 heads and 3 tails: 2−3=−1,
  • 1 heads and 4 tails: 1−4=−3,
  • 0 heads and 5 tails: 0−5=−5.

So, the values that 𝑋 can take are −5, −3, −1 , 1, 3, and 5.

Next, we will calculate the probability, 𝑓(𝑥), for each of the values 𝑋.

In order to calculate the probability, it is helpful to list the possible outcomes for each discrete random variable 𝑋 and then determine the probability once all the outcomes have been determined.

For 𝑥=5, there are 5 heads and 0 tails. HHHHH is the only outcome.

For 𝑥=3, there are 4 heads and 1 tails. HHHHT, HHHTH, HHTHH, HTHHH, and THHHH are the 5 outcomes.

For 𝑥=1, there are 3 heads and 2 tails. HHHTT, HHTHT, HTHHT, THHHT, HHTTH, HTHTH, THHTH, HTTHH, THTHH, and TTHHH are the 10 outcomes.

For 𝑥=−1, there are 2 heads and 3 tails. HHTTT, HTHTT, THHTT, HTTHT, THTHT, TTHHT, HTTTH, THTTH, TTHTH, and TTTHH are the 10 outcomes.

For 𝑥=−3, there are 1 heads and 4 tails. HTTTT, THTTT, TTHTT, TTTHT, and TTTTH are the 5 outcomes.

For 𝑥=−5, there are 0 heads and 5 tails. TTTTT is the only outcome.

By adding all of the outcomes together, we get 1+5+10+10+5+1=32, so there are 32 outcomes in total.

Therefore, to find the probability of each outcome, we take the number of outcomes for each discrete random variable and divide it by the total number of outcomes, as follows.

For 𝑥=5, 𝑓(5)=132.

For 𝑥=3, 𝑓(3)=532.

For 𝑥=1, 𝑓(1)=1032.

For 𝑥=−1, 𝑓(−1)=1032.

For 𝑥=−3, 𝑓(−3)=532.

For 𝑥=−5, 𝑓(−5)=132.

When displayed in a table, we have the following.

𝑥−5 −3 −1 1 3 5
𝑓(𝑥)132 532 1032 1032 532 132

In this explainer, we have learned about discrete random variables and their probability distribution functions. Let’s recap the key points.

Key Points

  • A discrete random variable is a variable that can take on a set of discrete values, each with an associated probability.
  • A probability distribution function maps each value of a discrete random variable to an associated probability.
  • The sum of the probabilities of a probability distribution function must equal 1, and each probability must lie in the interval [0,1].
  • Probability distribution functions can be presented in various ways, including:
    • in a table,
    • using the notation 𝑃(𝑋=𝑥)=𝑝, where 𝑥 represents the value of the discrete random variable 𝑋 and 𝑝 is the associated probability of that value,
    • using a function 𝑓(𝑥), where 𝑥 represents the value of the discrete random variable 𝑋 and 𝑓(𝑥) is its associated probability.

What is true about the probability distribution of a discrete random variable?

A discrete probability distribution lists each possible value that a random variable can take, along with its probability. It has the following properties: The probability of each value of the discrete random variable is between 0 and 1, so 0 ≤ P(x) ≤ 1. The sum of all the probabilities is 1, so ∑ P(x) = 1.

What is true about the probability distribution function pdf of a discrete random variable?

The characteristics of a probability distribution function (PDF) for a discrete random variable are as follows: Each probability is between zero and one, inclusive (inclusive means to include zero and one). The sum of the probabilities is one.

How to find probability distribution of function of a discrete random variable?

It is computed using the formula μ=Σx P(x). The variance σ2 and standard deviation σ of a discrete random variable X are numbers that indicate the variability of X over numerous trials of the experiment. They may be computed using the formula σ2=[Σx2 P(x) ]−μ2, taking the square root to obtain σ.

What is true about discrete random variable?

A discrete random variable is one which may take on only a countable number of distinct values such as 0,1,2,3,4,........ Discrete random variables are usually (but not necessarily) counts. If a random variable can take only a finite number of distinct values, then it must be discrete.