A computer system uses passwords that are exactly six characters and each character is one of the 26 letters (a–z) or 10 integers (0–9). Suppose that 10,000 users of the system have unique passwords. A hacker randomly selects (with replace- ment) one billion passwords from the potential set, and a match to a user’s password is called a hit. (a) What is the distribution of the number of hits? (b) What is the probability of no hits? (c) What are the mean and variance of the number of hits?

Respuesta :

Answer:

The number of hits would follow a binomial distribution with [tex]n =10,\!000[/tex] and [tex]p \approx 4.59 \times 10^{-6}[/tex].

The probability of finding [tex]0[/tex] hits is approximately [tex]0.955[/tex] (or equivalently, approximately [tex]95.5\%[/tex].)

The mean of the number of hits is approximately [tex]0.0459[/tex]. The variance of the number of hits is approximately [tex]0.0459\![/tex] (not the same number as the mean.)

Step-by-step explanation:

There are [tex](26 + 10)^{6} \approx 2.18 \times 10^{9}[/tex] possible passwords in this set. (Approximately two billion possible passwords.)

Each one of the [tex]10^{9}[/tex] randomly-selected passwords would have an approximately [tex]\displaystyle \frac{10,\!000}{2.18 \times 10^{9}}[/tex] chance of matching one of the users' password.

Denote that probability as [tex]p[/tex]:

[tex]p := \displaystyle \frac{10,\!000}{2.18 \times 10^{9}} \approx 4.59 \times 10^{-6}[/tex].

For any one of the [tex]10^{9}[/tex] randomly-selected passwords, let [tex]1[/tex] denote a hit and [tex]0[/tex] denote no hits. Using that notation, whether a selected password hits would follow a bernoulli distribution with [tex]p \approx 4.59 \times 10^{-6}[/tex] as the likelihood of success.

Sum these [tex]0[/tex]'s and [tex]1[/tex]'s over the set of the [tex]10^{9}[/tex] randomly-selected passwords, and the result would represent the total number of hits.

Assume that these [tex]10^{9}[/tex] randomly-selected passwords are sampled independently with repetition. Whether each selected password hits would be independent from one another.

Hence, the total number of hits would follow a binomial distribution with [tex]n = 10^{9}[/tex] trials (a billion trials) and [tex]p \approx 4.59 \times 10^{-6}[/tex] as the chance of success on any given trial.

The probability of getting no hit would be:

[tex](1 - p)^{n} \approx 7 \times 10^{-1996} \approx 0[/tex].

(Since [tex](1 - p)[/tex] is between [tex]0[/tex] and [tex]1[/tex], the value of [tex](1 - p)^{n}[/tex] would approach [tex]0\![/tex] as the value of [tex]n[/tex] approaches infinity.)

The mean of this binomial distribution would be:[tex]n\cdot p \approx (10^{9}) \times (4.59 \times 10^{-6}) \approx 0.0459[/tex].

The variance of this binomial distribution would be:

[tex]\begin{aligned}& n \cdot p \cdot (1 - p)\\ & \approx(10^{9}) \times (4.59 \times 10^{-6}) \times (1- 4.59 \times 10^{-6})\\ &\approx 4.59 \times 10^{-6}\end{aligned}[/tex].

ACCESS MORE
EDU ACCESS