Say, a person is a waiting for a cab by a street. We want to model the time she has to wait for the cab to arrive.
There are two cases to consider: (1) she has not called for a cab. (2) she called for a cab
In (1), there is no scheduled arrival of a cab. If a cab happen to go by her, she will take it. We can model this situation using an exponential random variable.
An exponential random variable is continuous random variable with probability density function . If the average time that the person has to wait is
minutes, we use
.
By the memoryless property of exponential distribution, we have . Giving
and
, we have
. This property might seem surprising. Say, the person begins to wait and determines the probabilities that he needs to wait for at least
minutes,
minutes and
minutes to be
,
and
respectively. Say, he waits and sees no cab for the first
minutes. The probability that he needs to wait for
additional minutes, namely
, is same as
. It might seem unfair that the probability that she sees no cab for
additional minutes is so high given that she already waited for
minutes. One might expect that the cab may come any minute then with a high probability. Can we explain this behavior?
The reason for this memoryless property is that in modeling the waiting time scenario by an exponential distribution, (like in the case of Poisson distribution) we divided time into tiny intervals and assumed that the intervals are independent (or weakly dependent) of each other. Cabs going in all directions, people getting on and off, traffic patterns etc., all cause the probability of a cab arriving in one interval unaffected by whether a cab had arrived or not in an earlier interval. Regardless of the fact that the person waited for minutes already, the outside traffic system has not changed, and thus we have the memoryless property.
Now, consider case (2) where the person called for a cab. Here, we assume there is a particular cab company that she called the cab from and there is a scheduled time of arrival. Although there is a scheduled time, the arrival time is still not deterministic and hence, can be modeled by a random variable. The question is if the random variable is exponential and in particular, if the waiting time still has the memoryless property? Say, the cab is scheduled to arrive minutes from when she called. And, she waits for
minutes and the cab hasn’t arrived still. Then, is the probability that the cab arrives in the next minute high, or is the probability, like in case (1), going to remain the same as what it was for the cab to arrive in the first minute when she started waiting? As can be expected, it is the former in this case. We do not have the memoryless property unlike case (1).
Where does the difference between (1) and (2) come from? In modelling case (2), if we divide the time into tiny intervals, we can see that the time intervals are not independent. For example, if the cab from the company arrived in a time interval, the probability that the cab from the company arrives in several following time intervals will diminish. Because of lack of independence between the time intervals, the scenario cannot be modeled as an exponential distribution, and hence, we do not have the memoryless property.