Conditional lifetime model (CLM) next up previous
Next: Using additional information about Up: Predicting queue times on Previous: Related work

Conditional lifetime model (CLM)

 

During 1995, 24906 jobs were submitted to the batch partition of the 400-node Intel Paragon at SDSC. Figure 1 shows the distribution of lifetimes for these jobs.

Except for the longest and shortest jobs, this distribution is nearly straight (on a logarithmic x-axis); in other words, the logs of the lifetimes are distributed uniformly. Although we do not know any theoretical reason why lifetimes should have this uniform-log distribution, we will take advantage of this observation by fitting a straight-line approximation to the observed distribution. Using this approximation (instead of the observed values) simplifies the calculation of the conditional distribution of lifetimes (Equation 3).

  figure56
Figure 1: The distribution of lifetimes for 24906 batch jobs submitted to the Intel Paragon at SDSC. The gray line shows the least-squares fit for the observed data (intercept tex2html_wrap_inline592 = -0.18, slope tex2html_wrap_inline594 = 0.10).

We calculated a straight-line approximation by least-squares regression (omitting the longest 10% and the shortest 10% of jobs). The tex2html_wrap_inline596 value of the fit is over 0.99, indicating that the linear approximation is very good. Thus our model of this distribution of lifetimes is


equation62

where tex2html_wrap_inline600 is the cumulative distibution function of L, a random variable representing the lifetime of a job, and tex2html_wrap_inline592 and tex2html_wrap_inline594 are the parameters of the model (intercept and slope). Also, t is bounded by tex2html_wrap_inline610 and tex2html_wrap_inline612. For the estimated parameters from Figure 1, tex2html_wrap_inline614 seconds and tex2html_wrap_inline616 hours.

For this model we can calculate the distribution of lifetimes conditioned on the current age of a job:


 eqnarray72

To find the median lifetime of a job given its current age, we set the conditional cdf to 0.5 and solve for t:


eqnarray81

Substituting tex2html_wrap_inline620, we express the median lifetime as tex2html_wrap_inline622. Thus, this distribution does not obey the past-repeats heuristic; rather, the median remaining lifetime increases slowly with age, until tex2html_wrap_inline624, after which the median remaining lifetime drops quickly.

  figure92
Figure 2: The median remaining lifetime of a job as a function of its current age. The gray line shows the values predicted by our model.

Figure 2 shows the median remaining lifetime as calculated according to the model and according to the observed data. The proposed model fits the data well for ages less than 6 - 12 hours. For longer lifetimes, the distribution of lifetimes diverges from the model. The remaining lifetimes of old jobs do not decline, as predicted, but appear to increase. It is difficult to generalize this behavior, because there are few jobs with lifetimes greater than 12 hoursgif and their distribution does not fit our model.

The average remaining lifetime of a job, tex2html_wrap_inline628, behaves similarly; it increases slowly with age and then drops sharply as the age approaches the limit. Since our model is not accurate for jobs with lifetimes greater than 12 hours, and since the distribution of these jobs is difficult to characterize, we will need to be careful about how we apply our model to old jobs.




next up previous
Next: Using additional information about Up: Predicting queue times on Previous: Related work

Allen Downey
Fri May 30 15:09:42 PDT 1997