by "Nathaniel Osgood" » Sun Dec 07, 2003 6:05 pm
Hi Tuomo,
A few comments:
The issue you have raised is a common one, but doesnt receive as
much discussion as it deserves. In general, the key question when
substituting mean values for a random variable (or mean realizations for
variable driven by a stochastic process) in an SD model is whether the
system (and the model that represents it) exhibits nonlinearity with respect
to the value. If it does, using the expected value (E[.]) rather than
drawing Monte Carlo samples from the distribution will give misleading
results. The reason for this is that for a nonlinear function f and random
variable X, E[f(X)] can be quite far from f(E[X]).
In your case, we are dealing with the issue of a system that exhibits
(presumably -- see below) stochastic transitions. As such we are dealing
with the difference in model output between a stochastic process and a
deterministic process. System evolution will in general be nonlinear with
respect to transition parameters. How big a difference this is will depend
on several things, notably the presence of positive feedbacks in the broader
system that are sensitive directly or indirectly to fluctuations in the
transition rates and details of your hazard function. I have done
experiments some systems (comparing SD and agent-based models of infectious
disease spread) in which the system (or at least the models that represent
it) is HIGHLY sensitive to deviations. Without knowing further details
about your system, it is hard to assess the level of difference to be
expected between the mean of the stochastic process and the deterministic
process you have described. Nonetheless, a few basic calculations suggest
that the variations of the stochastic process from the deterministic
trajectory may be significant.
Lacking details on hazard(t), consider the simple case of a stochastic
process associated with a hazard function that is uniform (value = lambda)
over time. Within this process, failures observe a Poisson process, with
the number of failures in a given timestep of length dt distributed
according to a Bernoulli distribution (risk of per-component failure lambda
* dt, # of trials equal to # of operating components) and with the
likelihood of survival of a given component over time exponentially
distributed (likelihood of survival of a given component after an interval
of time t = Exp[-lambda * t]). If a group of components of size n start
operating at time 0, the number of components remaining in operation after
time t would be a random variable with mean Exp[-lamda*dt]*n, and the
VARIANCE in this quantity would be n*Exp[-lambda*t]*(1-Exp[-lamda*t]). As
t rises, this variance eventually reaches a value as large as n*.25. This
suggests that the point deviations of the stochastic process from the
deterministic process could be rather high.
>From your description, it was not clear whether nonlinearity that would
boost the effect of deviations in transitions would be present elsewhere in
the system, but it seems plausible that it would be. For example, in terms
of the dynamics, the length of time required before a given item awaiting
repair is fixed would seem likely to depend on the number of items requiring
repair -- but how significant this effect is will depend on the specifics of
the system. Nonlinearities in the dynamics of a system such as this can
lead to magnifications of stochastic fluctuations. You may also have
nonlinearity in relationships in other areas of the system -- for example, a
nonlinear rise in the cost over the span of a components "downtime". Using
average transition times rather than representing the distribution may lead
to misleading results here as well.
Thinking more broadly, you consider whether the the seemingly
stochastic variations failures and repair times simulated in the traditional
discrete event models you mrefernece might reflect in part a underlying
heterogeneity in the component population. If so, this could significantly
challenge the results of those models. In particular, if due to
heterogeneity a given component is likely to be associated with a persistent
repair profile over a period of time, both dealing with the average
trajectory AND treating the failures and repair times as stochastic are
likely to miss an important component of the system, and may give misleading
results in simulations.
Finally, from your description, I wasnt sure I was interpreting the
form of the hazard function properly. In your description, you mention that
the hazard rate is a "function of time" -- which would imply a
non-stationary process (with the hazard uniform across all working
components). Perhaps what you meant was that the hazard rate specific to a
component changes as a function of the time that particular component has
been in circulation? If the latter is correct, this raises the question of
how you are keeping track of the time a given component has been in
circulation -- through stocks disaggregated (or arrayed) by time, or another
mechanism. In the presence of a nonlinear hazard, you will want to take
some care in representing this within the model, but given the timing this
probably isnt a first-order consideration.
Thanks for bringing up this interesting topic. I hope these comments
are helpful.
Best,
n
-----------------------------------------------
Nathaniel Osgood, PhD
Research Associate (TDP), Senior Lecturer (CEE)
Massachusetts Institute of Technology
77 Massachusetts Avenue Rm 1-175
Cambridge, MA 02139
(617) 253-9725
nosgood@mit.edu