Method of forecasting ambulance service demand6058370Abstract A method of forecasting the demand for ambulance services based upon a determination from the historical record of the number of currently active calls at the time each new call is received. The method may utilize records of specific calls history including the time each call was received and the time each call was completed, or it may use information on the number of calls received each hour to simulate specific calls history. The method may be extended to a "third dimension" to better account for the effect of service demands and staffing levels in preceding hours. The method has application to other services having the characteristics of limited available service resources, randomly occurring service requests but historically-repeating levels of demand, and relatively long service times. Claims What is claimed is: Description BACKGROUND OF THE INVENTION
TABLE 1
______________________________________
Calls received -- Monday, 2-3 p.m.
______________________________________
Week 1: 8 Week 11: 11
Week 2: 9 Week 12: 10
Week 3: 9 Week 13: 7
Week 4: 11 Week 14: 8
Week 5: 11 Week 15: 10
Week 6: 8 Week 16: 7
Week 7: 13 Week 17: 7
Week 8: 8 Week 18: 8
Week 9: 8 Week 19: 7
Week 10: 8 Week 20: 10
______________________________________
is collected for a statistically significant historical period, such as 20 to 28 weeks. The basic premise is that the number of units needed in a given hour can be estimated as the number of calls that is equaled or exceeded only a small percentage of the time. This is estimated as one of highest call volumes received in a given hour during some period of time. For example, based on twenty weeks of calls-received one might estimate that the needed number of equipment to be operating in a given hour (e.g. 2-3 p.m. on Mondays) is the second highest call volume received in that hour during a given time period (e.g. 20 weeks). In the example of Table 1, the highest value of 13 calls in week 7 would be discarded and the 2-3 p.m. Monday entry in the unit demand table would be set to the second highest value, 11. Different criteria could also be used, such as selecting the highest value, or an average of values over the study period (e.g. 20 weeks). Variations of this basic technique, or "modified" Stout 124, estimate the needed vehicles based upon the call volume of a given hour as well as that of the preceding hour (and sometimes the subsequent hour). Since the original Stout approach does not take into consideration how many units might already be busy during the early portion of that hour based upon calls that came in the previous hour, it may under-estimate or over-estimate true demand for that hour (e.g., if the "11" of the example is followed in the next hour by a "9", the first part of the "9" hour could potentially be understaffed due to the many units still out on calls). This effect of the previous hour's demand on the current hour is important and has generally been observed by planners. To compensate for such an effect, planners have averaged call rates in hours, added portions of previous call volumes to the current hour, etc. The idea is to compensate for the effect by adding in some of the demand from the previous hour, and subtracting some of the current demand, which affects the subsequent hour. Much of this is done based upon ad hoc rules, which appear to help improve the demand estimate. Numerous other refinements to the "classic" EMS allocation approach have been proposed; these can generally be characterized as other "spreadsheet" methods 126. In addition to utilizing empirical data directly in making such estimates, some have attempted to fit a regression model to the data and then use the model to make estimates. Trudeau, et al, give a good example of such an approach for the City of Montreal (Trudeau, P, Rosseau, J-M., Ferland, J. A., Choquette, J. [1988] "An operations research approach for the planning and operation of an ambulance service," INFOR, Vol.27(1), pp. 95-113). In that work, daily demand is estimated and then a model is used to estimate how the demand is distributed by each hour of the day. Their final estimate in terms of what is needed is still based, however, on the number of calls received per time period. The unit demand table that results from these methods consists of a single entry for each hour period, representing the number of units anticipated to be needed in that hour. A limitation of all methods which schedule resources on the basis of a unit demand table is that they provide little insight as to where additional resources should be placed for maximum benefit, or, conversely, if resources are limited, where cuts can be made with the least impact on the overall level of service. Such decisions are typically left to the "instincts" of the scheduler, which might be quite good, but which are likely to be overtaxed by larger EMS systems. The second major approach that has been used to estimate ambulance demand is the use of some form of queuing theory. Queuing theory deals with estimating the probabilities that waiting lines of specific lengths will happen, given assumptions about the distribution of calls for service and the distribution of service times. The earliest work that uses queuing theory applied to ambulance analysis is Bell and Allen (Bell, C. E., & Allen, D. [1969] "Optimal planning of an emergency ambulance service," Socio. -Econ. Plan. Sci, Vol. 3, pp. 95-101). Queuing theory has been a popular technique among academic researchers as the theory is associated with estimating precisely what planners need. The number of calls in a time period tends to follow a Poisson process with exponential inter-arrival times of calls. Also, it can be assumed that service times are either constant or exponentially distributed as well. Much of queuing theory is based upon such assumptions. Queuing theory can be used to help estimate average waiting time for service and the probability of various states occurring, i.e. the number of busy vehicles. The steady-state probability of n-users requiring service in a queuing system represents the probability of having exactly n users in the system at a given time. Thus, knowing the steady-state probabilities of a system, one will know exactly how many ambulances will be busy a certain percentage of the time. Given such steady-state probabilities, one can then estimate the needed number of servers to keep the probability of 1 or more waiting in a queue to be less than some acceptable level, alpha, where alpha is generally 5% or less. Such an estimate holds when a system is at steady-state. Unfortunately, an EMS system is not always at a steady state. The problem is further complicated by the fact that the service is delivered to users instead of the users coming to where the service is provided. To extend queuing theory to a spatial/geographical application in which servers travel to provide the service, Larson developed the hypercube queuing model (Larson, R. C. [1974] "A hypercube queuing model for facility location and redistricting in urban emergency services," Computers and Operations Research, Vol. 1(1), pp.-67-95). The hypercube queuing model attempts to analyze geographically distributed emergency servers and estimate the states of the system based upon various servers being busy. The hypercube model suffers from four principal problems. The first is that such estimates are still steady state probabilities, and the system may not be at steady state. The second is that the model grows in complexity as the number of servers increases, and as a result it is only possible to generate approximate solutions for realistically sized applications. The third problem is that the hypercube queuing model handles only one type of call and in practice there can be several call types. The last problem involves the assumption that vehicles always return to the same base, and this is often not the case. Simulation models, the third basic prior art approach, have also been used to estimate how many units are required to provide adequate service (Ignall, E. J., Kolesar, P., & Walker, W. E. [1978] "Using simulation to develop and validate analytic models: some case studies," Vol. 26(2), pp. 237-253). The fact remains that simulation models represent the best approach to capture many of the complexities of an EMS system in making an estimate of exactly what type of performance can be achieved for a given number of provided units. The idea is that all of the major components of the system response can be estimated and simulated so that the simulation model can produce a real estimate of exactly what would happen. The concept is to simulate a system operation by simulating calls and service. After simulating a sufficiently long period of time, then it is possible to estimate the levels of service provided. To do this in a manner which yields results which are useful to planners requires good data on practically every aspect of the system for a given application, including travel times between given points on the street network. Although such estimates are not impossible to make, much of the industry has not been in support of simulation because of three factors: (1) the costs to develop and validate a simulation model can be large, (2) the shortage of good data on specific aspects of the service process, and (3) the lack of a good travel time estimation model. A secondary problem is that simulation is a process which can only estimate the performance of a given crew allocation. To determine the best crew allocation may require many runs in order to single out the solution which provides the best balance between cost and service level. Unfortunately, simulation systems are not universally available for EMS planning. Further, significant data needs to be collected and prepared to perform a simulation analysis. Finally, simulation requires a sizable amount of computer time for running a reasonable set of allocation scenarios. The fourth basic prior art approach involves utilizing workload estimates. Such efforts are reflected in the work of ReVelle and Hogan (ReVelle, C. S., & Hogan, K. [1989] "The maximum availability location model," Transportation Science, Vol. 23(3), pp. 192); Baker et al (Baker, J. R., Clayton, E. R., Moore, L .J. [1989] "Redesign of primary response areas for county ambulance services," European Journal of Operations Research, Vol. 41, pp. 23-32; and Baker, J. R., Clayton, E. R., & Taylor, B. W. 111(1989) "A non-linear multi-criteria programming approach for determining county emergency medical service ambulance allocations," Journal of Operational Research Society, Vol. 40(5), pp. 423-432). The basic concept is that if service times take an average of y time, the number of calls expected equals c, and each unit is available to serve hours, then the number of units, x, needed to provide service can be calculated through the formula: ##EQU1## Where b is an upper limit on workload (e.g. 0.4) and x is the smallest value such that the expression is true. The idea is that workload of any given unit should not exceed a specified value. In the EMS industry an accepted workload value is 0.4, meaning that a typical unit will actively servicing a call 40% of the time. The needed number of units can be estimated on a global basis or sector by sector as well. Hogan and ReVelle devised a concept of busyness around a given demand area in the same fashion. The idea is that if the average workload of the units that can respond to a given demand is at a specific level b, then the number of needed units, x, can be calculated using the following formula: ##EQU2## where alpha is the performance level of the system and x is again the smallest integer number such that the expression is true. For this case c is the measure of demand of the surrounding area. While such estimates are easy to calculate, they are not necessarily accurate. Also such techniques are based upon a uniform distribution of calls and represents busyness for one time period alone. The effects of busyness in the previous time period on a subsequent time period are ignored. In view of the limitations of current methods of collecting and analyzing historical data for the purpose of predicting future demand for Emergency Medical Services, there is a need for methods which more accurately determine the level of demand for a given hour of the week, and which provide additional information allowing consideration of tradeoffs that may be made in the scheduling of resources. SUMMARY OF THE INVENTION In view of the shortcomings of current resource allocation and scheduling methods, the present invention is directed to methods of allocating and scheduling resources having the characteristics of limited available service resources, randomly occurring service requests but historically-repeating levels of demand, and relatively long service times, such as emergency medical service (EMS), with improved accuracy and improved ability to assess tradeoffs in scheduling decisions. The present invention provides an improved representation of true historical workloads through the use of an active calls table, based on a determination of the number of calls being serviced when a new request for service is received; The present invention, by utilizing information on active calls, more accurately reflects the effects of non-steady state demand than do methods based on calls received data; The present invention provides additional information for use is assessing resource allocation and scheduling tradeoffs, allowing for more informed decisions to be made, through the use of matrices reflecting percentages of resource availability and response at given demand levels, rather than the unit demand tables common in the prior art; The present invention further allows for the generation of an active calls table from data comprising records of calls per time period, allowing use of the method in situations where detailed specific calls history is not available; The present invention allows actual response time information to be incorporated into scheduling decisions through the use of augmented calls history information to generate an on-time response matrix; The present invention provides a variety of inputs to the resource allocation and scheduling process, allowing the process to be tailored to different scheduling situations and commercial needs; and The present invention avoids the costs and logistics problems of simulating emergency medical services systems by utilizing actual historical data about past calls. BRIEF DESCRIPTION OF THE DRAWINGS The invention is further described in connection with the accompanying drawings, in which: FIG. 1 is a block diagram of a common prior art method of forecasting service demand, based on the generation of a unit demand per time period table; FIG. 2 is a top-level block diagram of the method of the present invention; FIG. 3 is a more detailed block diagram illustrating the creation of an active calls table from either specific calls history or from calls per time period data; FIG. 4 is a graphical representation of the generation of an active calls table; FIG. 5 is an example active calls table, illustrating how active calls information from an historical period is combined to give a cumulative total of active calls and active calls percentages and cumulative percentages; FIG. 6 is a sample response availability profile; and FIG. 7 is a sample response availability profile "cube". DESCRIPTION OF THE PREFERRED EMBODIMENT As discussed above, the most common prior art emergency medical services resource allocation and scheduling methods utilize data accumulated over a statistically significant historical period, such as 20 or more weeks, to project future demand. Prior art methods most commonly utilize only data on the number of calls for service received in each hour. The present invention also utilizes data accumulated over a statistically significant historical period, but improves on the prior art estimates of demand by utilizing the concept of "active calls." "Active calls" represent the number of units currently servicing calls when a new call is received, if 9 units are currently busy and a new call is received, the number of active calls would become 10. The method of the present invention involves estimating demand based upon the distribution of active calls in each hour and not directly on the number of calls received. The total number of calls received represents one measure of work, but the number of actual active calls at any time tells more about the actual need for crews. As shown in FIG. 2, the present invention comprises active calls estimation and response availability profile generation 220, the essential feature of which is the generation and use of an active calls table 224. The active calls table 224 is then utilized in the generation of a table representation of demand ("response availability profile" 232) which in turn can be used as an input for resource allocation and scheduling 240. The active calls table 224 may be generated from specific calls history 214 contained in an historical database 210, or, if such information not available, specific calls history may be simulated 222 from a record of calls per time period 212. By simulating the number of calls that would have been active for different hypothetical staffing levels, based on the actual specific calls history 214 and historical profiles of response and service times 216, the method of the present invention may be extended to a "third dimension", resulting in a response availability "cube" 234 that provides greater insight into the effect of staffing levels during the preceding hours. This "cube" may also be used as an input for resource allocation and scheduling. Resource allocation and scheduling 240 can include many common scheduling techniques, such as ad hoc rules handed dozen from schedulers as a corporate approach, and heuristic and optimization modelling; often a "recipe" of methods will have been developed by a service provider as a trade secret. Number of Active Calls Data FIG. 3 more graphically illustrates the generation of active calls data 320. Data from a specific calls history database 312, which includes information about the time each call was received and the time at which each call was completed, is used to determine the number of active calls at each point in time. Alternatively, if no specific calls history is available, calls per time period data 314 may be used to estimate the beginning and end time for each call 316. The number of active calls at a particular time is the number of units previously dispatched which have not completed their calls. Thus, if at the beginning of an hour 9 units are currently in the field servicing calls, the measure of active calls would be 9; if a new call were received, the number of active calls would increase to 10. FIG. 4, with Table 2 below, illustrates the generation of active calls data for a sample set of data for the hour from 2 p.m. to 3 p.m. on Monday. At 2 p.m. [141] as an initial condition for the hour, previous calls history would indicate that 9 active calls were being serviced (9 units had been dispatched which have not yet completed their calls). At 2:02 p.m. [1412], unit 6 completes it's call, and the number of active calls drops to 8. At 2:03 [414] a call is received and unit 8 is dispatched, representing the 9th active call. At 2:09, 2:12, and 2:16 [416, 418, 420] units complete calls. At 2:20 [422], a call is received and unit 11 is dispatched, representing the 7th active call. At 2:30 [424] a call is completed, at 2:32, two units are dispatched [426, 428], representing the 7th and 8th active calls. At 2:33 a unit completes a call [430] and a new call is received [432], the 8th active call. at 2:40 [434] a 9th active call is dispatched, at 2:42, 2:47, and 2:53 calls are completed [436, 438, 440]. Finally, at 2:54 and 2:55 calls are dispatched [442, 444] as the 7th and 8th active calls, so that at the conclusion of the hour there are 8 units in the field on active calls [446].
TABLE 2
______________________________________
Time Activity Unit #
______________________________________
2:02 completed call unit 6
2:03 call rcv'd - unit dispatched
unit 8
2:09 completed call unit 5
2:12 completed call unit 3
2:16 completed call unit 11
2:20 call rcv'd - unit dispatched
unit 11
2:30 completed call unit 8
2:32 call rcv'd - unit dispatched
unit 6
2:32 call rcv'd - unit dispatched
unit 5
2:33 completed call unit 2
2:33 call rcv'd - unit dispatched
unit 8
2:40 call rcv'd - unit dispatched
unit 2
2:42 completed call unit 4
2:47 completed call unit 11
2:53 completed call unit 1
2:54 call rcv'd - unit dispatched
unit 9
2:55 call rcv'd - unit dispatched
unit 3
______________________________________
Specific Calls History Simulation As shown in FIG. 2, if specific calls history data is unavailable or incomplete, specific calls history can be simulated 222 from calls per time period information 212, and used to generate an active calls table as described above. To simulate specific calls history, either historical information is utilized to determine the distribution of call service times or a reasonable estimate is made regarding the distribution. The calls received in a time period are assigned randomly-distributed times of arrival, and a service time is assigned to each call based on the assumed distribution of service times. The call times and service completion times are then used to generate an active calls table 224, as described above. Active Calls Table FIG. 5 illustrates how accumulated active calls data is used to compute cumulative active calls information for the historical time period, as well as active call percentages and cumulative percentages. As can be seen in FIG. 4, twice during the sample hour of 2 p.m. to 3 p.m. a unit was dispatched as the ninth active call [412 and 434]; therefore the a "2" is entered into the "9th unit" slot of week "1" of the active calls tables [512]. Likewise, three times in the hour a unit was dispatched as the 8th active call [428, 432, and 444]; a "3" is entered in to the "8th unit" slot [514]. Two units were dispatched as the 7th active call [422, 426, and 442]; a "3" is therefore entered into the "7th unit" slot [515]. Data for each week of the historical period is likewise entered into the active calls table. A sum is then computed for each active unit for the historical period [520]; in this sample, there were four calls that were the 6 active call during the historic period; 48 that were the 7th active call; 73 that were the 8th active call; 40 that were the 9th active call; and 3 that were the 10th active call. The total number of calls is also computed for the historical period [530, 532]. For each level of unit activity, the number of active calls is divided by the total number of calls to give a percentage [522]. Stated mathematically: ##EQU3## Where: N.sub.ik =Number of times in all hours i over x weeks that busy calls jumped from k to k+1 And: N.sub.iT =Total number of calls in all hours i over x weeks A cumulative percentage 524 is also computed, which is the sum of the percentage as computed above for the given level of unit activity, and all the percentages for all fewer numbers of active units: ##EQU4## Response Availability Profile FIG. 6 illustrates how the cumulative percentages for each time period are assembled into a response availability profile. The active calls table procedure described above is repeated for each time interval; the cumulative percentages are then assembled into a response availability profile that may be utilized for resource allocation and scheduling. The example in FIG. 6 shows a portion of a sample profile representing Monday. The primary advantage of the response availability profile over prior art methods when used for resource allocation and scheduling is that the response availability profile provides much improved knowledge of the effects of scheduling different numbers of units in a given hour. Extension of the Method of the Present Invention to a Third Dimension The effects of activity in the previous hour may impact the availability of resources in the current hour. For example, there may be uncompleted calls at the end of an hour, or a backlog of calls which have not been responded to, which result in units being unavailable at the beginning of an hour. Since the staffing level in the previous hour influences resource allocation and scheduling decisions for the current hour, it is desirable to create an availability profile which reflects the influence of staffing during the previous hour. The response availability profile as generated from the specific calls history, as described above, reflects the demand for resources with the actual previous hour's staffing level that existed when the data was accumulated. To provide additional information for resource allocation and scheduling, the method of the present invention may be extended to a "third dimension", where the third dimension represents different levels of staffing in the preceding hour (see FIG. 7). Referring again to FIG. 2, creating a response availability "cube" is done by taking the actual call-received times from the specific calls history 214, and simulating an active calls table for various previous-hour staffing levels. Historical information 216 concerning out-of-chute times, travel times, and service times at different times of the day and day of the week are used in the simulations. When, in the simulation, a unit is not available to be sent on a call, the call is placed in a queue until a vehicle becomes available. The simulation is run for a range of possible previous-hour staffing levels. Each level results in a simulated response availability profile; the profiles are then grouped as a "cube". A portion of such a "cube" is shown in FIG. 7. Assuming that 5 units is an appropriate staffing level, a "plane" through the cube at the "5" level will resemble the response availability profile of FIG. 6. The response availability cube can then be utilized by resource allocation and scheduling software to better address the effects of fluctuating hourly demand and staffing levels. Other Applications The above is a detailed description of particular embodiments of the invention. It is recognized that departures from the disclosed embodiments may be within the scope of this invention and that obvious modifications will occur to a person skilled in the art. This specification should not be construed to unduly narrow the full scope of protection to which the invention is entitled. While the present invention is discussed with respect to the scheduling of ambulance services, the techniques are also applicable to other scheduling tasks where the service time is lengthy with respect to the measurement time period, such as scheduling nurses, police, and casino employees.
|
Same subclass Same class Consider this |
||||||||||
