 
  
  
   
Unfortunately, all contingencies  are not   so simple.  For   example,
considering  the   scatter  plot corresponding to    the line tripping
contingency (loss  of Circuit 1 of the   double circuit line  shown in
Fig. 5), we observe that its variability is much
higher although  its mean severity (97MW)  is  similar to that  of the
preceding contingency.  In  particular, the operating  states shown on
the  lower part  of the scatter    plot of Fig.   6b
correspond to very large severities (e.g.   MW).
Thus, in the present case a less trivial model  is required to predict
the severity.
MW).
Thus, in the present case a less trivial model  is required to predict
the severity.
To illustrate the regression trees, we will describe in detail their application. However, for the sake of simplicity, and noting that the lost line is an important one allowing to import power to the load region from Plant 2, we will use only two candidate attributes : the total reactive reserve in Plant 2 and the logical status (in or out) of Circuit 2.
We  consider the 4610 relevant  states  (i.e.  where  Circuit  1 is in
operation) and put aside 1835 test states to assess the reliability of
the tree.  The remaining  2775 states are  used  as a learning set  to
build a regression tree, yielding the 3-level tree represented in Fig.
4.  Each node of the tree is represented by a box
containing a graphical representation of the distribution of values of
 in the learning set at this node, together with its
sample mean  value and standard  deviation,  and  the  number  of  its
learning states (e.g.
 in the learning set at this node, together with its
sample mean  value and standard  deviation,  and  the  number  of  its
learning states (e.g.  at the  top-node).
  at the  top-node).
The tree is built  in a top-down  fashion,  starting at the  top-node,
where the reactive reserve in Plant 2 is automatically selected as the
best test  attribute,      together with  its   threshold   value   of
191Mvar. This test is determined by the tree building  method so as to
provide a maximum amount of information of the contingency severity in
the learning set.  Once the  test has been  selected, the learning set
is split into two subsets, corresponding respectively to 1219 and 1556
states.   This reduces the variance from  at
the top-node to a mean value of
 at
the top-node to a mean value of
 at   its
successors.
  at   its
successors.
Proceeding at both successors, we  see that the selected test consists
of checking whether Circuit 2 is in operation or  not, which allows us
to further reduce significantly the  overall variance to a mean  value
of
 .  Thus,  the
regression tree explains
.  Thus,  the
regression tree explains  of
the variance of the severity.
 of
the variance of the severity.
Once the tree has been constructed, it may be used to estimate the contingency severity of an unknown state : to this end, we direct the state from the top-node to the appropriate successor according to its reactive reserve and further to a terminal node according to the status of Circuit 2. There, the severity is estimated by the mean severity of the corresponding learning states.
Let us discuss the physical interpretation of the tree.
  
Figure 7: Computed vs estimated post-contingency LPMs 
Of course, the same error distribution is also obtained if we subtract the estimated severity from the pre-contingency LPM so as to estimate the post-contingency LPM, according to eqn. (1). This is illustrated in Fig. 7a showing the scatter plot of the test states in terms of their ``actual'' post-contingency load power margin and the estimated value of the latter. We notice that the estimated and actual LPMs are highly correlated. The correlation coefficient of 0.9923 indicates that overall the estimate is able to ``explain'' 98.5% of the variance of the post-contingency margin. This suggests that a very simple model may indeed provide valuable information about complex quantities such as post-contingency margins. To further fix ideas, we indicate that when using the estimated margin to classify the test states with respect to a threshold of 300MW (considering as unsecure the states for which the post-contingency margin is smaller than this threshold) leads to a classification error rate of 2.56%.
Giving a closer look at Fig. 7a, we observe that where the ``actual'' margin values are zero (i.e. if the contingency leads to a mid-term voltage instability without load increase), the estimated margin values become negative. Such negative margin values could for example be interpreted as the amount of emergency load shedding required to prevent a voltage instability.
To illustrate the possibility of exploiting the regression models to asses the impact of multiple contingencies, we use the tree to estimate the severity of the loss of two circuits.
We start with the hypothesis that the circuits  are both in operation,
and apply the tree to determine the severity of  the first outage.  If
a new mid-term  steady state is reached before  the second  circuit is
lost, yielding in particular a change in the reactive reserve in Plant
2, we may apply the regression tree a second time in order to evaluate
the  impact  of the loss  of the  second circuit.   Notice that if the
reactive reserve  was initially smaller  than 191Mvar, we might assume
that  it will remain  smaller than this threshold after  the loss of a
circuit.   Thus,   in this particular   case  it is not   necessary to
recompute the steady  state reached after the  first outage to predict
the effect   of the second  one;  the severity  of  the double circuit
outage  is  then estimated to  MW.  Remarkably, this is
actually very close to  the true mean  severity (327MW) of the  double
circuit outage, as obtained in  the same  conditions by direct  margin
computation.
MW.  Remarkably, this is
actually very close to  the true mean  severity (327MW) of the  double
circuit outage, as obtained in  the same  conditions by direct  margin
computation.
 
 
  
 