• 대한전기학회
Mobile QR Code QR CODE : The Transactions of the Korean Institute of Electrical Engineers
  • COPE
  • kcse
  • 한국과학기술단체총연합회
  • 한국학술지인용색인
  • Scopus
  • crossref
  • orcid

  1. (Dept. of Nuclear Power Plant Engineering, Kepco International Nuclear Graduate School(KINGS), Korea.)



Cascading Failure, Hanbit NPP, IEC 61850, Protection power system

1. 서 론

Cascading failure is the major cause of large blackouts for all kinds of generating stations, and it always occurs with initiating events [1]. Hidden failures occur in the electric power system on all equipment. This kind of failures in protection systems causes about 75% of the electrical protection power system blackout events [2]. The major failures in circuit breakers (CB) are due to faults in control signal and failure of insulation. This kind of failures occur without any indication and are difficult to be identified [3]. In this regard, this paper introduces a case study of CB operation failure in Hanbit nuclear power plant (NPP), this malfunction has been expanded to the sequential cascading failure and loss of offsite power due to short circuit (SC) rather than that against other abnormal operating conditions. Two protection systems are common practice: primary protection and backup protection. The first line of defense is secondary protection. The process of primary SC defense should be as quick as possible, it is preferred to be instantaneous, for power system stability reasons, and for equipment damage prevention. Via comprehensive literature review about cascading failure in Hanbit NPP unit 2, an appropriate prevention scheme was proposed in this study as follows; Section 2 introduces cascading failure study case in Hanbit nuclear power plant. Cascading failure prevention techniques are introduced with the solution methods in Section 3, Finally, Section 4 and 5 addressed the results, discussions and conclusions.

2. Challenges of Cascading Failure in Nuclear Power Plants

On 6th of March, 2018 during normal operation of Hanbit unit (2) the emergency diesel generator (EDG) was started automatically by the low voltage signal generated from the 4.16 kV medium voltage safety bus. As shown in Figure.1 the non-safety bus (NB-S02) is supplied with power from the unit auxiliary transformer (UAT#1) and from the start-up transformer (SUT#2) as alternative power source. In the process of replacing the central chiller pump, which is the load of the non-safety bus, the stop signal of the local control panel was initiated to open the central chiller CB so that the operator judged that the central chiller was stopped. In reality, the central chiller circuit breaker was not opened and the central chiller motor pump continued operation, only the central chiller lubrication pump was stopped. As the central chiller pump continued operation with the lubrication function stopped, the bearing temperature rose up, resulting in motor winding overheating (phase B) and ground fault (A, C phases).

Fig. 1. Schematic diagram of event propagation

../../Resources/kiee/KIEE.2020.69.1.203/fig1.png

2.1 Sequential Incident Propagation Process

In the process of replacement of the central chiller (GB-Z050), one breaker (NB- S02 -18) has been expanded to the loss of offsite power (LOOP). As shown in figures (1 and 2) ⓵ The CB (NB-S02-18) did not open due to insufficient lubrication on the CB’s open latch contacts and the central chiller pump was still running. Only, the central chiller lubrication oil pump stopped caused the bearing temperature of the chiller to gradually rise (the bearing high temperature alarm was activated) as, the bearing temperature was highly increased resulting in motor winding overheating. In this state, the central chiller instantaneous ground fault relay (50GS, setting 30A, and operating time 0.1 seconds) pick-up the fault and activated the CB (NB-S02-18) but it wasn’t operated. ⓶ The fault current generated due to motor winding overheating caused the upstream ground fault overcurrent relay (51N residual type) to trip the CB (NB-S02-01) after specified intentional time delay. ⓷ At that time, the operator thought that the cause of the failure has been cleared and tried switching to the alternative power source to restore the voltage loss by closing the CB (NB-S02-03) to continue supplying power from the start-up transformer (SUT#2). Due to the ground fault occurred in (A, C phases) caused by motor winding overheating the ground overcurrent relay (51N, residual type, 120A, operating time 3 seconds) pick-up the fault and the CB (NB-S02-03) was tripped, the operator attempted twice manually reclosing the CB (NB-S02-03) to recover the low voltage of the 4.16 kV medium voltage non-safety bus without causes of fault being resolved. ⓸ In the second attempt of reclosing the CB (NB-S02-03) the ground fault occurred under the same conditions as in the first reclosing of the CB but, the CB (NB-S02-03) wasn’t open due to current transformer (CT) saturation. At that time, the neutral overcurrent protection relay of the start-up transformer (51NB, 200A, set value 0.59 seconds of operation time) pick-up the fault and the switchyard breakers (7F00) and (7F71) were opened, also the safety bus (PB-S02) lost power. ⓹ Subsequently, the under voltage relay (27) of the (PB-S02) safety bus breaker checked the voltage loss and the emergency diesel generator (EDG) 'B' was automatically started to supply power to the safety buses. So, the malfunction of the central chiller (NB-S02-03) CB was extended in a chain leading to loss of offsite power (LOOP) event.

Fig. 2. Sequential incident propagation diagram.

../../Resources/kiee/KIEE.2020.69.1.203/fig2.png

2.2 Incident Main Causes

2.2.1 Causes of Unopened Chiller Circuit Breaker

The medium voltage 4.16 kV non-safety bus experienced low voltage due to the failure of shutting down the central chiller pump during replacement operation, it is found that the opening coil of the CB (NB-S02-18) was failed to rotate the opening latch due to stuck of open latch roller. The visual inspection of open latch rollers and open latch contacts figures out that the surface was not clean also insufficient lubrication of contact surfaces lead to a failure of the CB opening. Based on the report result analysis of the Korean Institute of Nuclear Safety (KINS) together with the manufacturer of the Breaker (GE), After checking the mechanical deformation of the opening mechanism and the internal control circuit, the investigation team declared that the insufficient lubrication for the breaker drive mechanism and the open rod and the open latch roller contact area were poorly lubricated, which resulted in the failure of the CB open signal.

2.2.2 Causes of Switchyard Circuit Breaker opening

The operator of the nuclear power plant attempted twice manually switching to the alternate power source by closing the CB (NB-S02-03) SUT side to recover the loss of voltage produced on the medium voltage 4.16 kV non-safety bus (NB-S02) without causes of the fault being resolved on the non-safety bus (NB-S02). During the first manual reclosing of the CB (NB-S02-03) SUT side, the breaker operates normally and closed to restore the loss of power in the non-safety bus (NB-S02-03) but as the causes of ground fault was not resolved, after about 4 seconds of manual reclosing, the ground overcurrent relay (51N, 120A, operating time 3 seconds) is activated and the CB (NB-S02-03) SUT side automatically opened. When the CB opened after the first manual reclosing, the operator tried the second manual reclosing of the same CB, with the same ground fault overcurrent at the 4.16 kV non-safety bus (NB-S02) but at that time the CB wasn’t open and the ground fault relay (51NB) of the start-up transformer (SUT#2) was operated and the switchyard breakers (7F00) and (7F71) were opened. According to the results of the Korean Institute of Nuclear Safety (KINS) investigation report, (see figures 3 and 4) the waveform of the fault current generated at the time of the first manual closing of the CB (NB-S02-03) is recorded as sine-wave and did not show any saturation form, but during the second reclosing of the CB (NB-S02-03) the fault current waveform was partially distorted and decreased due to the fault current generated during the first manual reclosing of the CB indicating the occurrence of CT saturation connected to the ground fault relay (51N).

Fig. 3. Fault current waveform at first closing

../../Resources/kiee/KIEE.2020.69.1.203/fig3.png

Fig. 4. Fault current at second reclosing

../../Resources/kiee/KIEE.2020.69.1.203/fig4.png

3. Prevention of Cascading Failure

The main challenge in the existing protection system is the long coordination time interval between the downstream and upstream relay and the lack of communications between the protective devices that results in cascading failure. the existing protection and control system was reviewed and upgraded to Make sure the interlocking between the chiller and lubrication pump motor. In addition the following countermeasures also can be considered to improve design vulnerability of the existing protection system;

⓵ Quick detection of the temperature rise before insulation failure using IEC 61850 with smart sensors.

⓶ Logical Interlocking by implementation of IEC61850 substation automation protocol.

⓷ Apply breaker failure protection scheme using digital relays based IEC61850.

Recently constructed NPPs applied digital relays and existing NPPs also getting retrofit the analog relay with digital relays so, peer to peer communication between relays can be applied without additional equipment installations and make it possible to apply above mentioned solutions.

3.1 Existing Electrical Protection and Control System

The current protection coordination for the central chiller motor pump is achieved using time over current relay (51) for incoming feeders, and instantaneous time overcurrent relay (50) and time overcurrent relay (51) for branch feeders. The coordination time interval (CTI) between the downstream and the upstream protection devices has to be considered and followed According to IEEE, the CTI between downstream and upstream protection devices, should be about 200~250 ms. This CTI cause late trip of the upstream circuit breaker when the circuit breaker of downstream relay fails to trip[4][5]. The existing fault clearing time by the circuit breaker of the upstream feeder relay in case of failure of the CB of the downstream relay can be calculated as shown in Table 1;

Table 1. Fault clearing time by circuit breaker.

Action

Clearing time

Downstream relay trip time

40 ms

Lockout relay operation

8 ms

Five cycle CB

83 ms

CTI between downstream and

upstream relay

250 ms

Total time

381 ms

As shown in Fig. 5 when a fault occurs in the motor feeder circuit it would take 131 ms for the CB of the downstream relay (50) up to 381 ms of the CB of the upstream relay to pick up and clear the fault, that means in case of downstream CB failure to trip, the fault occurred in the motor continued about 381 ms until the upstream CB pickup and remove the fault, resulted in release of substantial thermal energy during short circuit fault causing thermal overheating of the motor stator windings.

Fig. 5. Fault clearing time curve

../../Resources/kiee/KIEE.2020.69.1.203/fig5.png

The existing relay coordination curve between downstream and upstream circuit breakers for the central chiller water pump motor circuit of Hanbit NPP unit (2) was shown in Fig. 6.

In the following paragraphs, there is going to be much more explanation about improvement of the existing electrical protection system to prevent cascading failure in NPP’s electrical protection power system.

3.2 Breaker Failure in Conventional Architecture

Using a breaker-failure trip to trigger a breaker trip lockout relay is a common stance. This could be the 86B bus lock-out relay with suitable targeted breaker failure or a specialized 86BF lockout relay. The electro-mechanical lockout relay negatively affects the system reliability as it is a potential point of failure. In addition, the lockout relay contributes about one cycle to the clearing time of the breaker-failure.

Fig. 6. Central chiller water pump Relay curve.

../../Resources/kiee/KIEE.2020.69.1.203/fig6.png

3.3 Prevention of Cascading Failure Using IEC61850

IEC 61850 is a communication protocol that can give many benefits to the NPP electric power system, not only cost savings but also design improvement. However, in the MV system of NPPs, electrical system protection is relying on the time-current curve (TCC) coordination in the case of overcurrent fault. Using IEC61850 can significantly reduce communication time, which is critical when the fault occurs. IEC 61850 provides fast communication by using designated communication types; MMS, GOOSE and SV [6]. GOOSE is the fastest message among the three types of IEC 61850 communication types by using 2 OSI layers only. GOOSE message is especially for the urgent actions like trip, interlocking signals. On the other hand, Korean NPPs have applied digital relays since Shin-Kori 3&4 nuclear power plant. The digital relay is intelligent electronic device and it supports IEC 61850 communication functions. So, communication between IEDs with IEC 61850 is applicable on the MV system without additional device installation [7].

3.3.1 Communication-Based Breaker Failure Protection

The IEC61850 based GOOSE scheme allows continuous communication between IEDs, once the overcurrent relay detects a fault, it initiates a direct transfer trip (DTT) signal to the upstream circuit breaker to clear the fault [8]. DTT needs to be dependable during fault conditions to allow the trip signal to be received correctly. This type of peer to peer communications based breaker failure protection can be applied in different ways:

• As a function in IEDs that initiates the breaker failure protection when it receives the trip signal from the relay protecting the faulted power system equipment.

• As a built-in function in the protective IED that detected the fault and issued the trip signal.

3.3.2 Implementation of Circuit Breaker Failure in IEC 61850 environment

Fig. 7 shows the data communication between relays using IEC61850 environment, when a fault occurs in the downstream feeder. The downstream relay will detect a fault and will issue a tripping GOOSE message to clear the fault. The CBF function will respond to these GOOSE message and the breaker failure timer will start. In case of breaker failure, the breaker failure function will point out this failure and a GOOSE message will be sent over the LAN network to trip upstream breakers and initiate DTT to the upstream relay to clear the fault. When the IED initiates a trip signal, it starts a timer and monitoring the CB current. If the current does not go away in a predefined time, the IED issues a re-trip or trips the upstream breakers to isolate the faulted one. After the downstream relay gets a tripping signal but doesn’t received tripping signal from the breaker, the downstream relay shall send a breaker failure initiate signal (BFI) through a GOOSE message to the upstream relay to force it to trip the upstream circuit breaker, nevertheless the blocking timer has expired or not [9].

Fig. 7. Data communication between relays

../../Resources/kiee/KIEE.2020.69.1.203/fig7.png

In order to achieve this kind of communication, IEC61850 defines specific protective features as (LN), a specific category of specified attributes to each logical node are also defined. LN can also be currently located in different types of IEDs. Calculations of voltage, current and condition are transmitted to the relays via a process bus that is an ethernet channel of communication. The RBRF class is confined to circuit breaker failure (CBF) protection as per IEC61850-7-4, see Table 2. The IEC61850 standard documents define the attributes, (OpIn) corresponds to re-trip order from the RBRF to the broken CB (stage I or inner trip), (OpEx) corresponds to the corresponding breaker tripping command (stage II or external trip), And FailMod deals with the assessment of breaker malfunction by current status, This LN's specifications are described in IEC 61850 part 5.

Table 2. Class assigned to CBF

Attribute

Discription

class

Class

RBRF

OpCntRs

Operation re-set counter

Details about the condition

Str

ACD

Start, timer running

OpEx

ACT

BF trip (“External trip”)

OpIn

ACT

BF trip (“Internal trip”)

Settings

StrVal

ASG

Start Value

FailMod

ING

BF Mode detection.

FailTmms

ING

BF Time Delay for downstream trip.

ReTrMod

ING

Retrip Mode.

In case of fault, LN transfer a message to the RBRF (initiated by the CBF), and the RBRF triggers the timer. The RBRF gives a message via the network to the XCBRs (logical node for circuit breaker) to trip the upstream CB if the downstream breaker fails to trip / clear the fault. All upstream XCBR register to and receive this message at the same time to implement next required actions. The RBRF information communication with other logical nodes is shown in Figure 8. Blocks IHMI, CALH, ITCI, RBRF, XCBR and Pxyz are logical nodes ((Pxyz stands for the main protection (e.g., PIOC for instantaneous overcurrent)); Table 3 gives functional descriptions of each logical nodes.

Table 3. Functional descriptions of logical nodes

Logical Nodes

Functional Descriptions

IHMI

Human Machine Interface

CALH

Fault indication and Alarm signal

ITCI

Tele-controlling application

RBRF

Breaker failure

XCBR

Circuit Breaker

PIOC

Instantaneous overcurrent protection

PTOC

Time overcurrent protection

The RBRF receives from the protection LNs a trigger signal. The RBRF transmits four kinds of signals:

• Data about the fault is transmitted via IHMI and ITCI.

• trip indication transmitted through IHMI, ITCI and CALH.

• Settings are sent to IHMI and ITCI.

• All subscribed breakers (XCBR) receive Trip Commands through PIOC, PTOC and a coordination signal was published through CILO signal.

The first three classes of signals were transmitted to upstream logical nodes, which are indicators to monitor and record bay-level events. The last signals are defensive and the most important.as reliable communication links between the RBRF and the XCBRs subscribed are highly essential [10].

Fig. 8. Data Communication between RBRF and other logical nodes

../../Resources/kiee/KIEE.2020.69.1.203/fig8.png

3.3.3 IEC61850 based zone selective interlocking

ZSI is based on currents comparison between protective zones. It has a communication control logic system between downstream and upstream breakers. The IEC61850 GOOSE based ZSI scheme always concern about blocking time. It is used to improve the level of protection in the electrical protection power system, through communication between protective relays across the protected zones to reduce the fault clearing time. During ground fault or phase fault conditions the protective devices electronic interlocking allows the devices close to the fault to override its preset time delay automatically and clear the fault without intentional time delay [11]. As illustrated in Fig. 9, when a short circuit fault occurs at the central chiller motor, both the faulted downstream relay and the upstream relay detect the fault. In order to block tripping the main breaker (NB-S02-01) and avoid loss of power for the whole bus, the faulted downstream relay of the CB (NB-S02-18) sends a blocking signal to block the upstream relay from tripping the upstream main CB (NB-S02-01) and averting the whole bus power losses. After fault clearance, fault is not detected any more, but if a fault is not cleared and both the downstream feeder relay and upstream relay still detecting the fault, the main upstream relay will trip the main CB (NB-S02-01) as a backup protection.

Fig. 9. Protection coordination scheme using ZSI

../../Resources/kiee/KIEE.2020.69.1.203/fig9.png

The result is that other devices remain unaffected by the fault and the fault is cleared more rapidly than the existing protection system without using ZSI. The main advantage of applying IEC61850 based ZSI is to reduce blocking time between the downstream and upstream relay to its minimum without affecting relays operations. Accordingly, we need to consider the opening time and communication time between the downstream CB (NB-S02-18) and the upstream CB (NB-S02-01) that requires extended blocking time to give the feeder breakers a time to operate first and isolate the fault. For majority of the relays the internal rationale execution time from overcurrent relays to pick up the fault till the time to send a GOOSE messaging within 2 ms up to 10ms based on network traffics, it is also the same from receiving a GOOSE to trip the upstream breaker. The relay contact responding time is about 8 ms. If we consider medium voltage five-cycle circuit breaker that would add additional (83ms) for 60Hz system. Depending on network configuration and design, the overall travelling time among the downstream relay and the main upstream relay depending on the distance between the relays and the Ethernet network would be about 20 ms [12]. So, the fault clearing time between the downstream and upstream feeder relays using ZSI can be calculated as shown in Table 4;

Table 4. Fault clearing time using IEC61850 based ZSI

Action

Clearing time

Downstream relay trip time

40 ms

Lockout relay operation

8 ms

Five cycle CB

83 ms

Communication signal travelling time between downstream and upstream relay

20 ms

Total time

151 ms

As a result, by applying IEC61850 based ZSI, the fault clearing time of the upstream CB would be 151 ms for five cycle CB. This time is quite greatly faster compared to the existing protection time 381 ms which results in decreasing the motor thermal overheating caused by short circuit fault current.

4. Results and Discussions

This paper proposed practical solutions to prevent cascading failure in NPP’s protection power system. Introducing Hanbit NPP as a case study, the three major causes of the failure were defined and the proposed solutions to prevent this failures were described in the above sections and summarized as follows; ⓵ when a fault occurs within protective zone the CTI between downstream and upstream relay can be reduced from (381ms) to (151ms) using IEC61850 based ZSI with continuous communication between relays, If both the upstream relay and the downstream relay detect a fault current, it indicates that a fault has occurred in the branch circuit. If the fault current is detected only in the upstream relay, it means that a fault occurred in the bus, thereby preventing a motor overheating without time delay. Table 5. Shows the fault clearing time comparison achieved by using IEC61850 based ZSI interlocking compared to existing protection system.

Table 5. Fault clearing time comparison

Fault

Downstream

Relay

Detecting

Lockout relay

CB time

Total CTI

Existing System of Hanbit NPP

40 ms

250 ms

8ms

83

381ms

Improved system

40 ms

20ms

8ms

83

151ms

⓶ The circuit breaker failure (CBF) protection is used to operate when the primary protection devices (downstream relay) sends tripping signal to the CB to trip and clear the fault, if the circuit breaker fails to operate in a pre-set time, using IEC61850 send a trip signal to the upstream breaker to clear the fault.

⓷ Using smart sensors based IEC61850 provide a quicker monitoring of motor temperature rise before the insulation failure of the motor and generate alarm. By using IEC 61850 standard communication protocol and merging unit output signal from smart sensors can be transmitted to digital relay without interference of electrical noise.

5. Conclusion

The use of power equipment for a long time may degrade performance and cause unexpected malfunctions or a misfire. Unexpected means that there are no measures to anticipate such an accident or breakdown. The above-mentioned operation of the high-voltage power distribution system breaker section is a representative example. In spite of the motor stop signal, the circuit breaker was not opened and a ground fault occurred in the motor. One branch circuit accident occurred in a chain, leading to an accidental loss of power. In this paper, we proposed the ultimate solution to prevent similar accidents by using digital protection relay and IEC61850 standard technology.

Acknowledgements

This research was supported by 2019 Research Fund of the KEPCO International Nuclear Graduate School (KINGS), Ulsan, Republic of Korea.

References

1 
A. G. Phadkel, J. S. Thorpz, 1996, Expose hidden failures to prevent cascading outages, IEEE Comput. Appl. Power., Vol. 9, pp. 20-23DOI
2 
Y. Xiaohui, Z. Wuzhi, S. Xinli, W. Guoyang, L. Tao, S. Zhida, 2016, Review on power system cascading failure thoeries and studies, in Proc of 2016 Int. Conf. Probabilistic Methods Appl. to Power Syst. PMAPS 2016 - Proc., Beijing, ChinaDOI
3 
P. Dehghanian, T. Popovic, M. Kezunovic, 2014, Circuit breaker operational health assessment via condition monitoring data, in Proc. of 2014 North Am. Power Symp. NAPS 2014, Washington, USADOI
4 
C. koo Chang, 2018, Cascading failure in medium voltage network of NPP and counter measures, in Proc. Korean Inst. Electr. Eng. Conf., Seoul, South Korea, pp. 1-3Google Search
5 
IEEE Std, 2001, IEEE recommended practice for protection and coordination of industrial and commercial power systems, 242 nd ed., NYGoogle Search
6 
S. H. Hwang, Y. S. Im, H. C. Song, J. Do Park, 2018, Real time emulation of IEC 61850 SV, GOOSE and MMS using NS-3, J. Eng. Appl. Sci., Vol. 13, No. 3, pp. 634-638Google Search
7 
S. H. Hwang, Y. S. Im, H. C. Song, J. Do Park, 2018, Real time emulation of IEC 61850 SV, GOOSE and MMS using NS-3, J. Eng. Appl. Sci., Vol. 13, No. 3, pp. 634-638Google Search
8 
IEEE Std, 2016, IEEE Guide for Breaker Failure Protection of Power Circuit BreakersGoogle Search
9 
T. Zhao, L. Sevov, 2013, Practical considerations of applying IEC61850 GOOSE based zone selective interlocking scheme in industrial applications, in Proc. of 66th Annual Conference for Protective Relay Engineers CPRE 2013, pp. 263-270DOI
10 
Z. Darabi, B. Falahati, M. J. Mousavi, M. Ferdowsi, 2012, On circuit breaker failure protection in 61850-based substations, in Proc. of IEEE Power and Energy Society General Meeting, pp. 1-6DOI
11 
C. koo Chang, 2019, Mitigation of high energy arcing faults in nuclear power plant medium voltage switchgear, Nucl. Eng. Technol., Vol. 51, No. 1, pp. 317-324DOI
12 
M. Daboul, J. Orsagova, T. Bajanek, V. Wasserbauer, 2015, Testing protection relays based on IEC 61850 in substation automation systems, in Proc. 16th Int. Sci. Conf. Electr. Power Eng. EPE 2015, pp. 335-340DOI

저자소개

Moustafa Abdelrahman Moustafa
../../Resources/kiee/KIEE.2020.69.1.203/au1.png

He received his B.S. degree from Benha University in 2007, he has worked at the Ministry of Electricity and Renewable Energy in Egypt since 2010.

He received his master degree in Nuclear power plant Engineering; Department of KEPCO International Nuclear Graduate School (KINGS).

His research interest is protection and control of the electrical power system.

Choong-koo Chang
../../Resources/kiee/KIEE.2020.69.1.203/au2.png

He received M.S. in Electrical Engineering from Inha University in 1990, and Ph. D degree in Electrical Engineering from Myongji University in 2001.

He participated in Younggwang NPP 3&4 and Ulchin NPP 3&4 design project as an electrical system engineer from 1985 to 1993 at KOPEC.

From 1993 to 1998 he worked as a senior engineer for the plant control, and automation business team of Samsung Electronics. As an Executive Vice President and CTO at Sangjin Engineering from 2001 to 2012, he designed the electric power systems for nuclear power plants, thermal power plants, and combined cycle power plants.

Since 2013, he serves as a professor in the NPP Engineering Department of KEPCO International Nuclear Graduate School (KINGS).

His research interests are planning, design, and operation of the electric power systems for power plants.