THE DIAGNOSIS OF EVENT TIMING SYSTEM IN SuperKEKB LINAC

D. Wang\textsuperscript{1,2}, M. Satoh\textsuperscript{2}, K. Furukawa\textsuperscript{2}, T. Kudou\textsuperscript{3}, S. Kusano\textsuperscript{3}, Y. Iitsuka\textsuperscript{4}

\textsuperscript{1}SOKENDAI, Kanagawa 240-0193, Japan
\textsuperscript{2}KEK, Ibaraki 305-0801, Japan
\textsuperscript{3}Mitsubishi Electric System & Service Co., Ltd, Ibaraki, Japan
\textsuperscript{4}East Japan Institute of Technology Co., Ltd, Ibaraki, Japan

Abstract

We introduced MRF event timing system in injector linac in SuperKEKB to satisfy our demands. The event timing system is utilized to distribute high-level precise timing signals and accompanying control instructions to synchronize different subsystems and machines. EVG generates beam pulse pattern 50 Hz which contains several event codes while EVR receives them. The Event rate is 114.24 MHz thus the minimal event time interval is about 9 ns. To certain that events are consistent between EVG and EVR, recording them one by one is essentially needed. Owing to some hardware and network restrictions it is difficult to continuously send every event by EVR through EPICS Channel Access. An EVR based events diagnostic system is thus developed by modifying the device support of some records as well as EVR driver mrfioc2 to send the event codes thus comparing the received event codes with the beam-pattern control orders from beam operation and detecting the event timing interval fault as well as providing a logging system of persistent event data. Then, we are able to locate the fault, analyze the data, fix bugs or replace hardware and resume accelerator operation quickly.

INTRODUCTION

The SuperKEKB project is an electron/positron double ring collider upgraded from KEKB project since 2010, endeavoring to update the world highest luminosity record and discover new particle physics by Belle II experiment. The beam commissioning which divided as three Phases started from 2016. During Phase-2 commissioning in 2018, a new 1.1 GeV Damping Ring at the middle of injector linac aiming to lower positron emittance was deployed. The Phase-3 commissioning has started from spring 2019 and proceeds a maintenance during summer.

The electron/positron injector linac, with total length of 700 m, distributes beam to five rings (SuperKEKB HER/LER, two light source ring PF/PF-AR and one positron Damping Ring) with several parameters. The injector linac consists of 60 high-power accelerating units followed by a beam switch yard. The maximal beam repetition rate is 50 Hz and beam mode should switch every 20 ms \cite{1}.

TIMING SYSTEM ARCHITECTURE

The timing system in accelerator complex is to synchronize all relevant components and devices and precisely control some accelerator operations like injection and extraction, top-up and facility diagnostics with timestamp information. The precision of synchronization depends on the operation requirement as well as the size of the installation \cite{2}. Less than 30 ps jitter is required in SuperKEKB and 300/700 ps in PF/PF-AR.

Event Timing

The idea of event timing control system is to transmit synchronized timing fiducial by encoding them as event codes in an event clock rate. Event clock is usually synchronized to a RF reference as well as phase locking to the mains voltage to keep beam intensity and quality.

The event timing system usually consists of event generator and event receiver. The event generator is able to accept software and hardware input and generate event signals and distribute with fanout to a number of event receivers by a multi-mode fibre network in star topology. The event receiver receives the event streams, recover the signals and generator event clocks that is phase locked to the event generator event clock.

We introduced event timing system produced by MRF company to meet our requirements. MRF timing system consists of “Event Generator (VME-EVG-230)” and “Event Receiver (VME-EVR-230-RF)”. The event system uses optical fibre to transmit a 16-bit word at a frequency between 50 MHz and 142 MHz by 8b/10b encoding. In our case, event clock rate is 114.24 MHz in SuperKEKB linac, the bit rate of event link is about 2.3 GHz. Inside this 16-bit word, 8-bit distributed bus running in parallel and independent of the other 8-bit timing event allows distribution of eight signals updated with the event clock rate. All 256 event codes except for some special functions are user defined. Each EVR is able to generate pulse with an associated delay and width. MRF timing system is well integrated with EPICS control system by using mrfioc2 device support module \cite{3}. Figure 1 is the scheme of MRF timing system.
The selected event codes received in EVR will be decoded by 8b/10b and wrote into a 32-bit wide FIFO memory (8-bit event code and 24-bit timestamp) with attached timestamp information. This FIFO memory can hold up to 512 events. Write operation of an event to FIFO will trigger a VME interrupt. In mrfioc2 module, the EVR event is mapped to an EPICS event and make happen of several relevant EPICS records. Then, the IOC server sends the processed record information to all clients that subscribe to this record by Channel Access.

**Simultaneous Injection**

Based on EPICS and event-timing control, we use four virtual accelerators (VAs) for SuperKEKB project. A single injector linac would behave as four independent virtual accelerators with hundreds of independent parameters modulated pulse-by-pulse at 50 Hz (Fig. 2).

The equipment is operated via event-based, global, and synchronized controls to inject beams with different properties into four separate storage rings simultaneously. One of the crucial technologies used during the operation of linac is Pulse-to-Pulse Modulation (PPM) [4]. By means of PPM simultaneous top-up injection to several separated storage rings could be accomplished. Every pulse (20 ms) corresponds to a beam mode while every beam mode contains several event codes [5, 6].

**LOG SYSTEM**

During the operation, linac must operate periodically in 50 Hz, thus many devices like power supply, pulsed magnets need to be triggered every 20 ms to assure bunches are accelerated at appropriate location. And only 2 ms width of fluctuation is accepted [7]. However, we noticed that sometimes klystron trigger time interval would be less than 18 ms and that would cause unstable beam and if beam bunch is not correctly transferred, in the worst situation, beam would hit the Belle II detector and damage it. Besides this, lack of a log system makes it hard to evaluate whether correct event codes and sequences are received by EVR or not. And during the 21st KEKB Accelerator Review Committee in 2016, it was suggested in the meeting to have an operational diagnostic program of event system. The collection of the event codes and timestamp information received in EVR is beneficial to diagnose abnormal events and avoid erroneously trigger. In order to improve the stability and maintenance of injector linac, an EPICS based log system is developed to record all the events information and compare them with beam mode pattern.

**Data Acquisition**

The timing signal is generated and controlled at main timing station at injector linac and sent to EVRs all over SuperKEKB complex by fibre-optic cables. An EVR card used for log system is deployed in the VME crate placed in the control room of linac. Figure 3 is the picture of timing log system. MVME5500 on-board CPU controller and VxWorks-6.8.3 is used for developing EPICS IOC. A straightforward method to record all event codes used in injector linac is to create a number of EPICS PVs and monitor all of them. However, several mainly concerns are listed below:

- monitor a large number of PVs which updates very quickly might increase the network traffic in injector linac.
- camonitor call in Channel Access uses TCP to transmit the data packets. And owing to TCP’s retransmission mechanism, events sequence recorded in client might not be consistent with sequence recorded in EVR.
- too many CA callback functions in IOC may increase the VME CPU load.

Consequently, we modified the device support of EPICS waveform record to store the event code and the EVR timestamp. Event code and timestamp pairs are stored in the waveform record and IOC server periodically scan this record and send it to client. The client program monitors the waveform...
record and saves data to hard disk. Figure 4 is the scheme of log system.

**Log Performance**

Some abnormal phenomenon are noticed during the log system test operation. First problem is the event software overflow of the EVR device support. According to the document of mrfioc2 EPICS support module, occurrences of certain “interested” event code number together with individual arrival time are placed in the EVR’s FIFO buffer. A VxWorks task continuously fetch event one by one from the buffer and invoke a list of event-specified callback functions to notify some records to process. Since the invocation of callback function is processed in another thread, if the speed of event invocation is faster than callback function processing, then callback in other EPICS drivers may be lost. That means this event code can not be recorded in the waveform record [8].

Another problem is caused by processor issue. Some data would lose in the client during the periodically scanning of the waveform record. The CPU frequency of MVME5500 is 1 GHz which means 1 nanosecond per cycle, however, by contrast, the minimal interval between linac event code is 9 ns. According to the Fig. 5, sometimes the delay of scan-5 task is 4579 ms if we monitor all linac timing events. The scan-5 task scans a list of records every 5 second, in other words, if the delay of this task is large than 5 second, waveform record would not be scanned. From this perspective, we only registered some critical event codes to decrease the CPU load as a temporary solution.

This Log System ran for a month and about 17 GB events data are collected. As Fig. 6 shows, however, event packets from CA server still lost about 1000 times in a roughly month. Such data lost issue is intolerable for the requirement of event timing Log System. So, we utilize several methods to upgrade the system.

**Log System Upgrading**

To overcome the event overflow problem, we change and re-compile mrfioc2 module to satisfy our requirement by adding another high priority VxWorks task (parallel EPICS thread) to buffer the event code. When VME fetches an event code from EVR, this event code and timestamp will be put into a large-size ring-buffer and this ring-buffer put action wakes up a high priority thread. Resulting from this, all injector linac event codes can be recorded by the system without event software overflow error.

Several approaches have been adopted to solve the slow processor issue. A most straightforward method is to lower the CPU burden by reducing the number of EPICS records. Since EPICS performs initialization functions of many modules such as callbackInit, databaseInit and scanInit, and large amount of callback functions and records waiting for scanning would certainly increase the CPU load. Under these circumstances, only some EVR configuration records and event code I/O interrupt records are retained. Concerning this, a substitution data transmission approach, NFS protocol, is tested. Figure 7 is the scheme of Log System using NFS protocol.

Network File System (NFS) is a distributed file system protocol, allowing a user on a client computer to access files over a computer network much like local storage is accessed [9]. The source code of mrfioc2 module is modified so that an EPICS thread is created aiming for directly write event code and its timestamp information to local hard disk. Under this circumstance, we utilize the low-level system call and communication protocol by using NFS comparing...
to EPICS database record approach. Thus, the CPU load is dramatically decreased. Every day the log data will be processed and checked using Python script whether any abnormal beam sequences occur and the result of diagnosis will be sent by email to a list of email addresses automatically.

Accordingly, we want to monitor both parts events in the VME and this would definitely increase VME’s CPU load. Further experiments are undertaking now.

ACKNOWLEDGEMENTS

The author gratefully acknowledges the financial support from China Scholarship Council.

REFERENCES


