high context switch rate of ksoftirqd

This week one of our services, written in Java and residing on a TomCat, Red Hat 6.3 server had an interesting problem. Suddenly server seemed to have CPU congestion. According to top output, Java processes were the reason of congestion. But it showed more %sy CPU utilization than %us. Java threads were actually using 10-15% and the rest was used by OS itself. nmon values for context switching was far more beyond the values of the service on second node. Process causing this was interestingly ksoftirqd. A little investigation in internet proved that this was because of the “leap second” issue.

As soon as issuing the commands below for workaround, CPU usage dropped drastically:

/etc/init.d/ntpd stop
date `date +”%m%d%H%M%C%y.%S”`
/etc/init.d/ntpd start

However what triggered the problem after over a month is still a mystery. For more information about the matter:

Leap Second Detector for RHEL
Resolving Leap Second Issues in RHEL
Leap Second and Java in RHEL

Leap Second Issues for SUSE

Leave a Comment

Please complete the math problem below: *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

My website uses cookies. Click for more information

The cookie settings on this website are set to "allow cookies" to give you the best browsing experience possible. If you continue to use this website without changing your cookie settings or you click "Accept" below then you are consenting to this.