[LEAPSECS] Coding this week, and a trick for timeouts over leap seconds.
Paul J. Ste. Marie
paul_j at ste-marie.org
Sat Nov 5 21:31:33 EDT 2011
> On 10/1/2011 5:16 AM, Paul Sheer wrote:
>> I am busy implementing some heartbeat monitoring code between two
>> machines. The spec calls for a 1 second recovery.
>>
>> Basically if I get no heartbeats for 1 full second then I should
>> consider the peer system to have failed.
Sounds like you're generating once per second heartbeats, and failing if
a second elapses between heartbeats. This is a recipe for failure.
Increase either the timeout or the heartbeat pulse rate.
--
--Paul
More information about the LEAPSECS
mailing list