Resolved -
This incident has been resolved.
May 22, 20:27 MDT
Monitoring -
Zookeeper corruption has been rooted out. Things appear healthier and catching up in all cases.
May 22, 15:05 MDT
Update -
We are not to full resolution yet.
May 22, 11:43 MDT
Update -
Throughput has improved although behavior of individual partitions remains a problem and is still causing delays in some cases.
May 21, 23:42 MDT
Identified -
It has been a long day with kafka. We continue to experience instability, causing lag and dropped payloads.
May 21, 20:13 MDT
Monitoring -
An alternate approach has been applied, we are watching.
May 21, 10:13 MDT
Identified -
The initial fix was unsuccessful, certain accounts are now substantially delayed.
May 21, 09:08 MDT
Monitoring -
A fix has been implemented and we are monitoring the results.
May 21, 08:33 MDT
Investigating -
We are currently investigating this issue.
May 21, 07:55 MDT