TIL: Kafka and Streaming Tools Have Tail Latency Problems
So, I was going down a ton of rabbit holes for distributed computing instead of doing my homework and I spotted something neat, where someone mentioned that Kafka has tail latency problems. I thought that it was interesting, so I researched a bit more and found this apparently applies to all (most) streaming tools. What I found So, I first I found a case study with Allegro basically Amazon Poland, where they found that their Kafka median response times were single-digit milliseconds, and their p99 latency was up to 1 second, and p999 was up by 3 seconds. Personally, I think it’s hilarious being that unlucky user waiting 3 seconds for what usually takes a fraction of that. ...