Real-time probablistic count (Durand-Flajolet) implementation using Apache Storm

Distinct count Apache Storm topology

Distributed stream processing of meetup.com streams for probabilistic counting by Durand-Flajolet algorithm using Apache Storm. Streams of data can be processed for the purpose of distinct or unique count of field enteries which can be of interest to us. This nature of problem is referred to as “Counting distinct elements in a stream”. The data processed for the project was from meetup.com RSVP and EVENT streams.

Md Imbesat Hassan Rizvi
Md Imbesat Hassan Rizvi
Technical (Research) Associate

My research interests include scientific machine learning, natural language processing, reinforcement learning, robotics and human-robot interaction.

Related