Real-time probablistic count (Durand-Flajolet) implementation using Apache Storm
Distributed stream processing of meetup.com streams for probabilistic counting by Durand-Flajolet algorithm using Apache Storm. Streams of data can be processed for the purpose of distinct or unique count of field enteries which can be of interest to us. This nature of problem is referred to as “Counting distinct elements in a stream”. The data processed for the project was from meetup.com RSVP and EVENT streams.