How Bloom filters work

dave · September 24, 2019, 12:12am

Hi Andreas,

Bloom filters are an interesting ‘probabilistic’ data structure that can be used to see if an item has ‘never been added’ to a set previously. The curious wording here is intentional. It is probabilistic in that it is possible to have false positives but not possible to have false negatives. Bloom filters provide a much more compact and faster way of checking to see if an item exists than storing all items in a set and calling SISMEMBER.

Bloom filters work by running an item through a quick hashing function and sampling bits from that hash and setting them from a 0 to 1 at particular interval in a bitfield. To check for existence in a Bloom filter, the same bits are sampled. Many items may have bits that overlap, but since a hashing function produce unique identifiers, if a single bit from the hash is still a 0, then we know it has NOT been previously added. If it hasn’t been added, then it’s not in the database and you don’t have to go fetch it, possibly saving a lot of time and increasing performance.

For more information, checkout

Dave

Topic		Replies	Views
Use Cases RedisBloom	2	784	February 12, 2020
Redis Bloom Use Cases RedisBloom	2	952	June 9, 2020
HOW USE REDISBLOOM AND IMPLEMENTING RedisBloom	1	908	November 28, 2019
About RedisBloom RedisBloom	0	875	April 23, 2020
Intended use cases? RedisBloom	0	759	October 8, 2019

How Bloom filters work

Related topics