HyperLogLog is an algorithm for the count-distinct problem, approximating the number of distinct elements in a multiset. [1]
Transform Data by Example [DMX]
What part of X is in Y?
http://www.vldb.org/pvldb/vol11/p1165-he.pdf