Description

ReduceByKey runs on a dataset with (K, V) pairs, returns a dataset with (K, V) pairs, where values for the same key are aggreated by function f. Function f takes two arguments of type V and returns one type V. The function should be commutative and associative so that it can be computed correctly in parallel.