2013
2012
Probabilistic Algorithms to Process Massive Data
As a PhD student at Inria Rocquencourt, I gave a talk that provided an overview of how data streaming algorithms like HyperLogLog work, that was aimed at a broader audience of engineering PhD students at the Inria’s Junior Seminar.
RECORDINALITY: les grands flux de données vus comme des permutations aléatoires
Talk that introduces a novel, implementable data streaming algorithm for cardinality estimation, that uses
Dirichlet Random Samplers for Multiplicative Structures
Talk that introduces novel combinatorial object samplers, for objects in which the size is multiplicative rather than additive (such as for certain kinds of tilings, factorizations, and trees). With Olivier Bodini.
2011
How Philippe Flipped Coins to Count Data
This talk traces the history of data streaming algorithms for the cardinality estimation problem from the 1980s to 2010s through algorithms designed or studied by Philippe Flajolet, and the origins of the world-famous HyperLogLog algorithm.