improved decoder speed using LUT for huffman decoding. profiling/bench needed (see LUT_DEPTH parameter in huffman.h)