bash-3.2$ ./tst_ar4 -h pr_A1_50_16_64.nc
cs[0] cs[1] cs[2] cache(MB) deflate shuffle read_hor(us) read_time_ser(us)
50 16 64 4 1 0 157932 155803
50 16 64 16 1 0 3473 155237
50 16 64 32 1 0 3479 146510
50 16 64 64 1 0 3487 120306
50 16 64 128 1 0 3499 64149
Now the best performance comes from the largest cache.