priority 1 ---------- - join non-deduplicated chunks - choose when and how to - detect Similar chunks - implement "N-Transform SuperFeature" hash from Shilane-2012 - use the hash for detection priority 2 ---------- - use more the `Reader` API (which is analoguous to the `IOStream` in Java)