priority 1
----------
- join non-deduplicated chunks
  - choose when and how to
- detect Similar chunks
  - implement "N-Transform SuperFeature" hash from Shilane-2012
  - use the hash for detection

priority 2
----------
- use more the `Reader` API (which is analoguous to the `IOStream` in Java)