From d9703a9daa05f30e77bb99393eab8a0d40e788e4 Mon Sep 17 00:00:00 2001 From: n-peugnet Date: Thu, 2 Sep 2021 17:48:23 +0200 Subject: hash and store chunks left by matchStream --- TODO.md | 18 +++++++++--------- 1 file changed, 9 insertions(+), 9 deletions(-) (limited to 'TODO.md') diff --git a/TODO.md b/TODO.md index d4873aa..e7e1976 100644 --- a/TODO.md +++ b/TODO.md @@ -1,7 +1,7 @@ priority 1 ---------- -- add deltaEncode chunks function - - do not merge consecutive smaller chunks as these could be stored as chunks if no similar chunk is found. Thus it will need to be of `chunkSize` or less. Otherwise it could not be possibly used for deduplication. +- [x] add deltaEncode chunks function + - [x] do not merge consecutive smaller chunks as these could be stored as chunks if no similar chunk is found. Thus it will need to be of `chunkSize` or less. Otherwise it could not be possibly used for deduplication. ``` for each new chunk: find similar in sketchMap @@ -12,13 +12,13 @@ priority 1 store in fingerprintMap store in sketchMap ``` -- read from repo - - store recipe - - load recipe - - read chunks in-order into a stream -- properly store informations to be DNA encoded +- [ ] read from repo + - [ ] store recipe + - [ ] load recipe + - [ ] read chunks in-order into a stream +- [ ] properly store informations to be DNA encoded priority 2 ---------- -- use more the `Reader` API (which is analoguous to the `IOStream` in Java) -- refactor matchStream as right now it is quite complex +- [ ] use more the `Reader` API (which is analoguous to the `IOStream` in Java) +- [ ] refactor matchStream as right now it is quite complex -- cgit v1.2.3