lz4.git - LZ4 is lossless compression algorithm, providing compression speed > 500 MB/s per core, scalable with multi-cores CPU. It features an extremely fast decoder, with speed in multiple GB/s per core, typically reaching RAM speed limits on multi-core systems.

	Commit message (Collapse)	Author	Age	Files	Lines
*	merge lz4opt.h into lz4hc.c	Yann Collet	2018-02-25	1	-356/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Having a dedicated file for optimal parser made sense during its creation, it allowed Przemyslaw to work more freely on lz4opt, with less dependency on lz4hc, moreover, the optimal parser was more complex, with its own search functions. Since the optimal was rewritten last year, it's now a lot lighter. It makes more sense now to integrate it directly inside lz4hc.c, making it easier to edit (editors are a bit "lost" inside a `*.h` dependent on its #include position), it also reduces the number of files in the project, which fits pretty well with lz4 objectives. (adding lz4hc requires "just" lz4hc.h and lz4hc.c).
*	edge case : compress up to end-mflimit (12 bytes)	Yann Collet	2018-02-24	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The LZ4 block format specification states that the last match must start at a minimum distance of 12 bytes from the end of the block. However, out of an abundance of caution, the reference implementation would actually stop searching matches at 13 bytes from the end of the block. This patch fixes this small detail. The new version is now able to properly compress a limit case such as `aaaaaaaabaaa\n` as reported by Gao Xiang (@hsiangkao). Obviously, it doesn't change a lot of things. This is just one additional match candidate per block, with a maximum match length of 7 (since last 5 bytes must remain literals). With default policy, blocks are 4 MB long, so it doesn't happen too often Compressing silesia.tar at default level 1 saves 5 bytes (100930101 -> 100930096). At max level 12, it saves a grand 16 bytes (77389871 -> 77389855). The impact is a bit more visible when blocks are smaller, hence more numerous. For example, compressing silesia with blocks of 64 KB (using -12 -B4D) saves 543 bytes (77304583 -> 77304040). So the smaller the packet size, the more visible the impact. And it happens we have a ton of scenarios with little blocks using LZ4 compression ... And a useless "hooray" sidenote : the patch improves the LZ4 compression record of silesia (using -12 -B7D --no-frame-crc) by 16 bytes (77270672 -> 77270656) and the record on enwik9 by 44 bytes (371680396 -> 371680352) (previously claimed by [smallz4](http://create.stephan-brumme.com/smallz4/) ).
*	Merge pull request #434 from lz4/pattern	Yann Collet	2018-01-06	1	-1/+3
\|\ \| \| \| \|	conditional pattern analysis
\| *	conditional pattern analysis	Yann Collet	2017-12-22	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Pattern analysis (currently limited to long ranges of identical bytes) is actually detrimental to performance when `nbSearches` is low. Reason is : `nbSearches` provides a built-in protection for these cases. The problem with patterns is that they dramatically increase the number of candidates to visit. But with a low nbSearches, the match finder just aborts early. In such cases, pattern analysis adds some complexity without reducing total nb of candidates. It actually increases compression ratio a little bit, by filtering only "good" candidates, but at a measurable speed cost, so it's not a good trade-off. This patch makes pattern analysis optional. It's enabled for levels 8+ only.
* \|	lz4opt supports _destSize	Yann Collet	2017-12-22	1	-18/+43
\|/ \| \| \|	no longer limited to level 9
*	added code comments	Yann Collet	2017-11-09	1	-1/+6
\|
*	added constant TRAILING_LITERALS	Yann Collet	2017-11-09	1	-5/+6
\| \| \| \| \|	which is more explicit than its value `3`. reported by @terrelln
*	lz4opt: simplified match finder invocation to LZ4HC_FindLongerMatch()	Yann Collet	2017-11-09	1	-20/+11
\|
*	removed the ip++ at the beginning of block	Yann Collet	2017-11-08	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	The first byte used to be skipped to avoid a infinite self-comparison. This is no longer necessary, since init() ensures that index starts at 64K. The first byte is also useless to search when each block is independent, but it's no longer the case when blocks are linked. Removing the first-byte-skip saves about 10 bytes / MB on files compressed with -BD4 (linked blocks 64Kb), which feels correct as each MB has 16 blocks of 64KB.
*	minor comment edit	Yann Collet	2017-11-03	1	-7/+6
\|
*	moved ctx->end handling from parsers	Yann Collet	2017-11-03	1	-1/+0
\| \| \| \|	responsibility better handled one layer above (LZ4HC_compress_generic())
*	removed ctx->searchNum	Yann Collet	2017-11-03	1	-6/+8
\| \| \| \| \|	nbSearches now transmitted directly as function parameter easier to track and debug
*	LZ4_compress_HC_continue_destSize() now compatible with optimal parser	Yann Collet	2017-11-03	1	-5/+5
\| \| \| \|	levels 11+
*	removes matches[] table	Yann Collet	2017-11-03	1	-73/+67
\| \| \| \| \|	saves stack space clearer match finder interface (no more table to fill)
*	removed useless parameter from hash chain matchfinder	Yann Collet	2017-11-03	1	-5/+4
\| \| \| \|	used to be present for compatibility with binary tree matchfinder
*	removed code and reference to binary tree match finder	Yann Collet	2017-11-03	1	-122/+2
\| \| \| \|	reduced size of LZ4HC state
*	improved level 11 speed	Yann Collet	2017-11-03	1	-2/+4
\|
*	optimized skip strategy for level 12	Yann Collet	2017-11-03	1	-3/+6
\|
*	more generic skip formula	Yann Collet	2017-11-03	1	-13/+4
\| \| \| \|	improving speed
*	small adaptations for intermediate level 11	Yann Collet	2017-11-02	1	-6/+5
\|
*	partial search, while preserving compression ratio	Yann Collet	2017-11-02	1	-0/+14
\| \| \| \|	tag interesting places
*	searching match leading strictly farther does not work	Yann Collet	2017-11-02	1	-1/+1
\| \| \| \| \|	sometimes, it's better to re-use same match but start it later, in order to get shorter matchlength code
*	fixed last lost bytes in maximal mode	Yann Collet	2017-11-02	1	-3/+4
\| \| \| \| \|	even gained 2 bytes on calgary.tar... added conditional traces `g_debuglog_enable`
*	changed strategy : opt[] path is complete after each match	Yann Collet	2017-11-02	1	-33/+57
\| \| \| \| \| \| \|	previous strategy would leave a few "bad choices" on the ground they would be fixed later, but that requires passing through each position to make the fix and cannot give the end position of the last useful match.
*	fixed minor overflow mistake in optimal parser	Yann Collet	2017-10-31	1	-1/+5
\| \| \| \|	saving 20 bytes on calgary.tar
*	fixed minor initialization warning	Yann Collet	2017-10-30	1	-1/+1
\|
*	added hash chain with conditional length	Yann Collet	2017-10-25	1	-1/+2
\| \| \| \|	not a success yet
*	lz4opt: added hash chain search	Yann Collet	2017-10-21	1	-14/+44
\|
*	switched many types to int	Yann Collet	2017-10-20	1	-38/+37
\|
*	removed SET_PRICE macro	Yann Collet	2017-10-20	1	-17/+14
\|
*	removed one macro usage	Yann Collet	2017-10-20	1	-4/+11
\|
*	minor refactor	Yann Collet	2017-10-20	1	-28/+35
\| \| \| \| \|	reduce variable scope remove one macro usage
*	lz4opt: refactor sequence reverse traversal	Yann Collet	2017-10-20	1	-10/+20
\|
*	refactor variable matchnum	Yann Collet	2017-10-20	1	-14/+14
\| \| \| \| \|	separate initial and iterative search renamed nb_matches
*	simplified initial cost conditions	Yann Collet	2017-10-20	1	-10/+15
\| \| \| \|	llen integrated in opt[]
*	added assert	Yann Collet	2017-10-19	1	-1/+1
\|
*	renamed last_pos into last_match_pos	Yann Collet	2017-10-19	1	-15/+15
\|
*	simplified early exit when single solution	Yann Collet	2017-10-19	1	-5/+5
\|
*	FIX: added prefix to FORCE_INLINE to prevent redefinition error during ↵	tcpan	2017-08-24	1	-5/+5
\| \| \| \|	compilation when used with other libraries that define FORCE_INLINE
*	fix #369	Yann Collet	2017-06-26	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The bug would make the bt search read one byte in an invalid memory region, and make a branch decision based on its value. Impact was small (missed compression opportunity). It only happens in -BD mode, with extDict-prefix overlapping matches. The bt match search is supposed to work also in extDict mode. In which case, the match ptr can point into Dict. When the match was overlapping Dict<->Prefix, match[matchLength] would end up outside of Dict, in an invalid memory area. The correction ensures that in such a case, match[matchLength] ends up at intended location, inside prefix.
*	changed macro HEAPMODE into LZ4_HEAPMODE	Yann Collet	2017-05-02	1	-6/+7
\| \| \| \| \| \| \|	This macro is susceptible to be triggered from user side typically through compiler flag (-DLZ4_HEAPMODE=1). In which case, it makes sense to prefix the macro since we want to reduce potential side-effect on namespace.
*	Merge branch 'optlz4opt' of github.com:Cyan4973/lz4 into optlz4opt	Yann Collet	2017-03-20	1	-1/+0
\|\
\| *	slight btopt speed improvement	Yann Collet	2017-03-18	1	-2/+2
\| \| \| \| \| \| \| \|	removing a useless test
* \|	minor refactor	Yann Collet	2017-03-20	1	-72/+71
\| \|
* \|	slight btopt speed improvement	Yann Collet	2017-03-20	1	-3/+4
\|/ \| \| \|	removing a useless test
*	made SET_PRICE macro more usable	Yann Collet	2017-03-18	1	-4/+4
\| \| \| \| \|	previous version would use argument to also change target member. Now, only values are transferred
*	improved lz4opt speed (~4%)	Yann Collet	2017-03-17	1	-12/+12
\|
*	minor price function optimization	Yann Collet	2017-03-17	1	-8/+6
\|
*	LZ4_compress_HC_destSize() uses LZ4HC_compress_generic() code path	Yann Collet	2017-03-16	1	-1/+1
\| \| \| \| \|	Limits compression level to 10, to remain compatible with Hash Chain.
*	removed nextToUpdateBT	Przemyslaw Skibinski	2016-12-28	1	-3/+3
\|