commit | 76ef6d0ab0ea0937e8bb92c3a11a6445504d720f | [log] [download] |
---|---|---|
author | Alex Deymo <deymo@google.com> | Fri Jul 28 16:33:43 2017 +0200 |
committer | Alex Deymo <deymo@google.com> | Mon Aug 07 10:34:33 2017 +0200 |
tree | bce8cddddfe691fb961c86185a662e76ab317c2f | |
parent | 9a967030d7f049042a64cd095bd54ebd34f53780 [diff] |
Allow to predefine FORCE_INLINE macro. FORCE_INLINE macro is defined based on the compiler used. When using gcc, it will include "__attribute__((always_inline))" forcing gcc to always inline all the functions marked as FORCE_INLINE. However, this can cause a performance degradation of about 15%. This patch allows to set the FORCE_INLINE macro from the compiler command line to either "static" or "static inline" giving allowing it to inline functions as needed when performing optimizations.
LZ4 is lossless compression algorithm, providing compression speed at 400 MB/s per core, scalable with multi-cores CPU. It features an extremely fast decoder, with speed in multiple GB/s per core, typically reaching RAM speed limits on multi-core systems.
Speed can be tuned dynamically, selecting an “acceleration” factor which trades compression ratio for more speed up. On the other end, a high compression derivative, LZ4_HC, is also provided, trading CPU time for improved compression ratio. All versions feature the same decompression speed.
LZ4 library is provided as open-source software using BSD 2-Clause license.
Branch | Status |
---|---|
master | |
dev |
Branch Policy:
- The “master” branch is considered stable, at all times.
- The “dev” branch is the one where all contributions must be merged before being promoted to master.
- If you plan to propose a patch, please commit into the “dev” branch, or its own feature branch. Direct commit to “master” are not permitted.
The benchmark uses lzbench, from @inikep compiled with GCC v6.2.0 on Linux 64-bits. The reference system uses a Core i7-3930K CPU @ 4.5GHz. Benchmark evaluates the compression of reference Silesia Corpus in single-thread mode.
Compressor | Ratio | Compression | Decompression |
---|---|---|---|
memcpy | 1.000 | 7300 MB/s | 7300 MB/s |
LZ4 fast 8 (v1.7.3) | 1.799 | 911 MB/s | 3360 MB/s |
LZ4 default (v1.7.3) | 2.101 | 625 MB/s | 3220 MB/s |
LZO 2.09 | 2.108 | 620 MB/s | 845 MB/s |
QuickLZ 1.5.0 | 2.238 | 510 MB/s | 600 MB/s |
Snappy 1.1.3 | 2.091 | 450 MB/s | 1550 MB/s |
LZF v3.6 | 2.073 | 365 MB/s | 820 MB/s |
Zstandard 1.1.1 -1 | 2.876 | 330 MB/s | 930 MB/s |
Zstandard 1.1.1 -3 | 3.164 | 200 MB/s | 810 MB/s |
zlib deflate 1.2.8 -1 | 2.730 | 100 MB/s | 370 MB/s |
LZ4 HC -9 (v1.7.3) | 2.720 | 34 MB/s | 3240 MB/s |
zlib deflate 1.2.8 -6 | 3.099 | 33 MB/s | 390 MB/s |
LZ4 is also compatible and well optimized for x32 mode, for which it provides +10% speed performance.
The raw LZ4 block compression format is detailed within lz4_Block_format.
To compress an arbitrarily long file or data stream, multiple blocks are required. Organizing these blocks and providing a common header format to handle their content is the purpose of the Frame format, defined into lz4_Frame_format. Interoperable versions of LZ4 must respect this frame format.
Beyond the C reference source, many contributors have created versions of lz4 in multiple languages (Java, C#, Python, Perl, Ruby, etc.). A list of known source ports is maintained on the LZ4 Homepage.