Austin Murphy on 4 Apr 2011 12:46:01 -0700


[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]

Re: [PLUG] gnu parallel and tar


On Mon, Apr 4, 2011 at 2:37 PM, Julien Vehent <julien@linuxwall.info> wrote:
> On Mon, 4 Apr 2011 13:21:59 -0400, Austin Murphy wrote:
>>
>> I've had a good experience with lbzip2, a multi-threaded
>> implementation of bzip.
...
> Initial file:
> $ ls -s jmeter-server-node1.log --block-size=1
> 689274880 jmeter-server-node1.log
>
>
> === with bzip2 ====
> $ time bzip2 -z -9 jmeter-server-node1.log
>
> real    8m33.220s
> user    8m31.444s
> sys     0m0.880s
>
> $ ls -s jmeter-server-node1.log.bz2 --block-size=1
> 1589248 jmeter-server-node1.log.bz2
...
> === with lbzip2 ====
> $ time lbzip2 -n 4 -z -9 -S jmeter-server-node1.log
>
> real    5m37.425s
> user    20m57.227s
> sys     0m5.016s
>
> $ ls -s jmeter-server-node1.log.bz2 --block-size=1
> 1601536 jmeter-server-node1.log.bz2
...
> Compression is of the same level, but I'm surprised to see that while lbzip2
> is 65% faster, it also uses 250% more user time than bzip2. The efficiency
> per-core is a lot lower, but I'm happy to be using all my cores.


My understanding is that bzip2 is highly optimized to avoid cache
misses.  If you have too many threads running at once you might be
blowing out a shared cache.   You might try running with -n 2 or
letting it decide how many threads to run.

Austin
___________________________________________________________________________
Philadelphia Linux Users Group         --        http://www.phillylinux.org
Announcements - http://lists.phillylinux.org/mailman/listinfo/plug-announce
General Discussion  --   http://lists.phillylinux.org/mailman/listinfo/plug