<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: How To Think About Compression, Part 2</title>
	<atom:link href="http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/feed" rel="self" type="application/rss+xml" />
	<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2</link>
	<description>Viewpoints on technology, society, and government</description>
	<lastBuildDate>Sun, 05 Feb 2012 03:11:59 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Research on deduplicating disk-based and cloud backups &#124; The Changelog</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-8381</link>
		<dc:creator>Research on deduplicating disk-based and cloud backups &#124; The Changelog</dc:creator>
		<pubDate>Thu, 20 Jan 2011 17:55:07 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-8381</guid>
		<description>[...] at the moment. I set up a 2GB ZFS pool for this test, and set dedupe=on and compress=gzip-2. When I evaluated compression in the past, I hadn&#8217;t looked at lzjb. I found a blog post comparing lzjb to the gzip options [...]</description>
		<content:encoded><![CDATA[<p>[...] at the moment. I set up a 2GB ZFS pool for this test, and set dedupe=on and compress=gzip-2. When I evaluated compression in the past, I hadn&#8217;t looked at lzjb. I found a blog post comparing lzjb to the gzip options [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ole Tange</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-4215</link>
		<dc:creator>Ole Tange</dc:creator>
		<pubDate>Sat, 01 Aug 2009 18:21:10 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-4215</guid>
		<description>Did you have a chance to look at FreeArc.org?

I was impressed the time was comparable to Bzip2, but compressed 20-30% better.</description>
		<content:encoded><![CDATA[<p>Did you have a chance to look at FreeArc.org?</p>
<p>I was impressed the time was comparable to Bzip2, but compressed 20-30% better.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Charles Darke</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-3911</link>
		<dc:creator>Charles Darke</dc:creator>
		<pubDate>Sat, 13 Jun 2009 21:38:34 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-3911</guid>
		<description>You might be interested to google for pigz (parallel version of gzip). Personally, I use lzop (since it is free) or pigz as a good trade off. LZMA is good where time is not a factor (decompression speed is reasonable).</description>
		<content:encoded><![CDATA[<p>You might be interested to google for pigz (parallel version of gzip). Personally, I use lzop (since it is free) or pigz as a good trade off. LZMA is good where time is not a factor (decompression speed is reasonable).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Goerzen</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-3314</link>
		<dc:creator>John Goerzen</dc:creator>
		<pubDate>Mon, 02 Mar 2009 13:45:49 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-3314</guid>
		<description>The scale is plainly visible.  Sometimes the difference between numbers is such that in a size to be able to fit in a blog post, starting it at zero would be useless itself.</description>
		<content:encoded><![CDATA[<p>The scale is plainly visible.  Sometimes the difference between numbers is such that in a size to be able to fit in a blog post, starting it at zero would be useless itself.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Marijn Schouten</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-3313</link>
		<dc:creator>Marijn Schouten</dc:creator>
		<pubDate>Mon, 02 Mar 2009 12:14:25 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-3313</guid>
		<description>Graphs that don&#039;t start at zero are misleading and thus worse than useless.</description>
		<content:encoded><![CDATA[<p>Graphs that don&#8217;t start at zero are misleading and thus worse than useless.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Palmax</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-3258</link>
		<dc:creator>Palmax</dc:creator>
		<pubDate>Wed, 18 Feb 2009 21:37:32 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-3258</guid>
		<description>Did you use threads?</description>
		<content:encoded><![CDATA[<p>Did you use threads?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bill</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-3257</link>
		<dc:creator>Bill</dc:creator>
		<pubDate>Wed, 18 Feb 2009 21:28:45 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-3257</guid>
		<description>I&#039;ve been wondering how to describe my results using debootstrap. I got into it by wanting to use the --save-tarball as well as (unrelated to this post, directly anyway, learn how to --second-stage scripting from Neil Williams emdebian stuff)
I debootstraped all etch variants ( I wanted to get a copy before I couldn&#039;t easily anymore)  
When the minbase vairant of etch is tarballed it is 31386007 bytes - I thought to myself Nice...
Also wanting to know what the others looked like and in case I needed them too.
buildd is 52514446 and fakechroot is 45102635
I&#039;ve been intrigued by the notion of this thing called pristine tar since I first heard Joey Hess blog of it around the time he was moving from svn to git iirc..
&quot;minbase seems to me at this point&quot; to be a pristine debain install - although I have already removed all important priority packages as well using cat and diff to get the list from dpkg/info file.
After unpacking the tarball I got an urge to test tar on what I thought would be a decent test target.
So I 
first made a copy of the minbase system
cp -a /mnt/md6 /srv/pristine.minbase
du -b /srv/pristine.minbase 92913288
than I  made archives using tar, and tar switches for gzip, bzip2 and lzma
tar -xf pristine.tar /mnt/md6/*
tar -xzf pristine.tar.gz /mnt/md6/*
tar -xjf pristine.tar.bz2 /mnt/md6/*
tar -xf --lzma pristine.tar.lzma /mnt/md6/*
iirc
ls -l /srv
shows interestingly
drwxr-xr-x 20 root root     4096 2009-02-03 00:30 pristine.minbase
-rw-r--r--  1 root root 96634880 2009-02-03 11:56 pristine.tar
-rw-r--r--  1 root root 32455598 2009-02-03 11:58 pristine.tar.bz2
-rw-r--r--  1 root root 37310068 2009-02-03 11:57 pristine.tar.gz
-rw-r--r--  1 root root 22177523 2009-02-03 12:01 pristine.tar.lzma

Have I forgotten anything?</description>
		<content:encoded><![CDATA[<p>I&#8217;ve been wondering how to describe my results using debootstrap. I got into it by wanting to use the &#8211;save-tarball as well as (unrelated to this post, directly anyway, learn how to &#8211;second-stage scripting from Neil Williams emdebian stuff)<br />
I debootstraped all etch variants ( I wanted to get a copy before I couldn&#8217;t easily anymore)<br />
When the minbase vairant of etch is tarballed it is 31386007 bytes &#8211; I thought to myself Nice&#8230;<br />
Also wanting to know what the others looked like and in case I needed them too.<br />
buildd is 52514446 and fakechroot is 45102635<br />
I&#8217;ve been intrigued by the notion of this thing called pristine tar since I first heard Joey Hess blog of it around the time he was moving from svn to git iirc..<br />
&#8220;minbase seems to me at this point&#8221; to be a pristine debain install &#8211; although I have already removed all important priority packages as well using cat and diff to get the list from dpkg/info file.<br />
After unpacking the tarball I got an urge to test tar on what I thought would be a decent test target.<br />
So I<br />
first made a copy of the minbase system<br />
cp -a /mnt/md6 /srv/pristine.minbase<br />
du -b /srv/pristine.minbase 92913288<br />
than I  made archives using tar, and tar switches for gzip, bzip2 and lzma<br />
tar -xf pristine.tar /mnt/md6/*<br />
tar -xzf pristine.tar.gz /mnt/md6/*<br />
tar -xjf pristine.tar.bz2 /mnt/md6/*<br />
tar -xf &#8211;lzma pristine.tar.lzma /mnt/md6/*<br />
iirc<br />
ls -l /srv<br />
shows interestingly<br />
drwxr-xr-x 20 root root     4096 2009-02-03 00:30 pristine.minbase<br />
-rw-r&#8211;r&#8211;  1 root root 96634880 2009-02-03 11:56 pristine.tar<br />
-rw-r&#8211;r&#8211;  1 root root 32455598 2009-02-03 11:58 pristine.tar.bz2<br />
-rw-r&#8211;r&#8211;  1 root root 37310068 2009-02-03 11:57 pristine.tar.gz<br />
-rw-r&#8211;r&#8211;  1 root root 22177523 2009-02-03 12:01 pristine.tar.lzma</p>
<p>Have I forgotten anything?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Markus Hochholdinger</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-3256</link>
		<dc:creator>Markus Hochholdinger</dc:creator>
		<pubDate>Wed, 18 Feb 2009 19:48:17 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-3256</guid>
		<description>Great. The last time i made a gzip/bzip2 decission for my backups must be 2 years ago.
My reults are, the data is compressed with gzip because bzip2 was too slow fot the network and the tape. But i use bzip2 for my backup database to easily search for files in the backups because bzip2 saves me a lot of disk space compared to gzip.</description>
		<content:encoded><![CDATA[<p>Great. The last time i made a gzip/bzip2 decission for my backups must be 2 years ago.<br />
My reults are, the data is compressed with gzip because bzip2 was too slow fot the network and the tape. But i use bzip2 for my backup database to easily search for files in the backups because bzip2 saves me a lot of disk space compared to gzip.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael G</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-3253</link>
		<dc:creator>Michael G</dc:creator>
		<pubDate>Wed, 18 Feb 2009 17:19:55 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-3253</guid>
		<description>Great analysis.  

I&#039;d like to see the numbers for decompression as well.  There are a lot of cases where I&#039;d be willing to pay a lot of computing time during compression as long as decompression isn&#039;t too bad.  If I&#039;m putting something up for public download, for example, it&#039;s going to be compressed just once, but decompressed by every single person who downloads it.</description>
		<content:encoded><![CDATA[<p>Great analysis.  </p>
<p>I&#8217;d like to see the numbers for decompression as well.  There are a lot of cases where I&#8217;d be willing to pay a lot of computing time during compression as long as decompression isn&#8217;t too bad.  If I&#8217;m putting something up for public download, for example, it&#8217;s going to be compressed just once, but decompressed by every single person who downloads it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: suscal</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-3250</link>
		<dc:creator>suscal</dc:creator>
		<pubDate>Wed, 18 Feb 2009 04:55:52 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-3250</guid>
		<description>&gt; For part 2, I will be compressing each individual file 
&gt; contained in that tarball individually. This is a good 
&gt; test if you back up to hard disk and want quick 
&gt; access to your files.

For this cases, I like squashfs!</description>
		<content:encoded><![CDATA[<p>&gt; For part 2, I will be compressing each individual file<br />
&gt; contained in that tarball individually. This is a good<br />
&gt; test if you back up to hard disk and want quick<br />
&gt; access to your files.</p>
<p>For this cases, I like squashfs!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: How To Think About Compression &#124; The Changelog</title>
		<link>http://changelog.complete.org/archives/931-how-to-think-about-compression-part-2/comment-page-1#comment-3249</link>
		<dc:creator>How To Think About Compression &#124; The Changelog</dc:creator>
		<pubDate>Wed, 18 Feb 2009 03:20:00 +0000</pubDate>
		<guid isPermaLink="false">http://changelog.complete.org/?p=931#comment-3249</guid>
		<description>[...] Part 2 of this story is now available, which considers more compression tools, and looks at performance compressing files individually rather than the large tar file. [...]</description>
		<content:encoded><![CDATA[<p>[...] Part 2 of this story is now available, which considers more compression tools, and looks at performance compressing files individually rather than the large tar file. [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

