Well,it’s just a few days until our baby is born! We’re starting to get excited. And things are pretty busy, too — so that explains why I haven’t had many blog posts lately.
Monthly Archives: September 2006
Disk encryption support in Etch
Well, I got my new MacBook Pro 15″ in yesterday. I’ll write something about that shortly. The main OS for this machine is not Mac OS X, though, but Debian.
I decided that, being a laptop, I would like to run dm-crypt on here. Much to my delight, the etch installers support dm-crypt out of the box.
Not only that, but they supported this setup out of the box, too:
- Two partitions for Debian — one for /boot, everything else on the second one
- The second partition is completely encrypted
- Inside the encrypted container is an LVM physical volume
- Inside the LVM physical volume are logical volumes for /, /home, /usr, /var, and swap
- XFS is used for each filesystem
Not only that, but it set up proper boot sequence for all of this out of the box, too.
So I turn on the unit, enter the password for the encrypted partition, and then the system continues booting.
Nice. Very nice.
Kudos to the debian-installer and initramfs teams.
Renovation: Weeks 10-12
Well, it’s been awhile since I’ve posted photos of our renovation project. We’ve been very busy, and I’ll write about that sometime soon, too.
But anyway, I’ve now posted photos of weeks 10 – 12 of the renovation.
Some of the highlights include: new windows, a new roof, two new staircases, three old oil heaters removed from the attic (some still with oil in them), and a little joking around from the workers.
Click above to see all of the photos.
Here are two samples from this set. First, how the house looks from the oustide now:
And this one:
Someone found that old commode from the attic. They put it on the dirt pile, and put a bucket under it for good measure.
Nobody actually used it.
That we know of.
Two Stores I Like
There are a lot of places to buy stuff from online. Two favorites are cat5ecablguy and B&H Photo & Video.
First, cat5ecableguy. Well, despite the name, he sells cat3, cat5e, cat6, cable — both patch cables and bulk — as well as other things like speaker wire, jacks, coaxial cable, etc. He’s got quality products and almost always the cheapest around. It’s not unusual to see him selling things at about half the normal price.
I bought a lot of stuff from cat5ecableguy.com when we remodeled one room in our current house — bulk cable, keystone jacks, and keystone plates, mostly. We also buy from him at work — mostly patch cables there. He has prompt shipping and is great to work with. He’s even been helping me track down black CAT6 keystone jacks for our remodel project.
Second, B&H Photo. It seems like if you use a service like shopping.com or froogle to find good prices on things like cameras, TVs, etc., that there are a whole bunch of companies that advertise good prices but then call you to “confirm” an order, only to use high-pressure tactics to sell you way overpriced accessories. Some of them even go so far as to cancel orders for people that don’t purchase the accessories. Others just have bad prices or sell “grey market” merchandise (stuff intended for a non-US market that doesn’t have a valid US warranty) without mentioning it.
This is not B&H. B&H has a bunch of high-quality stuff, good prices (often not the absolute lowest, but certainly among the lowest, and usually the lowest trustworthy store), and good service. They have some large print catalogs you can request as well.
I was surprised to see recently that they’ve added pro audio to their lineup, so when I got a digitial audio recorder, I got it from them. It was a $500 item or so. They called me to confirm the order — on a *Sunday* morning no less — but with them, that really is all it is. “Did you place an order for xyz from us?” “Yes.” “OK, I’ve released it for shipping. Do you have any questions for us?” “No.” “OK, thanks for using B&H.”
I’m not quite sure why they call to confirm some orders, but I think it’s a fraud protection measure.
So anyway, I haven’t been paid to write these comments, and neither place even knows I’m doing this. I just wanted to pass along my experiences with two very nice, clueful, and honest retailers.
Another Haskell Solution to Lars’ Problem
Yesterday, I posted an 18-line solution to Lars’ language problem. One problem with it was that it was not very memory-efficient (or time-efficient, for that matter). In other words, it was optimized for elegance.
Here is a 22-line solution that is much more memory-efficient and works well with his “huge” test case. Note to Planet readers: Planet seems to corrupt code examples at times; click on the original story to see the correct code.
import System.Environment import Data.List import Data.Char import qualified Data.Map as Map custwords = filter (/= "") . lines . map (conv . toLower) where iswordchar x = isAlphaNum x && isAscii x conv x = if iswordchar x then x else '\n' wordfreq inp = Map.toList $ foldl' updmap (Map.empty::Map.Map String Int) inp where updmap nm word = case Map.lookup word nm of Nothing -> Map.insert word 1 nm Just x -> (Map.insert word $! x + 1) nm freqsort (w1, c1) (w2, c2) = if c1 == c2 then compare w1 w2 else compare c2 c1 showit (word, count) = show count ++ " " ++ word main = do args <- getArgs interact $ unlines . map showit . take (read . head $ args) . sortBy freqsort . wordfreq . custwords
The main change from the previous example to this one is using a Map to keep track of the frequency of each word.
A Haskell solution to Lars’ Problem
Thanks to a little glitch in planet, one of Lars’ posts from 2004 came to my attention. In it, he proposes a test for language benchmarking:
Read text from the standard input and count the number of times each word occurs. Convert letters to lower case. Order the words according to frequency, words with the same frequency should be ordered in ascending lexicographic order according to character code. Print out the top N words, where N is a decimal number given on the command line. Each output line must contain the count, a space, and the word (in lower case), and end in an ASCII LINE FEED character. Output must contain exactly N such output lines and no other output lines.
A word contains only ASCII letters A through Z and a through z (convert upper case to lower case) and ASCII digits 0 through 9 and is not empty. All other characters separate words and are ignored except to notice word boundaries. Word boundaries only occur at the beginning and end of the file and at non-word characters. You may not assume a maximum length for the word, line, or input file.
He provides a tarball with sample implementations in C, Python, and Shell.
His C code is 183 lines long, Python 57, and Shell 11. The specs for this test seem particularly suited for shell.
I wrote a version in Haskell, commented and formatted approximately the same as his Python version, but using an algorithm more like the shell version. It comes in at 18 lines. Here it is:
import System.Environment import Data.List import Data.Char custwords = filter (/= "") . lines . map (conv . toLower) where iswordchar x = isAlphaNum x && isAscii x conv x = if iswordchar x then x else '\n' wordfreq = map (\x -> (head x, length x)) . group . sort freqsort (w1, c1) (w2, c2) = if c1 == c2 then compare w1 w2 else compare c2 c1 showit (word, count) = show count ++ " " ++ word main = do args <- getArgs interact $ unlines . map showit . take (read . head $ args) . sortBy freqsort . wordfreq . custwords
Taking a look at this, one thing that might strike you is the function composition in main. This takes the output from one function and feeds it into the next -- and the Haskell syntactic sugar for this makes it look a lot like pipes in the shell version. The interact call takes, as a parameter, a function that takes a string and returns a string. interact supplies stdin as the input and prints the output to stdout. Note that, since Haskell is lazily, this does not mean buffering up the entire input or output -- it is read and written on demand.
The rest of the functions are also standard in Haskell, and you can find them in the index to the library reference if you want to learn more.
I understand and agree that short code doesn't necessarily mean good code, but I think that Haskell provides a very elegant and expressive solution to many problems -- one that also happens to be remarkably concise.
Updated 9/4: Changed isLower to isAlphaNum to fix a bug, and removed unnecessary Data.Map import
Lazy big-O and Haskell Answers
First, Evan has a host of interesting articles about Haskell, and I found his lazy big-O article particulary interesting.
Next, Eric Warmenhoven has recently taken up Haskell and posted some Haskell questions on his blog. Eric, here are some answers for you.
First, regarding shared libraries. While Haskell can be compiled to machine code, and GHC is a popular way to do that, a standard C way of representing information about a library (.h and .so files) is not really rich enough for Haskell. Consider, for instance, that functions may accept arguments of a wide range of types (or even things such as lists of any type). Haskell also performs type checking, and thus must know the type of arguments a function expects, as well as its return type, at compile time. So you do not generally compile Haskell code directly to .so files, but rather use the compiler’s module or package support to do that. See Cabal for more information on packages. Through the FFI (Foreign Function Interface), it is possible to both call into C and be called from C with Haskell code, if that’s where you want to go. It is actually easier in Haskell than in any other high-level language I’ve dealt with before.
Regarding circular module deps — I’ve never used them and can’t really comment. I can say, though, that the .boot files are internal files created by GHC.
Regarding practical stuff in tutorials — I share your complaint there. I have found a few that are better than the others: Yet Another Haskell Tutorial, and Haskell: The Craft of Functional Programming, 2nd ed., by Simon Thompson. Several of us are working intermittently on a project called Haskell V8 — take a look and darcs send me patches! I would say that Haskell’s I/O system is the most powerful I’ve seen in many ways — especially with regard to laziness — and in the upcoming GHC 6.6 release, it will be both lazy *and* blazingly fast. Very nice.
There isn’t much Debian-specific documentation, but there is a draft policy and a mailing list (link to it is in the policy doc).
Hope this helps!