Haskell for all: The category design pattern

Saturday, August 18, 2012

The category design pattern

Functional programming is all the rage these days, but in this post I want to emphasize that functional programming is a subset of a more important overarching programming paradigm: compositional programming.

If you've ever used Unix pipes, you'll understand the importance and flexibility of composing small reusable programs to get powerful and emergent behaviors. Similarly, if you program functionally, you'll know how cool it is to compose a bunch of small reusable functions into a fully featured program.

Category theory codifies this compositional style into a design pattern, the category. Moreover, category theory gives us a precise prescription for how to create our own abstractions that follow this design pattern: the category laws. These laws differentiate category theory from other design patterns by providing rigorous criteria for what does and does not qualify as compositional.

One could easily dismiss this compositional ideal as just that: an ideal, something unsuitable for "real-world" scenarios. However, the theory behind category theory provides the meat that shows that this compositional ideal appears everywhere and can rise to the challenge of messy problems and complex business logic.

This post is just one of many posts that I will write over time where I will demonstrate how to practically use this compositional style in your programs, even for things that may seem like they couldn't possibly lend themselves to compositional programming. This first post starts off by introducing the category as a compositional design pattern.

The function category

Let's define our first category: the category of Haskell functions!

id  :: (a -> a)
id x = x

(.) :: (b -> c) -> (a -> b) -> (a -> c)
(f . g) x = f (g x)

Let's prove to ourselves that these obey the category laws:

-- Left identity: id . f = f
id . f
= \x -> id (f x)
= \x -> f x
= f

-- Right identity: f . id = f
f . id
= \x -> f (id x)
= \x -> f x
= f

-- Associativity: (f . g) . h = f . (g . h)
(f . g) . h
= \x -> (f . g) (h x)
= \x -> f (g (h x))
= \x -> f ((g . h) x)
= \x -> (f . (g . h)) x
= f . (g . h)

Function composition is very easy to use, yet so powerful, precisely because it forms a category! This lets us express complex transformations simply by composing a bunch of reusable parts:

bestLangs :: [Language] -> [Language]
bestLangs = take 3 . sortBy (comparing speed) . filter isCool

Unfortunately, we can't express all of our programs as chains of ordinary functions. I guess we just give up, right? Wrong!

The Kleisli category

The next most common category we encounter on a daily basis is the category of monadic functions, which generalize ordinary functions:

return :: (Monad m) => (a -> m a)

(<=<)  :: (Monad m) => (b -> m c) -> (a -> m b) -> (a -> m c)

Mathematicians call this the "Kleisli" category, and Control.Monad provides both of the above functions.

Notice how the type signatures of return and (<=<) resemble their functional counterparts:

id     ::              (a ->   a)
return :: (Monad m) => (a -> m a)

(.)    ::              (b ->   c) -> (a ->   b) -> (a ->   c)
(<=<)  :: (Monad m) => (b -> m c) -> (a -> m b) -> (a -> m c)

The implementation for (<=<) also resembles the implementation for function composition:

(f  .  g) x = f     (g x)
(f <=< g) x = f =<< (g x)

-- Note (=<<) is the same as (>>=), but with the arguments flipped

Not a coincidence! Monadic functions just generalize ordinary functions and the Kleisli category demonstrates that monadic functions are composable, too. They just use a different composition operator: (<=<), and a different identity: return.

Well, let's assume that category theorists aren't bullshitting us and that (<=<) really is some sort of composition and return really is its identity. If that were true, we'd expect the following laws to hold:

return <=< f = f                   -- Left  identity

f <=< return = f                   -- Right identity

(f <=< g) <=< h = f <=< (g <=< h)  -- Associativity

Well, we already have the definition for (<=<):

(f <=< g) x = f =<< (g x)

... so let's use that definition to expand those laws:

return =<< (f x) = (f x)

f =<< (return x) = f x

(\y -> f =<< (g y)) =<< h x = f =<< (g =<< (h x))

If we simplify those a little and use (>>=) to flip the order of arguments, we get:

m >>= return = m

return x >>= f = f x

m >>= (\y -> g y >>= f) = (m >>= g) >>= f

Look familiar? Those are just the monad laws, which all Monad instances are required to satisfy. If you have ever wondered where those monad laws came from, now you know! They are just the category laws in disguise.

Consequently, every new Monad we define gives us a category for free! Let's try out some of these brave new categories:

-- The Kleisli category for the Maybe monad
lookup  :: k -> [(k, v)] -> Maybe v
maximumByMay :: (a -> a -> Ordering) -> [a] -> Maybe a

bestSuitor :: [(String, [Suitor])] -> Maybe Suitor
bestSuitor = maximumByMay (comparing handsome) <=< lookup "Tall"


-- The Kleisli category for the [] monad
children :: Person -> [Person]

greatGrandChildren :: Person -> [Person]
greatGrandChildren = children <=< children <=< children


-- The Kleisli category for the IO monad
-- * Stolen from /r/haskell today
spawn      ::  IO a  -> IO (IO a)
mapM spawn :: [IO a] -> IO [IO a]
sequence   :: [IO a] -> IO    [a]

concurrentSequence :: [IO a] -> IO [a]
concurrentSequence = sequence <=< mapM spawn

Monads that don't observe these laws are buggy and unintuitive. Don't believe me? Just ask the people who tried to use ListT , which breaks the monad laws.

The `Pipe` category

Not all categories are functions. I'll use a primitive version of my Pipe type (from the pipes package) with effects removed to simplify the example:

data Pipe a b r
  = Pure r
  | Await (a -> Pipe a b r)
  | Yield b (Pipe a b r)

Pure    r  <-< _          = Pure r
Yield b p1 <-< p2         = Yield b (p1 <-< p2)
Await   f  <-< Yield b p2 = f b <-< p2
p1         <-< Await   f  = Await $ \a -> p1 <-< f a
_          <-< Pure    r  = Pure r

cat = Await $ \a -> Yield a cat

Let's check out what types the compiler infers:

cat   :: Pipe a a r

(<-<) :: Pipe b c r -> Pipe a b r -> Pipe a c r

Those look an awful lot like an identity and composition. I leave it as an exercise for the reader to prove that they actually do form a category:

cat <-< p = p                            -- Right identity

p <-< cat = p                            -- Left  identity

(p1 <-< p2) <-< p3 = p1 <-< (p2 <-< p3)  -- Associativity

Pipes show how more complicated things that don't fit neatly into the functional programming paradigm can still be achieved with a compositional programming style. I won't belabor the compositionality of pipes, though, since my tutorial already does that.

So if you find something that doesn't seem like it could be compositional, don't give up! Chances are that a compositional solution exists just beneath the surface!

Conclusions

All category theory says is that composition is the best design pattern, but then leaves it up to you to define what precisely composition is. It's up to you to discover new and interesting ways to compose things besides just composing functions. As long as the composition operator you define obeys the category laws, you're golden.

Also, I'm really glad to see a resurgence in functional programming (since functions form a category), but in the long run we really need to think about more interesting composition operators than just function composition if we are serious about tackling more complicated problem domains.

Hopefully this post gets you a little bit excited about category theory. In future posts, I will expand upon this post with the following topics:

Why the category laws ensure that code is easy, intuitive, and free of edge cases
How functors let you mix and match different categories
How to use categories to optimize your code
How to use categories to simplify equational reasoning

32 comments:

myzoskiAugust 18, 2012 at 9:45 AM
" I leave it as an exercise for the reader to prove that they actually do form a category"

OK, I'll bite. But let's try a special case of the identity laws, i.e. showing that idP <+< idP = idP:

idP <+< idP
= Await (\a -> Yield a idP) <+< Await (\a' -> Yield a' idP)
= Await (\x -> idP <+< (\a' -> Yield a' idP) x)
= Await (\x -> idP <+< Yield x idP)
= Await (\x -> Await (\a -> Yield a idP) <+< Yield x idP)
= Await (\x -> (\a -> Yield a idP) x <+< idP)
= Await (\x -> Yield x idP <+< idP)
= Await (\x -> Yield x (idP <+< idP))

Now, if I had (idP <+< idP) = idP on the inside, this would reduce to
= Await (\x -> Yield x idP)
= idP,

but that would just be assuming what I want to prove.

More generally, when I try to prove the category laws for Pipe a b r, I run into cases where if I could prove the law for a sub-expression, I could prove it for the whole expression, which makes me want to do induction; but presumably I also want to be able to prove results for circular, self-recursive pipes like idP in which there is no guarantee that the sub-expressions are 'smaller' than the original expression, so I don't know what to do.

I figure this must have something to do with domain theory and least fixed points and supremums and all that, but I haven't worked all the way through that material so I'm not sure how the proof would look...
ReplyDelete
Replies
j2kunAugust 18, 2012 at 9:42 PM
I hate to be the snooty mathematician who corrects terminology, but here goes.

When you say something is the "category of X" you really mean the objects in that category are X and you have to define the morphisms separately. For instance, your first category of Haskell functions is actually (probably) the category of Haskell types, and you defined the morphisms to be Haskell functions. If your category was really the category of Haskell functions, then your morphisms would need to be morphisms of Haskell functions.

Of course, with the right tinkering you can blur the distinction between an object and the identity morphism on that object, but then the objects are still not *all* morphisms.

Another thing to note is that the Kleisli category is defined per monad. So it's not the category of all monadic functions across all possible monads.

I like the explanations though! I need to get back into Haskell myself.
ReplyDelete
Replies
PaulAugust 21, 2012 at 2:29 AM
Hi Gabriel, nice post.

I was planning to write something on the same topic, but in french, sometime in the near future. Haskell documentation is lacking in this dialect, and I hope that adding more content will create more opportunities for people to get into the purely functional way of programming.

If you agree, I could also translate this good article, and credit it clearly of course. What do you think about that ?
ReplyDelete
Replies
stephen@xemacsOctober 19, 2012 at 2:28 AM
By the way, talking about "order of grouping" is a little awkward (composition is generally not commutative). I think if you rephrase to something like "nesting of groups" the flow will be more natural for the reader.
ReplyDelete
Replies
UnknownNovember 8, 2015 at 3:07 PM
Great post. I really love math and especially category theory, so to see it applied to Haskell means I'm immediately going to take another look at the language. Just a question though. I really want to learn haskell but the Rubyist in me just finds it easier to bang out a quick script when I need something. Is there any like things I can do with Haskell while I'm learning? Like I find it helpful to learn by building so any ideas. Thanks and awesome post.
ReplyDelete
Replies
Juan PabloNovember 13, 2015 at 10:51 AM
Awesome post. To me, this explains the monad laws: they arise naturally if you want to define a category where monadic functions compose. Is that fair to say?
ReplyDelete
Replies
UnknownApril 14, 2016 at 8:54 PM
This is truly an awesome post!
In the end you've mentioned a few follow up posts. Can you point me to those? So far I've only found 'Functor Design Pattern'.
ReplyDelete
Replies
DocKlobiDecember 28, 2020 at 1:58 AM
Yep, not *bad*. I like the reasoning.
I, myself, usually do something well known in *simple* (addition) algebra.
"(f . g) x = f (g x)" can be translated into "(f . g)= \x -> f (g x)" in the same way that "a + b = c" can be translated to "a = (-b) + c".
All function arguments on the left side of the equation can be made into lambda arguments on the right side. It is *mechanic*. Just follow the (reverse) order.
ReplyDelete
Replies

Add comment

Haskell for all

Saturday, August 18, 2012

The category design pattern

Categories

The function category

The Kleisli category

The `Pipe` category

Conclusions

32 comments:

Followers

Saturday, August 18, 2012

The category design pattern

Categories

The function category

The Kleisli category

The Pipe category

Conclusions

32 comments:

Followers

The `Pipe` category