\gdef\TUnit{\texttt{Unit}} \gdef\fst{\texttt{fst}} \gdef\snd{\texttt{snd}} \gdef\step{\Rightarrow} \gdef\id{\text{id}} \gdef\pair#1#2{\langle {#1}, {#2}\rangle} \gdef\infer#1#2#3{\dfrac{#2}{#3}\texttt{#1}} \gdef\semant#1{\llbracket {#1} \rrbracket}

Introduction

Programs are complicated. How do we develop a better understanding of how they work? Denotational semantics is the subarea of programming languages where people study the behaviors of programs by mapping them into some semantic domain of mathematical objects.

The lambda calculus is one of the simplest programming languages there is. Written in Backus-Naur form (BNF), it only has three kinds of expressions:

e\;:=\;x\;|\;\lambda x.e\;|\;e_1\;e_2

which stands for variable, function abstraction, and function application. Despite its simplicity, the lambda calculus is Turing-complete, and in particular can represent infinite loops. There has been interesting work in giving semantics to the lambda calculus via domain theory, but that’s the topic for another day.

Computational trilogy

We will focus our attention on simply typed lambda calculus(STLC). STLC adds types to the (pure) lambda calculus. As a programming language, STLC is strongly normalizing, meaning that every well-typed term will eventually evaluate to a normal form. That shouldn’t be discouraging, since STLC is closely connected to intuitionistic logic, via the Curry-Howard correspondence. It turns out that logic and type systems are both connected to category theory via the so-called Curry-Howard-Lambek correspondence.

The goal of this post is to explicitly spell out a categorical semantics of STLC. I was unable to find proofs for the following table taken from this nlab page on “computational trilogy”, so I figured it will be a fun exercise to spell them out here.

logic	type theory	category theory
propositions	types	objects
proofs	terms	/
impl-intro	$\lambda$ abstraction	counit from tensor-hom adjunction
impl-elim	application	unit from tensor-hom adjunction
cut elimination	$\beta$ reduction	zigzag identities
identity elimination	$\eta$ conversion	other zigzag

Categorical semantics

Some of the following definitions are adapted from Benjamin Pierce’s Basic Category for Computer Scientists.

The type theory

We define the grammar for types in STLC:

A\;:=\;\TUnit\;|\;A\to B\;|\;A\;\times\;B

And the grammar for terms:

e\;:=\;\texttt{unit}\;|\;\lambda x:\tau.e\;|\;e_1\;e_2\;|\;(e_1,e_2)\;|\;\fst\;e\;|\;\snd\;e

Here, $\TUnit$ is the type with one element $\texttt{unit}$ . we also consider the following typing rules:

\infer{Axiom}{\;}{\Gamma,x:A\vdash x:A} \quad \infer{Weaken}{\Gamma\vdash e:A}{\Gamma,y:B\vdash e:A} \quad \infer{Unit-I}{\;}{\Gamma\vdash\texttt{unit}:\TUnit}

\infer{Arrow-I}{\Gamma, x:A\vdash e:B}{\Gamma\vdash\lambda x:A. e:A\to B} \quad \infer{Arrow-E}{\Gamma\vdash f:A\to B \quad \Gamma\vdash e:A}{\Gamma\vdash f\;e:B}

\infer{Prod-I}{\Gamma\vdash e_1:A \quad \Gamma\vdash e_2:B}{\Gamma\vdash (e_1,e_2):A\times B} \quad \infer{Prod-E1}{\Gamma\vdash e:A\times B}{\Gamma\vdash \fst~e:A} \quad \infer{Prod-E2}{\Gamma\vdash e:A\times B}{\Gamma\vdash \snd~e:B}

Here, the -I and -E refer to introduction rules and elimination rules. Introduction rules describe how to produce terms of a certain type, whereas elimination rules corresponds to how to consume terms of a certain type. We call the syntax associated with them introduction forms and elimination forms. For example, $\lambda x: A.~e$ and $(e_1,e_2)$ are introduction forms, whereas $e_1~e_2$ , or say $\fst~e$ , are elimination forms.

A Cartesian-closed category

For the other piece of the translation, we fix any Cartesian-closed category (CCC) $\mathbf{C}$ . Recall that a Cartesian-closed category is a category with finite products, such that for each object $A$ , the right product functor $(-\times A):\mathbf{C}\to\mathbf{C}$ has a right adjoint $(-^A):\mathbf{C}\to\mathbf{C}$ called the exponential.

The definition of adjunctions come with a unit natural transformation $\epsilon_A:(-^A\times A)\to \text{Id}_\mathbf{C}$ and a counit natural transformation $\eta_A:\text{Id}_\mathbf{C}\to (-\times A)^A$ such that the following “zigzag” equalities hold:

\begin{equation}\mathcal{Id}_{(-\times A)}=\epsilon_A (-\times A)\circ (-\times A)\eta_A\end{equation}

\begin{equation}\mathcal{Id}_{(-^A)}=(-^A)\epsilon_A\circ \eta_A (-^A)\end{equation}

Here $\text{Id}_\mathbf{C}$ is the identity functor on $\mathbf{C}$ , and the caligraphic $\mathcal{Id}$ denotes identity natural transformations on the respective functors.

An alternative formulation of the unit equation is useful as follows. For each object $B\in\mathbf{C}$ , the component $\epsilon_{A,B}$ of the unit $\epsilon_A$ has the universal property such that, for any $C\in\mathbf{C}$ and morphism $f$ from $(\underline{C}\times A)$ (the product functor applied to $C$ ) to $B$ , there exists a unique morphism $curry(f)$ from $C$ to $B^A$ (the exponential functor applied to $B$ ) making the diagram commute:

Test

The translation

We are finally ready to define the semantic translation! The double brackets $\semant{ \;}$ takes something from the type theory and maps it into the CCC we specified. On types, it is defined inductive on the syntax and maps each type to an object in the CCC. Again, the following translations are taken from Pierce’s Basic Category for Computer Scientists:

\begin{align*} \semant{ \TUnit} &\triangleq 1_\mathbf{C}\\ \semant{ A\times B} &\triangleq \semant{ A} \times \semant{ B} \\ \semant{ A\to B} &\triangleq \semant{ B} ^{\semant{ A} } \end{align*}

The translation for the typing context is inductively defined on the number of variables: $\semant{ \varnothing} \triangleq 1_\mathbf{C}$ and $\semant{ \Gamma,x:A} \triangleq \semant{ \Gamma} \times \semant{ A}$ .

Finally, we inductively define the translation on the typing derivations:

Derivation	Translation	Domain	Target
$\semant{ \Gamma\vdash \texttt{unit}:\TUnit}$	$!_{\semant{ \Gamma} }$	$\semant{ \Gamma}$	$1_\mathbf{C}$
$\semant{ \Gamma,x:A\vdash x:A}$	$\pi_2$	$\semant{ \Gamma} \times\semant{ A}$	$\semant{ A}$
$\semant{ \Gamma,y:B\vdash e:A}$	$\semant{ \Gamma\vdash e:A} \circ\pi_1$	$\semant{ \Gamma} \times\semant{ A}$	$\semant{ B}$
$\semant{ \Gamma\vdash \lambda x:A.e:A\to B}$	$curry(\semant{ \Gamma,x:A\vdash e:B} )$	$\semant{ \Gamma}$	$\semant{ B} ^{\semant{ A} }$
$\semant{ \Gamma\vdash e_1\;e_2:B}$	$\epsilon_{A,B}\circ\langle\semant{ \Gamma\vdash e_1:A\to B} ,\semant{ \Gamma\vdash e_2:A} \rangle$	$\semant{ \Gamma}$	$\semant{ B}$
$\semant{ \Gamma\vdash (e_1,e_2):A\times B}$	$\langle\semant{ \Gamma\vdash e_1:A} ,\semant{ \Gamma\vdash e_2:B} \rangle$	$\semant{ \Gamma}$	$\semant{ A} \times\semant{ B}$
$\semant{ \Gamma\vdash \fst\;e:A}$	$\pi_1\circ \semant{ \Gamma\vdash e:A\times B}$	$\semant{ \Gamma}$	$\semant{ A}$
$\semant{ \Gamma\vdash \snd\;e:B}$	$\pi_2\circ \semant{ \Gamma\vdash e:A\times B}$	$\semant{ \Gamma}$	$\semant{ B}$

For this particular formulation of STLC, it just so happens that for any fixed context, if a term is typeable then there is a unique typing rule that applies to it. This can be seen just from the fact that each term grammar branch appears in the conclusion of exactly one typing rule. Thus, we can be a little hand-wavy and define the translations on the term syntax directly.

Semantics respects equivalence

$\beta$ -equivalence

$\beta$ -equivalence is about how to reduce expressions to simpler ones. First define a Call-by-value small-step operational semantics:

\infer{Subst}{}{(\lambda x: A.\;e_1)\;e_2 \step e_1[x\mapsto e_2]} \quad \infer{Func}{e_1 \step e_1'}{e_1\;e_2 \step e_1' e_2} \quad \dots

As we can see, these rules tell us how elimination forms can “eliminate” introduction forms. The $e[x\mapsto e']$ notation denotes (capture-avoiding) substitution of free occurences of $x$ for $e'$ inside $e$ . And we can denote $\beta$ -equivalence, written $\equiv_\beta$ , as the reflexive-symmetric-transitive closure of this relation. However, proving that beta equivalence plays well with the semantics is now try because substitution is defined inductively on the syntax!

We will prove the following helpful lemma: If $\Gamma\vdash \lambda x: A.\;e_1$ and $\Gamma\vdash e_2 : A$ , then

\semant{\Gamma\vdash e_2[x \mapsto e_1] : B}=\semant{\Gamma,x: A\vdash e_1: B}\circ \pair{\id_{\semant{\Gamma}}}{\semant{\Gamma\vdash e_1: A}}

Proof: Induction on the typing derivation of $B$ .

$\eta$ -equivalence

The $\eta$ -equivalence for function types says that for whatever types $A,B$ , a function $h:A\to B$ should be considered “the same” as $\lambda x:A.~(h~x)$ . Indeed, the latter seems to be just ” $h$ with extra steps”. It is an instance of a more general idea called contextual equivalence, or even more abstractly, extensional equivalence. In short, it means that things are identified by “extension”, i.e. how they act from the outside perspective.

One might notice that $\lambda x:A. (h\;x)$ is an introduction form immediately wrapped around an elimination form, both for arrow types. In general, $\eta$ -equivalences almost all look like this.

$\eta$ -equivalence for product types

This amounts to claiming that $\semant{\Gamma\vdash (\fst~e, \snd~e): A\times B}=\semant{\Gamma \vdash e:A\times B}$ , which follows immediately from the universal property of the product:

Product

$\eta$ -equivalence for arrow types

Suppose $\Gamma\vdash \lambda x:A.e\;x:A\to B$ . Then $e$ must be of type $A\to B$ , but it doesn’t have to be in normal form. The semantics of the typing derivations defined earlier tells us

\begin{align*} &\semant{ \Gamma\vdash \lambda x:A.e\;x:A\to B} \\ =&curry(\semant{ \Gamma,x:A \vdash e\;x:B} )\\ =&curry(\epsilon_{A,B}\circ \langle\semant{ \Gamma,x:A\vdash e:A\to B} ,\semant{ \Gamma,x:A\vdash x:A} \rangle)\\ =&curry(\epsilon_{A,B}\circ \langle\semant{ \Gamma,x:A\vdash e:A\to B} ,\pi_2\rangle)\\ =&curry(\epsilon_{A,B}\circ \langle\semant{ \Gamma\vdash e:A\to B} \circ\pi_1,\pi_2\rangle) \end{align*}

Tho a mouthful, $\epsilon_{A,B}\circ\langle\semant{ \Gamma\vdash e:A\to B} \circ\pi_1,\pi_2\rangle$ is a morphism from $\semant{ \Gamma} \times \semant{ A}$ to $\semant{ B}$ . If we set $C$ to be $\semant{ \Gamma}$ , and $f$ as $\epsilon_{A,B}\circ\langle\semant{ \Gamma\vdash e:A\to B} \circ\pi_1,\pi_2\rangle$ , then the diagram for the unit equation becomes

Unit

Where did the mystery $g$ come from? Well, since $f$ is of the form $\epsilon_{A,B}$ composed with something, we can just define $g$ to be the something, aka $\langle\semant{ \Gamma\vdash e:A\to B} \circ\pi_1,\pi_2\rangle$ , and the above diagram will trivially commute. But, note $g$ is actually equal to $\semant{ \Gamma\vdash e:A\to B} \times \text{id}_{\semant{ A} }$ .

Finally, for the real kicker, we invoke the uniqueness of $curry$ in the universal property of $\epsilon_A$ , so that $\semant{ \Gamma\vdash e:A\to B} =curry(\epsilon_{A,B}\circ \langle\semant{ \Gamma\vdash e:A\to B} \circ\pi_1,\pi_2\rangle)$ . Thus,

\semant{ \Gamma\vdash \lambda x:A.e\;x:A\to B} =\semant{ \Gamma\vdash e:A\to B}

Conclusion

This was mostly just an exercise for me to make sure that the categorical semantics of STLC is sensible. Indeed, we have verified parts of the analogy from the nlab “computational trilogy” table, that the semantic respects $\beta$ and $\eta$ equivalence. Actually, the proof came down exactly to the unit and counit equations, just as in the table.

Note that our version of STLC doesn’t contain any “ground types” besides $\TUnit$ . One can simply add more ground types like $\texttt{Int}, \texttt{Bool}, \texttt{Number}$ , etc. The translation will then be incorporated to map each ground type to some designated object in our CCC, and the proof still works because it was defined inductively on the typing derivations. However, in this way the categorical semantics will not be able to enforce fine-grained equational constraints like 40 + 2 = 42 on the operational semantics.

References

B. C. Pierce, Basic Category Theory for Computer Scientists. The MIT Press, 1991. doi: 10.7551/mitpress/1524.001.0001.
S. Mac Lane, Categories for the Working Mathematician, vol. 5. in Graduate Texts in Mathematics, vol. 5. New York, NY: Springer, 1978. doi: 10.1007/978-1-4757-4721-8.
https://ncatlab.org/nlab/show/computational+trilogy
https://hustmphrrr.github.io/asset/pdf/comp-exam.pdf

Categorical Semantics of Simply Typed Lambda Calculus