Type Class Derivation
Type class derivation is a way to automatically generate given instances for type classes which satisfy some simple conditions. A type class in this sense is any trait or class with a single type parameter determining the type being operated on, and the special case CanEqual
. Common examples are Eq
, Ordering
, or Show
. For example, given the following Tree
algebraic data type (ADT):
enum Tree[T] derives Eq, Ordering, Show:
case Branch(left: Tree[T], right: Tree[T])
case Leaf(elem: T)
The derives
clause generates the following given instances for the Eq
, Ordering
and Show
type classes in the companion object of Tree
:
given [T: Eq] => Eq[Tree[T]] = Eq.derived
given [T: Ordering] => Ordering[Tree[T]] = Ordering.derived
given [T: Show] => Show[Tree[T]] = Show.derived
We say that Tree
is the deriving type and that the Eq
, Ordering
and Show
instances are derived instances.
Note: derived
can be used manually, this is useful when you do not have control over the definition. For example we can implement Ordering
for Option
s like so:
given [T: Ordering] => Ordering[Option[T]] = Ordering.derived
It is discouraged to directly refer to the derived
member if you can use a derives
clause instead.
All data types can have a derives
clause. This document focuses primarily on data types which also have a given instance of the Mirror
type class available.
Exact mechanism
In the following, when type arguments are enumerated and the first index evaluates to a larger value than the last, then there are actually no arguments, for example: A[T_2, ..., T_1]
means A
.
For a class/trait/object/enum DerivingType[T_1, ..., T_N] derives TC
, a derived instance is created in DerivingType
's companion object (or DerivingType
itself if it is an object).
The general "shape" of the derived instance is as follows:
given [...] => (...) => TC[ ... DerivingType[...] ... ] = TC.derived
TC.derived
should be an expression that conforms to the expected type on the left, potentially elaborated using term and/or type inference.
Note: TC.derived
is a normal access, therefore if there are multiple definitions of TC.derived
, overloading resolution applies.
What the derived instance precisely looks like depends on the specifics of DerivingType
and TC
, we first examine TC
:
TC
takes 1 parameter F
Therefore TC
is defined as TC[F[A_1, ..., A_K]]
(TC[F]
if K == 0
) for some F
. There are two further cases depending on the kinds of arguments:
F
and all arguments of DerivingType
have kind *
Note: K == 0
in this case.
The generated instance is then:
given [T_1: TC, ..., T_N: TC] => TC[DerivingType[T_1, ..., T_N]] = TC.derived
This is the most common case, and is the one that was highlighted in the introduction.
Note: The [T_i: TC, ...]
introduces a (using TC[T_i], ...)
, more information in Context Bounds. This allows the derived
member to access these evidences.
Note: If N == 0
the above means:
given TC[DerivingType] = TC.derived
For example, the class
case class Point(x: Int, y: Int) derives Ordering
generates the instance
object Point:
...
given Ordering[Point] = Ordering.derived
F
and DerivingType
have parameters of matching kind on the right
This section covers cases where you can pair arguments of F
and DerivingType
starting from the right such that they have the same kinds pairwise, and all arguments of F
or DerivingType
(or both) are used up. F
must also have at least one parameter.
The general shape will then be:
given [...] => TC[ [...] =>> DerivingType[...] ] = TC.derived
Where of course TC
and DerivingType
are applied to types of the correct kind.
To make this work, we split it into 3 cases:
If F
and DerivingType
take the same number of arguments (N == K
):
given TC[DerivingType] = TC.derived
// simplified form of:
given TC[ [A_1, ..., A_K] =>> DerivingType[A_1, ..., A_K] ] = TC.derived
If DerivingType
takes fewer arguments than F
(N < K
), we use only the rightmost parameters from the type lambda:
given TC[ [A_1, ..., A_K] =>> DerivingType[A_(K-N+1), ..., A_K] ] = TC.derived
// if DerivingType takes no arguments (N == 0), the above simplifies to:
given TC[ [A_1, ..., A_K] =>> DerivingType ] = TC.derived
If F
takes fewer arguments than DerivingType
(K < N
), we fill in the remaining leftmost slots with type parameters of the given:
given [T_1, ... T_(N-K)] => TC[[A_1, ..., A_K] =>> DerivingType[T_1, ... T_(N-K), A_1, ..., A_K]] = TC.derived
TC
is the CanEqual
type class
We have therefore: DerivingType[T_1, ..., T_N] derives CanEqual
.
Let U_1
, ..., U_M
be the parameters of DerivingType
of kind *
. (These are a subset of the T_i
s)
The generated instance is then:
given [T_1L, T_1R, ..., T_NL, T_NR] // every parameter of DerivingType twice
(using CanEqual[U_1L, U_1R], ..., CanEqual[U_ML, U_MR]): // only parameters of DerivingType with kind *
CanEqual[DerivingType[T_1L, ..., T_NL], DerivingType[T_1R, ..., T_NR]] = // again, every parameter
CanEqual.derived
The bounds of T_i
s are handled correctly, for example: T_2 <: T_1
becomes T_2L <: T_1L
.
For example, the class
class MyClass[A, G[_]](a: A, b: G[B]) derives CanEqual
generates the following given instance:
object MyClass:
...
given [A_L, A_R, G_L[_], G_R[_]] => CanEqual[A_L, A_R] => CanEqual[MyClass[A_L, G_L], MyClass[A_R, G_R]] = CanEqual.derived
TC
is not valid for automatic derivation
Throw an error.
The exact error depends on which of the above conditions failed. As an example, if TC
takes more than 1 parameter and is not CanEqual
, the error is DerivingType cannot be unified with the type argument of TC
.
All data types can have a derives
clause. The rest of this document focuses primarily on data types which also have a given instance of the Mirror
type class available.
Mirror
scala.deriving.Mirror
type class instances provide information at the type level about the components and labelling of the type. They also provide minimal term-level infrastructure to allow higher-level libraries to provide comprehensive derivation support.
Instances of the Mirror
type class are generated automatically by the compiler unconditionally for:
- enums and enum cases,
- case objects.
Instances for Mirror
are also generated conditionally for:
- case classes where the constructor is visible at the callsite (always true if the companion is not a case object)
- sealed classes and sealed traits where:
- there exists at least one child case,
- each child case is reachable from the parent's definition,
- if the sealed trait/class has no companion, then each child case is reachable from the callsite through the prefix of the type being mirrored,
- and where the compiler can generate a
Mirror
type class instance for each child case.
The scala.deriving.Mirror
type class definition is as follows:
sealed trait Mirror:
/** the type being mirrored */
type MirroredType
/** the type of the elements of the mirrored type */
type MirroredElemTypes
/** The mirrored *-type */
type MirroredMonoType
/** The name of the type */
type MirroredLabel <: String
/** The names of the elements of the type */
type MirroredElemLabels <: Tuple
object Mirror:
/** The Mirror for a product type */
trait Product extends Mirror:
/** Create a new instance of type `T` with elements
* taken from product `p`.
*/
def fromProduct(p: scala.Product): MirroredMonoType
trait Sum extends Mirror:
/** The ordinal number of the case class of `x`.
* For enums, `ordinal(x) == x.ordinal`
*/
def ordinal(x: MirroredMonoType): Int
end Mirror
Product types (i.e. case classes and objects, and enum cases) have mirrors which are subtypes of Mirror.Product
. Sum types (i.e. sealed class or traits with product children, and enums) have mirrors which are subtypes of Mirror.Sum
.
For the Tree
ADT from above the following Mirror
instances will be automatically provided by the compiler,
// Mirror for Tree
new Mirror.Sum:
type MirroredType = Tree
type MirroredElemTypes[T] = (Branch[T], Leaf[T])
type MirroredMonoType = Tree[_]
type MirroredLabel = "Tree"
type MirroredElemLabels = ("Branch", "Leaf")
def ordinal(x: MirroredMonoType): Int = x match
case _: Branch[_] => 0
case _: Leaf[_] => 1
// Mirror for Branch
new Mirror.Product:
type MirroredType = Branch
type MirroredElemTypes[T] = (Tree[T], Tree[T])
type MirroredMonoType = Branch[_]
type MirroredLabel = "Branch"
type MirroredElemLabels = ("left", "right")
def fromProduct(p: Product): MirroredMonoType =
new Branch(...)
// Mirror for Leaf
new Mirror.Product:
type MirroredType = Leaf
type MirroredElemTypes[T] = Tuple1[T]
type MirroredMonoType = Leaf[_]
type MirroredLabel = "Leaf"
type MirroredElemLabels = Tuple1["elem"]
def fromProduct(p: Product): MirroredMonoType =
new Leaf(...)
If a Mirror cannot be generated automatically for a given type, an error will appear explaining why it is neither a supported sum type nor a product type. For example, if A
is a trait that is not sealed,
No given instance of type deriving.Mirror.Of[A] was found for parameter x of method summon in object Predef. Failed to synthesize an instance of type deriving.Mirror.Of[A]:
* trait A is not a generic product because it is not a case class
* trait A is not a generic sum because it is not a sealed trait
Note the following properties of Mirror
types,
- Properties are encoded using types rather than terms. This means that they have no runtime footprint unless used and also that they are a compile-time feature for use with Scala 3's metaprogramming facilities.
- There is no restriction against the mirrored type being a local or inner class.
- The kinds of
MirroredType
andMirroredElemTypes
match the kind of the data type the mirror is an instance for. This allowsMirror
s to support ADTs of all kinds. - There is no distinct representation type for sums or products (ie. there is no
HList
orCoproduct
type as in Scala 2 versions of Shapeless). Instead the collection of child types of a data type is represented by an ordinary, possibly parameterized, tuple type. Scala 3's metaprogramming facilities can be used to work with these tuple types as-is, and higher-level libraries can be built on top of them. - For both product and sum types, the elements of
MirroredElemTypes
are arranged in definition order (i.e.Branch[T]
precedesLeaf[T]
inMirroredElemTypes
forTree
becauseBranch
is defined beforeLeaf
in the source file). This means thatMirror.Sum
differs in this respect from Shapeless's generic representation for ADTs in Scala 2, where the constructors are ordered alphabetically by name. - The methods
ordinal
andfromProduct
are defined in terms ofMirroredMonoType
which is the type of kind-*
which is obtained fromMirroredType
by wildcarding its type parameters.
Implementing derived
with Mirror
As seen before, the signature and implementation of a derived
method for a type class TC[_]
are arbitrary, but we expect it to typically be of the following form:
import scala.deriving.Mirror
inline def derived[T](using Mirror.Of[T]): TC[T] = ...
That is, the derived
method takes a context parameter of (some subtype of) type Mirror
which defines the shape of the deriving type T
, and computes the type class implementation according to that shape. This is all that the provider of an ADT with a derives
clause has to know about the derivation of a type class instance.
Note that derived
methods may have context Mirror
parameters indirectly (e.g. by having a context argument which in turn has a context Mirror
parameter, or not at all (e.g. they might use some completely different user-provided mechanism, for instance using Scala 3 macros or runtime reflection). We expect that (direct or indirect) Mirror
based implementations will be the most common and that is what this document emphasises.
Type class authors will most likely use higher-level derivation or generic programming libraries to implement derived
methods. An example of how a derived
method might be implemented using only the low-level facilities described above and Scala 3's general metaprogramming features is provided below. It is not anticipated that type class authors would normally implement a derived
method in this way, however this walkthrough can be taken as a guide for authors of the higher-level derivation libraries that we expect typical type class authors will use (for a fully worked out example of such a library, see Shapeless 3).
How to write a type class derived
method using low-level mechanisms
The low-level technique we will use to implement a type class derived
method in this example exploits three new type-level constructs in Scala 3: inline methods, inline matches, and implicit searches via summonInline
or summonFrom
. Given this definition of the Eq
type class,
trait Eq[T]:
def eqv(x: T, y: T): Boolean
we need to implement a method Eq.derived
on the companion object of Eq
that produces a given instance for Eq[T]
given a Mirror[T]
. Here is a possible implementation,
import scala.deriving.Mirror
inline def derived[T](using m: Mirror.Of[T]): Eq[T] =
lazy val elemInstances = summonInstances[T, m.MirroredElemTypes] // (1)
inline m match // (2)
case s: Mirror.SumOf[T] => eqSum(s, elemInstances)
case p: Mirror.ProductOf[T] => eqProduct(p, elemInstances)
Note that derived
is defined as an inline def
. This means that the method will be inlined at all call sites (for instance the compiler-generated instance definitions in the companion objects of ADTs which have a deriving Eq
clause).
Inlining of complex code is potentially expensive if overused (meaning slower compile times) so we should be careful to limit how many times
derived
is called for the same type. For example, when computing an instance for a sum type, it may be necessary to callderived
recursively to compute an instance for each one of its child cases. That child case may in turn be a product type, that declares a field referring back to the parent sum type. To compute the instance for this field, we should not callderived
recursively, but instead summon from the context. Typically, the found given instance will be the root given instance that initially calledderived
.
The body of derived
(1) first materializes the Eq
instances for all the child types of type the instance is being derived for. This is either all the branches of a sum type or all the fields of a product type. The implementation of summonInstances
is inline
and uses Scala 3's summonInline
construct to collect the instances as a List
,
inline def summonInstances[T, Elems <: Tuple]: List[Eq[?]] =
inline erasedValue[Elems] match
case _: (elem *: elems) => deriveOrSummon[T, elem] :: summonInstances[T, elems]
case _: EmptyTuple => Nil
inline def deriveOrSummon[T, Elem]: Eq[Elem] =
inline erasedValue[Elem] match
case _: T => deriveRec[T, Elem]
case _ => summonInline[Eq[Elem]]
inline def deriveRec[T, Elem]: Eq[Elem] =
inline erasedValue[T] match
case _: Elem => error("infinite recursive derivation")
case _ => Eq.derived[Elem](using summonInline[Mirror.Of[Elem]]) // recursive derivation
with the instances for children in hand the derived
method uses an inline match
to dispatch to methods which can construct instances for either sums or products (2). Note that because derived
is inline
the match will be resolved at compile-time and only the right-hand side of the matching case will be inlined into the generated code with types refined as revealed by the match.
In the sum case, eqSum
, we use the runtime ordinal
values of the arguments to eqv
to first check if the two values are of the same subtype of the ADT (3) and then, if they are, to further test for equality based on the Eq
instance for the appropriate ADT subtype using the auxiliary method check
(4).
import scala.deriving.Mirror
def eqSum[T](s: Mirror.SumOf[T], elems: => List[Eq[?]]): Eq[T] =
new Eq[T]:
def eqv(x: T, y: T): Boolean =
val ordx = s.ordinal(x) // (3)
(s.ordinal(y) == ordx) && check(x, y, elems(ordx)) // (4)
In the product case, eqProduct
, we test the runtime values of the arguments to eqv
for equality as products based on the Eq
instances for the fields of the data type (5),
import scala.deriving.Mirror
def eqProduct[T](p: Mirror.ProductOf[T], elems: => List[Eq[?]]): Eq[T] =
new Eq[T]:
def eqv(x: T, y: T): Boolean =
iterable(x).lazyZip(iterable(y)).lazyZip(elems).forall(check)
Both eqSum
and eqProduct
have a by-name parameter elems
, because the argument passed is the reference to the lazy elemInstances
value.
Pulling this all together we have the following complete implementation,
import scala.collection.AbstractIterable
import scala.compiletime.{erasedValue, error, summonInline}
import scala.deriving.*
inline def summonInstances[T, Elems <: Tuple]: List[Eq[?]] =
inline erasedValue[Elems] match
case _: (elem *: elems) => deriveOrSummon[T, elem] :: summonInstances[T, elems]
case _: EmptyTuple => Nil
inline def deriveOrSummon[T, Elem]: Eq[Elem] =
inline erasedValue[Elem] match
case _: T => deriveRec[T, Elem]
case _ => summonInline[Eq[Elem]]
inline def deriveRec[T, Elem]: Eq[Elem] =
inline erasedValue[T] match
case _: Elem => error("infinite recursive derivation")
case _ => Eq.derived[Elem](using summonInline[Mirror.Of[Elem]]) // recursive derivation
trait Eq[T]:
def eqv(x: T, y: T): Boolean
object Eq:
given Eq[Int]:
def eqv(x: Int, y: Int) = x == y
def check(x: Any, y: Any, elem: Eq[?]): Boolean =
elem.asInstanceOf[Eq[Any]].eqv(x, y)
def iterable[T](p: T): Iterable[Any] = new AbstractIterable[Any]:
def iterator: Iterator[Any] = p.asInstanceOf[Product].productIterator
def eqSum[T](s: Mirror.SumOf[T], elems: => List[Eq[?]]): Eq[T] =
new Eq[T]:
def eqv(x: T, y: T): Boolean =
val ordx = s.ordinal(x)
(s.ordinal(y) == ordx) && check(x, y, elems(ordx))
def eqProduct[T](p: Mirror.ProductOf[T], elems: => List[Eq[?]]): Eq[T] =
new Eq[T]:
def eqv(x: T, y: T): Boolean =
iterable(x).lazyZip(iterable(y)).lazyZip(elems).forall(check)
inline def derived[T](using m: Mirror.Of[T]): Eq[T] =
lazy val elemInstances = summonInstances[T, m.MirroredElemTypes]
inline m match
case s: Mirror.SumOf[T] => eqSum(s, elemInstances)
case p: Mirror.ProductOf[T] => eqProduct(p, elemInstances)
end Eq
we can test this relative to a simple ADT like so,
enum Lst[+T] derives Eq:
case Cns(t: T, ts: Lst[T])
case Nl
extension [T](t: T) def ::(ts: Lst[T]): Lst[T] = Lst.Cns(t, ts)
@main def test(): Unit =
import Lst.*
val eqoi = summon[Eq[Lst[Int]]]
assert(eqoi.eqv(23 :: 47 :: Nl, 23 :: 47 :: Nl))
assert(!eqoi.eqv(23 :: Nl, 7 :: Nl))
assert(!eqoi.eqv(23 :: Nl, Nl))
In this case the code that is generated by the inline expansion for the derived Eq
instance for Lst
looks like the following, after a little polishing,
given derived$Eq[T] => (eqT: Eq[T]) => Eq[Lst[T]] =
eqSum(summon[Mirror.Of[Lst[T]]], {/* cached lazily */
List(
eqProduct(summon[Mirror.Of[Cns[T]]], {/* cached lazily */
List(summon[Eq[T]], summon[Eq[Lst[T]]])
}),
eqProduct(summon[Mirror.Of[Nl.type]], {/* cached lazily */
Nil
})
)
})
The lazy
modifier on elemInstances
is necessary for preventing infinite recursion in the derived instance for recursive types such as Lst
.
Alternative approaches can be taken to the way that derived
methods can be defined. For example, more aggressively inlined variants using Scala 3 macros, whilst being more involved for type class authors to write than the example above, can produce code for type classes like Eq
which eliminate all the abstraction artefacts (eg. the Lists
of child instances in the above) and generate code which is indistinguishable from what a programmer might write by hand. As a third example, using a higher-level library such as Shapeless, the type class author could define an equivalent derived
method as,
given eqSum: [A] => (inst: => K0.CoproductInstances[Eq, A]) => Eq[A]:
def eqv(x: A, y: A): Boolean = inst.fold2(x, y)(false)(
[t] => (eqt: Eq[t], t0: t, t1: t) => eqt.eqv(t0, t1)
)
given eqProduct: [A] => (inst: => K0.ProductInstances[Eq, A]) => Eq[A]:
def eqv(x: A, y: A): Boolean = inst.foldLeft2(x, y)(true: Boolean)(
[t] => (acc: Boolean, eqt: Eq[t], t0: t, t1: t) =>
Complete(!eqt.eqv(t0, t1))(false)(true)
)
inline def derived[A](using gen: K0.Generic[A]): Eq[A] =
gen.derive(eqProduct, eqSum)
The framework described here enables all three of these approaches without mandating any of them.
For a brief discussion on how to use macros to write a type class derived
method, please read more at How to write a type class derived
method using macros.
Syntax
Template ::= InheritClauses [TemplateBody]
EnumDef ::= id ClassConstr InheritClauses EnumBody
InheritClauses ::= [‘extends’ ConstrApps] [‘derives’ QualId {‘,’ QualId}]
ConstrApps ::= ConstrApp {‘with’ ConstrApp}
| ConstrApp {‘,’ ConstrApp}
Note: To align extends
clauses and derives
clauses, Scala 3 also allows multiple extended types to be separated by commas. So the following is now legal:
class A extends B, C { ... }
It is equivalent to the old form
class A extends B with C { ... }
Discussion
This type class derivation framework is intentionally very small and low-level. There are essentially two pieces of infrastructure in compiler-generated Mirror
instances,
- type members encoding properties of the mirrored types.
- a minimal value-level mechanism for working generically with terms of the mirrored types.
The Mirror
infrastructure can be seen as an extension of the existing Product
infrastructure for case classes: typically, Mirror
types will be implemented by the ADTs companion object, hence the type members and the ordinal
or fromProduct
methods will be members of that object. The primary motivation for this design decision, and the decision to encode properties via types rather than terms was to keep the bytecode and runtime footprint of the feature small enough to make it possible to provide Mirror
instances unconditionally.
Whilst Mirrors
encode properties precisely via type members, the value-level ordinal
and fromProduct
are somewhat weakly typed (because they are defined in terms of MirroredMonoType
) just like the members of Product
. This means that code for generic type classes has to ensure that type exploration and value selection proceed in lockstep and it has to assert this conformance in some places using casts. If generic type classes are correctly written these casts will never fail.
As mentioned, however, the compiler-provided mechanism is intentionally very low-level and it is anticipated that higher-level type class derivation and generic programming libraries will build on this and Scala 3's other metaprogramming facilities to hide these low-level details from type class authors and general users. Type class derivation in the style of both Shapeless and Magnolia are possible (a prototype of Shapeless 3, which combines aspects of both Shapeless 2 and Magnolia has been developed alongside this language feature) as is a more aggressively inlined style, supported by Scala 3's new quote/splice macro and inlining facilities.