diff options
author | Richard Eisenberg <rae@richarde.dev> | 2021-04-05 17:40:29 -0400 |
---|---|---|
committer | Ryan Scott <rscott@galois.com> | 2021-04-13 10:13:00 -0400 |
commit | 5968b7ff5f209875834a1c51b48ea88657693417 (patch) | |
tree | 03a6278fc5b2044edad6d69ff2d5f70be402d7af | |
parent | 1f2681a91f2076765a884e5d3d5dcf92c54c0f33 (diff) | |
download | haskell-5968b7ff5f209875834a1c51b48ea88657693417.tar.gz |
Clarify commentary around the constraint solver
No changes to code; no changes to theory. Just better
explanation.
-rw-r--r-- | compiler/GHC/Tc/Solver/Monad.hs | 128 | ||||
-rw-r--r-- | compiler/GHC/Tc/Solver/Rewrite.hs | 3 | ||||
-rw-r--r-- | compiler/GHC/Tc/Utils/TcType.hs | 5 |
3 files changed, 104 insertions, 32 deletions
diff --git a/compiler/GHC/Tc/Solver/Monad.hs b/compiler/GHC/Tc/Solver/Monad.hs index e7e58f6599..6e432ec580 100644 --- a/compiler/GHC/Tc/Solver/Monad.hs +++ b/compiler/GHC/Tc/Solver/Monad.hs @@ -943,8 +943,8 @@ Notation: repeated application. S^0(f,t) = t S^(n+1)(f,t) = S(f, S^n(t)) -Definition: inert generalised substitution -A generalised substitution S is "inert" iff +Definition: terminating generalised substitution +A generalised substitution S is *terminating* iff (IG1) there is an n such that for every f,t, S^n(f,t) = S^(n+1)(f,t) @@ -952,13 +952,13 @@ A generalised substitution S is "inert" iff By (IG1) we define S*(f,t) to be the result of exahaustively applying S(f,_) to t. ----------------------------------------------------------------- +----------------------------------------------------------------------------- Our main invariant: - the inert CEqCans should be an inert generalised substitution ----------------------------------------------------------------- + the CEqCans in inert_eqs should be a terminating generalised substitution +----------------------------------------------------------------------------- -Note that inertness is not the same as idempotence. To apply S to a -type, you may have to apply it recursively. But inertness does +Note that termination is not the same as idempotence. To apply S to a +type, you may have to apply it recursively. But termination does guarantee that this recursive use will terminate. Note [Why R2?] @@ -974,21 +974,76 @@ the two equalities cannot rewrite one another. R2 actually restricts our ability to accept user-written programs. See Note [Deriveds do rewrite Deriveds] in GHC.Tc.Types.Constraint for an example. +Note [Rewritable] +~~~~~~~~~~~~~~~~~ +This Note defines what it means for a type variable or type family application +(that is, a CanEqLHS) to be rewritable in a type. This definition is used +by the anyRewritableXXX family of functions and is meant to model the actual +behaviour in GHC.Tc.Solver.Rewrite. + +Ignoring roles (for now): A CanEqLHS lhs is *rewritable* in a type t if the +lhs tree appears as a subtree within t without traversing any of the following +components of t: + * coercions (whether they appear in casts CastTy or as arguments CoercionTy) + * kinds of variables +The check for rewritability *does* look in kinds of the bound variable of a +ForAllTy. + +Goal: If lhs is not rewritable in t, then t is a fixpoint of the generalised +substitution containing lhs -f*-> t', where f* is a flavour such that f* >= f +for all f. + +The reason for this definition is that the rewriter does not rewrite in coercions +or variables' kinds. In turn, the rewriter does not need to rewrite there because +those places are never used for controlling the behaviour of the solver: these +places are not used in matching instances or in decomposing equalities. + +There is one exception: we sometimes do an occurs-check to decide e.g. how to +orient an equality. (See the comments on GHC.Tc.Solver.Canonical.canEqTyVarFunEq.) +Accordingly, the presence of a variable in a kind or coercion just might influence +the solver. Here is an example: + + type family Const x y where + Const x y = x + + AxConst :: forall x y. Const x y ~# x + + alpha :: Const Type Nat + [W] alpha ~ Int |> (sym (AxConst Type alpha) ;; + AxConst Type alpha ;; + sym (AxConst Type Nat)) + +The cast is clearly ludicrous, but we can't quite rule it out. (See (EQ1) from +Note [Respecting definitional equality] in GHC.Core.TyCo.Rep to see why we need +the Const Type Nat bit.) And yet this cast will (quite rightly) prevent alpha +from unifying with the RHS. I (Richard E) don't have an example of where this +problem can arise from a Haskell program, but we don't have an air-tight argument +for why the definition of *rewritable* given here is correct. + +Taking roles into account: we must consider a rewrite at a given role. That is, +a rewrite arises from some equality, and that equality has a role associated +with it. As we traverse a type, we track what role we are allowed to rewrite with. + +For example, suppose we have an inert [G] b ~R# Int. Then b is rewritable in +Maybe b but not in F b, where F is a type function. This role-aware logic is +present in both the anyRewritableXXX functions and in the rewriter. +See also Note [anyRewritableTyVar must be role-aware] in GHC.Tc.Utils.TcType. + Note [Extending the inert equalities] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Main Theorem [Stability under extension] Suppose we have a "work item" lhs -fw-> t - and an inert generalised substitution S, + and a terminating generalised substitution S, THEN the extended substitution T = S+(lhs -fw-> t) - is an inert generalised substitution + is a terminating generalised substitution PROVIDED (T1) S(fw,lhs) = lhs -- LHS of work-item is a fixpoint of S(fw,_) (T2) S(fw,t) = t -- RHS of work-item is a fixpoint of S(fw,_) (T3) lhs not in t -- No occurs check in the work item -- If lhs is a type family application, we require only that - -- lhs is not *rewritable* in t. See Note [CEqCan occurs check] - -- in GHC.Tc.Types.Constraint + -- lhs is not *rewritable* in t. See Note [Rewritable] and + -- Note [CEqCan occurs check] in GHC.Tc.Types.Constraint. AND, for every (lhs1 -fs-> s) in S: (K0) not (fw >= fs) @@ -998,12 +1053,13 @@ Main Theorem [Stability under extension] so the kick-out achieved nothing -- From here, we can assume fw >= fs - OR { (K1) lhs is not rewritable in lhs1. That is, lhs does not occur - in lhs1 (except perhaps in a cast or coercion). + OR (K4) lhs1 is a tyvar AND fs >= fw + + OR { (K1) lhs is not rewritable in lhs1. See Note [Rewritable]. Reason: if fw >= fs, WF1 says we can't have both lhs0 -fw-> t and F lhs0 -fs-> s - AND (K2): guarantees inertness of the new substitution + AND (K2): guarantees termination of the new substitution { (K2a) not (fs >= fs) OR (K2b) lhs not in s } @@ -1012,7 +1068,6 @@ Main Theorem [Stability under extension] (K3b) If the role of fs is representational: s is not of form (lhs t1 .. tn) } } - OR (K4) lhs1 is a tyvar AND fs >= fw Conditions (T1-T3) are established by the canonicaliser Conditions (K1-K3) are established by GHC.Tc.Solver.Monad.kickOutRewritable @@ -1057,9 +1112,9 @@ The idea is that - (K1) plus (L1) guarantee that the extended substitution satisfies (WF1). - (T3) guarantees (WF2). -* (K2) is about inertness. Intuitively, any infinite chain S^0(f,t), +* (K2) and (K4) are about termination. Intuitively, any infinite chain S^0(f,t), S^1(f,t), S^2(f,t).... must pass through the new work item infinitely - often, since the substitution without the work item is inert; and must + often, since the substitution without the work item is terminating; and must pass through at least one of the triples in S infinitely often. - (K2a): if not(fs>=fs) then there is no f that fs can rewrite (fs>=f) @@ -1071,7 +1126,7 @@ The idea is that - (K2b): if lhs not in s, we have no further opportunity to apply the work item - NB: Dimitrios has a PDF that does this in more detail + - See Note [K4] about (K4) * Lemma (L3). Suppose we have f* such that, for all f, f* >= f. Then if we are adding lhs -fw-> t (where T1, T2, and T3 hold), we will keep a -f*-> s. @@ -1090,23 +1145,31 @@ Here is the scenario: * We are considering adding (lhs -fw-> t) to the inert set S. * S already has (lhs1 -fs-> s). -* We know S(fw, lhs) = lhs, S(fw, t) = t, and lhs is not rewritable in t. These - are (T1), (T2), and (T3). +* We know S(fw, lhs) = lhs, S(fw, t) = t, and lhs is not rewritable in t. + See Note [Rewritable]. These are (T1), (T2), and (T3). * We further know fw >= fs. (If not, then we short-circuit via (K0).) K4 says that we may keep lhs1 -fs-> s in S if: lhs1 is a tyvar AND fs >= fw -Let's consider the two possible shapes of lhs1 (which, recall, is a CanEqLHS: -either a tyvar or an exactly-saturated type family application). - -+ lhs1 is a tyvar a: +Why K4 guarantees termination: * If fs >= fw, we know a is not rewritable in t, because of (T2). * We further know lhs /= a, because of (T1). * Accordingly, a use of the new inert item lhs -fw-> t cannot create the conditions - for a use of a -fs-> s (precisely because t does not mention a). + for a use of a -fs-> s (precisely because t does not mention a), and hence, + the extended substitution (with lhs -fw-> t in it) is a terminating + generalised substitution. - * Now, suppose not (fs >= fw). It might be the case that t mentions a, and this +Recall that the termination generalised substitution includes only mappings that +pass an occurs check. This is (T3). At one point, we worried that the +argument here would fail if s mentioned a, but (T3) rules out this possibility. +Put another way: the terminating generalised substitution considers only the inert_eqs, +not other parts of the inert set (such as the irreds). + +Can we liberalise K4? No. + +Why we cannot drop the (fs >= fw) condition: + * Suppose not (fs >= fw). It might be the case that t mentions a, and this can cause a loop. Example: Work: [G] b ~ a @@ -1118,7 +1181,7 @@ either a tyvar or an exactly-saturated type family application). * Note that the above example is different if the inert is a Given G, because (T1) won't hold. -+ lhs1 is a type family application F tys: +Why we cannot drop the tyvar condition: * Presume fs >= fw. Thus, F tys is not rewritable in t, because of (T2). * Can the use of lhs -fw-> t create the conditions for a use of F tys -fs-> s? Yes! This can happen if t appears within tys. @@ -1157,9 +1220,9 @@ a top-level K condition. Note [K3: completeness of solving] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ (K3) is not necessary for the extended substitution -to be inert. In fact K1 could be made stronger by saying +to be terminating. In fact K1 could be made stronger by saying ... then (not (fw >= fs) or not (fs >= fs)) -But it's not enough for S to be inert; we also want completeness. +But it's not enough for S to be terminating; we also want completeness. That is, we want to be able to solve all soluble wanted equalities. Suppose we have @@ -1993,19 +2056,20 @@ kick_out_rewritable new_fr new_lhs | not (fr_may_rewrite fs) = False -- (K0) Keep it in the inert set if the new thing can't rewrite it + -- Below here (fr_may_rewrite fs) is True + | TyVarLHS _ <- lhs , fs `eqMayRewriteFR` new_fr = False -- (K4) Keep it in the inert set if the LHS is a tyvar and -- it can rewrite the work item. See Note [K4] - -- Below here (fr_may_rewrite fs) is True | fr_can_rewrite_ty eq_rel (canEqLHSType lhs) = True -- (K1) -- The above check redundantly checks the role & flavour, -- but it's very convenient - | kick_out_for_inertness = True - | kick_out_for_completeness = True + | kick_out_for_inertness = True -- (K2) + | kick_out_for_completeness = True -- (K3) | otherwise = False where diff --git a/compiler/GHC/Tc/Solver/Rewrite.hs b/compiler/GHC/Tc/Solver/Rewrite.hs index 6300215d2a..6fd4b85da1 100644 --- a/compiler/GHC/Tc/Solver/Rewrite.hs +++ b/compiler/GHC/Tc/Solver/Rewrite.hs @@ -333,6 +333,9 @@ forgetful -- that is, omits one or more type variables in its RHS. If so, it expands the synonym and proceeds; if not, it simply returns the unexpanded synonym. See also Note [Rewriting synonyms]. +Where do we actually perform rewriting within a type? See Note [Rewritable] in +GHC.Tc.Solver.Monad. + Note [rewrite_args performance] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In programs with lots of type-level evaluation, rewrite_args becomes diff --git a/compiler/GHC/Tc/Utils/TcType.hs b/compiler/GHC/Tc/Utils/TcType.hs index c7d2b3e09d..886d120661 100644 --- a/compiler/GHC/Tc/Utils/TcType.hs +++ b/compiler/GHC/Tc/Utils/TcType.hs @@ -840,6 +840,8 @@ any_rewritable :: Bool -- Ignore casts and coercions -- This looks like it should use foldTyCo, but that function is -- role-agnostic, and this one must be role-aware. We could make -- foldTyCon role-aware, but that may slow down more common usages. +-- +-- See Note [Rewritable] in GHC.Tc.Solver.Monad for a specification for this function. {-# INLINE any_rewritable #-} -- this allows specialization of predicates any_rewritable ignore_cos role tv_pred tc_pred should_expand = go role emptyVarSet @@ -888,6 +890,7 @@ anyRewritableTyVar :: Bool -- Ignore casts and coercions -> EqRel -- Ambient role -> (EqRel -> TcTyVar -> Bool) -- check tyvar -> TcType -> Bool +-- See Note [Rewritable] in GHC.Tc.Solver.Monad for a specification for this function. anyRewritableTyVar ignore_cos role pred = any_rewritable ignore_cos role pred (\ _ _ _ -> False) -- no special check for tyconapps @@ -904,12 +907,14 @@ anyRewritableTyFamApp :: EqRel -- Ambient role -- should return True only for type family applications -> TcType -> Bool -- always ignores casts & coercions +-- See Note [Rewritable] in GHC.Tc.Solver.Monad for a specification for this function. anyRewritableTyFamApp role check_tyconapp = any_rewritable True role (\ _ _ -> False) check_tyconapp (not . isFamFreeTyCon) -- This version is used by shouldSplitWD. It *does* look in casts -- and coercions, and it always expands type synonyms whose RHSs mention -- type families. +-- See Note [Rewritable] in GHC.Tc.Solver.Monad for a specification for this function. anyRewritableCanEqLHS :: EqRel -- Ambient role -> (EqRel -> TcTyVar -> Bool) -- check tyvar -> (EqRel -> TyCon -> [TcType] -> Bool) -- check type family |