Change stack alignment to 16+8 bytes in STG code

This patch changes the STG code so that %rsp to be aligned to a 16-byte boundary + 8. This is the alignment required by the x86_64 ABI on entry to a function. Previously we kept %rsp aligned to a 16-byte boundary, but this was causing problems for the LLVM backend (see #4211). We now don't need to invoke llvm stack mangler on x86_64 targets. Since the stack is now 16+8 byte algined in STG land on x86_64, we don't need to mangle the stack manipulations with the llvm mangler. This patch only modifies the alignement for x86_64 backends. Signed-off-by: David Terei <davidterei@gmail.com>
author: David M Peixotto <dmp@rice.edu> 2011-10-19 15:49:06 -0500
committer: David Terei <davidterei@gmail.com> 2011-11-01 03:18:40 -0700
commit: a9ce36118f0de3aeb427792f8f2c5ae097c94d3f (patch)
tree: d03c1697a04df842b21bafa214f22140473a2e0d /rts/StgCRun.c
parent: f0ae3f31277ebfe2384fca3f89867f340ae9b492 (diff)
download: haskell-a9ce36118f0de3aeb427792f8f2c5ae097c94d3f.tar.gz
1 files changed, 26 insertions, 20 deletions
diff --git a/rts/StgCRun.c b/rts/StgCRun.c
index 7251e64253..11e0543475 100644
--- a/rts/StgCRun.c
+++ b/rts/StgCRun.c
@@ -267,29 +267,36 @@ StgRunIsImplementedInAssembler(void)
 	"addq %0, %%rsp\n\t"
 	"retq"
 
-	: : "i"(RESERVED_C_STACK_BYTES+48+8 /*stack frame size*/));
+	: : "i"(RESERVED_C_STACK_BYTES+48 /*stack frame size*/));
     /* 
-       HACK alert!
-
-       The x86_64 ABI specifies that on a procedure call, %rsp is
+       The x86_64 ABI specifies that on entry to a procedure, %rsp is
        aligned on a 16-byte boundary + 8.  That is, the first
        argument on the stack after the return address will be
-       16-byte aligned.  
-       
-       Which should be fine: RESERVED_C_STACK_BYTES+48 is a multiple
-       of 16 bytes.  
+       16-byte aligned.
+
+       We maintain the 16+8 stack alignment throughout the STG code.
+
+       When we call STG_RUN the stack will be aligned to 16+8. We used
+       to subtract an extra 8 bytes so that %rsp would be 16 byte
+       aligned at all times in STG land. This worked fine for the
+       native code generator which knew that the stack was already
+       aligned on 16 bytes when it generated calls to C functions.
+
+       This arrangemnt caused problems for the LLVM backend. The LLVM
+       code generator would assume that on entry to each function the
+       stack is aligned to 16+8 as required by the ABI. However, since
+       we only enter STG functions by jumping to them with tail calls,
+       the stack was actually aligned to a 16-byte boundary. The LLVM
+       backend had its own mangler that would post-process the
+       assembly code to fixup the stack manipulation code to mainain
+       the correct alignment (see #4211).
+
+       Therefore, we now now keep the stack aligned to 16+8 while in
+       STG land so that LLVM generates correct code without any
+       mangling. The native code generator can handle this alignment
+       just fine by making sure the stack is aligned to a 16-byte
+       boundary before it makes a C-call.
        
-       BUT... when we do a C-call from STG land, gcc likes to put the
-       stack alignment adjustment in the prolog.  eg. if we're calling
-       a function with arguments in regs, gcc will insert 'subq $8,%rsp'
-       in the prolog, to keep %rsp aligned (the return address is 8
-       bytes, remember).  The mangler throws away the prolog, so we
-       lose the stack alignment.
-
-       The hack is to add this extra 8 bytes to our %rsp adjustment
-       here, so that throughout STG code, %rsp is 16-byte aligned,
-       ready for a C-call.  
-
        A quick way to see if this is wrong is to compile this code:
 
           main = System.Exit.exitWith ExitSuccess
@@ -300,7 +307,6 @@ StgRunIsImplementedInAssembler(void)
        stack isn't aligned, and calling exitWith from Haskell invokes
        shutdownHaskellAndExit using a C call.
 
-       Future gcc releases will almost certainly break this hack...
     */
 }
author	David M Peixotto <dmp@rice.edu>	2011-10-19 15:49:06 -0500
committer	David Terei <davidterei@gmail.com>	2011-11-01 03:18:40 -0700
commit	a9ce36118f0de3aeb427792f8f2c5ae097c94d3f (patch)
tree	d03c1697a04df842b21bafa214f22140473a2e0d /rts/StgCRun.c
parent	f0ae3f31277ebfe2384fca3f89867f340ae9b492 (diff)
download	haskell-a9ce36118f0de3aeb427792f8f2c5ae097c94d3f.tar.gz