summaryrefslogtreecommitdiff
path: root/gcc/PROJECTS
diff options
context:
space:
mode:
Diffstat (limited to 'gcc/PROJECTS')
-rw-r--r--gcc/PROJECTS48
1 files changed, 30 insertions, 18 deletions
diff --git a/gcc/PROJECTS b/gcc/PROJECTS
index 2f508e695e1..2c2111a2a6e 100644
--- a/gcc/PROJECTS
+++ b/gcc/PROJECTS
@@ -22,14 +22,9 @@ Haifa scheduler (haifa-sched.c, loop.[ch], unroll.[ch], genattrtab.c):
opinion that gcc already has too many -foptions, and haifa doesn't help
that situation.
- * Testing and benchmarking. Haifa has received little testing inside
- Cygnus -- it needs to be throughly tested on a wide variety of platforms
- which benefit from instruction scheduling (sparc, alpha, pa, ppc, mips, x86,
- i960, m88k, sh, etc). It needs to be benchmarked -- my tests showed
- haifa was very much a hit or miss in terms of performance improvements.
-
- Some benchmarks ran significantly fasters, other significantly slower.
- We need to work on making haifa generate better overall code.
+ * Testing and benchmarking. We've converted a few ports to using the
+ Haifa scheduler (hppa, sparc, ppc, alpha). We need to continue testing
+ and benchmarking the new scheduler on additional targets.
We need to have some kind of docs for how to best describe a machine to
the haifa scheduler to get good performance. Some existing ports have
@@ -38,6 +33,26 @@ Haifa scheduler (haifa-sched.c, loop.[ch], unroll.[ch], genattrtab.c):
+Improvements to global cse and partial redundancy elimination:
+
+The current implementation of global cse uses partial redundancy elimination
+as described in Chow's thesis.
+
+Long term we want to use lazy code motion as the basis for partial redundancy
+elimination. lcm will find as many (or more) redunancies *and* it will
+place the remaining computations at computationally optimal placement points
+within the function. This reduces the number of redundant operations performed
+as well as reducing register lifetimes. My experiments have shown that the
+cases were the current PRE code hurts performance are greatly helped by using
+lazy code motion.
+
+lcm also provides the underlying framework for several additional optimizations
+such as shrink wrapping, spill code motion, dead store elimination, and generic
+load/store motion (all the other examples are subcases of load/store motion).
+
+It can probably also be used to improve the reg-stack pass of the compiler.
+
+Contact law@cygnus.com if you're interested in working on lazy code motion.
-------------
@@ -57,14 +72,6 @@ The difficulty is in finding a clean way for the RTL which refers
to the constant (currently, only by an assembler symbol name)
to point to the constant and cause it to be output.
-* More cse
-
-The techniques for doing full global cse are described in the red
-dragon book, or (a different version) in Frederick Chow's thesis from
-Stanford. It is likely to be slow and use a lot of memory, but it
-might be worth offering as an additional option. Contact dje@cygnus.com
-before doing any work on CSE.
-
* Optimize a sequence of if statements whose conditions are exclusive.
It is possible to optimize
@@ -207,6 +214,10 @@ the same location; and, in between, there is no reference to anything
that might be that location (including no reference to a variable
address).
+This can be modeled as a partial redundancy elimination/lazy code motion
+problem. Contact law@cygnus.com before working on dead store elimination
+optimizations.
+
* Loop optimization.
Strength reduction and iteration variable elimination could be
@@ -275,8 +286,9 @@ to the order in which to generate code for subexpressions of an expression.
* More code motion.
-Consider hoisting common code up past conditional branches or
-tablejumps.
+Consider hoisting common code up past conditional branches or tablejumps.
+
+Contact law@cygnus.com before working on code hoisting.
* Trace scheduling.