delta/colm.git - github.com: adriandt/colm.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	set the pubdatecolm-0.6 0.6	Adrian Thurston	2012-06-20	1	-1/+1
\|
*	moved away closed issues	Adrian Thurston	2012-06-16	48	-0/+0
\|
*	converted issues to text and split by id	Adrian Thurston	2012-06-09	79	-1643/+824
\|
*	cleanup of region creation	Adrian Thurston	2012-05-29	2	-61/+35
\|
*	flattened the reg lang name tree down to a list for regions	Adrian Thurston	2012-05-29	2	-44/+14
\|
*	only need regions in the name tree.	Adrian Thurston	2012-05-28	2	-138/+0
\|
*	cleanup in token region code	Adrian Thurston	2012-05-28	2	-68/+27
\| \| \| \| \|	Use the same name for the RegionDef and TokenRegion. Eventually should be able to unify these two structs.
*	don't need labels in the regular language tree	Adrian Thurston	2012-05-28	2	-30/+0
\|
*	code cleanup	Adrian Thurston	2012-05-28	4	-251/+3
\| \| \| \| \| \| \|	Eliminated the name resolution walk within the state machine. This is from ragel and is not needed. Also removed some top level code for constructing state machines not in a scanner. We don't have this in colm, all state machines are in a scanner.
*	code cleanup	Adrian Thurston	2012-05-28	5	-59/+42
\| \| \| \| \|	The JoinOrLm structs are no longer needed. VarDef and RegionDef reference the Join and the TokenRegion, respectively.
*	specializing graph dicts and lists for regions and regular language defs	Adrian Thurston	2012-05-28	6	-77/+171
\| \| \| \| \| \|	Previously used a single graph dictionary for regions and regular language defs because we were derived from ragel. Splitting these The split goes down to VarDef and JoinOrLm.
*	separating graph dict for regular language defs and scanners	Adrian Thurston	2012-05-27	2	-30/+24
\| \| \| \| \| \|	Renamed the def function for regions to reflect that it is for regions. Took the isInstance out of both functions. All lexical regions are instances (in the ragel sense) and all regular language definitions are not.
*	cleanup of ragel-derived code	Adrian Thurston	2012-05-27	4	-15/+40
\| \| \| \| \| \| \|	The scanner code was derivied from ragel, where the same map of names to graphs is used for regular language defintions and scanners. Some of the regular lanuage defintiions are instantiations, meaning they create states. Starting to retire this by creating a separate map for regular language defs (rlMap).
*	some fixes for this test, but not, but not funcional yet	Adrian Thurston	2012-05-27	1	-22/+26
\|
*	added shell script test harness	Adrian Thurston	2012-05-27	3	-10/+166
\| \| \| \| \| \| \|	Added a shell script test harness that executes all the lm tests in the current directory. Doesn't require any makefile generation, TESTS file, etc. The shell script be easily programmed to add extra steps during testing, such as pre/post execution, linking, etc. Just the better way to go.
*	cleanup: file renaming	Adrian Thurston	2012-05-26	4	-3/+3
\| \| \| \| \| \|	codegen.cc for writing the colm program compiler.cc for the main compiler logic synthesis.cc for the bytecode program generation
*	code movement	Adrian Thurston	2012-05-26	6	-22/+55
\| \| \| \| \|	The exports.cc file is for writing the C++ interface. Write for generic code writing. Currently has only the main file.
*	cleanup: code movement	Adrian Thurston	2012-05-26	3	-124/+83
\| \| \| \|	Merged parsedata.cc and analysis.cc and renamed it colm.cc
*	class name change ParseData -> Compiler	Adrian Thurston	2012-05-26	18	-444/+444
\|
*	minor code cleanup	Adrian Thurston	2012-05-26	1	-2/+4
\| \| \| \| \|	Allocate scanners for included files on the heap. Consistent with the main line.
*	cleanup of the mainline	Adrian Thurston	2012-05-26	4	-35/+37
\| \| \| \| \|	Allocating the primary processing objects in the mainline and calling them, previously had scanners allocating parser and parsers allocating parse data.
*	removed the opt_collect_ignore productions, not using	Adrian Thurston	2012-05-25	1	-40/+10
\|
*	test the capture-ignore mechanism	Adrian Thurston	2012-05-25	3	-5/+16
\|
*	putting collect-ignores in the grammar as zero-length tokens	Adrian Thurston	2012-05-25	10	-72/+75
\| \| \| \| \| \|	PDA construction and execution is complicated too much by the automatic insertion of collect ignore tokens when the collect-ignore property is set. Instead put the collect ignores into grammars, patterns and replacements.
*	Bump to 0.6. Will start depending on this version.	Adrian Thurston	2012-05-25	1	-1/+1
\|
*	cleanup of collect-ignore	Adrian Thurston	2012-05-25	6	-10/+38
\| \| \| \| \|	Suppress code generation of types that are duplicates into the ignore/token/ci regions. Removed some print statements used for debugging.
*	collect-ignore implementation	Adrian Thurston	2012-05-24	6	-21/+52
\| \| \| \| \| \| \|	Now possible to parse patterns that have collect-ignores. Sometimes you need them present in the input stream when you pass over the production. Other times you don't when you pass over the nonterminal. Built skipping of them into the backtracker.
*	experimenting with use of a nonterm for collecting ignores.	Adrian Thurston	2012-05-24	13	-33/+233
\| \| \| \| \| \| \| \| \|	Can say that a production should collect ignores from a region. There is a collect ignore region created, but the states from the ignore-version of the region is used. When the scanner fails to produce a token from the collect-ignore region, the collect-ignore token is generated and accepted by the fsm. Need to take it out of the data tree on reductions and put it into an ignore list. Reverse this during unparsing.
*	removed old print statement	Adrian Thurston	2012-05-23	1	-2/+2
\|
*	added a syntax for specifying no ignores	Adrian Thurston	2012-05-23	12	-49/+83
\| \| \| \| \| \|	Added the keyword 'ni', which can go ahead of or before a token pattern (literal or usual), which means no-ignore. Sets the noPreIgnore and noPostIgnore bits in the token, which affect the ignore scanning and attaching.
*	fixed botched initialization of TokenDef::dupOf	Adrian Thurston	2012-05-23	1	-1/+1
\|
*	fix for right ignore attaching	Adrian Thurston	2012-05-22	1	-16/+12
\| \| \| \| \| \|	The right ignore attaching failed to take into account that the accume ignore list is in reverse order. Need to take the tail that is for right ignore, not the head.
*	updated tests for latest parser changes	Adrian Thurston	2012-05-22	29	-34/+44
\|
*	added another ignore test	Adrian Thurston	2012-05-22	7	-7/+64
\| \| \| \| \|	Exercises the attaching of tokens to the side that the ignore definitions came from.
*	improvements to ignore handling in the parser	Adrian Thurston	2012-05-22	14	-60/+214
\| \| \| \| \| \| \| \| \| \| \| \|	Every region now also has a duplicate scanning region that is only for tokens. The duplicate ignores and tokens generate the original tokens through a TokenDef ignore mechanism. Can turn off post ignore parsing and pre-igore parsing on a token-by-token basis. Probably want to move it into the productions and specify it there. Currently don't have a specification mechanism. If an ignore is a post-token ignore it is not right-attached.
*	added text_notrim() to the C++ interface.	Adrian Thurston	2012-05-22	1	-3/+4
\| \| \| \| \|	The text_notrim() functions retrieve the text of a token without automatically trimming off whitespace.
*	added trim control flag to print code, auto-trimming all colm print calls	Adrian Thurston	2012-05-22	17	-43/+54
\| \| \| \| \| \| \| \|	The print implemenation now takes a trim flag. The colm print function now sets this flag by default. This is a change to the colm language back to 0.5 semantics. The $ conversion uses this flag too (also 0.5 semantics), in the previous commit it issued a tree trim operation. The % operation gives a string conversion without triming.
*	force DEF_PAT names to be unique.	Adrian Thurston	2012-05-21	1	-1/+2
\|
*	took out the trim before str conversion	Adrian Thurston	2012-05-21	1	-1/+1
\| \| \| \| \| \|	No longer need to trim trees before converting to strings because the string conversion does it automatically. To convert to a string without trimming the % operator is used. This may change.
*	moved repeat -> repeat1, added repeat2	Adrian Thurston	2012-05-21	9	-12/+7438
\| \| \| \| \|	The repeat2 test is the current doc gen program from ragel. There is a custom language with a traversal that calls prints (no real transformation).
*	removed empty fsmrun.c	Adrian Thurston	2012-05-21	2	-21/+1
\|
*	auto trim before $ string conversion	Adrian Thurston	2012-05-21	3	-3/+15
\| \| \| \| \|	The $ operation automatically adds a TRIM. The '%' opertion was added, which is the original $ conversion without the trim.
*	clone elimination/refactoring of ignore functions	Adrian Thurston	2012-05-21	1	-39/+2
\| \| \| \|	Eliminated final clone of the push ignores, this one was in the trim operation.
*	clone elimination.	Adrian Thurston	2012-05-21	1	-36/+2
\| \| \| \|	Clone elimination on calls pushIgnore.
*	ongoing refactoring cleanup	Adrian Thurston	2012-05-21	3	-16/+18
\| \| \| \| \| \|	Removed sp from the pushIgnore functions. Need to use it in contexts where it is not currently available. Actually not needed because we can directly access refs when trees are moved around during the push.
*	more clone removal surrounding ignore handling	Adrian Thurston	2012-05-21	3	-70/+88
\|
*	clone removal	Adrian Thurston	2012-05-21	3	-130/+141
\| \| \| \|	The ignore node handling code has been frequently cloned. Cleaning that up.
*	eliminated the IgnoreTree struct.	Adrian Thurston	2012-05-21	8	-93/+45
\| \| \| \| \|	Eliminated the IgnoreTree struct because it no longer contains any extensions of Tree. Just using Tree.
*	eliminated generation from IgnoreList	Adrian Thurston	2012-05-21	5	-12/+0
\| \| \| \| \|	This field was the only field that extends the basic tree. Can now eliminate the structure and just use Tree.
*	test cases updated for no-kid-flags and no-dup-ignors	Adrian Thurston	2012-05-21	21	-43/+51
\| \| \| \| \|	Whitespace is shifting. Most of the updates involve triming whitespace where it was previously trimmed automatically.