delta/colm.git - github.com: adriandt/colm.git

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	separating graph dict for regular language defs and scanners	Adrian Thurston	2012-05-27	2	-30/+24
\| \| \| \| \| \|	Renamed the def function for regions to reflect that it is for regions. Took the isInstance out of both functions. All lexical regions are instances (in the ragel sense) and all regular language definitions are not.
*	cleanup of ragel-derived code	Adrian Thurston	2012-05-27	4	-15/+40
\| \| \| \| \| \| \|	The scanner code was derivied from ragel, where the same map of names to graphs is used for regular language defintions and scanners. Some of the regular lanuage defintiions are instantiations, meaning they create states. Starting to retire this by creating a separate map for regular language defs (rlMap).
*	some fixes for this test, but not, but not funcional yet	Adrian Thurston	2012-05-27	1	-22/+26
\|
*	added shell script test harness	Adrian Thurston	2012-05-27	3	-10/+166
\| \| \| \| \| \| \|	Added a shell script test harness that executes all the lm tests in the current directory. Doesn't require any makefile generation, TESTS file, etc. The shell script be easily programmed to add extra steps during testing, such as pre/post execution, linking, etc. Just the better way to go.
*	cleanup: file renaming	Adrian Thurston	2012-05-26	4	-3/+3
\| \| \| \| \| \|	codegen.cc for writing the colm program compiler.cc for the main compiler logic synthesis.cc for the bytecode program generation
*	code movement	Adrian Thurston	2012-05-26	6	-22/+55
\| \| \| \| \|	The exports.cc file is for writing the C++ interface. Write for generic code writing. Currently has only the main file.
*	cleanup: code movement	Adrian Thurston	2012-05-26	3	-124/+83
\| \| \| \|	Merged parsedata.cc and analysis.cc and renamed it colm.cc
*	class name change ParseData -> Compiler	Adrian Thurston	2012-05-26	18	-444/+444
\|
*	minor code cleanup	Adrian Thurston	2012-05-26	1	-2/+4
\| \| \| \| \|	Allocate scanners for included files on the heap. Consistent with the main line.
*	cleanup of the mainline	Adrian Thurston	2012-05-26	4	-35/+37
\| \| \| \| \|	Allocating the primary processing objects in the mainline and calling them, previously had scanners allocating parser and parsers allocating parse data.
*	removed the opt_collect_ignore productions, not using	Adrian Thurston	2012-05-25	1	-40/+10
\|
*	test the capture-ignore mechanism	Adrian Thurston	2012-05-25	3	-5/+16
\|
*	putting collect-ignores in the grammar as zero-length tokens	Adrian Thurston	2012-05-25	10	-72/+75
\| \| \| \| \| \|	PDA construction and execution is complicated too much by the automatic insertion of collect ignore tokens when the collect-ignore property is set. Instead put the collect ignores into grammars, patterns and replacements.
*	Bump to 0.6. Will start depending on this version.	Adrian Thurston	2012-05-25	1	-1/+1
\|
*	cleanup of collect-ignore	Adrian Thurston	2012-05-25	6	-10/+38
\| \| \| \| \|	Suppress code generation of types that are duplicates into the ignore/token/ci regions. Removed some print statements used for debugging.
*	collect-ignore implementation	Adrian Thurston	2012-05-24	6	-21/+52
\| \| \| \| \| \| \|	Now possible to parse patterns that have collect-ignores. Sometimes you need them present in the input stream when you pass over the production. Other times you don't when you pass over the nonterminal. Built skipping of them into the backtracker.
*	experimenting with use of a nonterm for collecting ignores.	Adrian Thurston	2012-05-24	13	-33/+233
\| \| \| \| \| \| \| \| \|	Can say that a production should collect ignores from a region. There is a collect ignore region created, but the states from the ignore-version of the region is used. When the scanner fails to produce a token from the collect-ignore region, the collect-ignore token is generated and accepted by the fsm. Need to take it out of the data tree on reductions and put it into an ignore list. Reverse this during unparsing.
*	removed old print statement	Adrian Thurston	2012-05-23	1	-2/+2
\|
*	added a syntax for specifying no ignores	Adrian Thurston	2012-05-23	12	-49/+83
\| \| \| \| \| \|	Added the keyword 'ni', which can go ahead of or before a token pattern (literal or usual), which means no-ignore. Sets the noPreIgnore and noPostIgnore bits in the token, which affect the ignore scanning and attaching.
*	fixed botched initialization of TokenDef::dupOf	Adrian Thurston	2012-05-23	1	-1/+1
\|
*	fix for right ignore attaching	Adrian Thurston	2012-05-22	1	-16/+12
\| \| \| \| \| \|	The right ignore attaching failed to take into account that the accume ignore list is in reverse order. Need to take the tail that is for right ignore, not the head.
*	updated tests for latest parser changes	Adrian Thurston	2012-05-22	29	-34/+44
\|
*	added another ignore test	Adrian Thurston	2012-05-22	7	-7/+64
\| \| \| \| \|	Exercises the attaching of tokens to the side that the ignore definitions came from.
*	improvements to ignore handling in the parser	Adrian Thurston	2012-05-22	14	-60/+214
\| \| \| \| \| \| \| \| \| \| \| \|	Every region now also has a duplicate scanning region that is only for tokens. The duplicate ignores and tokens generate the original tokens through a TokenDef ignore mechanism. Can turn off post ignore parsing and pre-igore parsing on a token-by-token basis. Probably want to move it into the productions and specify it there. Currently don't have a specification mechanism. If an ignore is a post-token ignore it is not right-attached.
*	added text_notrim() to the C++ interface.	Adrian Thurston	2012-05-22	1	-3/+4
\| \| \| \| \|	The text_notrim() functions retrieve the text of a token without automatically trimming off whitespace.
*	added trim control flag to print code, auto-trimming all colm print calls	Adrian Thurston	2012-05-22	17	-43/+54
\| \| \| \| \| \| \| \|	The print implemenation now takes a trim flag. The colm print function now sets this flag by default. This is a change to the colm language back to 0.5 semantics. The $ conversion uses this flag too (also 0.5 semantics), in the previous commit it issued a tree trim operation. The % operation gives a string conversion without triming.
*	force DEF_PAT names to be unique.	Adrian Thurston	2012-05-21	1	-1/+2
\|
*	took out the trim before str conversion	Adrian Thurston	2012-05-21	1	-1/+1
\| \| \| \| \| \|	No longer need to trim trees before converting to strings because the string conversion does it automatically. To convert to a string without trimming the % operator is used. This may change.
*	moved repeat -> repeat1, added repeat2	Adrian Thurston	2012-05-21	9	-12/+7438
\| \| \| \| \|	The repeat2 test is the current doc gen program from ragel. There is a custom language with a traversal that calls prints (no real transformation).
*	removed empty fsmrun.c	Adrian Thurston	2012-05-21	2	-21/+1
\|
*	auto trim before $ string conversion	Adrian Thurston	2012-05-21	3	-3/+15
\| \| \| \| \|	The $ operation automatically adds a TRIM. The '%' opertion was added, which is the original $ conversion without the trim.
*	clone elimination/refactoring of ignore functions	Adrian Thurston	2012-05-21	1	-39/+2
\| \| \| \|	Eliminated final clone of the push ignores, this one was in the trim operation.
*	clone elimination.	Adrian Thurston	2012-05-21	1	-36/+2
\| \| \| \|	Clone elimination on calls pushIgnore.
*	ongoing refactoring cleanup	Adrian Thurston	2012-05-21	3	-16/+18
\| \| \| \| \| \|	Removed sp from the pushIgnore functions. Need to use it in contexts where it is not currently available. Actually not needed because we can directly access refs when trees are moved around during the push.
*	more clone removal surrounding ignore handling	Adrian Thurston	2012-05-21	3	-70/+88
\|
*	clone removal	Adrian Thurston	2012-05-21	3	-130/+141
\| \| \| \|	The ignore node handling code has been frequently cloned. Cleaning that up.
*	eliminated the IgnoreTree struct.	Adrian Thurston	2012-05-21	8	-93/+45
\| \| \| \| \|	Eliminated the IgnoreTree struct because it no longer contains any extensions of Tree. Just using Tree.
*	eliminated generation from IgnoreList	Adrian Thurston	2012-05-21	5	-12/+0
\| \| \| \| \|	This field was the only field that extends the basic tree. Can now eliminate the structure and just use Tree.
*	test cases updated for no-kid-flags and no-dup-ignors	Adrian Thurston	2012-05-21	21	-43/+51
\| \| \| \| \|	Whitespace is shifting. Most of the updates involve triming whitespace where it was previously trimmed automatically.
*	added missing downref in detach ignore	Adrian Thurston	2012-05-21	1	-0/+2
\| \| \| \| \|	Added a missing downref in the detach of right ignores. This was lost when converting to the no-dupe of ignores.
*	code and expected output changes for no-kf-dupign	Adrian Thurston	2012-05-21	9	-8/+23
\| \| \| \| \|	Explictly suppress leading and trailing ignores using the TRIM operation. Adjusted output for the no-dupignores too. Whitespace shifts around.
*	improvements to the delayed ignore-tree printing	Adrian Thurston	2012-05-21	1	-27/+29
\| \| \| \| \| \|	Need to put the visitType on the stack if we are to reference it past a recursive call. Implement the suppress left when going backwards to reverse the list. Implement suppress-right on the pass forward for printing.
*	refcounting fix in TREE_TRIM	Adrian Thurston	2012-05-21	1	-2/+0
\| \| \| \| \|	Don't need to adjust the refcounts in TREE_TRIM. The trim function will split the tree, making it safe to write to. Just pop, split, modify, push back.
*	added the trim operation (^)	Adrian Thurston	2012-05-21	8	-21/+135
\| \| \| \| \| \| \| \| \|	The trim operation wraps the ignores with nodes that have the suppress-left and suppress-right flags set in the ignore list struct. These are now added to the list of ignores to ouput once a terminal is hit, but are not printed since they have no data. They are there just for their flags. Implemented the suppress-left and suppress-right operations in a walk of the ignores to output by altering the list.
*	Buffer ignore data until a terminal is hit.	Adrian Thurston	2012-05-20	1	-36/+65
\|
*	tests useful on the no-kf-dupign branch	Adrian Thurston	2012-05-20	9	-6/+116
\| \| \| \|	These tests are helpful when testing the no-kf-dupign branch.
*	first checkin on no-kid-flags and no-ignore-dupes branch	Adrian Thurston	2012-05-20	7	-131/+215
\| \| \| \| \| \| \| \| \| \| \| \|	Trying out an elimination of the kid flags and the duplicate ignore tokens. The kid flags waste a lot of space. There is also a bug WRT iterators. Some kids in the ref chain are actually on the stack only the tree is safe to edit because the stack variables are not full kids. The duplicate ignores causes a complicated implementation in the print function. It may actually be unneeded now that we have follow ignores. We can also provide some control over where the ignore tokens go, to the right of the left token or to the left of the right token.
*	added two test cases causing segfaults	Adrian Thurston	2012-05-20	9	-5/+235
\|
*	final test updates for follow-ignore	Adrian Thurston	2012-05-20	3	-6/+5
\| \| \| \|	Final adjustments related to shifting ignore token placements.
*	updated context2 test for follow-ignore	Adrian Thurston	2012-05-20	2	-4/+4
\| \| \| \| \|	Removed the token matching the empty string. Adjusted the output for shifting placement of ignore tokens.