298 Commits

Author SHA1 Message Date
ca8a360c0e Provide matched text to lexer user code block 2022-10-15 13:37:34 -04:00
623c644e74 Work on real D strings instead of ubyte pointer and length
Also fix a couple UTF-8 decoder bugs!
2022-10-15 13:32:33 -04:00
de93d23585 Add test for non-LALR grammar failing to generate parser 2022-10-13 05:17:06 -04:00
ad09ff039a Add spec to test parsing lists 2022-10-13 05:02:05 -04:00
727c8cd1ea Execute rule user code blocks when reducing the rule 2022-10-13 04:56:50 -04:00
74beaf7ed8 Store final parse result and give access with .result parser property 2022-10-12 21:33:09 -04:00
31970522de Store parse result; add result_type grammar keyword 2022-10-12 20:56:14 -04:00
02be6de48e Add lexer modes and $mode() code expansion 2022-10-09 22:49:01 -04:00
b2d11321fe Add grammar syntax to specify lexer mode for tokens and patterns 2022-10-04 22:23:39 -04:00
6bd9d4a09b Remove _TOKEN_NONE and use _TOKEN_COUNT instead 2022-10-03 21:40:34 -04:00
ced5a56424 Support global user code blocks 2022-10-03 21:24:13 -04:00
66d654b6b9 Add $token() user code block expansion 2022-10-02 10:43:47 -04:00
43fb74fe4b Capture and verify stdout results from tests 2022-10-02 10:31:07 -04:00
01ef4fc27c Rename some test files 2022-10-02 10:10:04 -04:00
f46b5b3f4d Remove TOKEN_EOF; define EOF token and start rule in Generator 2022-10-02 10:07:44 -04:00
150be33826 Assign pattern code IDs in Generator instead of Grammar 2022-10-01 10:36:35 -04:00
e7e30c4f28 Add pattern statement 2022-09-30 21:05:18 -04:00
04367db0ac Add forward slashes around patterns and parse more robustly 2022-09-28 23:05:01 -04:00
1547528ecf Split grammar statement parsing into separate methods 2022-09-27 18:49:04 -04:00
62cb5c03f2 Refactor grammar parsing a bit 2022-09-27 15:28:39 -04:00
4b148f3797 Add Grammar spec 2022-09-27 14:45:29 -04:00
a4c5546876 Disable parser debug output 2022-09-27 12:33:23 -04:00
48704295bb Rename lexer state "accepts" to "token" 2022-09-25 14:57:46 -04:00
4d716f6c10 Remove Token#c_name; use given token case in token constants 2022-09-25 14:48:49 -04:00
01c9340819 Rename lexer variables to reflect trying to get longest match 2022-09-25 14:44:44 -04:00
672098ad32 Execute user code blocks assigned to tokens 2022-09-24 17:31:40 -04:00
92ce30f354 Add tokenid keyword 2022-09-19 22:16:57 -04:00
38ae5ac7a1 Split Token class into Token/Pattern 2022-09-15 22:46:44 -04:00
bf075a69f6 Test matching a semicolon 2022-07-23 22:25:17 -04:00
b682c72b17 Add semicolon to end of all grammar statements 2022-07-23 22:09:19 -04:00
382e17804c Test SLR grammar 2022-06-27 21:06:03 -04:00
30f4cfcc99 Write parser log file
Fix bug of skipping rule set IDs.
Remove unneeded out_sets from ItemSet class.
2022-06-26 11:06:55 -04:00
2fbe13e071 Do not consume lookahead token when reducing 2022-06-25 21:35:54 -04:00
f2cc5b112e Handle shifting states after reducing 2022-06-25 16:16:20 -04:00
84c4a16ce6 Start on Parser.parse() 2022-06-21 23:03:00 -04:00
f17efe8c82 Add RuleSet#id to use when reducing
Parser will know what state to go to after reducing a Rule based on the
RuleSet ID.
Start on Parser class.
2022-06-21 20:07:27 -04:00
60e2818075 Determine the reduce actions for each parser state 2022-06-17 01:45:48 -04:00
5af3179ff2 Add RuleSet#start_token_set 2022-06-17 01:35:02 -04:00
11ea02fb86 Determine the possibly-empty RuleSets 2022-06-17 01:35:02 -04:00
8f38d0b017 Clean up ItemSet#close! 2022-06-17 01:35:02 -04:00
a91bf2f250 Clean up parser ItemSet processing 2022-06-14 22:57:07 -04:00
0f1c00d1a6 Add RuleSet#could_be_empty? 2022-06-14 07:35:58 -04:00
291509f005 Add Rule#empty? 2022-06-14 07:35:38 -04:00
c1769503a8 Add Item#complete? 2022-06-14 07:35:07 -04:00
8b152abaa7 Add ItemSet#leading_item_sets 2022-06-14 07:34:35 -04:00
d6e5c4325d Add new start rule ending with EOF token rather than appending EOF token to all user-specified start rules 2022-06-07 22:25:31 -04:00
2837dfda6b Add Rule#id 2022-06-06 22:30:53 -04:00
9d850294a9 Clean up and comment Item and ItemSet 2022-06-05 18:12:14 -04:00
df8088c3c6 Clean up rule format in grammar files 2022-06-05 16:28:35 -04:00
ba74d0a20a Reduce maximum code point value to not interfere with magic code point values used by parser 2022-06-05 15:24:40 -04:00