325 Commits

Author SHA1 Message Date
b78827746a Return a Token object from Lexer.user_code() 2023-03-17 20:24:48 -04:00
ce80e00e60 Remove TOKEN_DECODE_ERROR 2023-03-17 20:01:55 -04:00
01b45df1f9 Remove TOKEN_DROP 2023-03-17 19:58:54 -04:00
ffccc659aa Clean up Lexer.State a bit 2023-03-14 19:38:42 -04:00
e432f62b05 Add UserCodeID struct to represent user code IDs 2023-03-13 21:56:18 -04:00
058945e08b Add Token struct with validity checks 2023-03-13 16:14:50 -04:00
f402315201 Name internally generated tokens with double-underscore 2023-03-13 14:13:13 -04:00
d10e982fc8 Reserve double-underscore identifiers for internal use 2023-03-13 14:12:12 -04:00
64974cc1e2 Refactor some of Lexer.attempt_lex_token() into find_longest_match() 2023-03-12 21:19:03 -04:00
b92679e0c2 Replace LexedToken with Result struct 2023-03-11 21:16:55 -05:00
1af018b103 MatchInfo: store pointer to the accepting state rather than copying its fields 2023-03-09 20:16:57 -05:00
aabb574fea Use CodePoint type for code point values 2023-03-09 19:10:17 -05:00
30004c571d Use result type query methods on Decoder.Result 2023-03-09 19:04:15 -05:00
b00f53050b Clean up Decoder result handling 2023-03-06 20:32:01 -05:00
2e5083d823 Add explicit base types to the enums 2023-03-06 19:58:24 -05:00
7f844711d9 Start on more of the user guide 2023-02-28 20:53:31 -05:00
e2100f6ccd Move JSON parser source to standalone file 2023-01-16 14:59:43 -05:00
3057173370 Update license copyright year 2023-01-04 20:43:25 -05:00
90d08e3554 Add script to build distributable propane script 2023-01-04 20:43:25 -05:00
f5756e9523 Add script to generate user guide HTML 2023-01-04 20:43:25 -05:00
dbc5560aec Add JSON parser test case 2022-11-14 23:07:10 -05:00
62451f3a92 Allow multiple test files to be compiled 2022-11-14 21:00:49 -05:00
2e48921bb1 Fix $n expansion for rules to use rule component ptype 2022-11-14 21:00:49 -05:00
c6ea4f83c2 Store parser values according to the rule/pattern type 2022-11-13 22:20:30 -05:00
e4a160f918 Allow defining multiple parser types and assigning parser types to tokens and rules 2022-11-13 13:35:19 -05:00
8dc27686aa Rename result_type to ptype, parser result to pvalue 2022-11-12 13:29:10 -05:00
bca0a14371 Allow storing a result value for a token from a lexer code block 2022-10-16 21:40:25 -04:00
ca8a360c0e Provide matched text to lexer user code block 2022-10-15 13:37:34 -04:00
623c644e74 Work on real D strings instead of ubyte pointer and length
Also fix a couple UTF-8 decoder bugs!
2022-10-15 13:32:33 -04:00
de93d23585 Add test for non-LALR grammar failing to generate parser 2022-10-13 05:17:06 -04:00
ad09ff039a Add spec to test parsing lists 2022-10-13 05:02:05 -04:00
727c8cd1ea Execute rule user code blocks when reducing the rule 2022-10-13 04:56:50 -04:00
74beaf7ed8 Store final parse result and give access with .result parser property 2022-10-12 21:33:09 -04:00
31970522de Store parse result; add result_type grammar keyword 2022-10-12 20:56:14 -04:00
02be6de48e Add lexer modes and $mode() code expansion 2022-10-09 22:49:01 -04:00
b2d11321fe Add grammar syntax to specify lexer mode for tokens and patterns 2022-10-04 22:23:39 -04:00
6bd9d4a09b Remove _TOKEN_NONE and use _TOKEN_COUNT instead 2022-10-03 21:40:34 -04:00
ced5a56424 Support global user code blocks 2022-10-03 21:24:13 -04:00
66d654b6b9 Add $token() user code block expansion 2022-10-02 10:43:47 -04:00
43fb74fe4b Capture and verify stdout results from tests 2022-10-02 10:31:07 -04:00
01ef4fc27c Rename some test files 2022-10-02 10:10:04 -04:00
f46b5b3f4d Remove TOKEN_EOF; define EOF token and start rule in Generator 2022-10-02 10:07:44 -04:00
150be33826 Assign pattern code IDs in Generator instead of Grammar 2022-10-01 10:36:35 -04:00
e7e30c4f28 Add pattern statement 2022-09-30 21:05:18 -04:00
04367db0ac Add forward slashes around patterns and parse more robustly 2022-09-28 23:05:01 -04:00
1547528ecf Split grammar statement parsing into separate methods 2022-09-27 18:49:04 -04:00
62cb5c03f2 Refactor grammar parsing a bit 2022-09-27 15:28:39 -04:00
4b148f3797 Add Grammar spec 2022-09-27 14:45:29 -04:00
a4c5546876 Disable parser debug output 2022-09-27 12:33:23 -04:00
48704295bb Rename lexer state "accepts" to "token" 2022-09-25 14:57:46 -04:00