432 Commits

Author SHA1 Message Date
0d0da49cd5 Just return integer result code from Lexer.find_longest_match() 2023-07-05 17:07:48 -04:00
d56cc2deeb Just return integer result code from Decoder.decode_code_point() 2023-07-05 16:54:52 -04:00
9e33a32930 Store code_point_length in a ubyte 2023-07-05 16:45:03 -04:00
02d99082b2 Allow switching on Decoder.Result 2023-07-04 12:49:54 -04:00
9895733a05 Switch unit tests from gdc to ldc2 2023-06-29 19:16:36 -04:00
11348ca351 Add FindLongestMatchResult 2023-06-04 21:14:07 -04:00
6b0fb4cb12 Add return type for transition() 2023-06-03 20:38:42 -04:00
b78827746a Return a Token object from Lexer.user_code() 2023-03-17 20:24:48 -04:00
ce80e00e60 Remove TOKEN_DECODE_ERROR 2023-03-17 20:01:55 -04:00
01b45df1f9 Remove TOKEN_DROP 2023-03-17 19:58:54 -04:00
ffccc659aa Clean up Lexer.State a bit 2023-03-14 19:38:42 -04:00
e432f62b05 Add UserCodeID struct to represent user code IDs 2023-03-13 21:56:18 -04:00
058945e08b Add Token struct with validity checks 2023-03-13 16:14:50 -04:00
f402315201 Name internally generated tokens with double-underscore 2023-03-13 14:13:13 -04:00
d10e982fc8 Reserve double-underscore identifiers for internal use 2023-03-13 14:12:12 -04:00
64974cc1e2 Refactor some of Lexer.attempt_lex_token() into find_longest_match() 2023-03-12 21:19:03 -04:00
b92679e0c2 Replace LexedToken with Result struct 2023-03-11 21:16:55 -05:00
1af018b103 MatchInfo: store pointer to the accepting state rather than copying its fields 2023-03-09 20:16:57 -05:00
aabb574fea Use CodePoint type for code point values 2023-03-09 19:10:17 -05:00
30004c571d Use result type query methods on Decoder.Result 2023-03-09 19:04:15 -05:00
b00f53050b Clean up Decoder result handling 2023-03-06 20:32:01 -05:00
2e5083d823 Add explicit base types to the enums 2023-03-06 19:58:24 -05:00
7f844711d9 Start on more of the user guide 2023-02-28 20:53:31 -05:00
e2100f6ccd Move JSON parser source to standalone file 2023-01-16 14:59:43 -05:00
3057173370 Update license copyright year 2023-01-04 20:43:25 -05:00
90d08e3554 Add script to build distributable propane script 2023-01-04 20:43:25 -05:00
f5756e9523 Add script to generate user guide HTML 2023-01-04 20:43:25 -05:00
dbc5560aec Add JSON parser test case 2022-11-14 23:07:10 -05:00
62451f3a92 Allow multiple test files to be compiled 2022-11-14 21:00:49 -05:00
2e48921bb1 Fix $n expansion for rules to use rule component ptype 2022-11-14 21:00:49 -05:00
c6ea4f83c2 Store parser values according to the rule/pattern type 2022-11-13 22:20:30 -05:00
e4a160f918 Allow defining multiple parser types and assigning parser types to tokens and rules 2022-11-13 13:35:19 -05:00
8dc27686aa Rename result_type to ptype, parser result to pvalue 2022-11-12 13:29:10 -05:00
bca0a14371 Allow storing a result value for a token from a lexer code block 2022-10-16 21:40:25 -04:00
ca8a360c0e Provide matched text to lexer user code block 2022-10-15 13:37:34 -04:00
623c644e74 Work on real D strings instead of ubyte pointer and length
Also fix a couple UTF-8 decoder bugs!
2022-10-15 13:32:33 -04:00
de93d23585 Add test for non-LALR grammar failing to generate parser 2022-10-13 05:17:06 -04:00
ad09ff039a Add spec to test parsing lists 2022-10-13 05:02:05 -04:00
727c8cd1ea Execute rule user code blocks when reducing the rule 2022-10-13 04:56:50 -04:00
74beaf7ed8 Store final parse result and give access with .result parser property 2022-10-12 21:33:09 -04:00
31970522de Store parse result; add result_type grammar keyword 2022-10-12 20:56:14 -04:00
02be6de48e Add lexer modes and $mode() code expansion 2022-10-09 22:49:01 -04:00
b2d11321fe Add grammar syntax to specify lexer mode for tokens and patterns 2022-10-04 22:23:39 -04:00
6bd9d4a09b Remove _TOKEN_NONE and use _TOKEN_COUNT instead 2022-10-03 21:40:34 -04:00
ced5a56424 Support global user code blocks 2022-10-03 21:24:13 -04:00
66d654b6b9 Add $token() user code block expansion 2022-10-02 10:43:47 -04:00
43fb74fe4b Capture and verify stdout results from tests 2022-10-02 10:31:07 -04:00
01ef4fc27c Rename some test files 2022-10-02 10:10:04 -04:00
f46b5b3f4d Remove TOKEN_EOF; define EOF token and start rule in Generator 2022-10-02 10:07:44 -04:00
150be33826 Assign pattern code IDs in Generator instead of Grammar 2022-10-01 10:36:35 -04:00