|
11348ca351
|
Add FindLongestMatchResult
|
2023-06-04 21:14:07 -04:00 |
|
|
6b0fb4cb12
|
Add return type for transition()
|
2023-06-03 20:38:42 -04:00 |
|
|
b78827746a
|
Return a Token object from Lexer.user_code()
|
2023-03-17 20:24:48 -04:00 |
|
|
ce80e00e60
|
Remove TOKEN_DECODE_ERROR
|
2023-03-17 20:01:55 -04:00 |
|
|
01b45df1f9
|
Remove TOKEN_DROP
|
2023-03-17 19:58:54 -04:00 |
|
|
ffccc659aa
|
Clean up Lexer.State a bit
|
2023-03-14 19:38:42 -04:00 |
|
|
e432f62b05
|
Add UserCodeID struct to represent user code IDs
|
2023-03-13 21:56:18 -04:00 |
|
|
058945e08b
|
Add Token struct with validity checks
|
2023-03-13 16:14:50 -04:00 |
|
|
f402315201
|
Name internally generated tokens with double-underscore
|
2023-03-13 14:13:13 -04:00 |
|
|
64974cc1e2
|
Refactor some of Lexer.attempt_lex_token() into find_longest_match()
|
2023-03-12 21:19:03 -04:00 |
|
|
b92679e0c2
|
Replace LexedToken with Result struct
|
2023-03-11 21:16:55 -05:00 |
|
|
1af018b103
|
MatchInfo: store pointer to the accepting state rather than copying its fields
|
2023-03-09 20:16:57 -05:00 |
|
|
aabb574fea
|
Use CodePoint type for code point values
|
2023-03-09 19:10:17 -05:00 |
|
|
30004c571d
|
Use result type query methods on Decoder.Result
|
2023-03-09 19:04:15 -05:00 |
|
|
b00f53050b
|
Clean up Decoder result handling
|
2023-03-06 20:32:01 -05:00 |
|
|
2e5083d823
|
Add explicit base types to the enums
|
2023-03-06 19:58:24 -05:00 |
|
|
c6ea4f83c2
|
Store parser values according to the rule/pattern type
|
2022-11-13 22:20:30 -05:00 |
|
|
8dc27686aa
|
Rename result_type to ptype, parser result to pvalue
|
2022-11-12 13:29:10 -05:00 |
|
|
bca0a14371
|
Allow storing a result value for a token from a lexer code block
|
2022-10-16 21:40:25 -04:00 |
|
|
ca8a360c0e
|
Provide matched text to lexer user code block
|
2022-10-15 13:37:34 -04:00 |
|
|
623c644e74
|
Work on real D strings instead of ubyte pointer and length
Also fix a couple UTF-8 decoder bugs!
|
2022-10-15 13:32:33 -04:00 |
|
|
727c8cd1ea
|
Execute rule user code blocks when reducing the rule
|
2022-10-13 04:56:50 -04:00 |
|
|
74beaf7ed8
|
Store final parse result and give access with .result parser property
|
2022-10-12 21:33:09 -04:00 |
|
|
31970522de
|
Store parse result; add result_type grammar keyword
|
2022-10-12 20:56:14 -04:00 |
|
|
02be6de48e
|
Add lexer modes and $mode() code expansion
|
2022-10-09 22:49:01 -04:00 |
|
|
6bd9d4a09b
|
Remove _TOKEN_NONE and use _TOKEN_COUNT instead
|
2022-10-03 21:40:34 -04:00 |
|
|
ced5a56424
|
Support global user code blocks
|
2022-10-03 21:24:13 -04:00 |
|
|
66d654b6b9
|
Add $token() user code block expansion
|
2022-10-02 10:43:47 -04:00 |
|
|
f46b5b3f4d
|
Remove TOKEN_EOF; define EOF token and start rule in Generator
|
2022-10-02 10:07:44 -04:00 |
|
|
e7e30c4f28
|
Add pattern statement
|
2022-09-30 21:05:18 -04:00 |
|
|
a4c5546876
|
Disable parser debug output
|
2022-09-27 12:33:23 -04:00 |
|
|
48704295bb
|
Rename lexer state "accepts" to "token"
|
2022-09-25 14:57:46 -04:00 |
|
|
4d716f6c10
|
Remove Token#c_name; use given token case in token constants
|
2022-09-25 14:48:49 -04:00 |
|
|
01c9340819
|
Rename lexer variables to reflect trying to get longest match
|
2022-09-25 14:44:44 -04:00 |
|
|
672098ad32
|
Execute user code blocks assigned to tokens
|
2022-09-24 17:31:40 -04:00 |
|
|
2fbe13e071
|
Do not consume lookahead token when reducing
|
2022-06-25 21:35:54 -04:00 |
|
|
f2cc5b112e
|
Handle shifting states after reducing
|
2022-06-25 16:16:20 -04:00 |
|
|
84c4a16ce6
|
Start on Parser.parse()
|
2022-06-21 23:03:00 -04:00 |
|
|
f17efe8c82
|
Add RuleSet#id to use when reducing
Parser will know what state to go to after reducing a Rule based on the
RuleSet ID.
Start on Parser class.
|
2022-06-21 20:07:27 -04:00 |
|
|
ca6a93a4c5
|
Move generation logic to new Generator class
|
2022-06-05 14:57:58 -04:00 |
|
|
f37801ec9e
|
Store tokens and drop tokens separately
|
2022-06-05 14:36:19 -04:00 |
|
|
6f1ce32775
|
Move Lexer::DFA#build_tables to Lexer
|
2022-06-05 14:18:35 -04:00 |
|
|
7598c589fe
|
Detect other invalid UTF-8 encodings
|
2022-05-31 22:26:09 -04:00 |
|
|
f3ed678fe1
|
Store tokens in Hash by name
|
2021-09-27 21:40:12 -04:00 |
|
|
00016f16b3
|
Combine Grammar and Generator into top-level Imbecile class
|
2021-08-22 21:04:46 -04:00 |
|
|
f295acb593
|
Generator builds a Lexer, not a Lexer::DFA
|
2021-08-19 13:11:12 -04:00 |
|
|
3158e51059
|
Add length field to LexedToken
|
2021-07-06 11:59:35 -04:00 |
|
|
d9e4f64d2e
|
Fix returning TOKEN_EOF when lexing at EOF
|
2021-07-06 11:55:44 -04:00 |
|
|
ec2dcf9a72
|
Fix not progressing through input while lexing a token
|
2021-07-06 11:47:33 -04:00 |
|
|
230c324209
|
Fix iterating through all transitions in a state
|
2021-07-06 11:09:13 -04:00 |
|