Premium Only Content
Compiler From Scratch: Phase 1 - Tokenizer Generator 017: Fixing encoding issues, more build testing
Streamed on 2024-11-08 (https://www.twitch.tv/thediscouragerofhesitancy)
Zero Dependencies Programming!
Picking up where we left off last week, the build permutation testing script is mostly working and has revealed that the different encodings supported aren't equally well supported. So I started out by turning the UTF8CharRef into just CharRef, where each CharRef has a flag for its own encoding, and a statically set default encoding. The VVProject sets the CharRef default encoding once it parses that encoding and then the rest of the parsing works the same as before. Fixing a couple of other things here and there got all the encodings to build.
I added some code for testing. Each encoding has a pre-set string to tokenize. Then the Tokenizer generated by our test script is compiled and run. Regardless of build option, the expected outputs are the same. It verifies the number of tokens, the number of lines and the length (in bytes) of the longest token.
With an actual test actually running it was time to run all the permutations. This testing revealed at the end of the stream that there is a bug when hitting the end of the text buffer while using LAZY processing. We tried a couple of things, but didn't have time to debug it. That is where we'll pick up next week.
-
LIVE
LFA TV
21 hours agoLIVE & BREAKING NEWS! | THURSDAY 11/20/25
1,024 watching -
LIVE
freecastle
8 hours agoTAKE UP YOUR CROSS- Let your eyes LOOK directly FORWARD, and your gaze be STRAIGHT before YOU!
30 watching -
1:25:51
DeVory Darkins
6 hours agoDemocrats caught in corruption scheme as JD Vance issues MAJOR UPDATE
155K76 -
1:48:48
MattMorseTV
6 hours ago $41.25 earned🔴Sedition Charges INBOUND.🔴WH Press Conference.🔴
69K174 -
9:06
Jamesons Travels
22 hours ago $5.39 earnedMilitary Veterans in Congress Tell Troops to Refuse Trump's Orders
39.1K62 -
1:42:47
The Bold Lib
4 hours agoBOLDCHAT: Unemployment | AI | Birth Rates w/ANGELA BELCAMINO
27K2 -
20:34
ArynneWexler
8 hours agoNew Poll: Women Are Done With America | NN8
29.1K47 -
59:30
The White House
6 hours agoPress Secretary Karoline Leavitt Briefs Members of the Media, Nov. 20, 2025
67.1K30 -
2:06:37
Steven Crowder
9 hours agoJasmine Crockett's Epstein Idiocy & the Absolute State of the Democrat Party
636K433 -
33:37
The Boomer Effect
19 hours agoBeyond Convenience: The Tyranny Behind Digital IDs
19.6K2