@jaytaph · 2 years ago
I ran into a bit of trouble with the document tree generation. This is where a good grasp of the borrow checker comes into play, and I'm not there ye…
CodeMusings is a community where developers share their current projects, plans, and thoughts —
a modern take on the classic Unix .plan file.
Write in Markdown, connect with fellow coders.
@jaytaph · 2 years ago
I ran into a bit of trouble with the document tree generation. This is where a good grasp of the borrow checker comes into play, and I'm not there ye…
@jaytaph · 2 years ago
I'm starting to get the hang of Rust. The borrow checker isn't yelling at me as much as it used to do, and I'm starting to see why things aren't work…
@jaytaph · 2 years ago
Now that the tokenizer is done, I seem to be able to tokenize most of the html files i've tested. That's good! So i'm spending my time on creating t…
@jaytaph · 2 years ago
Finally, the tokenizer passes all html5lib-tests and is merger into the main branch [https://github.com/jaytaph/gosub-browser/](https://github.com/ja…
@jaytaph · 2 years ago
It seems I hit (another) snag: more than 5000 of the 6000 token tests from html5lib-tests are passing, so that's a big win. There are 2 issues that…
@jaytaph · 2 years ago
Happy times with the tokenizer! I've setup a tester that will actually test the tokenizer again the html5 testsuite. More and more tests found are p…
@jaytaph · 2 years ago
Now that I'm filling more and more data states, i can actually parse more and more html5 code. To test all this, i've generated some testcases throug…
@jaytaph · 2 years ago
As character entities are "completed", i can finally tokenize whole strings like "this is © 2023" and have it parsed correctly with the copyrigh…
@jaytaph · 2 years ago
Finally done with the named character references. This took too long because I could not get the businesslogic right. Different code bases uses a dif…
@jaytaph · 2 years ago
I've spend some time fighting against Rust. It's not clear who won though. The biggest problem I'm currently facing is the fact that I have to discov…
@jaytaph · 2 years ago
A few days in, and lots of things learned. Rust is less explicit and relies more on syntax stuff than for instance go. Which means it can be hard to…
@jaytaph · 2 years ago
So, I've decided to start a project that I wanted to do for a long time: write a browser. There are a few TINY issues with this: * First, I don't kn…
@jaytaph · 3 years ago
I'm having a hard time trying to fit hierachical documents into elasticsearch. Normally, one would denormalize this and have a flat structure, but th…
@jaytaph · 3 years ago
I'm doing some hobby work on spacetraders.io. A (simple) space trading game where the only interface is an HTTP API. I've setup a PHP SDK, and tryi…
@jaytaph · 3 years ago
This weekend I've created a proof-of-concept of a system that crawls PDFs, extract the text with Tika, stored it into Elasticsearch and created a sim…