mirror of
https://github.com/kjanat/articulate-parser.git
synced 2026-02-13 18:25:41 +01:00
Replaces the fragile regex-based HTML cleaning logic with a proper HTML parser using `golang.org/x/net/html`. The previous implementation was unreliable and could not correctly handle malformed tags, script content, or a wide range of HTML entities. This new approach provides several key improvements: - Skips the content of `