|Re: [asciidoc-lang-dev] Whitespace handling|
On 03/03/2021 02:14, Lex Trotman wrote:
> Interesting question since the spacing is context, not part of the
> markup itself, just like the character on the other side.
Correct. In my own experiments, to identify the context, I used the
lookahead/lookbehind features of the Parsing _expression_ grammar (PEG --
) I implemented. This adds some "context sensibility" on top of an
otherwise context-free grammar.
> Thinking about it, (as well as some defined code points) the non-spacing
> context character must be able to be any Unicode letter code point or it
> prevents the markup being used on some non-English languages, and so I
> don't see why the spacing context should not be any code point with the
> appropriate spacing Unicode property as well. If non-ASCII context on
> one side is valid, there is no reason it should not be valid on both sides.
The valid spacing/non-spacing character around constrained markups needs
clarifications to me. Especially if we consider non-Latin scripts. This
is something we should discuss in its own thread. Or shouldn't we?
Back to the top