Aharon Robbins
2024-07-26 07:31:53 UTC
Hi All.
I've been working with Mike Haertel (the original author of GNU grep)
for a number of months now. He is writing a new regexp matcher for
use in gawk (and other places, as people desire).
[ clipped ]
My system complains about -std=c++20 so I cannot test it. (I thinkI've been working with Mike Haertel (the original author of GNU grep)
for a number of months now. He is writing a new regexp matcher for
use in gawk (and other places, as people desire).
[ clipped ]
I'll wait for a native C release.)
on a Linux system.
Questions, comments, and *bug reports* are welcome.
Well, I skimmed through the txt file on Mike's git page to learnabout the algorithm; especially the algorithm and its complexity
is of interest to me. The document was not quite clear about that
(or at least made me doubt) beyond the general and typical O(N*M)
characteristics.
beyond me. Or, just open an issue on the GitHub and ask questions there.
I forgot to mention what is likely the most important point about
the new matcher, which is that it is fully POSIX-compliant. The
existing GNU matchers are not, and likely never will be. There's at
least one bug I reported a few years back in the GNU matchers
that MinRX doesn't have, also.
This matcher also has advantages for me as the maintainer.
Algorithm simplicity is nice but as I understand there's not yet
performance comparisons done?
They will be done. By the time MinRX is in gawk for real, it willperformance comparisons done?
be performant, and in C.
Unless it was a deliberate offer to use GNU Awk as a test bed.
And "nearly-feature-complete implementation" (section Features)
is not quite a fruitful marketing concept.
As far as I'm concerned, it is feature complete. However, itAnd "nearly-feature-complete implementation" (section Features)
is not quite a fruitful marketing concept.
doesn't support POSIX BREs.
I also wonder why BSD and GNU extensions are supported but not
the very useful abbreviations for {some,all} Perl RE shortcuts.
Because they're just window dressing. I have no desire to bethe very useful abbreviations for {some,all} Perl RE shortcuts.
perl compatible. My needs are to be able to do what gawk currently
does, no more.
HTH.
--
Aharon (Arnold) Robbins arnold AT skeeve DOT com
Aharon (Arnold) Robbins arnold AT skeeve DOT com