how do you find the end of the clause with regex? I can only think of ways to do it with a natural language parser and my brain literally won't tell me the dumb way to do it...
You look up whatever regex library you're using's eagerness/laziness settings and pay very careful attention, because no two regex systems are the same, and anyone who claims they are is a smug script kiddy.
Now that has the be the most cursed shit I've heard in a while comrade.
Why not make it compile into befunge while we're at it?
No but more seriously compiling code into regex is not always possible and when it is possible it it tends to be an absolute nightmare. I think a basic parser would be more efficient to implement, even if it means coding it in Rust or C instead of python.
Oh god. I thought we were just going for the next word (chain of unicode letter characters, I know that's not actually correct all the time either). If you want to identify noun phrases... yeah don't use a regex. But also just don't get into a situation where you need to do that in the first place.
The person who got me into programming basically told me "don't get yourself into situations where you need regex, just find something else to do." While not really possible in a lot of cases, it does stand as good, practical advice.
deleted by creator
how do you find the end of the clause with regex? I can only think of ways to do it with a natural language parser and my brain literally won't tell me the dumb way to do it...
deleted by creator
You look up whatever regex library you're using's eagerness/laziness settings and pay very careful attention, because no two regex systems are the same, and anyone who claims they are is a smug script kiddy.
yea but <noun phrase> is a hard pattern to match for, whatever your laziness settings
deleted by creator
Screams in compute costs
deleted by creator
Now that has the be the most cursed shit I've heard in a while comrade.
Why not make it compile into befunge while we're at it?
No but more seriously compiling code into regex is not always possible and when it is possible it it tends to be an absolute nightmare. I think a basic parser would be more efficient to implement, even if it means coding it in Rust or C instead of python.
deleted by creator
Oh god. I thought we were just going for the next word (chain of unicode letter characters, I know that's not actually correct all the time either). If you want to identify noun phrases... yeah don't use a regex. But also just don't get into a situation where you need to do that in the first place.
The person who got me into programming basically told me "don't get yourself into situations where you need regex, just find something else to do." While not really possible in a lot of cases, it does stand as good, practical advice.
"now you have two problems"