• eduardog3000 [he/him]
      ·
      4 years ago

      wdym, this is so easy to read:

      https?:\/\/(www\.)?[-a-zA-Z0-9@:%._\+~#=]{1,256}\.[a-zA-Z0-9()]{1,6}\b([-a-zA-Z0-9()@:%_\+.~#?&//=]*)
      
      • alcoholicorn [comrade/them, doe/deer]
        ·
        edit-2
        4 years ago

        You joke, but that's very straightforward to me.

        https, or nothing

        ://

        www., or nothing

        1-256 of these chars: [-a-zA-Z0-9@:%._+~#=]

        Until you get to a "." followed by:

        1-6 of these chars:[a-zA-Z0-9()]

        Any line break

        0+ of these chars: [-a-zA-Z0-9()@:%_+.~#?&//=]

        • eduardog3000 [he/him]
          ·
          4 years ago

          I mean, yeah it's really not too bad when you really dig into it. But it's still just a mess.

          Although your "https, or nothing" is actually "http, optionally s"

          That one isn't even completely right. It gets a lot of false positives AFAIK (example: https://..com). The real thing is even worse.