Have been trying to set it up for hours now. Nothing works.

  • Latest version does not seem to have winutils support, and using it causes errors when using some important methods. (EDIT: this is likely wrong, and the winutils stuff that I have should probably be fine.)
  • Older versions require to be built with Maven. However, that just gives me a PluginExecutionException.

I need to do this ASAP, preferably within the next 3 hours.

I have nowhere else to ask for help, it seems, especially considering that reddit-logo suspended an account I set up specifically for asking questions after I edited a relevant post.

Highly doubt that anybody will be able to help me.

EDIT2: the issue has, thankfully, been resolved. I was using Python 3.12, and switched to 3.11.8. That made the problem go away.

  • Tomorrow_Farewell [any, they/them]
    hexagon
    ·
    2 months ago

    I do not have a programming setup on my Linux OS yet, although I do consider on trying to do this inside a VM. That will be a bit painful, though, as Virtual Box doesn't seem to allow for much graphics memory, meaning that the framerate will be low.

    If things go dire and I will fail to find any sort of way out, I will just have to ask other people to run my code instead, ugh.

    • bunnygirl [she/her]
      ·
      2 months ago

      You could try WSL, it's basically just a headless Linux VM so it's ideal for stuff like this. The terminal itself is just running on windows so no issues with framerate or anything https://learn.microsoft.com/en-us/windows/wsl/install

      • Tomorrow_Farewell [any, they/them]
        hexagon
        ·
        2 months ago

        I'm not really sure how to actually run and debug relevant python files using WSL, at least on account of me not having used WSL much (I have only installed one distribution for it and set up the first user). Any help in this regard?

        • bunnygirl [she/her]
          ·
          2 months ago

          Sorry for the late response,

          I can't help for PySpark specifically cause I have no experience with it. In general tho you'll have to install the tooling you need to compile/run the program in WSL, I haven't used Spark in years so I don't know specifics but you'll want to have at least Java and Python installed here. On Ubuntu, you'll want the packages default-jdk, python3, python3-pip, python3-venv (if you're using venv), as well as python-is-python3 for convenience. If you're using venv, you might want to rerun python -m venv env again to make sure it has the files Bash needs, then do source env/bin/activate to activate the venv. You might also have to install pyspark from the Bash shell in case it needs to build anything platform specific. You can set environment variables in ~/.bashrc (It's the home dir in the Linux VM, not Windows so use the terminal to change this e.g. nano ~/.bashrc or vim ~/.bashrc if you're familiar with vi) with the shape export VARIABLE=VALUE (put quotes around VALUE if it has spaces etc), then start a new shell to load those (do exec bash to replace the currently running shell with a new process)

          From there you should be able to just run the code normally but in WSL instead

          • Tomorrow_Farewell [any, they/them]
            hexagon
            ·
            2 months ago

            Thankfully, the problem has been resolved, so, after I finish with this project, I will have more time to get an actual programming set up on a non-VM NixOS that I have already installed.