While having racial bias in training data is a massive issue, it is actually a symptom of a much larger issue with neural networks: we have no idea why they do what they do. There is no actual way to be certain why the model is giving what output. Even if you track the data from one side of the model to the other, each parameter does not necessarily imply anything in particular. It is, after all, just a statistical model that is prone to the same issues that simply linear regression is, but it's no longer as simple as saying correlation doesn't imply causation. Often there are as many if not more variables than there is training data, and way more parameters. This means that overfitting is a massive issue and the model cannot be used to make inferences on data that is in any way outside of the domain of the data it was trained on. So the question becomes what are neural networks learning? Are they even learning or can we just give them enough randomly assigned parameters that we end up with a 'golden ticket' path in the network? All we can really say is that this network with these weights hits a (local!) minimum on some loss function.
Deepfakes and stuff seem like this crazy technology, but ultimately it's just a parlor trick. It's literally training the network to do this one thing for one face and one video clip. NNs mean nothing of substance and should NOT be used in any manner where they are depended on for the safety of a human and especially the public as a whole until these issues are solved.
If you are using bad feature engineering or you aren't using cross validation then yeah, you're probably going to overfit, but that's not really a problem with neural networks. They're not an all purpose tool that can solve all of humanity's problems, but they definitely have their uses.
I know there are ways to mitigate overfitting, but I figured explaining it wasn't important for a comment made for the layman (I also have my doubts about the real world effectiveness of some of these techniques anyway, but that's just based on my own experiences, so I didn't mention it). Things have gotten better than when I first started playing with NNs, but overfitting is still an issue, especially since good, large datasets are hard to come by. The lack of generalization, whether through overfitting or as a natural consequence of NNs is a very large issue if there were to be used in a potentially life threatening scenario. But again, without the ability to know why a neural network has made given it's answer they cannot be trusted with decision making.
While having racial bias in training data is a massive issue, it is actually a symptom of a much larger issue with neural networks: we have no idea why they do what they do. There is no actual way to be certain why the model is giving what output. Even if you track the data from one side of the model to the other, each parameter does not necessarily imply anything in particular. It is, after all, just a statistical model that is prone to the same issues that simply linear regression is, but it's no longer as simple as saying correlation doesn't imply causation. Often there are as many if not more variables than there is training data, and way more parameters. This means that overfitting is a massive issue and the model cannot be used to make inferences on data that is in any way outside of the domain of the data it was trained on. So the question becomes what are neural networks learning? Are they even learning or can we just give them enough randomly assigned parameters that we end up with a 'golden ticket' path in the network? All we can really say is that this network with these weights hits a (local!) minimum on some loss function.
Deepfakes and stuff seem like this crazy technology, but ultimately it's just a parlor trick. It's literally training the network to do this one thing for one face and one video clip. NNs mean nothing of substance and should NOT be used in any manner where they are depended on for the safety of a human and especially the public as a whole until these issues are solved.
deleted by creator
yeah machine learning is at the level of being good for sorting data
If you are using bad feature engineering or you aren't using cross validation then yeah, you're probably going to overfit, but that's not really a problem with neural networks. They're not an all purpose tool that can solve all of humanity's problems, but they definitely have their uses.
the problem is that humans are too stupid to understand this, and will try to use them for everything
Yep investors think it's going to solve all of humanity's problems.
I know there are ways to mitigate overfitting, but I figured explaining it wasn't important for a comment made for the layman (I also have my doubts about the real world effectiveness of some of these techniques anyway, but that's just based on my own experiences, so I didn't mention it). Things have gotten better than when I first started playing with NNs, but overfitting is still an issue, especially since good, large datasets are hard to come by. The lack of generalization, whether through overfitting or as a natural consequence of NNs is a very large issue if there were to be used in a potentially life threatening scenario. But again, without the ability to know why a neural network has made given it's answer they cannot be trusted with decision making.