A friend had the brilliant idea of asking one of those image AIs to draw a diagram of the anatomy of a human hand, since that seems to be the thing they have the most trouble drawing. The AI managed then produced these diagrams, which are frankly impressive in some ways, while 100 % confirming that they cannot draw human hands. Never mind getting the number of fingers right, look at these skeletal structures and tell me there aren't deeper issues at work.

  • laziestflagellant [they/them]
    ·
    2 years ago

    The AI can recognize what fingers look like. It can recreate its saved mathematical vectors of finger images. It knows that images of hands usually have multiple fingers connected to them.

    The AI does not know how to count.

    • Awoo [she/her]
      ·
      edit-2
      2 years ago

      The AI does not know how to count.

      It also does not have a sense of the image as a 3d object like we do. It can not and does not understand that a limb continues in a set direction when it passes behind an object, coming out the other side of the object as you would expect. This causes a lot of the problems with hands if a gesture causes fingers to pass behind other fingers, it has no idea which finger is what and where the fingers should be popping back into visibility. This leads to finger monstrosities.

      These AI need to be able to process an image as an idea, they need to understand what a hand is and what makes up a hand, they need to understand and recognise what the human is in an image and what parts make up a human.

        • Awoo [she/her]
          ·
          2 years ago

          Bit idea: Medical AI that insists the problem causing your stomach ache is that you do not have enough fingers.

    • Utter_Karate [he/him, comrade/them]
      hexagon
      ·
      2 years ago

      Same problem as you can get by asking OpenGPT "If it takes 5 machines 5 minutes to make 5 boxes, how many minutes does it take for 100 machines to make 100 boxes?". That one is so focused on understanding language that it manages to be bad at math by understanding the question well enough to answer in the form of a really wrong intuitive guess. This one is so focused on art styles that it can make an incredibly accurate imitation of a 19th century medical diagram while failing kindergarten tasks like counting to five or singing that song about which bone is connected to which bone.