• Veganhydride [he/him]
    ·
    4 years ago

    Do'nt get me wrong, it's an arbitrary calculation, but my numbers were 17 billion parameters (Microsoft DeepSpeed ) / 86 billion neurons