Over the years I've noticed that much of Japanese video content is presented in the format of a dialogue between two anime girl pngs using text to speech voices as opposed to just some guy appearing on camera or narrating whatever the video is about.
Like if you wanted to watch a review of a new monitor, it would be one anime character explaining the pros and cons to another character
Is it a greater cultural preference for anonymity, or are people just more likely to ingest information if it's presented by Touhou characters? Note that I'm not talking about Vtubers here, this trend is much older and covers pretty much all possible video topics
such things exist, the vtuber Zentraya uses something like this.
Zen uses standard TTS as far as I know (technically, speech-to-text -> text-to-speech) which is why there's a significant delay.
IDK, it's fast enough I thought it was a speech to speech. The delay is like a second longer than usual streaming delay, just feels a bit longer because she doesn't make smaller noises between words like most streamers.
She previously stated that she uses speech-to-text-to-speech, and the voice she uses is a standard TTS voice you'll hear on other streams (not sure which voice exactly)