• @taladar@sh.itjust.works
    link
    fedilink
    English
    011 months ago

    Most of these languages dont even have enough professional voice actors to cover the bandwidth.

    And you think anyone is training AI voice models for those languages? Have you even seen how long it takes even large companies like Google to support the languages with hundreds of millions of speakers?

    • JohnEdwa
      link
      fedilink
      English
      0
      edit-2
      11 months ago

      That’s the benefit of using AI and machine learning - once you have enough source material, you can throw it all in and it’ll eventually spit out a model.
      Which is exactly what Meta did with their Massively Multilingual Speech project which supports text-to-speech and speech-to-text for 1107 different languages.

      Is it actually any good in 99% of them, I don’t have a clue, but it exists.

    • Dr. Moose
      link
      fedilink
      English
      011 months ago

      It becomes easier and cheaper every day. Today’s open source LLMs are better than last year’s best model.

      • @Jhex@lemmy.world
        link
        fedilink
        English
        011 months ago

        Is it? I just tried again yesterday for a simple script since coding is the one thing apparently AI will replace people like me and it could not put together a working JavaScript script.

        I have yet to see tangible results not announced by the people with sunken cost exploding their balls.

        • Dr. Moose
          link
          fedilink
          English
          0
          edit-2
          11 months ago

          Sounds like a skill issue my dude. While you struggle to get a js script people are putting out entire programs with AI assistants so sure - you’re right and they’re wrong

            • Dr. Moose
              link
              fedilink
              English
              011 months ago

              Yes, to effectively use AI you actually have to understand the medium you’re in to describe the problem you’re trying to solve. You can get there with prompting but it’ll take you much longer if you just don’t understand code yourself.

              Thats why most senior software devs are not afraid of LLMs cause they need strong oversight and thats exactly what years of software dev experience trained you to do.

      • @ExperiencedWinter@lemmy.world
        link
        fedilink
        English
        011 months ago

        You’re fundamentally misunderstanding the comment you replied to, they are not saying that voice AI are bad, they are saying there is not enough training data to improve the AI for these languages. How will it improve without good training data?

        • Dr. Moose
          link
          fedilink
          English
          011 months ago

          Thats not how AI training works and even then there’s absolutely enough data. Also training data can be created and even synthesized. There are many techniques to extract make training value from datasets that we discover every year - It’s really not a problem you think it is.

          I’m genuinely confused how AI illiterate users here are. It’s just blind leading the blind.