• mormund@feddit.org
    link
    fedilink
    arrow-up
    42
    arrow-down
    2
    ·
    2 days ago

    Yeah, transcription is one of the only good uses for LLMs imo. Of course they can still produce nonsense, but bad subtitles are better none at all.

    • kautau@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      edit-2
      3 hours ago

      Just an important note, speech to text models aren’t LLMs, which are literally “conversational” or “text generation from other text” models. Things like https://github.com/openai/whisper are their own, separate types of models, specifically for transcription.

      That being said, I totally agree, accessibility is an objectively good use for “AI”