[HTML payload içeriği buraya]
29.9 C
Jakarta
Monday, May 18, 2026

Meta’s new AI mannequin can translate speech from greater than 100 languages


“Meta has achieved a terrific job having a breadth of various issues they help, like text-to-speech, speech-to-text, even automated speech recognition,” says Chetan Jaiswal, a professor of laptop science at Quinnipiac College, who was not concerned within the analysis. “The mere variety of languages they’re supporting is an amazing achievement.”

Human translators are nonetheless a significant a part of the interpretation course of, the researchers say within the paper, as a result of they will grapple with various cultural contexts and ensure the identical that means is conveyed from one language into one other. This step is essential, says Lynne Bowker, Canada Analysis Chair in Translation, Applied sciences and Society at Université Laval in Quebec, who didn’t work on Seamless. “Languages are a mirrored image of cultures, and cultures have their very own methods of realizing issues,” she says. 

On the subject of functions like drugs or regulation, machine translations have to be totally checked by a human, she says. If not, misunderstandings may end up. For instance, when Google Translate was used to translate public well being details about the covid-19 vaccine from the Virginia Division of Well being in January 2021, it translated “not necessary” in English into “not crucial” in Spanish, altering the entire that means of the message.

AI fashions have far more examples to coach on in some languages than others. This implies present speech-to-speech fashions might be able to translate a language like Greek into English, the place there could also be many examples, however can’t translate from Swahili to Greek. The staff behind Seamless aimed to resolve this drawback by pre-training the mannequin on tens of millions of hours of spoken audio in several languages. This pre-training allowed it to acknowledge common patterns in language, making it simpler to course of much less broadly spoken languages as a result of it already had some baseline for what spoken language is meant to sound like.  

The system is open-source, which the researchers hope will encourage others to construct upon its present capabilities. However some are skeptical of how helpful it might be in contrast with obtainable options. “Google’s translation mannequin is just not as open-source as Seamless, but it surely’s far more responsive and quick, and it doesn’t value something as an instructional,” says Jaiswal.

Essentially the most thrilling factor about Meta’s system is that it factors to the potential of on the spot interpretation throughout languages within the not-too-distant future—just like the Babel fish in Douglas Adams’ cult novel The Hitchhiker’s Information to the Galaxy. SeamlessM4T is quicker than present fashions however nonetheless not on the spot. That mentioned, Meta claims to have a more recent model of Seamless that’s as quick as human interpreters. 

“Whereas having this type of delayed translation is okay and helpful, I feel simultaneous translation shall be much more helpful,” says Kenny Zhu, director of the Arlington Computational Linguistics Lab on the College of Texas at Arlington, who is just not affiliated with the brand new analysis.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles