英語閱讀 學(xué)英語,練聽力,上聽力課堂! 注冊 登錄
> 輕松閱讀 > 科學(xué)前沿 >  內(nèi)容

科技巨頭打響語音助手大戰(zhàn) Siri將學(xué)說上海話

所屬教程:科學(xué)前沿

瀏覽:

2017年03月15日

手機(jī)版
掃描二維碼方便學(xué)習(xí)和分享
據(jù)路透社報道,隨著谷歌助理上周開始大范圍登陸安卓設(shè)備,語音助手行業(yè)的競爭進(jìn)入白熱化。現(xiàn)在,蘋果、亞馬遜、微軟以及谷歌都在提供能夠接受用戶指令的虛擬助手。專家表示,要想打贏語音助手大戰(zhàn),必須跨越語言障礙的鴻溝,而蘋果Siri作為業(yè)內(nèi)老前輩,已經(jīng)遙遙領(lǐng)先其他幾家語音助手,它不僅會說36個國家的21種語言,如今還要學(xué)習(xí)上海方言。
科技巨頭打響語音助手大戰(zhàn) Siri將學(xué)說上海話

With the broad release of Google Assistant last week, the voice-assistant wars are in full swing, with Apple Inc, Amazon.com Inc, Microsoft Corp and now Alphabet Inc's Google all offering electronic assistants to take your commands.

隨著谷歌公司上周全面推出谷歌助手服務(wù),蘋果、亞馬遜、微軟以及谷歌都推出了接受人類指令的語音助手服務(wù),一場語音助手戰(zhàn)已全面打響。

Siri會21種語言 現(xiàn)在將學(xué)說上海話

Siri is the oldest of the bunch, and researchers including Oren Etzioni, chief executive officer of the Allen Institute for Artificial Intelligence in Seattle, said Apple has squandered its lead when it comes to understanding speech and answering questions.

Siri是這些服務(wù)中的老前輩。許多研究人員,包括位于西雅圖的艾倫人工智能研究所的首席執(zhí)行官奧倫•埃齊奧尼在內(nèi),都認(rèn)為在語音識別和回答問題方面,蘋果的領(lǐng)先優(yōu)勢已消耗殆盡。

But there is at least one thing Siri can do that the other assistants cannot: speak 21 languages localized for 36 countries, a very important capability in a smartphone market where most sales are outside the United States.

不過,至少有一件事是Siri可以但其競爭對手做不到的,即會說36個國家的21種語言。這個功能在智能手機(jī)市場至關(guān)重要,因?yàn)榇蟛糠种悄苁謾C(jī)都是銷往美國以外的地區(qū)的。

Microsoft Cortana, by contrast, has eight languages tailored for 13 countries. Google's Assistant, which began in its Pixel phone but has moved to other Android devices, speaks four languages. Amazon's Alexa features only English and German. Siri will even soon start to learn Shanghainese, a special dialect of Wu Chinese spoken only around Shanghai.

微軟“小娜”為13個國家定制了8種語言。谷歌語音助手會說4種語言,該服務(wù)一開始只在谷歌自家的Pixel手機(jī)里有,后來又應(yīng)用到其他安卓設(shè)備。亞馬遜的Alexa只會說英語和德語。而Siri甚至馬上要開始學(xué)說上海話了,這是一種只在上海及其周邊地區(qū)使用的吳語方言。

科技巨頭打響語音助手大戰(zhàn) Siri將學(xué)說上海話

The language issue shows the type of hurdle that digital assistants still need to clear if they are to become ubiquitous tools for operating smartphones and other devices.

這說明,智能助手要想成為無所不在的智能手機(jī)和其他設(shè)備操控工具,就必須跨越語言的障礙。

At Microsoft, an editorial team of 29 people works to customise Cortana for local markets. Google and Amazon said they plan to bring more languages to their assistants but declined to comment further.

微軟公司一個29人的編輯團(tuán)隊(duì)正著手推出面向地方市場的定制“小娜”服務(wù)。谷歌和亞馬遜都表示計(jì)劃給它們的語音助手添加更多種語言,但兩家公司均拒絕透露更多消息。

你知道蘋果是怎么教Siri學(xué)習(xí)方言嗎?

At Apple, the company starts working on a new language by bringing in humans to read passages in a range of accents and dialects, which are then transcribed by hand so the computer has an exact representation of the spoken text to learn from, said Alex Acero, head of the speech team at Apple. Apple also captures a range of sounds in a variety of voices. From there, an acoustic model is built that tries to predict words sequences.

蘋果語音團(tuán)隊(duì)負(fù)責(zé)人亞歷克斯•阿賽羅說,當(dāng)公司研發(fā)人員開始教Siri一門新的語言時,會讓真人用各種方言和口音來閱讀文本段落,然后再手動轉(zhuǎn)錄,這樣計(jì)算機(jī)就可以擁有準(zhǔn)確學(xué)習(xí)樣本。蘋果還會從不同的聲音中捕捉各種語音。然后,蘋果會建立起一個聲學(xué)模型,并通過這個模型開始嘗試預(yù)測字符序列。

科技巨頭打響語音助手大戰(zhàn) Siri將學(xué)說上海話

Then Apple deploys "dictation mode," its text-to-speech translator, in the new language, Acero said. When customers use dictation mode, Apple captures a small percentage of the audio recordings and makes them anonymous. The recordings, complete with background noise and mumbled words, are transcribed by humans, a process that helps cut the speech recognition error rate in half.

蘋果隨后會在新語言中部署“聽寫模式”,這是一種文本和語音之間的翻譯器。當(dāng)用戶使用聽寫模式時,蘋果會捕獲音頻錄音中的一小部分,然后對其匿名處理。由于這些錄音有背景噪音以及模糊不清的詞語,將會由專人轉(zhuǎn)錄,這一過程可以將語音識別的錯誤率降低一半。

After enough data has been gathered and a voice actor has been recorded to play Siri in a new language, Siri is released with answers to what Apple estimates will be the most common questions, Acero said. Once released, Siri learns more about what real-world users ask and is updated every two weeks with more tweaks.

阿賽羅說,在收集了足夠的數(shù)據(jù),并讓配音員為Siri錄制了新語言之后,Siri就會向公眾開放,剛發(fā)布時Siri能夠回答蘋果認(rèn)為最常見的問題。發(fā)布之后,Siri將會從用戶的實(shí)際問題中學(xué)習(xí),并且會每兩周更新一次,作出一些調(diào)整。

But script-writing does not scale, said Charles Jolley, creator of an intelligent assistant named Ozlo. “You can't hire enough writers to come up with the system you'd need in every language. You have to synthesize the answers,” he said. That is years off, he said.

不過,智能助理Ozlo的創(chuàng)始人查爾斯•喬利表示,撰寫腳本無法做到規(guī)?;?,“你不可能聘請足夠多的作者為每一種語言撰寫需要的腳本,你必須人工合成答案。”他認(rèn)為,這還需要多年的時間。

The founders of Viv, a startup founded by Siri's original creators that Samsung acquired last year, is working on just that.

三星去年收購了“Siri之父”創(chuàng)建的人工智能公司Viv,這家初創(chuàng)公司的創(chuàng)始人正在著手解決這個問題。

"Viv was built to specifically address the scaling issue for intelligent assistants," said Dag Kittlaus, the CEO and co-founder of Viv. "The only way to leapfrog today's limited functionality versions is to open the system up and let the world teach them."

Viv的聯(lián)合創(chuàng)始人兼CEO戴格•吉特拉斯說:“Viv旨在專攻智能助理的規(guī)?;瘑栴},要想跨過目前版本的功能局限,唯一的方法就是開放系統(tǒng),讓世界來教它們。”
 


用戶搜索

瘋狂英語 英語語法 新概念英語 走遍美國 四級聽力 英語音標(biāo) 英語入門 發(fā)音 美語 四級 新東方 七年級 賴世雄 zero是什么意思廣州市江燕花園英語學(xué)習(xí)交流群

  • 頻道推薦
  • |
  • 全站推薦
  • 推薦下載
  • 網(wǎng)站推薦