语音技术能否颠覆沟通方式

知识 陈清泉的英语补习班 第26期 2017-02-06 创建 播放:112

介绍: Voice techonology Now we’re talking 
Voice technology is making computers less daunting and more accessible
daunting英 [dɔ:ntɪŋ] 美 [dɔ:ntɪŋ]adj.令人畏惧的;使人气馁的;令人怯步的v.威吓(daunt的现在分词);使气馁;使害怕;使失去信心

Any sufficiently advanced technology, noted Arthur C. Clarke, a British science-fi...

介绍: Voice techonology Now we’re talking 
Voice technology is making computers less daunting and more accessible
daunting英 [dɔ:ntɪŋ] 美 [dɔ:ntɪŋ]adj.令人畏惧的;使人气馁的;令人怯步的v.威吓(daunt的现在分词);使气馁;使害怕;使失去信心

Any sufficiently advanced technology, noted Arthur C. Clarke, a British science-fiction writer, is indistinguishable from magic. The fast-emerging technology of voice computing proves his point. Using it is just like casting a spell: say a few words into the air, and a nearby device can grant your wish.

sufficiently英 [sə'fɪʃntlɪ] 美 [səˈfɪʃəntlɪ]adv.足够地,充分地;十分,相当

The Amazon Echo, a voice-driven cylindrical computer that sits on a table top and answers to the name Alexa, can call up music tracks and radio stations, tell jokes, answer trivia questions and control smart appliances; even before Christmas it was already resident in about 4% of American households. Voice assistants are proliferating in smartphones, too: Apple’s Siri handles over 2bn commands a week, and 20% of Google searches on Android-powered handsets in America are input by voice. Dictating e-mails and text messages now works reliably enough to be useful. Why type when you can talk?

cylindrical英 [səˈlɪndrɪkl] 美 [səˈlɪndrɪkəl]adj.圆筒形;圆柱形的,圆筒状的,气缸(或滚筒)的
trivia英 [ˈtrɪviə] 美 [ˈtrɪviə]n.琐事,不足道的细枝末节;(中世纪大学的)三学科( trivium的名词复数);薄物细故
proliferate英 [prəˈlɪfəreɪt] 美 [prəˈlɪfəˌret]vi.扩散;激增;增生,增殖vt.使激增;使扩散
dictate英 ['dɪkˌteɪt] 美 ['dɪkˌteɪtɪŋ]v.口授;支配;大声讲或读;摆布

This is a huge shift. Simple though it may seem, voice has the power to transform computing, by providing a natural means of interaction. Windows, icons and menus, and then touchscreens, were welcomed as more intuitive ways to deal with computers than entering complex keyboard commands. But being able to talk to computers abolishes the need for the abstraction of a “user interface” at all. Just as mobile phones were more than existing phones without wires, and cars were more than carriages without horses, so computers without screens and keyboards have the potential to be more useful, powerful and ubiquitous than people can imagine today.
ubiquitous英 [ju:ˈbɪk wɪtəs] 美 [juˈbɪkwɪtəs]adj.无所不在的;普遍存在的

Voice will not wholly replace other forms of input and output. Sometimes it will remain more convenient to converse with a machine by typing rather than talking . But voice is destined to account for a growing share of people’s interactions with the technology around them, from washing machines that tell you how much of the cycle they have left to virtual assistants in corporate call-centres. However, to reach its full potential, the technology requires further breakthroughs—and a resolution of the tricky questions it raises around the trade-off between convenience and privacy.

Computer-dictation systems have been around for years. But they were unreliable and required lengthy training to learn a specific user’s voice. Computers’ new ability to recognise almost anyone’s speech dependably without training is the latest manifestation of the power of “deep learning”, an artificial-intelligence technique in which a software system is trained using millions of examples, usually culled from the internet. Thanks to deep learning, machines now nearly equal humans in transcription accuracy, computerised translation systems are improving rapidly and text-to-speech systems are becoming less robotic and more natural-sounding. Computers are, in short, getting much better at handling natural language in all its forms .
cull英 [kʌl] 美 [kʌl]vt.精选;挑选,剔除;采,摘(花)n.剔除;拣出的东西
transcription英 [trænˈskrɪpʃn] 美 [trænˈskrɪpʃən]n.抄写;抄本;录音;翻译

Although deep learning means that machines can recognise speech more reliably and talk in a less stilted manner, they still don’t understand the meaning of language. That is the most difficult aspect of the problem and, if voice-driven computing is truly to flourish, one that must be overcome. Computers must be able to understand context in order to maintain a coherent conversation about something, rather than just responding to simple, one-off voice commands, as they mostly do today . Researchers in universities and at companies large and small are working on this very problem, building “bots” that can hold more elaborate conversations about more complex tasks, from retrieving information to advising on mortgages to making travel arrangements.

stilted英 [ˈstɪltɪd] 美 [ˈstɪltɪd]adj.(动作或言语)生硬的,不自然的
coherent英 [kəʊˈhɪərənt] 美 [koʊˈhɪrənt]adj.连贯的;一致的;条理分明的;清楚明白的

Consumers and regulators also have a role to play in determining how voice computing develops. Even in its current, relatively primitive form, the technology poses a dilemma: voice-driven systems are most useful when they are personalised, and are granted wide access to sources of data such as calendars, e-mails and other sensitive information. That raises privacy and security concerns.
dilemma英 [dɪˈlemə] 美 [dɪˈlɛmə]n.进退两难;窘境,困境

To further complicate matters, many voice-driven devices are always listening, waiting to be activated. Some people are already concerned about the implications of internet-connected microphones listening in every room and from every smartphone. Not all audio is sent to the cloud—devices wait for a trigger phrase before they start relaying the user’s voice to the servers that actually handle the requests—but when it comes to storing audio, it is unclear who keeps what and when.

Police investigating a murder in Arkansas, which may have been overheard by an Amazon Echo, have asked the company for access to any audio that might have been captured. Amazon has refused to co-operate, arguing that the legal status of such requests is unclear. The situation is analogous to Apple’s refusal in 2016 to help FBI investigators unlock a terrorist’s iPhone; both cases highlight the need for rules that specify when and what intrusions into personal privacy are justified in the interests of security.

analogous英 [əˈnæləgəs] 美 [əˈnæləɡəs]adj.相似的,可比拟的;<生>同功的;模拟式
intrusion英 [ɪnˈtru:ʒn] 美 [ɪnˈtruʒən]n.闯入;打扰;(对某事的)干扰;干涉

Consumers will adopt voice computing even if such issues remain unresolved. In many situations voice is far more convenient and natural than any other means of communication. Uniquely, it can also be used while doing something else. It can extend the power of computing to people unable, for one reason or another, to use screens and keyboards. And it could have a dramatic impact not just on computing, but on the use of language itself. Computerised simultaneous translation could render the need to speak a foreign language irrelevant for many people; and in a world where machines can talk, minor languages may be more likely to survive. The arrival of the touchscreen was the last big shift in the way humans interact with computers. The leap to speech matters more.
simultaneous英 [ˌsɪmlˈteɪniəs] 美 [ˌsaɪmlˈteɪniəs]adj.同时的;同时发生的,同时存在的;联立的n.同时译员

  • 音乐开放平台
  • 云村交易所
  • X StudioAI歌手
  • 用户认证
  • AI 免费写歌
  • 云推歌
  • 赞赏

廉正举报 不良信息举报邮箱: 51jubao@service.netease.com

互联网宗教信息服务许可证:浙(2022)0000120 增值电信业务经营许可证:浙B2-20150198 粤B2-20090191-18  浙ICP备15006616号-4  工业和信息化部备案管理系统网站

网易公司版权所有©1997-2025杭州乐读科技有限公司运营:浙网文[2024] 0900-042号 浙公网安备 33010802013307号 算法服务公示信息