ABOUT THE SPEAKER

Rupal Patel - Speech scientist
People relying on synthetic speech use the voice they’re given, not their own. Rupal Patel created the vocaliD project to change that.

Why you should listen

Northeastern University computer science professor Rupal Patel looks for ways to give voice to the voiceless. As founder and director of the Communication Analysis and Design Laboratory (CadLab), she developed a technology that combines real human voices with the characteristics of individual speech patterns. The result is VocaliD, an innovation that gives people who can't speak the ability to communicate in a voice all their own.

"There's nothing better than seeing the person who's actually going to use it, seeing their reaction, seeing their smile," says Patel.

More profile about the speaker
Rupal Patel | Speaker | TED.com

TEDWomen 2013

Rupal Patel: Synthetic voices, as unique as fingerprints

ルパル・パテル: 指紋のようにユニークな合成音声

Filmed: 2013-12-05

Readability: 3.9

944,754 views

重度の言語障害をもつ多くの患者はコンピューターを駆使してコミュニケーションをはかっています。でもその音声の選択肢には限りがあります。そのため、イギリス人のスティーヴン・ホーキングの声はアメリカ訛りで、多くの人達が同じ声を使い、しばしば不似合いな声で我慢しているのです。スピーチ・サイエンティストのルパル・パテルはこの現状をどうにか変えたいと願いました。素晴らしい講演の中で、パテルは声なき人達のためにユニークな声を生み出す方法について紹介します。

Rupal Patel - Speech scientist
People relying on synthetic speech use the voice they’re given, not their own. Rupal Patel created the vocaliD project to change that. Full bio

Double-click the English transcript below to play the video.

00:12

I'd like to talk today今日

0

719

1490

今日皆さんにお話したいのは

00:14

about a powerful強力な and fundamental基本的な aspectアスペクト

1

2209

2927

私たちのあり方を決める
パワフルで

00:17

of who we are: our voice音声.

2

5136

3598

基礎的なもの―
「声」についてです

00:20

Each各 one of us has a uniqueユニークな voiceprint声紋

3

8734

2746

私たち一人一人に独特の声紋があり

00:23

that reflects反映する our age年齢, our sizeサイズ,

4

11480

2289

私たちの年齢､体格

00:25

even our lifestyleライフスタイル and personality人.

5

13769

3237

生活習慣や個性までも映し出します

00:29

In the words言葉 of the poet詩人 Longfellowロングフェロー,

6

17006

2142

ヘンリー・ワーズワース・ロングフェローは

00:31

"the human人間 voice音声 is the organ器官 of the soul魂."

7

19148

3870

「人の声は心のオルガン（心の臓器）である」と
詩でつづりました

00:35

As a speechスピーチ scientist科学者, I'm fascinated魅惑的な

8

23018

2747

スピーチ・サイエンティストである私は

00:37

by how the voice音声 is produced生産された,

9

25765

1829

発声の仕組みに魅せられ

00:39

and I have an ideaアイディア for how it can be engineered設計された.

10

27594

3658

これを人工的に作り出す方法を
見つけました

00:43

That's what I'd like to shareシェア with you.

11

31252

2210

これを皆さんと共有いたします

00:45

I'm going to start開始 by playing遊ぶ you a sampleサンプル

12

33462

1814

まずは皆さんが
ご存知かもしれない

00:47

of a voice音声 that you mayかもしれない recognize認識する.

13

35276

1871

声のサンプルを流します

00:49

(Recording録音) Stephenスティーヴン Hawkingホーキング: "I would have thought

14

37147

1304

（音声）スティーヴン・ホーキング：
「私が意図することは

00:50

it was fairlyかなり obvious明らか what I meant意味した."

15

38451

2749

かなり明確だと思っていました」

00:53

Rupalルパール Patelパテル: That was the voice音声

16

41200

1280

お聞きいただいたのは

00:54

of Professor教授 Stephenスティーヴン Hawkingホーキング.

17

42480

2086

スティーヴン・ホーキング教授の声です

00:56

What you mayかもしれない not know is that same同じ voice音声

18

44566

3849

皆さんが
ご存知ないかもしれないのは

01:00

mayかもしれない alsoまた、 be used by this little girl女の子

19

48415

2478

同じ声を
こちらの女の子のような

01:02

who is unableできない to speak話す

20

50893

1697

神経疾患で話すことができない

01:04

because of a neurological神経学 condition調子.

21

52590

2597

子供も使っている
可能性があることです

01:07

In fact事実, all of these individuals個人

22

55187

2068

実はこのような方々は

01:09

mayかもしれない be usingを使用して the same同じ voice音声,

23

57255

2012

声の選択肢がごく限られているため

01:11

and that's because there's
only a few少数 optionsオプション available利用可能な.

24

59267

3557

同じ声を使っていることがあるのです

01:14

In the U.S. alone単独で, there are 2.5 million百万 Americansアメリカ人

25

62824

4317

アメリカだけでも
話すことができない人達が

01:19

who are unableできない to speak話す,

26

67141

1610

250万人もいます

01:20

and manyたくさんの of whom誰 use computerizedコンピュータ化された devicesデバイス

27

68751

2622

その多くの人達が
コミュニケーション手段として

01:23

to communicate通信する.

28

71373

1522

コンピューターを使用します

01:24

Now that's millions何百万 of people worldwide世界的に

29

72895

3479

世界規模で
数百万の人々が

01:28

who are usingを使用して genericジェネリック voices声,

30

76374

1652

人工音声を使っているのです

01:30

includingを含む Professor教授 Hawkingホーキング,

31

78026

1446

ホーキング教授もその１人で

01:31

who uses用途 an American-accentedアメリカンアクセント型 voice音声.

32

79472

4833

アメリカ訛りの音声を使っていますね

01:36

This lack欠如 of individuation個性化 of the synthetic合成 voice音声

33

84305

3328

この個性に欠けた
合成音声には

01:39

really hitヒット home

34

87633

1416

本当にショックを受けました

01:41

when I was at an assistive補助的 technology技術 conference会議

35

89049

2472

数年前に障害を持つ人の
技術支援に関する会議に

01:43

a few少数 years年 ago前,

36

91521

1850

参加した時のことです

01:45

and I recall想起 walking歩く into an exhibit示す hallホール

37

93371

3604

展示ホールに足を入れると

01:48

and seeing見る a little girl女の子 and a grown成長した man

38

96975

3044

小さい女の子から
成人男性まで

01:52

having持つ a conversation会話 usingを使用して their彼らの devicesデバイス,

39

100019

2916

それぞれの機器を使って
話しているんですが

01:54

different異なる devicesデバイス, but the same同じ voice音声.

40

102935

4284

機器は違えど
同じ声でした

01:59

And I looked見た around and I saw this happeningハプニング

41

107219

1909

周りを見回すと
私の周りでも

02:01

all around me, literally文字通り hundreds数百 of individuals個人

42

109128

4190

同じことが起こっていました
文字通り数百人の人達が

02:05

usingを使用して a handful一握りの of voices声,

43

113318

2738

ごく限られた音声を使っていて

02:08

voices声 that didn't fitフィット their彼らの bodies体

44

116056

3091

それぞれの身体や個性に

02:11

or their彼らの personalities人格.

45

119147

2082

合っていないんです

02:13

We wouldn'tしないだろう dream夢 of fittingフィッティング a little girl女の子

46

121229

2727

小さい女の子に
成人男性用の義足を

02:15

with the prosthetic補綴 limb手足 of a grown成長した man.

47

123956

3396

あてがうなんて想像できませんよね

02:19

So why then the same同じ prosthetic補綴 voice音声?

48

127352

3304

ではなぜ人工音声もそうしないのか？

02:22

It really struck打たれた me,

49

130656

1291

これが大変気に掛かり

02:23

and I wanted to do something about this.

50

131947

3151

この状況を何とかしたいと思ったのです

02:27

I'm going to play遊びます you now a sampleサンプル

51

135098

1953

これからお聞きいただくのは

02:29

of someone誰か who has, two people actually実際に,

52

137051

3288

重度の言語障害を患っている

02:32

who have severe厳しい speechスピーチ disorders障害.

53

140339

1768

２人の音声サンプルです

02:34

I want you to take a listen to how they sound音.

54

142107

3230

どのように聞こえるか
お聞きください

02:37

They're saying言って the same同じ utterance発話.

55

145337

2357

同じ内容を発話しています

02:39

(First voice音声)

56

147694

2432

（第１音声）

02:42

(Second二番目 voice音声)

57

150126

3617

（第２音声）

02:45

You probably多分 didn't understandわかる what they said,

58

153743

2412

話の内容までは
分からなかったかもしれませんが

02:48

but I hope希望 that you heard聞いた

59

156155

1854

２人の個性的な

02:50

their彼らの uniqueユニークな vocalボーカル identitiesアイデンティティ.

60

158009

4283

音声はお分かりいただけたでしょう

02:54

So what I wanted to do next次 is,

61

162292

2813

次に私がやりたかったことは

02:57

I wanted to find out how we could harnessハーネス

62

165105

2384

このように残された
発話能力を

02:59

these residual残余 vocalボーカル abilities能力

63

167489

1821

活かして

03:01

and buildビルドする a technology技術

64

169310

2016

使用者に合わせてカスタマイズできる

03:03

that could be customizedカスタマイズされた for them,

65

171326

2143

テクノロジー
つまり彼らのために

03:05

voices声 that could be customizedカスタマイズされた for them.

66

173469

2429

カスタマイズできる声を
開発することでした

03:07

So I reached到達した out to my collaborator協力者, Timティム Bunnellブンネル.

67

175898

2685

そこで協力者の
ティム・バンネルに助言を仰ぎました

03:10

DrDr. Bunnellブンネル is an expert専門家 in speechスピーチ synthesis合成,

68

178583

3063

バンネル博士は音声合成の
第一人者で

03:13

and what he'd彼は been doing is building建物

69

181646

2033

彼がやっているのは

03:15

personalizedパーソナライズド voices声 for people

70

183679

1881

事前に録音してあった

03:17

by puttingパッティング together一緒に

71

185560

2097

本人の音声サンプルを用いて

03:19

pre-recorded事前に録音した samplesサンプル of their彼らの voice音声

72

187657

2150

音声を復元することで

03:21

and reconstructing再構築する a voice音声 for them.

73

189807

2879

個人用の音声を作っているのです

03:24

These are people who had lost失われた their彼らの voice音声

74

192686

1712

対象となるのは後天性の障害で

03:26

later後で in life.

75

194398

1911

声を失った人達です

03:28

We didn't have the luxury贅沢

76

196309

1394

生まれながらに

03:29

of pre-recorded事前に録音した samplesサンプル of speechスピーチ

77

197703

1774

言語障害がある人達には

03:31

for those bornうまれた with speechスピーチ disorder障害.

78

199477

2292

「事前に録音した音声サンプル」なんてありません

03:33

But I thought, there had to be a way

79

201769

2537

でも私が考えたのは
残されたかすかな声から

03:36

to reverse逆 engineerエンジニア a voice音声

80

204306

1944

その人の声を

03:38

from whateverなんでも little is left over.

81

206250

2291

蘇らせることができるはずだと

03:40

So we decided決定しました to do exactly正確に that.

82

208541

2714

そこでこれに取り組むことにしたのです

03:43

We setセット out with a little bitビット of funding資金調達
from the Nationalナショナル Science科学 Foundation財団,

83

211255

3403

アメリカ国立科学財団から
わずかな資金援助を受け

03:46

to create作成する custom-craftedカスタムメイド voices声 that captured捕獲

84

214658

3565

話者の独特な声の特徴を反映した

03:50

their彼らの uniqueユニークな vocalボーカル identitiesアイデンティティ.

85

218223

1536

個人用音声の開発を始めました

03:51

We call this projectプロジェクト VocaliDVocaliD, or vocalボーカル I.D.,

86

219759

3203

私たちはこのプロジェクトを
“VocaliD”や“vocal I.D.”と

03:54

for vocalボーカル identity身元.

87

222962

2033

名づけました

03:56

Now before I get into the details詳細 of how

88

224995

2674

これから皆さんに
この特注の声がどのように作られ

03:59

the voice音声 is made製 and let you listen to it,

89

227669

2048

実際の声をお聞きいただく前に

04:01

I need to give you a realリアル quickクイック
speechスピーチ science科学 lessonレッスン. Okay?

90

229717

3350

音声科学についてのごく簡単な講義をします
いいですか？

04:05

So first, we know that the voice音声 is changing変化

91

233067

3159

まず私たちの音声は
成長過程において

04:08

dramatically劇的に over the courseコース of development開発.

92

236226

2854

劇的に変化します

04:11

Children子供 sound音 different異なる from teens十代の若者たち

93

239080

2090

小さな子供の声は
十代の人達と異なりますし

04:13

who sound音 different異なる from adults大人.

94

241170

1463

成人の人達も異なります

04:14

We've私たちは all experienced経験豊富な this.

95

242633

2642

皆さんこれを経験しますね

04:17

Fact事実 number数 two is that speechスピーチ

96

245275

3363

２つ目の事実は発声とは

04:20

is a combination組み合わせ of the sourceソース,

97

248638

2553

皆さんの喉頭から発せられた

04:23

whichどの is the vibrations振動 generated生成された by your voice音声 boxボックス,

98

251191

3479

振動による音源が

04:26

whichどの are then pushed押された throughを通して

99

254670

1939

残りの声道を通過することで

04:28

the rest残り of the vocalボーカル tract管.

100

256609

2437

起こります

04:31

These are the chambers部屋 of your head頭 and neckネック

101

259046

2484

皆さんの頭と首の中にある
スペースが

04:33

that vibrate振動する,

102

261530

1239

振動することで

04:34

and they actually実際に filterフィルタ that sourceソース sound音

103

262769

2110

音源をフィルターにかけて

04:36

to produce作物 consonants子音 and vowels母音.

104

264879

2537

母音と子音が発音されるのです

04:39

So the combination組み合わせ of sourceソース and filterフィルタ

105

267416

3860

つまり音源がフィルターにかかることが

04:43

is how we produce作物 speechスピーチ.

106

271276

2630

発声のメカニズムなのです

04:45

And that happens起こる in one individual個人.

107

273906

3026

これが一人一人に起きているわけです

04:48

Now I told you earlier先に that I'd spent過ごした

108

276932

2626

先ほど申し上げたように

04:51

a good part部 of my careerキャリア

109

279558

2025

私は重い言語障害を患う人達の

04:53

understanding理解 and studying勉強する

110

281583

2453

音源の特性についての

04:56

the sourceソース characteristics特性 of people

111

284036

1958

理解と研究に
長いこと

04:57

with severe厳しい speechスピーチ disorder障害,

112

285994

2301

携わってきました

05:00

and what I've found見つけた

113

288295

1465

そこで気づいたのは

05:01

is that even thoughしかし their彼らの filtersフィルター were impaired障害者,

114

289760

3366

彼らのフィルターに障害があっても

05:05

they were ableできる to modulate変調する their彼らの sourceソース:

115

293126

2961

音源は調節可能であるということで

05:08

the pitchピッチ, the loudnessラウドネス, the tempoテンポ of their彼らの voice音声.

116

296087

3262

それは声のピッチ､大きさ､テンポです

05:11

These are calledと呼ばれる prosody韻律, and
I've been documenting文書化する for years年

117

299349

3368

これらはプロソディー（韻律）と呼ばれるもので
長年の調査で

05:14

that the prosodic韻律 abilities能力 of these individuals個人

118

302717

2277

言語障害者のプロソディーが
健在であることを

05:16

are preserved保存された.

119

304994

1575

実証してきました

05:18

So when I realized実現した that those same同じ cues合図

120

306569

4087

ですからこれらの表現が
話し手のアイデンティティにも

05:22

are alsoまた、 important重要 for speakerスピーカー identity身元,

121

310656

2769

重要だと気づいた時

05:25

I had this ideaアイディア.

122

313425

2015

このアイデアを思いついたのです

05:27

Why don't we take the sourceソース

123

315440

2516

それは発話させたい人の

05:29

from the person人 we want the voice音声 to sound音 like,

124

317956

2213

音源を使い

05:32

because it's preserved保存された,

125

320169

1463

―これは残っているんですね

05:33

and borrowかりて the filterフィルタ

126

321632

2135

対象となる人と同じ年齢で

05:35

from someone誰か about the same同じ age年齢 and sizeサイズ,

127

323767

3229

同じ体格の人から
フィルターを借りて

05:39

because they can articulate明瞭な speechスピーチ,

128

327011

2407

この明瞭な音声と

05:41

and then mixミックス them?

129

329418

1791

混ぜたらどうかと考えたのです

05:43

Because when we mixミックス them,

130

331209

1787

合成した声は

05:44

we can get a voice音声 that's as clearクリア

131

332996

1698

フィルターを借りた

05:46

as our surrogate代理 talkerトーカー --

132

334694

1754

代理話者と同じくらい

05:48

that's the person人 we borrowed借りた the filterフィルタ from—

133

336448

2595

明瞭な声で
私たちがターゲットとしている話者の

05:51

and is similar類似 in identity身元 to our targetターゲット talkerトーカー.

134

339043

4649

アイデンティティにも
類似しているんです

05:55

It's that simple単純.

135

343692

1427

こんなに簡単なんです

05:57

That's the science科学 behind後ろに what we're doing.

136

345119

2934

これが私たちがやっていることの
裏にある科学です

06:00

So once一度 you have that in mindマインド,

137

348053

3533

ではアイデアが思いついたところで

06:03

how do you go about building建物 this voice音声?

138

351586

2258

どうやって実際に声を構築したらいいでしょう？

06:05

Well, you have to find someone誰か

139

353844

1480

まずはフィルターを提供してくれる人を
探す必要がありました

06:07

who is willing喜んで to be a surrogate代理.

140

355324

2400

まずはフィルターを提供してくれる人を
探す必要がありました

06:09

It's not suchそのような an ominous不吉 thing.

141

357724

2264

全然難しいことではないんです

06:11

Beingであること a surrogate代理 donorドナー

142

359988

1523

提供者になるということは

06:13

only requires要求する you to say a few少数 hundred百

143

361511

2788

数百から数千の言葉を

06:16

to a few少数 thousand千 utterances発話.

144

364299

2242

発声するだけです

06:18

The processプロセス goes行く something like this.

145

366541

2003

この過程はこんな感じです

06:20

(Videoビデオ) Voice音声: Things happen起こる in pairsペア.

146

368544

2190

声：物事は対になって起こります

06:22

I love to sleep睡眠.

147

370734

1925

寝るのが大好きです

06:24

The sky空 is blue青 withoutなし clouds雲.

148

372659

3882

雲一つない青い空です

06:28

RPRP: Now she's going to go on like this

149

376541

2002

これを３時間から

06:30

for about three三 to four4つの hours時間,

150

378543

1919

４時間ほど続けます

06:32

and the ideaアイディア is not for her to say everything

151

380462

3005

ここでのポイントは
対象となる人が話したい文章を

06:35

that the targetターゲット is going to want to say,

152

383467

2045

代理人に言わせるのではなく

06:37

but the ideaアイディア is to coverカバー all the different異なる combinations組み合わせ

153

385512

3395

言葉の中で生じる
全ての異なる

06:40

of the sounds音 that occur発生する in the language言語.

154

388907

3271

音の組み合わせを
拾っていくことです

06:44

The more speechスピーチ you have,

155

392178

1638

サンプルが多ければ多いほど

06:45

the better sounding聞こえる voice音声 you're going to have.

156

393816

2305

より質の良い声を得ることができます

06:48

Once一度 you have those recordings録音,

157

396121

1673

収録が終わったら

06:49

what we need to do

158

397794

1413

次に必要なのは

06:51

is we have to parse解析する these recordings録音

159

399207

2718

読まれた文章を解析し

06:53

into little snippetsスニペット of speechスピーチ,

160

401925

2449

言語の要素に分割することです

06:56

one-1- or two-sound2音色 combinations組み合わせ,

161

404374

2337

１つの音や２つの音の組み合わせや

06:58

sometimes時々 even whole全体 words言葉

162

406711

1883

時には単語全体を

07:00

that start開始 populatingポピュレートする a datasetデータセット or a databaseデータベース.

163

408594

4516

データセットすなわちデータベースに
集積していきます

07:05

We're going to call this databaseデータベース a voice音声 bankバンク.

164

413110

3717

このデータベースを
音声バンクと呼びましょう

07:08

Now the powerパワー of the voice音声 bankバンク

165

416827

2096

音声バンクのパワフルな点は

07:10

is that from this voice音声 bankバンク,

166

418923

2014

この音声バンクから

07:12

we can now say any new新しい utterance発話,

167

420937

2011

新しい言葉を発声できることで

07:14

like, "I love chocolateチョコレート" --

168

422948

1424

「チョコレートが好き」とか

07:16

everyoneみんな needsニーズ to be ableできる to say that—

169

424372

1739

これは誰でも言いたいですよね

07:18

fish魚 throughを通して that databaseデータベース

170

426111

1831

データベースを駆使して

07:19

and find all the segmentsセグメント necessary必要

171

427942

1940

その言葉の発声に必要な

07:21

to say that utterance発話.

172

429882

1929

全ての断片を見つけるのです

07:23

(Videoビデオ) Voice音声: I love chocolateチョコレート.

173

431811

1789

声：チョコレートが好きです

07:25

RPRP: So that's speechスピーチ synthesis合成.

174

433600

1391

これが音声合成です

07:26

It's calledと呼ばれる concatenative連結 synthesis合成,
and that's what we're usingを使用して.

175

434991

2573

波形接続合成という
私たちが使っている手法です

07:29

That's not the novel小説 part部.

176

437564

1533

これは目新しくありませんが

07:31

What's novel小説 is how we make it sound音

177

439097

2221

新しい点はどうやって
この若い女性が

07:33

like this young若い woman女性.

178

441318

1457

話すような音声にするかです

07:34

This is Samanthaサマンサ.

179

442775

1524

彼女の名前はサマンサです

07:36

I met会った her when she was nine9人,

180

444299

2346

私が彼女に出会ったのは

07:38

and since以来 then, my teamチーム and I

181

446645

1897

彼女が９歳の時で

07:40

have been trying試す to buildビルドする her a personalizedパーソナライズド voice音声.

182

448542

2714

私のチームは
彼女のための声を構築してきました

07:43

We first had to find a surrogate代理 donorドナー,

183

451256

3099

まずは代理ドナーを探して

07:46

and then we had to have Samanthaサマンサ

184

454355

1818

サマンサにもいくつかの

07:48

produce作物 some utterances発話.

185

456173

1929

発声をお願いしました

07:50

What she can produce作物 are mostly主に vowel-like母音のような sounds音,

186

458102

2379

彼女が発声できるのは
主に母音だけですが

07:52

but that's enough十分な for us to extractエキス

187

460481

2479

彼女の音源特性を引き出すのには

07:54

her sourceソース characteristics特性.

188

462960

2285

十分な情報でした

07:57

What happens起こる next次 is bestベスト described記載された

189

465245

3271

次のステップは

08:00

by my daughter's娘の analogy類推. She's six6.

190

468516

2767

私の６歳の娘が上手く例えています

08:03

She callsコール it mixing混合 colors色 to paintペイント voices声.

191

471283

5422

娘は「声を色づかせるために
絵の具を混ぜているんだね」と

08:08

It's beautiful綺麗な. It's exactly正確に that.

192

476705

2555

きれいですよね
まさにその通りなんです

08:11

Samantha'sサマンサ voice音声 is like a concentrated濃縮 sampleサンプル

193

479260

2860

サマンサの声は
濃縮された食紅のように

08:14

of red赤 foodフード dye染料 whichどの we can infuse注入する

194

482120

2609

彼女の代理ドナーの
録音した声に混ぜることで

08:16

into the recordings録音 of her surrogate代理

195

484729

2540

彼女の代理ドナーの
録音した声に混ぜることで

08:19

to get a pinkピンク voice音声 just like this.

196

487269

4387

ピンク色の声になるのです
まさにこんな風に

08:23

(Videoビデオ) Samanthaサマンサ: Aaaaaahアアアア.

197

491656

4491

サマンサ：ああああああ

08:28

RPRP: So now, Samanthaサマンサ can say this.

198

496147

2808

今ではこんな風に話せます

08:30

(Videoビデオ) Samanthaサマンサ: This voice音声 is only for me.

199

498955

3069

サマンサ：この声は私だけのもの

08:34

I can't wait to use my new新しい voice音声 with my friends友達.

200

502024

6305

友達と新しい声で話すのが楽しみ

08:40

RPRP: Thank you. (Applause拍手)

201

508329

6417

ありがとう　（拍手）

08:46

I'll never forget忘れる the gentle優しい smileスマイル

202

514746

2333

彼女が最初に
この声を聞いた時の

08:49

that spread普及 across横断する her face面

203

517079

1902

顔いっぱいに広がった

08:50

when she heard聞いた that voice音声 for the first time.

204

518981

3649

優しい笑みはずっと忘れないでしょう

08:54

Now there's millions何百万 of people

205

522630

1882

世界中には数百万人もの

08:56

around the world世界 like Samanthaサマンサ, millions何百万,

206

524512

2833

サマンサのような人々がいます
数百万ですよ

08:59

and we've私たちは only begun始まった to scratchスクラッチ the surface表面.

207

527345

3440

私たちの取り組みは
まだまだ始まったばかりです

09:02

What we've私たちは done完了 so far遠い is we have

208

530785

1642

これまでの取り組みは

09:04

a few少数 surrogate代理 talkers話す人 from around the U.S.

209

532427

3859

アメリカ国内で
声を提供してくれる人々を

09:08

who have donated寄付された their彼らの voices声,

210

536286

1507

数名集めて

09:09

and we have been usingを使用して those

211

537793

1928

私たちの初の試みとなる

09:11

to buildビルドする our first few少数 personalizedパーソナライズド voices声.

212

539721

4472

個人用の声の構築に
利用しています

09:16

But there's so much more work to be done完了.

213

544193

1756

でもやることは山ほどあります

09:17

For Samanthaサマンサ, her surrogate代理

214

545949

2188

例えばサマンサの代理ドナーは

09:20

came来た from somewhereどこかで in the Midwest中西部, a strangerストレンジャー

215

548137

3046

中西部の出身で
見ず知らずの他人が

09:23

who gave与えた her the gift贈り物 of voice音声.

216

551183

3841

声の贈り物をしてくれたのです

09:27

And as a scientist科学者, I'm so excited興奮した

217

555024

2153

私が科学者として
とても楽しみなのは

09:29

to take this work out of the laboratory研究室

218

557177

1935

研究室でやっていた仕事を

09:31

and finally最後に into the realリアル world世界

219

559112

1800

ついに実用化して

09:32

so it can have real-world現実の世界 impact影響.

220

560912

3165

実社会に影響を与えることです

09:36

What I want to shareシェア with you next次

221

564077

1582

次に皆さんと共有させていただくのは

09:37

is how I envision想像 taking取る this work

222

565659

2175

この成果を
どうやって次のレベルに

09:39

to that next次 levelレベル.

223

567834

2711

進めるかです

09:42

I imagine想像する a whole全体 world世界 of surrogate代理 donorsドナー

224

570545

3887

私が考えているのは
世界中のあらゆる階層の人々

09:46

from all walksあるきます of life, different異なる sizesサイズ, different異なる ages年齢,

225

574432

3260

異なる体格や
違う年齢層の人々が

09:49

coming到来 together一緒に in this voice音声 driveドライブ

226

577692

3058

代理ドナーとなって

09:52

to give people voices声

227

580750

2270

個性と同じくらい

09:55

that are as colorfulカラフル as their彼らの personalities人格.

228

583020

3799

色彩に富んだ声を
人々に贈ることです

09:58

To do that as a first stepステップ,

229

586819

2300

これを叶えるための第一歩として

10:01

we've私たちは put together一緒に this websiteウェブサイト, VocaliDVocaliD.org組織,

230

589119

3275

『VocaliD.org』というウェブサイトを
立ち上げました

10:04

as a way to bring持参する together一緒に those

231

592394

1624

声や専門知識の提供を

10:06

who want to join参加する us as voice音声 donorsドナー,

232

594018

2675

募るためのサイトで
私たちのビジョンを

10:08

as expertise専門知識 donorsドナー,

233

596693

1772

いろいろな形で

10:10

in whateverなんでも way to make this visionビジョン a reality現実.

234

598465

5339

支援してくれる人たちを
集める試みです

10:15

They say that giving与える blood血液 can saveセーブ lives人生.

235

603804

4153

献血で他人の命を救うことができますね

10:19

Well, giving与える your voice音声 can change変化する lives人生.

236

607957

4982

声を提供することで
他人の人生を変えることができます

10:24

All we need is a few少数 hours時間 of speechスピーチ

237

612939

3050

ほんの数時間分の
代理話者の

10:27

from our surrogate代理 talkerトーカー,

238

615989

1491

音声サンプルと

10:29

and as little as a vowel母音 from our targetターゲット talkerトーカー,

239

617480

4733

声を受け取る人の発声した
母音が１つでもあれば

10:34

to create作成する a uniqueユニークな vocalボーカル identity身元.

240

622213

3711

独特な声のアイデンティティを
作れます

10:37

So that's the science科学 behind後ろに what we're doing.

241

625924

2626

これが私たちがやっている裏にある
科学なんです

10:40

I want to end終わり by circling旋回する back to the human人間 side側

242

628550

4455

この仕事に
インスピレーションをもたらしてくれた

10:45

that is really the inspirationインスピレーション for this work.

243

633005

4102

人間的な部分に立ち返ることで
締めくくります

10:49

About five五 years年 ago前, we built建てられた our very first voice音声

244

637107

3699

約５年前のことです
私たちが最初に作った声は

10:52

for a little boy男の子 named名前 Williamウィリアム.

245

640806

2501

ウィリアムという男の子のためでした

10:55

When his momママ first heard聞いた this voice音声,

246

643307

2357

母親がこの声を始めて耳にした時

10:57

she said, "This is what Williamウィリアム

247

645664

2345

「まさにウィリアムの声だ

11:00

would have sounded聞こえた like

248

648009

1546

もしこの子が話せていたら

11:01

had he been ableできる to speak話す."

249

649555

2449

きっとこんな声だったに違いない」と

11:04

And then I saw Williamウィリアム typingタイピング a messageメッセージ

250

652004

2418

するとウィリアムが
彼の機器で

11:06

on his deviceデバイス.

251

654422

1362

メッセージをタイプするんです

11:07

I wondered疑問に思った, what was he thinking考え?

252

655784

3293

私は彼が何を考えているのか
思いを馳せました

11:11

Imagine想像する carrying運ぶ around someone誰か else's他の voice音声

253

659077

3590

９年間も他人の声を使っていた

11:14

for nine9人 years年

254

662667

2193

男の子が

11:16

and finally最後に finding所見 your own自分の voice音声.

255

664860

4844

ついに自分の声を手に入れたのです

11:21

Imagine想像する that.

256

669704

1377

どんな気分だと思いますか

11:23

This is what Williamウィリアム said:

257

671081

2797

ウィリアムはこう言いました

11:25

"Never heard聞いた me before."

258

673878

4463

「自分の声でしゃべったのは初めてだ」

11:32

Thank you.

259

680417

1619

ありがとうございました

11:34

(Applause拍手)

260

682036

4724

（拍手）

Translated by Mari Arimitsu
Reviewed by Akiko Hicks

ABOUT THE SPEAKER

Rupal Patel - Speech scientist
People relying on synthetic speech use the voice they’re given, not their own. Rupal Patel created the vocaliD project to change that.

Why you should listen

Northeastern University computer science professor Rupal Patel looks for ways to give voice to the voiceless. As founder and director of the Communication Analysis and Design Laboratory (CadLab), she developed a technology that combines real human voices with the characteristics of individual speech patterns. The result is VocaliD, an innovation that gives people who can't speak the ability to communicate in a voice all their own.

"There's nothing better than seeing the person who's actually going to use it, seeing their reaction, seeing their smile," says Patel.

More profile about the speaker
Rupal Patel | Speaker | TED.com

THE ORIGINAL VIDEO ON TED.COM

ルパル・パテル: 指紋のようにユニークな合成音声 | TED Talk | TED.com