ABOUT THE SPEAKER
Ajit Narayanan - Visual grammar engine inventor
Ajit Narayanan is the inventor of Avaz, an affordable, tablet-based communication device for people who are speech-impaired.

Why you should listen

Ajit Narayanan is the founder and CEO of Invention Labs, and the inventor of Avaz AAC, the first assistive device aimed at an Indian market that helps people with speech disabilities -- such as cerebral palsy, autism, intellectual disability, aphasia and learning disabilities -- to communicate. Avaz is also available as an iPad app, aimed at children with autism. In 2010, Avaz won the National Award for Empowerment of People with Disabilities from the president of India, and in 2011, Narayanan was listed in MIT Technology Review 35 under 35.
 
Narayanan is a prolific inventor with more than 20 patent applications. He is an electrical engineer with degrees from IIT Madras. His research interests are embedded systems, signal processing and understanding how the brain perceives language and communication.

More profile about the speaker
Ajit Narayanan | Speaker | TED.com
TED2013

Ajit Narayanan: A word game to communicate in any language

Filmed:
1,391,245 views

While working with kids who have trouble speaking, Ajit Narayanan sketched out a way to think about language in pictures, to relate words and concepts in "maps." The idea now powers the FreeSpeech app, which can help nonverbal people communicate.
- Visual grammar engine inventor
Ajit Narayanan is the inventor of Avaz, an affordable, tablet-based communication device for people who are speech-impaired. Full bio

Double-click the English transcript below to play the video.

00:12
I work with children with autism.
0
721
2670
00:15
Specifically, I make technologies
1
3391
1914
00:17
to help them communicate.
2
5305
2171
00:19
Now, many of the problems that children
3
7476
1539
00:21
with autism face, they have a common source,
4
9015
3763
00:24
and that source is that they find it difficult
5
12778
2094
00:26
to understand abstraction, symbolism.
6
14872
5260
00:32
And because of this, they have
a lot of difficulty with language.
7
20132
4652
00:36
Let me tell you a little bit about why this is.
8
24784
3015
00:39
You see that this is a picture of a bowl of soup.
9
27799
3934
00:43
All of us can see it. All of us understand this.
10
31733
2485
00:46
These are two other pictures of soup,
11
34218
2312
00:48
but you can see that these are more abstract
12
36530
2067
00:50
These are not quite as concrete.
13
38597
1856
00:52
And when you get to language,
14
40453
2174
00:54
you see that it becomes a word
15
42627
1868
00:56
whose look, the way it looks and the way it sounds,
16
44495
3261
00:59
has absolutely nothing to do
with what it started with,
17
47756
2912
01:02
or what it represents, which is the bowl of soup.
18
50668
2830
01:05
So it's essentially a completely abstract,
19
53498
2900
01:08
a completely arbitrary representation of something
20
56398
2576
01:10
which is in the real world,
21
58974
1163
01:12
and this is something that children with autism
22
60137
1791
01:13
have an incredible amount of difficulty with.
23
61928
3164
01:17
Now that's why most of the people
that work with children with autism --
24
65092
2751
01:19
speech therapists, educators --
25
67843
1878
01:21
what they do is, they try to help children with autism
26
69721
2633
01:24
communicate not with words, but with pictures.
27
72354
3229
01:27
So if a child with autism wanted to say,
28
75583
1930
01:29
"I want soup," that child would pick
29
77513
2458
01:31
three different pictures, "I," "want," and "soup,"
30
79971
2260
01:34
and they would put these together,
31
82231
1609
01:35
and then the therapist or the parent would
32
83840
1867
01:37
understand that this is what the kid wants to say.
33
85707
1887
01:39
And this has been incredibly effective;
34
87594
1778
01:41
for the last 30, 40 years
35
89372
2141
01:43
people have been doing this.
36
91513
1613
01:45
In fact, a few years back,
37
93126
1349
01:46
I developed an app for the iPad
38
94475
2675
01:49
which does exactly this. It's called Avaz,
39
97150
2255
01:51
and the way it works is that kids select
40
99405
2279
01:53
different pictures.
41
101684
1321
01:55
These pictures are sequenced
together to form sentences,
42
103005
2570
01:57
and these sentences are spoken out.
43
105575
1719
01:59
So Avaz is essentially converting pictures,
44
107294
3025
02:02
it's a translator, it converts pictures into speech.
45
110319
3960
02:06
Now, this was very effective.
46
114279
1718
02:07
There are thousands of children using this,
47
115997
1384
02:09
you know, all over the world,
48
117381
1430
02:10
and I started thinking about
49
118811
2175
02:12
what it does and what it doesn't do.
50
120986
2654
02:15
And I realized something interesting:
51
123640
1684
02:17
Avaz helps children with autism learn words.
52
125324
4203
02:21
What it doesn't help them do is to learn
53
129527
2405
02:23
word patterns.
54
131932
2748
02:26
Let me explain this in a little more detail.
55
134680
2472
02:29
Take this sentence: "I want soup tonight."
56
137152
3057
02:32
Now it's not just the words
here that convey the meaning.
57
140209
4080
02:36
It's also the way in which these words are arranged,
58
144289
3140
02:39
the way these words are modified and arranged.
59
147429
2515
02:41
And that's why a sentence like "I want soup tonight"
60
149959
2306
02:44
is different from a sentence like
61
152265
1984
02:46
"Soup want I tonight," which
is completely meaningless.
62
154249
3312
02:49
So there is another hidden abstraction here
63
157561
2619
02:52
which children with autism find
a lot of difficulty coping with,
64
160180
3557
02:55
and that's the fact that you can modify words
65
163737
2840
02:58
and you can arrange them to have
66
166577
2101
03:00
different meanings, to convey different ideas.
67
168678
2895
03:03
Now, this is what we call grammar.
68
171573
3459
03:07
And grammar is incredibly powerful,
69
175032
2036
03:09
because grammar is this one component of language
70
177068
3157
03:12
which takes this finite vocabulary that all of us have
71
180225
3489
03:15
and allows us to convey an
infinite amount of information,
72
183714
4531
03:20
an infinite amount of ideas.
73
188245
2134
03:22
It's the way in which you can put things together
74
190379
2002
03:24
in order to convey anything you want to.
75
192381
2168
03:26
And so after I developed Avaz,
76
194549
2127
03:28
I worried for a very long time
77
196676
1568
03:30
about how I could give grammar
to children with autism.
78
198244
3910
03:34
The solution came to me from
a very interesting perspective.
79
202154
2275
03:36
I happened to chance upon a child with autism
80
204429
3449
03:39
conversing with her mom,
81
207878
2109
03:41
and this is what happened.
82
209987
2094
03:44
Completely out of the blue, very spontaneously,
83
212081
2186
03:46
the child got up and said, "Eat."
84
214267
2463
03:48
Now what was interesting was
85
216730
1770
03:50
the way in which the mom was trying to tease out
86
218500
4244
03:54
the meaning of what the child wanted to say
87
222744
2213
03:56
by talking to her in questions.
88
224957
2260
03:59
So she asked, "Eat what? Do
you want to eat ice cream?
89
227217
2593
04:01
You want to eat? Somebody else wants to eat?
90
229810
2112
04:03
You want to eat cream now? You
want to eat ice cream in the evening?"
91
231922
3313
04:07
And then it struck me that
92
235235
1514
04:08
what the mother had done was something incredible.
93
236749
2028
04:10
She had been able to get that child to communicate
94
238777
1994
04:12
an idea to her without grammar.
95
240771
4138
04:16
And it struck me that maybe this is what
96
244909
2696
04:19
I was looking for.
97
247605
1385
04:20
Instead of arranging words in an order, in sequence,
98
248990
4142
04:25
as a sentence, you arrange them
99
253132
2172
04:27
in this map, where they're all linked together
100
255304
3811
04:31
not by placing them one after the other
101
259115
2143
04:33
but in questions, in question-answer pairs.
102
261258
3284
04:36
And so if you do this, then what you're conveying
103
264542
2358
04:38
is not a sentence in English,
104
266900
1986
04:40
but what you're conveying is really a meaning,
105
268886
2966
04:43
the meaning of a sentence in English.
106
271852
1511
04:45
Now, meaning is really the underbelly,
in some sense, of language.
107
273363
2932
04:48
It's what comes after thought but before language.
108
276295
3821
04:52
And the idea was that this particular representation
109
280116
2503
04:54
might convey meaning in its raw form.
110
282619
3261
04:57
So I was very excited by this, you know,
111
285880
1771
04:59
hopping around all over the place,
112
287651
1493
05:01
trying to figure out if I can convert
113
289144
1771
05:02
all possible sentences that I hear into this.
114
290915
2524
05:05
And I found that this is not enough.
115
293439
1773
05:07
Why is this not enough?
116
295212
1385
05:08
This is not enough because if you wanted to convey
117
296597
1711
05:10
something like negation,
118
298308
2250
05:12
you want to say, "I don't want soup,"
119
300558
1736
05:14
then you can't do that by asking a question.
120
302294
2220
05:16
You do that by changing the word "want."
121
304514
2285
05:18
Again, if you wanted to say,
122
306799
1637
05:20
"I wanted soup yesterday,"
123
308436
1980
05:22
you do that by converting
the word "want" into "wanted."
124
310416
2737
05:25
It's a past tense.
125
313153
1666
05:26
So this is a flourish which I added
126
314819
2103
05:28
to make the system complete.
127
316922
1576
05:30
This is a map of words joined together
128
318498
1977
05:32
as questions and answers,
129
320475
1656
05:34
and with these filters applied on top of them
130
322131
2264
05:36
in order to modify them to represent
131
324395
1817
05:38
certain nuances.
132
326212
1709
05:39
Let me show you this with a different example.
133
327921
1951
05:41
Let's take this sentence:
134
329872
1254
05:43
"I told the carpenter I could not pay him."
135
331126
1980
05:45
It's a fairly complicated sentence.
136
333106
1792
05:46
The way that this particular system works,
137
334898
1893
05:48
you can start with any part of this sentence.
138
336791
2578
05:51
I'm going to start with the word "tell."
139
339369
1698
05:53
So this is the word "tell."
140
341067
1462
05:54
Now this happened in the past,
141
342529
1600
05:56
so I'm going to make that "told."
142
344129
2223
05:58
Now, what I'm going to do is,
143
346352
1708
06:00
I'm going to ask questions.
144
348060
1756
06:01
So, who told? I told.
145
349816
2364
06:04
I told whom? I told the carpenter.
146
352180
1927
06:06
Now we start with a different part of the sentence.
147
354107
1751
06:07
We start with the word "pay,"
148
355858
1867
06:09
and we add the ability filter to it to make it "can pay."
149
357725
4577
06:14
Then we make it "can't pay,"
150
362302
2101
06:16
and we can make it "couldn't pay"
151
364403
1599
06:18
by making it the past tense.
152
366002
1663
06:19
So who couldn't pay? I couldn't pay.
153
367665
1923
06:21
Couldn't pay whom? I couldn't pay the carpenter.
154
369588
2676
06:24
And then you join these two together
155
372264
1731
06:25
by asking this question:
156
373995
1350
06:27
What did I tell the carpenter?
157
375345
1737
06:29
I told the carpenter I could not pay him.
158
377082
4049
06:33
Now think about this. This is
159
381131
1937
06:35
—(Applause)—
160
383068
3542
06:38
this is a representation of this sentence
161
386610
3672
06:42
without language.
162
390282
2435
06:44
And there are two or three
interesting things about this.
163
392717
2192
06:46
First of all, I could have started anywhere.
164
394909
3131
06:50
I didn't have to start with the word "tell."
165
398040
2243
06:52
I could have started anywhere in the sentence,
166
400283
1416
06:53
and I could have made this entire thing.
167
401699
1507
06:55
The second thing is, if I wasn't an English speaker,
168
403206
2776
06:57
if I was speaking in some other language,
169
405982
2175
07:00
this map would actually hold true in any language.
170
408157
3156
07:03
So long as the questions are standardized,
171
411313
1990
07:05
the map is actually independent of language.
172
413303
4287
07:09
So I call this FreeSpeech,
173
417590
2115
07:11
and I was playing with this for many, many months.
174
419705
2935
07:14
I was trying out so many
different combinations of this.
175
422640
2726
07:17
And then I noticed something very
interesting about FreeSpeech.
176
425366
2289
07:19
I was trying to convert language,
177
427655
3243
07:22
convert sentences in English
into sentences in FreeSpeech,
178
430898
2384
07:25
and vice versa, and back and forth.
179
433282
1752
07:27
And I realized that this particular configuration,
180
435034
2255
07:29
this particular way of representing language,
181
437289
2026
07:31
it allowed me to actually create very concise rules
182
439315
4395
07:35
that go between FreeSpeech on one side
183
443710
2734
07:38
and English on the other.
184
446444
1488
07:39
So I could actually write this set of rules
185
447932
2180
07:42
that translates from this particular
representation into English.
186
450112
3395
07:45
And so I developed this thing.
187
453507
1831
07:47
I developed this thing called
the FreeSpeech Engine
188
455338
2232
07:49
which takes any FreeSpeech sentence as the input
189
457570
2561
07:52
and gives out perfectly grammatical English text.
190
460131
3930
07:56
And by putting these two pieces together,
191
464061
1605
07:57
the representation and the engine,
192
465666
1881
07:59
I was able to create an app, a
technology for children with autism,
193
467547
3796
08:03
that not only gives them words
194
471343
2499
08:05
but also gives them grammar.
195
473842
3941
08:09
So I tried this out with kids with autism,
196
477783
2360
08:12
and I found that there was an
incredible amount of identification.
197
480143
5013
08:17
They were able to create sentences in FreeSpeech
198
485156
2720
08:19
which were much more complicated
but much more effective
199
487876
2558
08:22
than equivalent sentences in English,
200
490434
2899
08:25
and I started thinking about
201
493333
1682
08:27
why that might be the case.
202
495015
1969
08:28
And I had an idea, and I want to
talk to you about this idea next.
203
496984
4287
08:33
In about 1997, about 15 years back,
204
501271
3142
08:36
there were a group of scientists that were trying
205
504413
2011
08:38
to understand how the brain processes language,
206
506424
2389
08:40
and they found something very interesting.
207
508813
1779
08:42
They found that when you learn a language
208
510592
1872
08:44
as a child, as a two-year-old,
209
512464
2912
08:47
you learn it with a certain part of your brain,
210
515376
2366
08:49
and when you learn a language as an adult --
211
517742
1600
08:51
for example, if I wanted to
learn Japanese right now —
212
519342
3911
08:55
a completely different part of my brain is used.
213
523253
2707
08:57
Now I don't know why that's the case,
214
525960
1831
08:59
but my guess is that that's because
215
527791
1991
09:01
when you learn a language as an adult,
216
529782
2437
09:04
you almost invariably learn it
217
532219
1616
09:05
through your native language, or
through your first language.
218
533835
4266
09:10
So what's interesting about FreeSpeech
219
538101
3252
09:13
is that when you create a sentence
220
541353
1802
09:15
or when you create language,
221
543155
1695
09:16
a child with autism creates
language with FreeSpeech,
222
544850
3070
09:19
they're not using this support language,
223
547920
1833
09:21
they're not using this bridge language.
224
549753
2211
09:23
They're directly constructing the sentence.
225
551964
2657
09:26
And so this gave me this idea.
226
554621
2193
09:28
Is it possible to use FreeSpeech
227
556814
2024
09:30
not for children with autism
228
558838
2510
09:33
but to teach language to people without disabilities?
229
561348
6262
09:39
And so I tried a number of experiments.
230
567610
1978
09:41
The first thing I did was I built a jigsaw puzzle
231
569588
2948
09:44
in which these questions and answers
232
572536
1970
09:46
are coded in the form of shapes,
233
574506
1835
09:48
in the form of colors,
234
576341
1138
09:49
and you have people putting these together
235
577479
1849
09:51
and trying to understand how this works.
236
579328
1773
09:53
And I built an app out of it, a game out of it,
237
581101
2376
09:55
in which children can play with words
238
583477
2661
09:58
and with a reinforcement,
239
586138
1704
09:59
a sound reinforcement of visual structures,
240
587842
2585
10:02
they're able to learn language.
241
590427
2013
10:04
And this, this has a lot of potential, a lot of promise,
242
592440
2736
10:07
and the government of India recently
243
595176
1975
10:09
licensed this technology from us,
244
597151
1404
10:10
and they're going to try it out
with millions of different children
245
598555
2074
10:12
trying to teach them English.
246
600629
2605
10:15
And the dream, the hope, the vision, really,
247
603234
2614
10:17
is that when they learn English this way,
248
605848
3082
10:20
they learn it with the same proficiency
249
608930
2643
10:23
as their mother tongue.
250
611573
3718
10:27
All right, let's talk about something else.
251
615291
3816
10:31
Let's talk about speech.
252
619107
1997
10:33
This is speech.
253
621104
1271
10:34
So speech is the primary mode of communication
254
622375
1962
10:36
delivered between all of us.
255
624337
1613
10:37
Now what's interesting about speech is that
256
625950
1855
10:39
speech is one-dimensional.
257
627805
1245
10:41
Why is it one-dimensional?
258
629050
1359
10:42
It's one-dimensional because it's sound.
259
630409
1568
10:43
It's also one-dimensional because
260
631977
1539
10:45
our mouths are built that way.
261
633516
1205
10:46
Our mouths are built to create
one-dimensional sound.
262
634721
3512
10:50
But if you think about the brain,
263
638233
2866
10:53
the thoughts that we have in our heads
264
641099
1764
10:54
are not one-dimensional.
265
642863
2102
10:56
I mean, we have these rich,
266
644965
1459
10:58
complicated, multi-dimensional ideas.
267
646424
3028
11:01
Now, it seems to me that language
268
649452
1690
11:03
is really the brain's invention
269
651142
2332
11:05
to convert this rich, multi-dimensional thought
270
653474
3096
11:08
on one hand
271
656570
1587
11:10
into speech on the other hand.
272
658157
1923
11:12
Now what's interesting is that
273
660080
1762
11:13
we do a lot of work in information nowadays,
274
661842
2568
11:16
and almost all of that is done
in the language domain.
275
664410
3079
11:19
Take Google, for example.
276
667489
1939
11:21
Google trawls all these
countless billions of websites,
277
669428
2677
11:24
all of which are in English,
and when you want to use Google,
278
672105
2725
11:26
you go into Google search, and you type in English,
279
674830
2450
11:29
and it matches the English with the English.
280
677280
4163
11:33
What if we could do this in FreeSpeech instead?
281
681443
3583
11:37
I have a suspicion that if we did this,
282
685026
2301
11:39
we'd find that algorithms like searching,
283
687327
2068
11:41
like retrieval, all of these things,
284
689395
2325
11:43
are much simpler and also more effective,
285
691720
3075
11:46
because they don't process
the data structure of speech.
286
694795
4417
11:51
Instead they're processing
the data structure of thought.
287
699212
5976
11:57
The data structure of thought.
288
705188
2808
11:59
That's a provocative idea.
289
707996
2076
12:02
But let's look at this in a little more detail.
290
710072
2142
12:04
So this is the FreeSpeech ecosystem.
291
712214
2366
12:06
We have the Free Speech
representation on one side,
292
714580
2884
12:09
and we have the FreeSpeech
Engine, which generates English.
293
717464
2228
12:11
Now if you think about it,
294
719694
1725
12:13
FreeSpeech, I told you, is completely
language-independent.
295
721419
2544
12:15
It doesn't have any specific information in it
296
723963
2087
12:18
which is about English.
297
726050
1228
12:19
So everything that this system knows about English
298
727278
2800
12:22
is actually encoded into the engine.
299
730078
4620
12:26
That's a pretty interesting concept in itself.
300
734698
2237
12:28
You've encoded an entire human language
301
736935
3604
12:32
into a software program.
302
740539
2645
12:35
But if you look at what's inside the engine,
303
743184
2531
12:37
it's actually not very complicated.
304
745715
2358
12:40
It's not very complicated code.
305
748073
2105
12:42
And what's more interesting is the fact that
306
750178
2672
12:44
the vast majority of the code in that engine
307
752850
2203
12:47
is not really English-specific.
308
755053
2412
12:49
And that gives this interesting idea.
309
757465
1895
12:51
It might be very easy for us to actually
310
759360
2038
12:53
create these engines in many,
many different languages,
311
761398
3826
12:57
in Hindi, in French, in German, in Swahili.
312
765224
6354
13:03
And that gives another interesting idea.
313
771578
2799
13:06
For example, supposing I was a writer,
314
774377
2654
13:09
say, for a newspaper or for a magazine.
315
777031
2122
13:11
I could create content in one language, FreeSpeech,
316
779153
5011
13:16
and the person who's consuming that content,
317
784164
2056
13:18
the person who's reading that particular information
318
786220
3061
13:21
could choose any engine,
319
789281
2495
13:23
and they could read it in their own mother tongue,
320
791776
2736
13:26
in their native language.
321
794512
3939
13:30
I mean, this is an incredibly attractive idea,
322
798451
2722
13:33
especially for India.
323
801173
1999
13:35
We have so many different languages.
324
803172
1690
13:36
There's a song about India, and there's a description
325
804862
2142
13:39
of the country as, it says,
326
807004
2344
13:41
(in Sanskrit).
327
809348
2360
13:43
That means "ever-smiling speaker
328
811708
2773
13:46
of beautiful languages."
329
814481
4519
13:51
Language is beautiful.
330
819000
1964
13:52
I think it's the most beautiful of human creations.
331
820964
2454
13:55
I think it's the loveliest thing
that our brains have invented.
332
823418
3978
13:59
It entertains, it educates, it enlightens,
333
827396
3584
14:02
but what I like the most about language
334
830980
2044
14:05
is that it empowers.
335
833024
1500
14:06
I want to leave you with this.
336
834524
1838
14:08
This is a photograph of my collaborators,
337
836362
2385
14:10
my earliest collaborators
338
838747
997
14:11
when I started working on language
339
839744
1462
14:13
and autism and various other things.
340
841206
1502
14:14
The girl's name is Pavna,
341
842708
1417
14:16
and that's her mother, Kalpana.
342
844125
1902
14:18
And Pavna's an entrepreneur,
343
846027
2138
14:20
but her story is much more remarkable than mine,
344
848165
2371
14:22
because Pavna is about 23.
345
850536
2400
14:24
She has quadriplegic cerebral palsy,
346
852936
2552
14:27
so ever since she was born,
347
855488
1640
14:29
she could neither move nor talk.
348
857128
3600
14:32
And everything that she's accomplished so far,
349
860728
2403
14:35
finishing school, going to college,
350
863131
2227
14:37
starting a company,
351
865358
1416
14:38
collaborating with me to develop Avaz,
352
866774
2140
14:40
all of these things she's done
353
868914
1892
14:42
with nothing more than moving her eyes.
354
870806
5523
14:48
Daniel Webster said this:
355
876329
2689
14:51
He said, "If all of my possessions were taken
356
879018
2940
14:53
from me with one exception,
357
881958
2988
14:56
I would choose to keep the power of communication,
358
884946
2981
14:59
for with it, I would regain all the rest."
359
887927
3903
15:03
And that's why, of all of these incredible
applications of FreeSpeech,
360
891830
5116
15:08
the one that's closest to my heart
361
896946
2080
15:11
still remains the ability for this
362
899026
2068
15:13
to empower children with disabilities
363
901094
2380
15:15
to be able to communicate,
364
903474
1773
15:17
the power of communication,
365
905247
1789
15:19
to get back all the rest.
366
907036
2240
15:21
Thank you.
367
909276
1397
15:22
(Applause)
368
910673
1332
15:24
Thank you. (Applause)
369
912005
4199
15:28
Thank you. Thank you. Thank you. (Applause)
370
916204
5323
15:33
Thank you. Thank you. Thank you. (Applause)
371
921527
4000

▲Back to top

ABOUT THE SPEAKER
Ajit Narayanan - Visual grammar engine inventor
Ajit Narayanan is the inventor of Avaz, an affordable, tablet-based communication device for people who are speech-impaired.

Why you should listen

Ajit Narayanan is the founder and CEO of Invention Labs, and the inventor of Avaz AAC, the first assistive device aimed at an Indian market that helps people with speech disabilities -- such as cerebral palsy, autism, intellectual disability, aphasia and learning disabilities -- to communicate. Avaz is also available as an iPad app, aimed at children with autism. In 2010, Avaz won the National Award for Empowerment of People with Disabilities from the president of India, and in 2011, Narayanan was listed in MIT Technology Review 35 under 35.
 
Narayanan is a prolific inventor with more than 20 patent applications. He is an electrical engineer with degrees from IIT Madras. His research interests are embedded systems, signal processing and understanding how the brain perceives language and communication.

More profile about the speaker
Ajit Narayanan | Speaker | TED.com