1
00:00:00,160 --> 00:00:04,414
Will robots ever turn against us and take over
the world?
2
00:00:04,414 --> 00:00:07,359
I'm going to tell you all about it.
3
00:00:08,000 --> 00:00:10,379
Will robots outsmart us?
4
00:00:16,139 --> 00:00:18,944
This the university of the
Netherlands
5
00:00:18,944 --> 00:00:23,078
New technologies emerge quickly and movies show somehow...
6
00:00:23,078 --> 00:00:27,131
frightening scenarios in which
robots are smarter than humans...
7
00:00:27,131 --> 00:00:28,954
and are taking over control.
8
00:00:28,954 --> 00:00:32,514
Some movies even go a step further and predict...
9
00:00:32,514 --> 00:00:34,480
that robots can replicate themselves.
10
00:00:35,120 --> 00:00:40,442
However will this ever become reality? Will robots outsmart us?
11
00:00:40,442 --> 00:00:44,299
As an applied mathematician and mechanical engineer...
12
00:00:44,299 --> 00:00:46,620
I'm trying to answer this question.
13
00:00:47,367 --> 00:00:52,061
Together with the team we work on recreating intelligence in robots.
14
00:00:52,061 --> 00:00:57,756
We strive to build robots that can learn, make decisions, recognize environments...
15
00:00:57,756 --> 00:01:01,231
and possibly create without
human intervention.
16
00:01:01,231 --> 00:01:05,920
This is known as artificial intelligence or AI.
17
00:01:05,920 --> 00:01:11,938
If Ai can think as a human or
basically if AI is conscious...
18
00:01:11,938 --> 00:01:15,114
then we call such AI general AI.
19
00:01:15,727 --> 00:01:21,280
On the other hand, if AI can
replicate itself, then we call it super AI.
20
00:01:22,400 --> 00:01:26,020
However our society is far
from achieving these two.
21
00:01:26,447 --> 00:01:31,202
Currently we mostly work on the so-called weak artificial intelligence.
22
00:01:31,202 --> 00:01:36,111
In weak AI we strive
to use human reasoning...
23
00:01:36,111 --> 00:01:40,272
as the model for the robot's training,
but not for its end goal.
24
00:01:40,272 --> 00:01:47,491
Basically we do not try to replicate full human mental capabilities in a robot...
25
00:01:47,491 --> 00:01:51,985
but instead we train robots
to perform simple operations.
26
00:01:51,985 --> 00:01:56,314
For example that a robot is saying hi when somebody is passing by
27
00:01:56,314 --> 00:01:58,800
or that is laughing at on one of your jokes.
28
00:02:00,480 --> 00:02:04,989
At this moment probably most of you would think that AI and robotics...
29
00:02:04,989 --> 00:02:05,840
have the same goal.
30
00:02:06,640 --> 00:02:12,749
However this is not true. In robotics we strive to mimic physical human actions.
31
00:02:12,749 --> 00:02:16,964
We build robots that can pick up objects for example a ball...
32
00:02:16,964 --> 00:02:21,840
and place it on a specific position
that is predefined by a human.
33
00:02:22,960 --> 00:02:25,999
In artificial intelligence
on the other hand
34
00:02:25,999 --> 00:02:29,334
we are trying to recreate
intelligence in robots.
35
00:02:29,334 --> 00:02:34,839
Basically we train robots to recognize
objects, to understand that the object...
36
00:02:34,839 --> 00:02:40,203
is a ball and also to place them accordingly to a specific position...
37
00:02:40,203 --> 00:02:42,080
without human interaction.
38
00:02:43,840 --> 00:02:47,967
Therefore if we have AI and
robotics together...
39
00:02:47,967 --> 00:02:50,720
then we get an artificially intelligent robot.
40
00:02:52,000 --> 00:02:57,208
In which AI acts as a brain and
robotics acts as a body.
41
00:02:58,035 --> 00:03:01,269
So now you know what is artificial intelligence.
42
00:03:01,269 --> 00:03:06,447
However how can we train robots to be artificially intelligent?
43
00:03:06,447 --> 00:03:11,202
How can we train robots to make decisions, to recognize environments...
44
00:03:11,202 --> 00:03:16,638
or to act? Well for this we need to understand human behaviour...
45
00:03:16,638 --> 00:03:21,760
and therefore we need to understand
how the human brain is functioning.
46
00:03:23,120 --> 00:03:27,411
by zooming in on the human brain, we see that it consists...
47
00:03:27,411 --> 00:03:29,520
of billions of brain cells.
48
00:03:30,720 --> 00:03:32,554
They are also called neurons.
49
00:03:32,554 --> 00:03:37,411
Each of these cells is an
electrochemical structure.
50
00:03:37,411 --> 00:03:42,202
Basically each neuron starts
with a set of input antennas...
51
00:03:42,202 --> 00:03:47,628
which are called dendrites. And in these antennas we receive signals...
52
00:03:47,628 --> 00:03:50,630
from the outside world or from the
other cells.
53
00:03:50,631 --> 00:03:55,520
Once when the input signal is high enough...
54
00:03:56,320 --> 00:04:02,626
then the cell starts being activated and it's releasing a set of neurotransmitters...
55
00:04:02,626 --> 00:04:07,813
that travel right to the back of the cell to the synaps...
56
00:04:07,813 --> 00:04:11,040
which is a terminal for communication with another cell.
57
00:04:12,400 --> 00:04:17,208
On that moment we say that our cell is firing or being activated.
58
00:04:17,208 --> 00:04:24,140
Connecting cells into large networks, we can fire specific patterns...
59
00:04:24,140 --> 00:04:29,931
of brain cells and those are then understood as specific situations.
60
00:04:29,932 --> 00:04:35,200
For example if we have a spider in front of us,
61
00:04:35,840 --> 00:04:41,195
we will suddenly get afraid and some of us will run, the others will start screaming.
62
00:04:41,196 --> 00:04:48,999
Simply explained, this visual stimuli or spider is activating a set of cells...
63
00:04:48,999 --> 00:04:54,974
in our brain that are recognized as a pattern that means dangerous situation.
64
00:04:54,974 --> 00:05:00,852
In order to mimic this in robots, we have to understand this pattern...
65
00:05:00,852 --> 00:05:02,878
and we have to program it.
66
00:05:04,159 --> 00:05:10,240
To do so we actually mimic brain cells
by artificial cells
67
00:05:10,800 --> 00:05:13,611
And then we connect them into large
networks...
68
00:05:13,611 --> 00:05:15,520
which are similar to natural ones.
69
00:05:16,720 --> 00:05:22,515
However note that new artificial networks are not electrochemical devices.
70
00:05:22,515 --> 00:05:28,659
They are mathematical devices. Each cell is a mathematical element...
71
00:05:28,659 --> 00:05:32,080
with a set of inputs and a set
of output signals
72
00:05:32,960 --> 00:05:35,522
If the input signal is strong enough
73
00:05:35,522 --> 00:05:39,692
then our mathematical element
will get activated.
74
00:05:39,692 --> 00:05:44,761
It will release an output signal which then can activate some other cell.
75
00:05:46,174 --> 00:05:50,210
In this way we build patterns in large neural networks.
76
00:05:50,211 --> 00:05:55,056
However note that we need to train these patterns...
77
00:05:55,056 --> 00:05:57,347
because we need to train the neural network...
78
00:05:57,347 --> 00:05:59,871
that specific situations mean something.
79
00:05:59,871 --> 00:06:04,772
For example that a spider means dangerous situation.
80
00:06:04,960 --> 00:06:10,366
To do so we have to provide neural networks with huge amounts of data.
81
00:06:10,367 --> 00:06:16,497
For example in order that a robot learns that a spider means dangerous situation...
82
00:06:16,497 --> 00:06:21,455
we have to provide the robot with a lot of images of different spiders.
83
00:06:21,455 --> 00:06:27,120
In this way the robot can recognize features of an animal called spider.
84
00:06:29,040 --> 00:06:32,495
However in this way we are only training robots...
85
00:06:32,495 --> 00:06:36,080
to recognize objects, situations or environments.
86
00:06:37,200 --> 00:06:42,392
But not to act to it. For example if we see a spider we immediately run.
87
00:06:42,392 --> 00:06:45,195
So we would like that our robot does the same.
88
00:06:45,195 --> 00:06:52,137
To do so we have to actually
understand human actions...
89
00:06:52,137 --> 00:06:59,760
in order to train robots to act and for this we are searching solutions in psychology.
90
00:07:01,280 --> 00:07:06,799
In 19th century Ivan Pavlov has studied
something called classical conditioning.
91
00:07:07,680 --> 00:07:11,861
In his famous experiment he let the dog hear a bell sound...
92
00:07:11,861 --> 00:07:13,280
whenever he got some food.
93
00:07:14,240 --> 00:07:17,218
In this way the dog started drooling.
94
00:07:17,218 --> 00:07:22,331
Over time the association between food and bell sound became so strong...
95
00:07:22,331 --> 00:07:26,480
that actually the dog started drooling by only hearing the bell sound.
96
00:07:28,000 --> 00:07:30,848
And this is exactly conditioned learning.
97
00:07:30,848 --> 00:07:36,913
In conditioned learning we are learning an association between two stimuli...
98
00:07:36,913 --> 00:07:38,875
natural and neutral one.
99
00:07:38,875 --> 00:07:42,280
natural is food, neutral one is bell sound.
100
00:07:42,280 --> 00:07:48,871
Over time they are actually having a strong association among each other...
101
00:07:48,871 --> 00:07:51,120
and they will give the same response.
102
00:07:52,000 --> 00:07:56,213
Classical conditioning can also be translated to the robot's world.
103
00:07:56,213 --> 00:08:00,565
For example if we have a robot and two objects, car and a ball...
104
00:08:00,565 --> 00:08:03,402
and we would like that the robot is tracing a ball...
105
00:08:03,402 --> 00:08:08,758
then whenever the robot sees a ball, we actually can play a specific melody...
106
00:08:08,758 --> 00:08:12,640
that is implemented in the computer program as a reward.
107
00:08:13,520 --> 00:08:16,727
In this way whenever a robot
is tracking a ball...
108
00:08:16,727 --> 00:08:19,119
he will know that he's doing good job.
109
00:08:21,440 --> 00:08:26,240
Over time the association between ball
and melody will become so strong
110
00:08:26,240 --> 00:08:29,484
that actually the robot will start tracking any ball...
111
00:08:29,484 --> 00:08:31,919
without hearing any melody.
112
00:08:34,000 --> 00:08:38,001
In classical conditioning therefore we are learning association...
113
00:08:38,001 --> 00:08:44,308
between two stimuli: ball and melody,
food and a bell sound.
114
00:08:44,308 --> 00:08:49,437
In machine learning this is known
as prediction algorithm...
115
00:08:49,437 --> 00:08:52,918
in which we predict
the outcome of each situation.
116
00:08:52,918 --> 00:08:57,520
However there is also a main issue in this learning technique
117
00:08:58,880 --> 00:09:02,989
If we repeat Pavlov's experiment and then give to a dog food...
118
00:09:02,989 --> 00:09:08,367
without any bell sound and if we do this repeatedly, the dog will forget...
119
00:09:08,367 --> 00:09:10,560
to drool whenever hearing a bell sound.
120
00:09:11,680 --> 00:09:14,479
Therefore this learning process
can be diminished.
121
00:09:15,120 --> 00:09:18,616
And this is something we do not
want to use for robots...
122
00:09:18,616 --> 00:09:22,319
because they will forget what
they were trained for.
123
00:09:24,080 --> 00:09:27,223
Hence we need a more complex learning technique.
124
00:09:27,223 --> 00:09:30,079
And this is reinforcement learning.
125
00:09:30,960 --> 00:09:36,061
In reinforcement learning an animal or human will exhibit more frequently...
126
00:09:36,061 --> 00:09:40,805
an action that has led to a reward in the past and less frequently...
127
00:09:40,805 --> 00:09:42,800
actions that have led to punishments.
128
00:09:43,680 --> 00:09:48,829
For example, if you let a cat into a cage with four different buttons...
129
00:09:48,830 --> 00:09:54,480
and let's say that one of these buttons is providing a reward or food to a cat...
130
00:09:54,480 --> 00:09:58,453
and the three other buttons are representing punishments.
131
00:09:58,453 --> 00:10:04,841
For example by electrical current. Then
after some time the cat will realize...
132
00:10:04,841 --> 00:10:09,589
That actually the fourth button is the one that provides food...
133
00:10:09,589 --> 00:10:13,269
and it will start using this button more often.
134
00:10:13,269 --> 00:10:17,535
This can be translated to
the robot's world as well.
135
00:10:17,535 --> 00:10:23,188
In such a case we have to build an algorithm that is based on law of effect.
136
00:10:23,188 --> 00:10:28,610
That means that the algorithm has to
compare different actions...
137
00:10:28,610 --> 00:10:32,817
by their outcomes and also
has to build associations.
138
00:10:32,817 --> 00:10:37,280
It has to associate each situation
with one pre-learned action.
139
00:10:39,120 --> 00:10:44,412
And this is allowing us then to train
robots for unknown environments.
140
00:10:44,412 --> 00:10:47,465
For example if you want
to send the robot to mars...
141
00:10:47,465 --> 00:10:52,080
then you would like that this robot can move without hitting any obstacles.
142
00:10:52,960 --> 00:10:55,730
And for this we use
reinforcement learning.
143
00:10:55,730 --> 00:10:59,840
However the biggest issue in reinforcement learning
144
00:11:00,400 --> 00:11:03,429
is that it has to interact
with an environment.
145
00:11:03,429 --> 00:11:06,319
We need to get feedback
from the environment.
146
00:11:07,520 --> 00:11:09,697
And this is not always possible.
147
00:11:09,697 --> 00:11:13,744
For example we have environments that give us very sparse rewards...
148
00:11:13,744 --> 00:11:15,935
or no reward at all.
149
00:11:16,522 --> 00:11:21,680
For example if you want to train a robot to automatically play a computer game,
150
00:11:22,240 --> 00:11:26,603
then you can't use reinforcement learning if you are winning or losing...
151
00:11:26,603 --> 00:11:27,839
only in the end of the game.
152
00:11:30,160 --> 00:11:33,376
In such situations we need
a new type of learning.
153
00:11:33,376 --> 00:11:35,680
This is called imitation learning
154
00:11:37,120 --> 00:11:42,661
As its name says in imitation learning we bring an imitator or teacher...
155
00:11:42,661 --> 00:11:46,511
that is actually demonstrating
an action to a robot.
156
00:11:46,511 --> 00:11:50,201
In this way the robots
gets more data...
157
00:11:50,201 --> 00:11:52,400
and the learning process
becomes shorter.
158
00:11:53,680 --> 00:11:58,800
Imitation learning can be used for
example if we send our robots to Mars.
159
00:11:58,800 --> 00:12:01,040
And we would like them to build settlements.
160
00:12:02,880 --> 00:12:05,262
By imitating human behaviour...
161
00:12:05,262 --> 00:12:09,839
robots will start building these
structures in acollaborative manner.
162
00:12:10,880 --> 00:12:16,667
So now we are learning that actually it's not enough to train one robot individually...
163
00:12:16,667 --> 00:12:19,155
but we have to train them in a group.
164
00:12:19,155 --> 00:12:22,680
This is similar to human society.
165
00:12:22,681 --> 00:12:28,581
Humans are grouping together in order to collaborate for better survival.
166
00:12:28,581 --> 00:12:32,080
Therefore if we want to build
human free factories...
167
00:12:32,640 --> 00:12:35,443
we have to use collaborative robotics.
168
00:12:35,443 --> 00:12:39,821
Imagine two robots that are welding together or two robots...
169
00:12:39,821 --> 00:12:44,650
that are printing material together into some very precise structure.
170
00:12:44,650 --> 00:12:47,477
In such case we can get a lot of issues.
171
00:12:47,477 --> 00:12:52,764
For example one robot could recognize an object and the other not.
172
00:12:52,764 --> 00:12:56,900
One is processing information
much faster than the other.
173
00:12:56,900 --> 00:13:01,011
Then they are not aware of
its own physical constraints...
174
00:13:01,011 --> 00:13:03,471
or physical constraints of other robots.
175
00:13:03,471 --> 00:13:06,743
And also they need to be
aware of all actions....
176
00:13:06,743 --> 00:13:09,629
that all other robots
in the group are taking.
177
00:13:09,629 --> 00:13:14,320
Therefore we need to enhance AI
to be collaborative
178
00:13:15,840 --> 00:13:19,840
It doesn't matter if we train an
individual robot or a group of them.
179
00:13:20,400 --> 00:13:24,173
We still need to use big data sets to train robots.
180
00:13:24,173 --> 00:13:27,040
And therefore we need
huge computing power.
181
00:13:27,920 --> 00:13:30,627
For this purpose we use supercomputers.
182
00:13:30,627 --> 00:13:34,159
However even a bee's brain is more powerful...
183
00:13:34,159 --> 00:13:36,938
than the largest supercomputer on earth.
184
00:13:36,938 --> 00:13:41,723
Of course supercomputers are
faster in processing information.
185
00:13:41,723 --> 00:13:47,183
In this case this would be around
200 000 times faster than a bee's brain.
186
00:13:47,183 --> 00:13:50,960
However they require more energy.
187
00:13:51,920 --> 00:13:57,323
For processing the same information the bee's brain needs 10 microwatts of energy
188
00:13:57,323 --> 00:14:01,266
while the supercomputer
needs 13 megawatts.
189
00:14:01,267 --> 00:14:06,804
Therefore AI algorithms require
a lot of energy...
190
00:14:06,804 --> 00:14:12,420
and we need to build new ones that would be based only on small data sets...
191
00:14:12,420 --> 00:14:15,078
and hence use less energy.
192
00:14:15,078 --> 00:14:18,978
On the other hand when we train robots...
193
00:14:18,978 --> 00:14:22,387
we train them based on
our previous knowledge.
194
00:14:22,388 --> 00:14:26,853
That means that a robot can make
a new piece of music or a new drawing.
195
00:14:26,853 --> 00:14:30,319
However this is all based
on our previous knowledge.
196
00:14:32,320 --> 00:14:35,156
And therefore they can't
create anything new.
197
00:14:35,156 --> 00:14:41,771
Some researchers argue that then artificial intelligence is not intelligent at all.
198
00:14:41,772 --> 00:14:47,752
However I do believe that we will make robots more intelligent than they are today.
199
00:14:47,752 --> 00:14:51,824
For this we need to pass from building associative relationships...
200
00:14:51,824 --> 00:14:54,854
between stimuli or stimulus and action...
201
00:14:54,855 --> 00:14:58,124
and start building
cause-effect relationships.
202
00:14:58,124 --> 00:15:02,425
For example, our robots today
can very precisely move...
203
00:15:02,425 --> 00:15:06,400
but they don't know enough to
predict possible accidents.
204
00:15:08,080 --> 00:15:12,189
Their knowledge is lower than
knowledge of a 16 month old baby.
205
00:15:12,189 --> 00:15:18,645
Therefore we need to build AI that can build such cause-effect relationships.
206
00:15:18,645 --> 00:15:23,852
Today we have learned that
the machine and robot intelligence...
207
00:15:23,852 --> 00:15:30,276
depends on big data sets, complex
algorithms and huge computing power.
208
00:15:30,276 --> 00:15:35,430
However is this enough to
achieve general or super AI?
209
00:15:35,431 --> 00:15:39,840
Well, only time will tell.
However one thing is for sure.
210
00:15:40,720 --> 00:15:44,408
Robot intelligence will be
different from human intelligence...
211
00:15:44,408 --> 00:15:49,065
in the same way as animal intelligence is different from human one.
212
00:15:49,065 --> 00:15:54,283
Robots will just excel in different
types of skills than humans.
213
00:15:54,283 --> 00:15:57,890
And I do believe that one day
they will make more...
214
00:15:57,890 --> 00:16:07,840
than what human species have
done so far. Thank you.