Name: 第1講 | 視覺識別的卷積神經網絡簡介 (Lecture 1 | Introduction to Convolutional Neural Networks for Visual Recognition)
Uploaded: 2021-01-14T07:28:44.000Z
Duration: 57 min 57 s
Description: 【看影片學英語】數萬部 YouTube 影片，搭配英漢字典即點即查，輕鬆掌握單字發音與用法，長久累積看電影不必再看字幕。

I'm super excited to offer this class again

It seems that every time we offer this class

it's growing exponentially unlike most things in the world.

This is the third time we're teaching this class.

Last year, we had 350 students, so it doubled.

This year we've doubled again to about 730 students

So anyone who was not able to fit into the lecture hall

But, the videos will be up on the SCPD website

then you can still check it out within a couple hours.

So this class CS231n is really about computer vision.

Computer vision is really the study of visual data.

Since there's so many people enrolled in this class,

I think I probably don't need to convince you

but I'm still going to try to do that anyway.

has really exploded to a ridiculous degree

And, this is largely a result of the large number

So I think on average there's even more cameras

And, as a result of all of these sensors,

there's just a crazy large, massive amount

of visual data being produced out there in the world

So one statistic that I really like to kind of put

which is where we are now that roughly 80%

of all traffic on the internet would be video.

and other types of visual data on the web.

But, just from a pure number of bits perspective,

the majority of bits flying around the internet

So it's really critical that we develop algorithms

that can utilize and understand this data.

However, there's a problem with visual data,

and that's that it's really hard to understand.

Sometimes we call visual data the dark matter

of the internet in analogy with dark matter in physics.

So for those of you who have heard of this in physics

before, dark matter accounts for some astonishingly large

and we know about it due to the existence

of gravitational pulls on various celestial bodies

and what not, but we can't directly observe it.

And, visual data on the internet is much the same

flying around the internet, but it's very difficult

for algorithms to actually go in and understand

and see what exactly is comprising all the visual data

Another statistic that I like is that of Youtube.

that happens in the world, there's something like five hours

one, two, three, now there's 15 more hours

Google has a lot of employees, but there's no way

that they could ever have an employee sit down

and watch and understand and annotate every video.

relevant videos and maybe monetize by putting ads

on those videos, it's really crucial that we develop

technologies that can dive in and automatically understand

truly an interdisciplinary field, and it touches

So obviously, computer vision's the center of the universe,

around computer vision, we touch on areas like physics

because we need to understand optics and image formation

and how images are actually physically formed.

We need to understand biology and psychology

to understand how animal brains physically see

We of course draw a lot on computer science,

mathematics, and engineering as we actually strive

So a little bit more about where I'm coming from

and about where the teaching staff of this course

字幕列表影片播放

第1講 | 視覺識別的卷積神經網絡簡介 (Lecture 1 | Introduction to Convolutional Neural Networks for Visual Recognition)

sort

assume

process

recognize