Name: 使用TensorFlow的自然語言處理2--初級課程 (Natural Language Processing with TensorFlow 2 - Beginner's Course)
Uploaded: 2021-01-14T10:22:21.000Z
Duration: 1 h 35 min 44 s
Description: 【看影片學英語】數萬部 YouTube 影片，搭配英漢字典即點即查，輕鬆掌握單字發音與用法，長久累積看電影不必再看字幕。

Practical introduction to Natural Language Processing with Tensorflow, too.

I got my PhD in experimental condensed matter physics and went to work for Intel Corporation as a back end drying process engineer.

I left there in 2015 to pursue my own interests and have been studying artificial intelligence and deep burning ever since.

If you're unfamiliar with natural language processing, it is the application of deep neural networks to text processing allows us to do things such as text generation.

You may have heard the hubbub in recent months over the open A I G p T to our them that allow them to produce fake news and also allows us to do things like sentiment classification as well as something more mathematical, which is representing strings of characters, words as mathematical constructs that allow us to determine relationships between those words.

It would be most helpful if you have some background in deporting.

No networks, but it's not really required.

We're gonna walk through everything in the tutorial, so you'll be able to go from start to finish without any prior knowledge.

Although of course, it would be helpful if you would like to see him or deep learning, reinforcement, learning and natural language processing content.

Check man here on YouTube at machine learning with Phil, I hope to see you there.

In this tutorial, you are going to do word in beddings with tensorflow two point.

Oh, if you don't know what that means, don't worry.

I'll explain what it is and why it's important As we go along.

Let's get started before we begin with our imports a couple of housekeeping items.

First of all, I am basically working through the tensorflow tutorial from their website.

So gonna link that in the description s I'm not claiming this code is my own, although I do some cleaning up at the end to kind of make it my own.

We need iota handle dumping the word and beddings to a file so that we can visualize later.

We'll need Matt plot live handle plotting.

So this is tensorflow 2.1 dot zero RC one released candidate one.

So this is, as far as I'm aware, the latest build so attention for two point.

Oh, throw some really weird warnings and 2.1 seems to deal with that.

So if you're running tensorflow 2.0, and you get funny errors.

Funny warnings, but you still get functional code and learning.

That is why you wanna update to the newest version of TENSORFLOW.

Course we needed care us to handle pretty much everything.

We also need the layers for our embedding and dense layers, and we're also going to use the tensorflow data sets.

So I'm not gonna have you download your own data set.

We're going to use the I am D B movie data set for this particular tutorial.

So of course, that is an additional independency for this tutorial.

So now that we've handled our imports, let's talk a little bit about what word and weddings are.

So how could you represent a word for a machine?

And more importantly, instead of a string of characters.

How can you represent a collection of words?

One way is to take the entire set of all the words that you have in your same movie reviews.

You know, you just take all the words and find all the unique words and that becomes your dictionary, and you can represent that as a one hot encoding.

So if you have, let's say 10,000 words, then you would have a vector for each word with 10,000 elements, which are predominant heroes except for the one correspondent to whichever word it is.

The problem with this and coding is that while it does work, it is incredibly inefficient.

And because it is sparse, you know, the majority of the data is zero and the only one important bit and the whole thing so not very efficient.

And another option is to do imager and coding, so you could just rank order the numbers.

You could just assign a number to each unique word, and then every time that word appears in a review.

You would have that imager in an array, so you end up with a set of variable length of Ray's, where the length of the array corresponds.

The number of words in the review and the members of the array correspond to the words that appear within that review.

This is far more efficient, but it's still not quite ideal, right?

So it doesn't tell you anything about the relationships between the words.

So if you think of the word, let's say King.

So there's some relationship between the king and a man.

So there is also the connotation of owning land and having control over that land.

King males have a queen, so it has some sort of relationship to a queen as well may have a prince and princess.

You know all these kinds of different relationships between words that are not incorporated into the er imager encoding of our dictionary.

The reason is that the image a recording of our dictionary forms a basis in some higher dimensional space.

But all of those vectors are orthogonal, so if we take their dot product, they are essentially at right angles to each other in a hybrid dimensional space.

And southern dot product is zero, so there's no projection of one vector one word onto another.

There's no overlap in the meaning between the words, at least in this higher dimensional space.

Now word M beddings fix this problem by keeping the in injuring coding but then doing a transformation to a totally different space.

So we introduce a new space of vector of some arbitrary length.

It's a hybrid parameter of your model, much like the number of neurons in a dense layer is hyper primitive.

Your model, the length of the embedding layer is a hyper parameter, and we'll just say it's eight.

So the word King then has eight floating point elements that describe its relationship to all the other vectors in that space.

And so what that allows you to do is to take dot products between two arbitrary words in your dictionary and you get non zero components, and so that what that means, in practical terms, is that you get a sort of semantic relationship between words that emerges is a consequence of training your model.

So the way it works in practice is we're gonna have a whole bunch of reviews from the IMDB data set, and they will have some classifications as a good or bad review.

So, for instance, you know, uh, for the Star Wars last Jedi movie.

I don't think it's in the in there, but, you know, my review would be that it was terrible, awful, no good, totally ruined Luke Luke's character.

And so you would see on I'm not alone in that.

So if you did a huge number of reviews for the last 10 eye, you would see a strong correlation of words such as horrible bad.

Wouldn't characters Mary sue things like that?

And so the model would then, uh, take those words running through the embedding layer and try to come up with a prediction for whether not that is a good or bad review and match it up to the training label and then do back propagation to vary those weights in that embedding layer.

So say eight elements and by training over the data set multiple times.

You are able to predict whether or not a review is positive or negative about a particular movie.

But also it shows you the relationship between the words because the model learns the correlations between words within reviews that give it either a positive or negative context.

So that is word M beddings in a nutshell, and we're gonna go ahead and get started coding that.

So the first thing we're gonna have is a on embedding layer, and this is just gonna be for illustration purposes.

I'm gonna be layers start embedding And let's say there's 1000 and five elements, so we'll see results.

So then let's print the result, uh dot numb pie.

Okay, so let's head to the terminal and execute this and see precisely what we get.

字幕列表影片播放

使用TensorFlow的自然語言處理2--初級課程 (Natural Language Processing with TensorFlow 2 - Beginner's Course)

stuff

process

bunch

positive