The model is passed in label data that is, input data, along with the associative prediction and since in this case we know what the right answer is, we're able to calculate the error that is, how many times is the model getting it wrong and by how much we used these errors to improve the model, and this process is repeated many, many times until we reach the point that we think that the model is good enough or that this is the best that we can do.