Let's start the discussion from the beginning. What does a background subtraction process look like? Consider the following image:

The previous image represents the background scene. Now, let's introduce a new object into this scene:

As we can see, there is a new object in the scene. So, if we compute the difference between this image and our background model, you should be able to identify the location of the TV remote:

The overall process looks like this:
