This task type uses CLIP to crop a previously defined VOI (Volume-of-Interest) out of the input image.