Tackling multiple tasks with a single visual language model