Microsoft has released a new multimodal reasoning model: Phi-4-reasoning-vision-15B. The model combines two existing algorithms using a mid-fusion approach and can analyze images, scientific graphs, ...