Detect and label objects in images and videos
Chat with Eagle2-VL to generate text based on text and images