Visual Objects Tutorials

19h

Humanoid Robotics Emerges as Solution for Not-Well-Defined Real-World Problems

New Analysis Platform Explores Why Household Tasks and Physical Automation Require Embodied Intelligence Beyond Traditional Computer Approaches The next wave of AI is physical AI. AI that understands ...

IEEE

Object-Aware Image Augmentation for Audio-Visual Zero-Shot Learning

Abstract: Audio-visual zero-shot learning (ZSL) leverages both video and audio information for model training, aiming to classify new video categories that were not seen during the training. However, ...

IEEE

CTRL-O: Language-Controllable Object-Centric Visual Representation Learning

Abstract: Object-centric representation learning aims to decompose visual scenes into fixed-size vectors called "slots" or "object files", where each slot captures a distinct object. Current ...

GitHub

T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Note: This model has been trained for approximately 2.7M steps (batch size = 1) and is still in the training process. I have attached a .ipynb file in the repository. You can refer to it to know how ...

GitHub

R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning

Visual (Single) Object Tracking aims to continuously localize and estimate the scale of a target in subsequent video frames, given only its initial state in the first frame. This task can be ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results