<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>ArXiv | Khoa Vo</title><link>https://vhvkhoa.github.io/tags/arxiv/</link><atom:link href="https://vhvkhoa.github.io/tags/arxiv/index.xml" rel="self" type="application/rss+xml"/><description>ArXiv</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Sat, 27 Dec 2025 00:00:00 +0000</lastBuildDate><image><url>https://vhvkhoa.github.io/media/icon_hu4529995727383835976.png</url><title>ArXiv</title><link>https://vhvkhoa.github.io/tags/arxiv/</link></image><item><title>Clutter-Resistant Vision-Language-Action Models through Object-Centric and Geometry Grounding</title><link>https://vhvkhoa.github.io/publication/2025obeyedvla/</link><pubDate>Sat, 27 Dec 2025 00:00:00 +0000</pubDate><guid>https://vhvkhoa.github.io/publication/2025obeyedvla/</guid><description>&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;ul>
&lt;li>&lt;strong>Status:&lt;/strong> Under submission at IEEE Transactions on Robotics (T-RO).&lt;/li>
&lt;li>&lt;strong>Problem:&lt;/strong> End-to-end VLAs often lose reliable language-vision grounding in real clutter, absent-target cases, background shifts, and unseen-object scenes.&lt;/li>
&lt;li>&lt;strong>Idea:&lt;/strong> OBEYED-VLA decouples perception from control using frozen VLM-based object-centric grounding plus masked-depth geometric grounding, then fine-tunes a VLA only on clean single-object demonstrations.&lt;/li>
&lt;li>&lt;strong>Results:&lt;/strong> OBEYED-VLA improves robustness across clutter, absent-target rejection, background appearance changes, and unseen-object manipulation, with ablations showing that both semantic and geometry-aware grounding are critical.&lt;/li>
&lt;/ul>
&lt;h2 id="project-page">Project Page&lt;/h2>
&lt;p>See the full project page for the method overview, videos, and experiments: &lt;a href="https://uark-aicv.github.io/OBEYED_VLA/" target="_blank" rel="noopener">OBEYED-VLA&lt;/a>.&lt;/p></description></item></channel></rss>