Contact-Anchored Policies

Contact Conditioning Creates Strong Robot Utility Models

Zichen Jeff Cui, Omar Rayyan, Haritheja Etukuru, Bowen Tan, Zavier Andrianarivo, Zicheng Teng, Yihang Zhou, Krish Mehta, Nicholas Wojno, Kevin Yuanbo Wu, Manan H Anjaria, Ziyuan Wu, Manrong Mao, Guangxun Zhang, Binit Shah, Yejin Kim, Soumith Chintala, Lerrel Pinto, Nur Muhammad Mahi Shafiullah

Corresponding author: jeff dot cui at nyu dot edu

CAPs in action

Read the paper

GitHub (Coming Soon)

iPhone App

The prevalent paradigm in robot learning attempts to generalize across environments, embodiments, and tasks with language prompts at runtime. A fundamental tension limits this approach: language is often too abstract to guide the concrete physical understanding required for robust manipulation. In this work, we introduce Contact-Anchored Policies (CAP), a new class of general robotic behavior models, which replace language conditioning with points of physical contact in space. Simultaneously, we structure CAP as a library of modular utility models rather than a monolithic generalist policy. This factorization allows us to implement an efficient real-to-sim iteration cycle: we build EgoGym, a lightweight simulation benchmark, to rapidly identify failure modes and refine our models and datasets prior to real-world deployment.

We show that by conditioning on contact and iterating via simulation, CAP generalizes to novel environments and embodiments out of the box on three fundamental manipulation skills while using only 23 hours of demonstration data, and outperforms large, state-of-the-art VLAs in zero-shot evaluations by 56%. All model checkpoints, codebase, hardware, simulation, and datasets will be open-sourced.

Contact-anchored policies (CAPs) are trained on a diverse dataset of contact-anchored interactions:

Contact Anchoring

Data Collection Setup — Handheld gripper

Automatic contact anchor labeling

Diverse in-the-wild data

Deployment across embodiments in real and sim

UR3e

xArm

iPhone App

EgoGym Pick

Videos

CAPs in action

We evaluate CAPs zero-shot on Pick (general object pickup), Open (opening cabinet doors and drawers), and Close (closing). We then chain these lower-level skills to perform more complex tasks, such as cleaning a table until it's empty, or fetching a bag of coffee beans from within a cabinet.

Zero-shot Success Compilation

Long-Horizon Tasks

CAPs can be composed to perform complex, long-horizon behaviors. In the following cases, CAPs are controlled with a high-level controller using a VLM that sequences and retries the tasks as necessary.

Cross-Embodiment

We also deploy CAPs on different robot embodiments (xArm, UR3e, iPhone demo app) without any environment or embodiment specific fine-tuning. For the robots, we just need to build an appropriate mount and write an inverse kinematics controller for the robot. For the iPhone, the app instructs the user to move to the next position and closes the simulated gripper when the policy does so.

Zero-shot home deployment

We deploy CAPs zero-shot in a real New York apartment. This policy has never seen this apartment (no new training data or fine-tuning). The robot performs the following tasks zero-shot.

EgoGym Sim Environment

One of the biggest costs of developing general policies is running real-world evaluations. To address this, we develop EgoGym, a lightweight simulation suite that allows us to evaluate CAPs across diverse environments within our training loop. We ran single-blind checkpoint evaluations and found that the simulated success rate is coarsely indicative of real world performance improvement.

Failure Modes

Hardware

Robot Gripper/iPhone Mount

Make a gripper for your own robot arm with our 3D-printed mount, a Dynamixel motor set, and an iPhone.

Gripper Assembly Guide

Setup Guide

Paper

Contact-Anchored Policies: Contact Conditioning Creates Strong Robot Utility Models

Read the paper (arXiv)

Read the paper (PDF)

Citation (bibtex)

@article{cui2026contact,
  title={Contact-Anchored Policies: Contact Conditioning Creates Strong Robot Utility Models},
  author={Zichen Jeff Cui and Omar Rayyan and Haritheja Etukuru and Bowen Tan and Zavier Andrianarivo and Zicheng Teng and Yihang Zhou and Krish Mehta and Nicholas Wojno and Kevin Yuanbo Wu and Manan H Anjaria and Ziyuan Wu and Manrong Mao and Guangxun Zhang and Binit Shah and Yejin Kim and Soumith Chintala and Lerrel Pinto and Nur Muhammad Mahi Shafiullah},
  journal={arXiv preprint arXiv:2602.09017},
  year={2026}
}

Code

Get the code on github.

GitHub (Coming soon)

Documentation