Curieux.JY
  • JungYeon Lee
  • Post
  • ๐Ÿ•ธ๏ธ Graph
  • Lecture
  • Note

On this page

  • ๐Ÿ” Ping Review
  • ๐Ÿ”” Ring Review
    • ํ•œ ์ค„๋กœ ์‹œ์ž‘ํ•˜๋ฉด
    • ๋ฐฐ๊ฒฝ: ์™œ ์‚ฌ๋žŒ ์‹œ์—ฐ ์ „์ด๊ฐ€ ์–ด๋ ค์šด๊ฐ€
    • ๋ฐฉ๋ฒ• ์ƒ์„ธ
    • ์‹คํ—˜
    • ๋น„ํŒ์ ์œผ๋กœ ๋ณด๋ฉด
      • ๊ฐ•์ 
      • ์•ฝ์ ยทํ•œ๊ณ„
    • ๊ด€๋ จ ์—ฐ๊ตฌ์™€์˜ ์ž๋ฆฌ ๋งค๊น€
    • ์š”์•ฝ

๐Ÿ“ƒCHORD ๋ฆฌ๋ทฐ

dexterity
manipulation
rl
contact
force-closure
human-demonstration
benchmark
humanoid
cross-embodiment
CHORD: Learning Dexterous Manipulation Using Contact Wrench Guidance From Human Demonstration
Published

June 30, 2026

  • Project

  • Paper (PDF)

  • Code ยท arXiv: Coming Soon (2026-06 ๊ธฐ์ค€ ๋ฏธ๊ณต๊ฐœ)

  • Xinghao Zhu*, Zixi Liu*, Shalin Jain*, Chenran Liโ€ , Milad Nooriโ€ , Huihua Zhao, John Welsh, Michael Andres Lin, Wei Liu, Tingwu Wang, Xingye Da, Zhengyi Luo, Vishal Kulkarni, Naema Bhatti, Yuke Zhu, Linxi Fan, Bowen Wen, Danfei Xu, Soha Pouya, Yan Changโ€ก

  • NVIDIA (Isaac ยท video_to_data) โ€” *equal, โ€ core, โ€กproject leadยทcorresponding

  • Preprint, 2026

  1. ๐Ÿ’ก ์‚ฌ๋žŒ ์‹œ์—ฐ์„ ์†์žฌ์ฃผ(dexterous) ๋กœ๋ด‡ ์ •์ฑ…์œผ๋กœ ์˜ฎ๊ธธ ๋•Œ, ์† ๋ชจ์–‘ยท์ ‘์ด‰ ์œ„์น˜๋ฅผ ๊ทธ๋Œ€๋กœ ๋ฒ ๋ผ๋Š” ๋Œ€์‹  ์ ‘์ด‰์ด ๋ฌผ์ฒด์— ๊ฐ€ํ•  ์ˆ˜ ์žˆ๋Š” ํž˜ยทํ† ํฌ(contact wrench) ๋ฅผ ๋งž์ถ”๊ฒŒ ํ•˜๋ฉด ํ˜•ํƒœ๊ฐ€ ๋‹ค๋ฅธ ๋กœ๋ด‡๋„ ๊ฐ™์€ โ€œ๋ฌผ์ฒด ์šด๋™ ํšจ๊ณผโ€๋ฅผ ์žฌํ˜„ํ•  ์ˆ˜ ์žˆ๋‹ค.
  2. โš™๏ธ ์‚ฌ๋žŒ ์‹œ์—ฐ์—์„œ ์ ‘์ด‰์ ยท๋งˆ์ฐฐ์ฝ˜ โ†’ ์ ‘์ด‰ ๋ Œ์น˜ ํ–‰๋ ฌ โ†’ support function์„ ๋ฝ‘์•„, ๋กœ๋ด‡์˜ ๋ Œ์น˜๊ฐ€ ์‚ฌ๋žŒ ๋ Œ์น˜๋ฅผ ์žฌํ˜„ํ•˜๋„๋ก ํ•˜๋Š” contact wrench-space reward ๋ฅผ RL(taskยทimitation reward + VOC)์— ๋”ํ•œ๋‹ค.
  3. ๐ŸŽฏ 4,739๊ฐœ bimanual ํƒœ์Šคํฌ ๋ฒค์น˜๋งˆํฌ๋ฅผ ๊ตฌ์ถ•ํ•˜๊ณ , ๊ทธ์ค‘ 1,831๊ฐœ์—์„œ ํ‰๊ท  ์„ฑ๊ณต๋ฅ  82.12%, whole-body loco-manipulation์—์„œ 90.77%, ์‹ค์„ธ๊ณ„ open/closed-loop ์ „์ด๊นŒ์ง€ ๋ณด์˜€๋‹ค.

๐Ÿ” Ping Review

๐Ÿ” Ping โ€” A light tap on the surface. Get the gist in seconds.

์†์žฌ์ฃผ ์กฐ์ž‘์„ ์‚ฌ๋žŒ ์‹œ์—ฐ์œผ๋กœ ๊ฐ€๋ฅด์น  ๋•Œ ๊ฐ€์žฅ ํ”ํ•œ ํ•จ์ •์€ โ€œ์‚ฌ๋žŒ ์†๋™์ž‘์„ ๊ทธ๋Œ€๋กœ ๋”ฐ๋ผ ํ•˜๊ฒŒ ํ•˜๋Š” ๊ฒƒโ€์ด๋‹ค. ์‚ฌ๋žŒ ์†๊ณผ ๋กœ๋ด‡ ์†์€ ํ˜•ํƒœยท์šด๋™ํ•™ยท์ ‘์ด‰ ๊ธฐํ•˜๊ฐ€ ๋‹ค๋ฅด๋ฏ€๋กœ, ๊ฐ™์€ ๋ฌผ์ฒด ํšจ๊ณผ๋ฅผ ๋‚ด๋ ค๋ฉด ๋‹ค๋ฅธ ์ ‘์ด‰์„ ์จ์•ผ ํ•œ๋‹ค. CHORD์˜ ํ•ต์‹ฌ ํ†ต์ฐฐ์€ ์ ‘์ด‰(contact)์ด ์‚ฌ๋žŒ ์‹œ์—ฐ๊ณผ ๋กœ๋ด‡ ํ–‰๋™์„ ์ž‡๋Š” ์ž์—ฐ์Šค๋Ÿฌ์šด ๋‹ค๋ฆฌ ๋ผ๋Š” ๊ฒƒ์ด๋‹ค. ๋‹จ, 3D ์ ‘์ด‰ ์œ„์น˜ ๋ฅผ ๋งž์ถ”๋Š” ๊ฒƒ์œผ๋กœ๋Š” ๋ถ€์กฑํ•˜๋‹ค โ€” ๊ฐ™์€ ํ‘œ๋ฉด์„ ๋งŒ์ ธ๋„ ์ ‘์ด‰ ๋ฒ•์„ ยทํž˜ ๋ฐฉํ–ฅ์— ๋”ฐ๋ผ ๋ฌผ์ฒด์— ์ƒ๊ธฐ๋Š” ์šด๋™์ด ๋‹ฌ๋ผ์ง€๊ธฐ ๋•Œ๋ฌธ์ด๋‹ค. ๊ทธ๋ž˜์„œ CHORD๋Š” ์ ‘์ด‰์„ object-centric wrench space(์ ‘์ด‰์ด ๋ฌผ์ฒด์— ๊ฐ€ํ•  ์ˆ˜ ์žˆ๋Š” forceโ€“torque ๋ฐฉํ–ฅ์˜ ๊ณต๊ฐ„)์—์„œ ํ‘œํ˜„ํ•˜๊ณ , ์‚ฌ๋žŒ๊ณผ ๋กœ๋ด‡์˜ ์ ‘์ด‰์„ โ€œ์–ด๋–ค ๋ฌผ์ฒด ์šด๋™์„ ์œ ๋ฐœํ•  ์ˆ˜ ์žˆ๋Š”๊ฐ€โ€๋กœ ๋น„๊ตํ•œ๋‹ค. ์ ‘์ด‰ ์œ„์น˜ยท๊ฐœ์ˆ˜ยท์† ๋ชจ์–‘์ด ๋‹ฌ๋ผ๋„ wrench space์—์„œ๋Š” ๋น„๊ต๊ฐ€ ๋œ๋‹ค.


CHORD ๊ฐœ์š”(Fig. 1) โ€” (a) ์‚ฌ๋žŒ ์‹œ์—ฐ์˜ hand-object ๊ถค์ ์„ ์ž…๋ ฅ์œผ๋กœ (b) ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ๋กœ๋ด‡ ์ •์ฑ…์„ ํ•™์Šตํ•ด (c) ์‹ค๋กœ๋ด‡์œผ๋กœ ์ „์ดํ•œ๋‹ค. (d) articulated, (e) rigid ๋ฌผ์ฒด ์กฐ์ž‘๊ณผ (f, g) whole-body ์ž„๋ฒ ๋””๋จผํŠธ๋กœ ์ผ๋ฐ˜ํ™”. ๋ฐฐ๊ฒฝ์€ 4,739๊ฐœ bimanual ํƒœ์Šคํฌ ๋Œ€๊ทœ๋ชจ ๋ฒค์น˜๋งˆํฌ.

ํ•ต์‹ฌ ๋ฐฉ๋ฒ•๋ก :

๊ฐ ์‹œ์ ยท๊ฐ•์ฒด ๋ถ€๋ถ„ k์— ๋Œ€ํ•ด ์‚ฌ๋žŒ ์‹œ์—ฐ์—์„œ ์ ‘์ด‰์  p^{h,k}_i์™€ ๋ฒ•์„  n^{h,k}_i๋ฅผ ๋ฝ‘๋Š”๋‹ค. ์ ‘์ด‰ i์—์„œ ๊ฐ€๋Šฅํ•œ ์ ‘์ด‰๋ ฅ f๊ฐ€ ๋งŒ๋“œ๋Š” primitive wrench ๋Š” force์™€ ๊ทธ ๋ชจ๋ฉ˜ํŠธ๋ฅผ ์Œ“์€ 6D ๋ฒกํ„ฐ๋‹ค:

w^{i,j}_{h,k} = \left[\, f^{j}_{h,k},\; p^{i}_{h,k}\times f^{j}_{h,k} \,\right]^{\top}\in\mathbb{R}^6 .

Coulomb ๋งˆ์ฐฐ์ฝ˜์„ d๊ฐœ์˜ ๋ชจ์„œ๋ฆฌ ํž˜์œผ๋กœ ๊ทผ์‚ฌํ•ด ๋ชจ๋“  primitive wrench๋ฅผ ๋ชจ์œผ๋ฉด wrench matrix \mathcal{W}_{h,k}\in\mathbb{R}^{6\times(c_{h,k}d)} ๊ฐ€ ๋œ๋‹ค โ€” ์ด ํ–‰๋ ฌ์ด โ€œ์‚ฌ๋žŒ ์ ‘์ด‰์ด ๋ฌผ์ฒด์— ๊ฐ€ํ•  ์ˆ˜ ์žˆ๋Š” forceโ€“torque ๋ฐฉํ–ฅโ€ ์ „์ฒด๋ฅผ ๋‹ด๋Š”๋‹ค. ๋‘ wrench matrix๋Š” ์—ด ๊ฐœ์ˆ˜ยท์ˆœ์„œ๊ฐ€ ๋‹ฌ๋ผ ์ง์ ‘ ๋น„๊ต๊ฐ€ ์–ด๋ ต๋‹ค. ๊ทธ๋ž˜์„œ ๋ฏธ๋ฆฌ ๋ฝ‘์€ b๊ฐœ์˜ ๋‹จ์œ„ ๋ฐฉํ–ฅ \mathcal{B}\in\mathbb{R}^{6\times b}์— ๋Œ€ํ•œ support function ์œผ๋กœ ๊ธฐํ•˜๋ฅผ ์š”์•ฝํ•œ๋‹ค:

\sigma_{h,k} = \max_{\mathrm{col}}\big(\mathcal{B}^{\top}\mathcal{W}_{h,k}\big)\in\mathbb{R}^{b}.

๋กœ๋ด‡ support \sigma_{r,k}๋ฅผ ๊ฐ™์€ ๋ฐฉ์‹์œผ๋กœ ๊ตฌํ•ด, ์ƒ๋Œ€ ํ—ˆ์šฉ์˜ค์ฐจ \beta ์•ˆ์—์„œ ์‚ฌ๋žŒ reference์™€ ๋น„๊ตํ•˜๋Š” ๊ฒƒ์ด contact wrench-space(CWS) reward ๋‹ค:

r^{k}_{\mathrm{cws}} = \exp\!\left(-\frac{\lVert\max(0,(1-\beta)\sigma_{h,k}-\sigma_{r,k})\rVert_2^2}{v_{\mathrm{cws}}} - \frac{\lVert\max(0,\sigma_{r,k}-(1+\beta)\sigma_{h,k})\rVert_2^2}{v_{\mathrm{cws}}}\right).

์•ž ํ•ญ์€ ๋กœ๋ด‡ support๊ฐ€ ํ•˜ํ•œ๋ณด๋‹ค ์ž‘์œผ๋ฉด, ๋’ค ํ•ญ์€ ์ƒํ•œ์„ ๋„˜์œผ๋ฉด ๋ฒŒ์ ์„ ์ค€๋‹ค. ์ถ”๊ฐ€๋กœ ์‚ฌ๋žŒ ์ ‘์ด‰์ด ์—†๋Š”๋ฐ(\sigma_{h,k}=0) ๋กœ๋ด‡์ด ์ ‘์ด‰ํ•˜๋ฉด(r_{\mathrm{unintend}}), ์‚ฌ๋žŒ ์ ‘์ด‰์ด ์žˆ๋Š”๋ฐ ๋กœ๋ด‡์ด ๋†“์น˜๋ฉด(r_{\mathrm{miss}}) ๋”ฐ๋กœ ํŽ˜๋„ํ‹ฐ๋ฅผ ์ค€๋‹ค. ์ „์ฒด ๋ณด์ƒ์€ r = r_{\mathrm{task}} + r_{\mathrm{imit}} + r_{\mathrm{contact}}์ด๋ฉฐ, ์—ฌ๊ธฐ์— DexMachina์˜ virtual object controller(VOC) ๋ฅผ ์ปค๋ฆฌํ˜๋Ÿผ์œผ๋กœ annealingํ•ด ํƒ์ƒ‰์„ ๋•๋Š”๋‹ค.

์ฃผ์š” ๊ฒฐ๊ณผ: (PDF์—์„œ ํ™•์ธํ•œ ์ˆ˜์น˜๋งŒ)

  • ๋‹จ์ผ ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ๋กœ ๋ฒค์น˜๋งˆํฌ 1,831๊ฐœ ํƒœ์Šคํฌ ํ‰๊ท  ์„ฑ๊ณต๋ฅ  82.12% โ€” ์ €์ž ์ฃผ์žฅ์ƒ ์ด ๊ทœ๋ชจ๋กœ ํ‰๊ฐ€๋œ ์ฒซ RL ๊ธฐ๋ฐ˜ ์†์žฌ์ฃผ ์กฐ์ž‘.
  • baseline์˜ ์›๋ž˜ ํƒœ์Šคํฌ ์Šค์œ„ํŠธยท์ง€ํ‘œ๋กœ ๋น„๊ต ์‹œ(Table 1) DexMachina ์Šค์œ„ํŠธ AUC 0.232โ†’0.687, ManipTrans SR 0.428โ†’0.639; ์ž์ฒด ์Šค์œ„ํŠธ์—์„œ๋Š” Ours-1 AUC 0.211โ†’0.895, Ours-2 SP-SR 0.533โ†’0.982 ๋“ฑ baseline์„ ๋งค์นญยท์ƒํšŒ.
  • reward ํ˜•ํƒœ ablation(Table 2): box grab SR 0.702(CHORD) vs 0.334(Position Only) vs 0.384(No Contact), mixer use 0.894 vs 0.624 vs 0.423.
  • CWS reward์™€ ํƒœ์Šคํฌ ์„ฑ๊ณต์˜ ์ƒ๊ด€: Pearson r\approx0.80(๋ฐ์ดํ„ฐ์…‹๋ณ„ 0.76โ€“0.89), ๋‹จ์กฐยทํฌํ™” ๊ด€๊ณ„.
  • whole-body loco-manipulation 90.77%, ๋‹ค๋ฅธ ์† ํ˜•ํƒœ(G1 + Dex3 3์ง€)๋กœ์˜ cross-embodiment ์ „์ด์—์„œ position reward(0.217)๋ฅผ ํฌ๊ฒŒ ์ƒํšŒ(0.925, Table 3).
  • Dexmate + Sharpa ๋‘ ์† ์‹ค๋กœ๋ด‡์—์„œ open-loopยทclosed-loop ์ „์ด ์„ฑ๊ณต(Fig. 9).

๊ฒฐ๋ก : CHORD๋Š” โ€œ์ ‘์ด‰ ์œ„์น˜โ€๊ฐ€ ์•„๋‹ˆ๋ผ โ€œ์ ‘์ด‰์ด ๋งŒ๋“œ๋Š” ๋ฌผ์ฒด ์šด๋™(wrench)โ€์„ ๋งž์ถ”๋Š” ๋ณด์ƒ์œผ๋กœ, ์‚ฌ๋žŒ ์‹œ์—ฐ โ†’ ์†์žฌ์ฃผ RL ์ •์ฑ… ์ „์ด๋ฅผ ์ž„๋ฒ ๋””๋จผํŠธยท์ ‘์ด‰ ํ˜•ํƒœ์— ๋ฌด๊ด€ํ•˜๊ฒŒ ํ™•์žฅ ๊ฐ€๋Šฅํ•˜๊ฒŒ ๋งŒ๋“ ๋‹ค. ๋Œ€๊ทœ๋ชจ ๋ฒค์น˜๋งˆํฌ์™€ ์ผ๊ด€๋œ ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ํ‰๊ฐ€๋กœ ํ™•์žฅ์„ฑ์„ ์‹ค์ฆํ•œ๋‹ค.

๐Ÿ”” Ring Review

๐Ÿ”” Ring โ€” An idea that echoes. Grasp the core and its value.

ํ•œ ์ค„๋กœ ์‹œ์ž‘ํ•˜๋ฉด

โ€œ๊ฐ™์€ ๊ณณ์„ ๋งŒ์ง€๋ผโ€๊ฐ€ ์•„๋‹ˆ๋ผ โ€œ๊ฐ™์€ ๋ฌผ์ฒด ์šด๋™์„ ๋งŒ๋“ค ์ˆ˜ ์žˆ๊ฒŒ ๋งŒ์ง€๋ผโ€ โ€” ์ ‘์ด‰์„ wrench space์—์„œ ๋น„๊ตํ•˜๋Š” ๋ณด์ƒ ํ•˜๋‚˜๋กœ ์‚ฌ๋žŒ ์‹œ์—ฐ์„ ํ˜•ํƒœ๊ฐ€ ๋‹ค๋ฅธ ์†์žฌ์ฃผ ๋กœ๋ด‡์— ์˜ฎ๊ธฐ๊ณ , ๊ทธ ํšจ๊ณผ๋ฅผ 4,739๊ฐœ ํƒœ์Šคํฌ ๊ทœ๋ชจ๋กœ ๊ฒ€์ฆํ•œ ์—ฐ๊ตฌ๋‹ค.

๋ฐฐ๊ฒฝ: ์™œ ์‚ฌ๋žŒ ์‹œ์—ฐ ์ „์ด๊ฐ€ ์–ด๋ ค์šด๊ฐ€

์†์žฌ์ฃผ ์กฐ์ž‘์€ ์‚ฌ๋žŒ ์‹œ์—ฐ์ด ํ’๋ถ€ํ•˜๋‹ค๋Š” ์ด์ ์ด ์žˆ์ง€๋งŒ, ๊ทธ ์‹œ์—ฐ์„ ๋กœ๋ด‡ ์ •์ฑ…์œผ๋กœ ์˜ฎ๊ธฐ๋Š” ์ผ์€ ์—ฌ์ „ํžˆ ์–ด๋ ต๋‹ค. CHORD๋Š” ๋‘ ๊ฐˆ๋ž˜์˜ ๊ธฐ์กด ์ ‘๊ทผ์ด ๋ชจ๋‘ ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค๊ณ  ๋ณธ๋‹ค. โ‘  ์ตœ์ ํ™”์— ์‹œ์—ฐ์„ ์“ฐ๋Š” ๋ฐฉ๋ฒ• ์€ โ€œ์‹œ์—ฐ์„ ์–ด๋–ป๊ฒŒ ์ „์ดํ• ์ง€โ€์— ๋Œ€ํ•œ brittleํ•œ ๊ฐ€์ •์— ์˜์กดํ•˜๊ณ , โ‘ก ํ‘œํ˜„ ํ•™์Šต(representation learning) ๋ฐฉ๋ฒ• ์€ ํƒœ์Šคํฌยท๋ฌผ์ฒด๋งˆ๋‹ค ์ •๋ ฌ๋œ human-robot ๋ฐ์ดํ„ฐ๋ฅผ ์š”๊ตฌํ•ด curated ์„ธํŒ… ๋ฐ–์œผ๋กœ ํ™•์žฅํ•˜๊ธฐ ์–ด๋ ต๋‹ค. ๊ทผ๋ณธ ๋‚œ์ ์€ ํ˜•ํƒœยท์šด๋™ํ•™ยท์† ๊ธฐํ•˜์˜ ์ฐจ์ด ๋•Œ๋ฌธ์— ๋กœ๋ด‡์ด ์‚ฌ๋žŒ ์†๋™์ž‘์„ ๊ทธ๋Œ€๋กœ replay ํ•ด์„œ๋Š” ๊ฐ™์€ ์กฐ์ž‘์„ ์žฌํ˜„ํ•  ์ˆ˜ ์—†๋‹ค ๋Š” ๋ฐ ์žˆ๋‹ค.

์ ‘์ด‰ ๊ฐ€์ด๋“œ๋ฅผ ์“ฐ๋Š” ์ตœ๊ทผ RL ์—ฐ๊ตฌ๋“ค๋„ ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค. ManipTrans๋Š” ์‹œ์—ฐ๋œ hand-object ์ƒํ˜ธ์ž‘์šฉ ๊ทผ์ฒ˜์˜ ์ ‘์ด‰๋ ฅ ์— ๋ณด์ƒ์„ ์ฃผ๊ณ , DexMachinaยทSPIDER๋Š” ์‹œ์—ฐ๋œ ์ ‘์ด‰ ์œ„์น˜ ์™€ VOC ๊ฐ™์€ ์ปค๋ฆฌํ˜๋Ÿผ์„ ํ•จ๊ป˜ ์“ด๋‹ค. VOC๋Š” ์ดˆ๊ธฐ ํ•™์Šต ๋™์•ˆ ๋ณด์กฐ wrench๋กœ ๋ฌผ์ฒด๋ฅผ reference ๊ถค์ ์„ ๋”ฐ๋ผ ์›€์ง์—ฌ, ์ •์ฑ…์ด ์ •ํ™•ํ•œ ์ ‘์ด‰ ํƒ€์ด๋ฐยทํž˜์„ ์ฆ‰์‹œ ์ฐพ์ง€ ์•Š์•„๋„ ๋˜๊ฒŒ ํ•ด ํƒ์ƒ‰์„ ๋งค๋„๋Ÿฝ๊ฒŒ ํ•œ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ์œ„์น˜ ๊ธฐ๋ฐ˜ ์ ‘์ด‰ ๋ณด์ƒ์€ ์ ‘์ด‰ ์œ„์น˜๋งŒ์œผ๋กœ๋Š” ์ ‘์ด‰์˜ ํšจ๊ณผ๊ฐ€ ์ •ํ•ด์ง€์ง€ ์•Š๋Š”๋‹ค ๋Š” ๋ณธ์งˆ์  ์•ฝ์ ์„ ์•ˆ๋Š”๋‹ค โ€” ๊ฐ™์€ ๋ฌผ์ฒด ์˜์—ญ๋„ ์ ‘์ด‰ ๋ฒ•์„ ยทํž˜ ๋ฐฉํ–ฅ์— ๋”ฐ๋ผ ๋‹ค๋ฅธ ๋ฌผ์ฒด ์šด๋™์„ ๋‚ธ๋‹ค(์ €์ž๋“ค์˜ box-opening ์˜ˆ์‹œ: ์œ„์น˜๋Š” ์‚ฌ๋žŒ๊ณผ ๊ฐ€๊น์ง€๋งŒ ์ ‘์ด‰ ๋ฒ•์„ ์ด ๊ฑฐ์˜ ์ˆ˜์ง์œผ๋กœ ์–ด๊ธ‹๋‚˜ ๋ฌผ๋ฆฌ์ ์œผ๋กœ mismatch). ํ•œํŽธ ๊ธฐ์กด grasp ์ชฝ์˜ wrench ๋ณด์ƒ์€ ์ •์  ์•ˆ์ •์„ฑ(force closure) ๋งŒ ์ตœ์ ํ™”ํ•ด grasp์—” ์ข‹์ง€๋งŒ ์ผ๋ฐ˜ ์กฐ์ž‘์—” ๋„ˆ๋ฌด ๊ฒฝ์ง๋ผ ์žˆ๋‹ค. CHORD๋Š” wrench space๋ฅผ ์‚ฌ๋žŒ ์‹œ์—ฐ๊ณผ ๋กœ๋ด‡ ์‹คํ–‰์„ โ€œ์œ ๋ฐœ ์šด๋™โ€์œผ๋กœ ๋น„๊ตํ•˜๋Š” ์ฒ™๋„ ๋กœ ์ฒ˜์Œ ์“ด๋‹ค๊ณ  ์ฃผ์žฅํ•˜๋ฉฐ, ์ด๋Š” pushingยทleveringยทsliding ๊ฐ™์€ ๋น„-force-closure ๊ตฌ๊ฐ„๊ณผ articulated ๋ฌผ์ฒด๊นŒ์ง€ ํฌ๊ด„ํ•œ๋‹ค.

๋ฐฉ๋ฒ• ์ƒ์„ธ

๋ฌธ์ œ ์„ค์ •. K๊ฐœ ๊ฐ•์ฒด ๋ถ€๋ถ„์„ ๊ฐ€์ง„ ๋ฌผ์ฒด(๋ถ„๋ฆฌ ๊ฐ•์ฒด ๋˜๋Š” articulated)๋ฅผ ๋‹ค๋ฃฌ๋‹ค. ์‹œ์—ฐ \tau^{\mathrm{ref}}=\{x^{\mathrm{human}}_t, x^{\mathrm{object}}_t\}_{t=1}^{H}๋Š” 3D ์† keypoint์™€ ๋ถ€๋ถ„๋ณ„ SE(3) pose๋ฅผ ์ค€๋‹ค. ๋จผ์ € keypoint์—์„œ IK๋กœ ๋กœ๋ด‡ ๊ตฌ์„ฑ x^{\mathrm{robot}}_t๋กœ retargetํ•œ ๋’ค, ์ •์ฑ… \pi(a_t\mid o^{\mathrm{robot}}_t, o^{\mathrm{object}}_t; x^{\mathrm{robot}}_t, x^{\mathrm{object}}_t)์ด rollout์˜ ๋ฌผ์ฒด pose๊ฐ€ reference๋ฅผ ์ถ”์ข…ํ•˜๋„๋ก ํ–‰๋™์„ ๋‚ธ๋‹ค.

๋ณด์ƒ 3์ข… + VOC. r=r_{\mathrm{task}}+r_{\mathrm{imit}}+r_{\mathrm{contact}}.

  • r_{\mathrm{task}}: ๋ถ€๋ถ„๋ณ„ pose ์ถ”์ข… \exp(-\sum_k \lVert x^{\mathrm{object},k}_t\ominus s^{\mathrm{object},k}_t\rVert_2^2/\mathrm{var})์— ๋”ํ•ด, insertionยทpouringยทscoopingยทtool use ์ฒ˜๋Ÿผ ๋ฌผ์ฒด-๋ฌผ์ฒด ๊ธฐํ•˜๊ฐ€ ์ค‘์š”ํ•œ ๊ตฌ๊ฐ„์—์„œ๋งŒ ์ผœ์ง€๋Š” ์ƒ๋Œ€ ๋ณด์ƒ r_{\mathrm{relative}}=m(t)\exp(-e_{\mathrm{object}}/\mathrm{var}_{\mathrm{rel}})๋ฅผ ๋‘”๋‹ค.
  • r_{\mathrm{imit}}: retarget๋œ ์‚ฌ๋žŒ ๋ชจ์…˜ ์ชฝ์œผ๋กœ์˜ ์ •๊ทœํ™” \exp(-\lVert x^{\mathrm{robot}}_t-s^{\mathrm{robot}}_t\rVert_2^2/\mathrm{var}_{\mathrm{imit}}).
  • r_{\mathrm{contact}}: ์œ„ Ping์˜ CWS reward r^k_{\mathrm{cws}} + unintended/missed ์ ‘์ด‰ ํŽ˜๋„ํ‹ฐ.

CHORD ๋ณด์ƒ ๊ตฌ์„ฑ(Fig. 2, mixer-closing ํƒœ์Šคํฌ) โ€” ์™ผ์ชฝ: ์‚ฌ๋žŒ ์‹œ์—ฐ์—์„œ ์ถ”์ถœํ•œ ์ ‘์ด‰ wrench reference(์•„๋ž˜์— ์ ‘์ด‰ ์œ„์น˜ยท๋งˆ์ฐฐ์ฝ˜์€ ๋นจ๊ฐ•). ๊ฐ€์šด๋ฐ: ์‹œ๊ฐ„์— ๋”ฐ๋ฅธ per-hand ์ ‘์ด‰ wrench๋ฅผ force manifold๋กœ ์‹œ๊ฐํ™”, ๋นจ๊ฐ•=์‚ฌ๋žŒ, ํŒŒ๋ž‘=CHORD ์ •์ฑ…์ด ๋งŒ๋“  ์ ‘์ด‰ wrench. ์œ„=์‚ฌ๋žŒ ์‹œ์—ฐ, ์•„๋ž˜=ํ•™์Šต๋œ ๋กœ๋ด‡ ์ •์ฑ….

์™œ support function์ธ๊ฐ€(์ง๊ด€). wrench matrix \mathcal{W}๋Š” ์ ‘์ด‰๋“ค์ด ๋ฌผ์ฒด์— ๊ฐ€ํ•  ์ˆ˜ ์žˆ๋Š” 6D forceโ€“torque์˜ โ€œ๊ตฌ๋ฆ„โ€์ด๋‹ค. ๋‘ ๊ตฌ๋ฆ„์„ ์ง์ ‘ ๋น„๊ตํ•˜๋ ค๋ฉด ์—ด์˜ ๊ฐœ์ˆ˜ยท์ˆœ์„œ๊ฐ€ ๋งž์•„์•ผ ํ•˜๋Š”๋ฐ, ์‚ฌ๋žŒ๊ณผ ๋กœ๋ด‡์€ ์ ‘์ด‰ ๊ฐœ์ˆ˜ยท์ˆœ์„œ๊ฐ€ ๋‹ค๋ฅด๋‹ค. support function \sigma=\max_{\mathrm{col}}(\mathcal{B}^{\top}\mathcal{W})๋Š” ๋ฏธ๋ฆฌ ์ •ํ•œ b๊ฐœ ๋ฐฉํ–ฅ๋งˆ๋‹ค โ€œ๊ทธ ๋ฐฉํ–ฅ์œผ๋กœ ์–ผ๋งˆ๋‚˜ ๋ฉ€๋ฆฌ ๋ฐ€ ์ˆ˜ ์žˆ๋Š”๊ฐ€โ€๋ฅผ ์žฌ โ€” ์ฆ‰ wrench ๋‹คํฌ์ฒด(polytope)์˜ ์ง€์ง€ ํญ ์„ ๋ฐฉํ–ฅ๋ณ„๋กœ ์š”์•ฝํ•œ๋‹ค. ์ด๋ ‡๊ฒŒ ํ•˜๋ฉด ์ ‘์ด‰์˜ ๊ฐœ์ˆ˜ยท์ˆœ์„œยท์œ„์น˜๊ฐ€ ๋‹ฌ๋ผ๋„ โ€œ๋ฌผ์ฒด์— ์–ด๋–ค ์šด๋™์„ ์œ ๋ฐœํ•  ๋Šฅ๋ ฅ์ด ์žˆ๋Š”๊ฐ€โ€๋ผ๋Š” ์ž„๋ฒ ๋””๋จผํŠธ-๋ถˆ๋ณ€ ์ฒ™๋„ ๋กœ ํ™˜์›๋œ๋‹ค. CWS reward๋Š” ๋กœ๋ด‡์˜ ์ง€์ง€ ํญ์ด ์‚ฌ๋žŒ์˜ [(1-\beta)\sigma_h,\,(1+\beta)\sigma_h] ๋ฐด๋“œ ์•ˆ์— ๋“ค๋ฉด ๋ณด์ƒํ•˜๋Š” ๊ตฌ์กฐ๋ผ, ๋„ˆ๋ฌด ์•ฝํ•œ(๋ชป ๋ฏธ๋Š”) ์ ‘์ด‰๊ณผ ๋„ˆ๋ฌด ๊ณผํ•œ(๊ณผ๋„ํ•˜๊ฒŒ ๋ฏธ๋Š”) ์ ‘์ด‰์„ ๋ชจ๋‘ ์–ต์ œํ•œ๋‹ค.

ํšจ์œจยท๊ฐ•๊ฑดยท์ผ๋ฐ˜ํ™” ์žฅ์น˜(3.2). โ‘  reference ๊ถค์ ์˜ ์ž„์˜ ์ƒํƒœ๋กœ simulator๋ฅผ resetํ•˜๋˜ VOC๋ฅผ ์งง์€ stabilization window ๋™์•ˆ ์™„์ „ ํ™œ์„ฑํ™”ํ•ด ์ ‘์ด‰ ํšŒ๋ณต ํ›„ ๋ณด์กฐ๋ฅผ annealing, โ‘ก ๋ฌผ์ฒด ๋ถ€๋ถ„์— \mathcal{W}_{h,k}์—์„œ ์ƒ˜ํ”Œํ•œ wrench๋กœ ํƒœ์Šคํฌ-๊ด€๋ จ ๊ต๋ž€ ์„ ๊ฐ€ํ•ด ๊ฐ•๊ฑดํ™”, โ‘ข retarget ๋ชจ์…˜์„ prior๋กœ ํ•œ residual action space(ManipTrans์‹) + VOC ์ปค๋ฆฌํ˜๋Ÿผ(DexMachina์‹).

๋…ธ์ด์ฆˆ ๋Œ€์‘ โ€” reduced force-closure objective. RGB ๋น„๋””์˜ค ์žฌ๊ตฌ์„ฑ์ฒ˜๋Ÿผ hand-object ์ •ํ•ฉ์ด noisyํ•ด ์ ‘์ด‰ ์ถ”์ •์ด ๋ถˆ์•ˆ์ •ํ•˜๋ฉด, ์‚ฌ๋žŒ wrench๋ฅผ ์ง์ ‘ ๋งž์ถ”๋Š” ๋Œ€์‹  ๊ฐ basis ๋ฐฉํ–ฅ์œผ๋กœ ์–‘์˜ ์ง€์ง€ ๋ฅผ ๋‚ด๊ฒŒ ํ•˜๋Š” ์™„ํ™”๋œ ๋ชฉํ‘œ๋กœ ์ „ํ™˜ํ•œ๋‹ค:

r^{k}_{\mathrm{fc}} = \frac{1}{B}\sum_{b=1}^{B}\mathbb{1}[\sigma_{r,k,b} > \epsilon].

์ด๋ฅผ ์ตœ๋Œ€ํ™”ํ•˜๋ฉด force closure์™€ ๋™์น˜๊ฐ€ ๋œ๋‹ค. Whole-body ํ™•์žฅ ๋„ ๊ฐ™์€ ๊ณจ๊ฒฉ์ด๋‹ค โ€” hand-only reference(egocentric ์žฌ๊ตฌ์„ฑ)๋Š” inpainting ๋ชจ๋“ˆ๋กœ ์ „์‹  ๋ชจ์…˜์„ ์ฑ„์šด ๋’ค CWS reward๋ฅผ, whole-body reference(third-person ์žฌ๊ตฌ์„ฑ)๋Š” ์†๊ฐ€๋ฝ ์žฌ๊ตฌ์„ฑ์ด ๋ถ€์ •ํ™•ํ•˜๋ฏ€๋กœ reduced r^k_{\mathrm{fc}}๋ฅผ ์“ด๋‹ค.

๋ฒค์น˜๋งˆํฌ(3.3). mocap ๋ฐ์ดํ„ฐ์…‹(ARCTICยทOakInk2ยทHOT3DยทTACO ๋“ฑ)๊ณผ in-house ๋น„๋””์˜ค ์žฌ๊ตฌ์„ฑ์„ Isaac Lab์œผ๋กœ ๊ฐ€์ ธ์™€ 4,739๊ฐœ simulatableยทtrainable ํƒœ์Šคํฌ๋กœ ๋ณ€ํ™˜ํ–ˆ๋‹ค. single/multi rigid + articulated bimanual ์กฐ์ž‘์„ ํฌ๊ด„ํ•œ๋‹ค. ๊ธฐ์กด์ž‘ ๋Œ€๋น„ ์‹œ๊ฐ„ horizonยทํƒœ์Šคํฌ๋‹น ์ ‘์ด‰ ์ด๋ฒคํŠธ ์ˆ˜ยทgrasp ์•ˆ์ •์„ฑ(Ferrari-Canny epsilon) ์„ธ ์ง€ํ‘œ์—์„œ ๋” ๊ธธ๊ณ  ๋” denseํ•˜๋‹ค.


๋ฒค์น˜๋งˆํฌ ๋ถ„ํฌ(Fig. 3) โ€” ์‹œ๊ฐ„ horizon, ํƒœ์Šคํฌ๋‹น ์ ‘์ด‰ ์ด๋ฒคํŠธ ์ˆ˜, Ferrari-Canny epsilon ๋ถ„ํฌ. CHORD(์ฃผํ™ฉ)๊ฐ€ DexMachinaยทManipTransยทSPIDER๋ณด๋‹ค ๋” ๋งŽ์€ ํƒœ์Šคํฌยท๋” ๊ธด horizonยท๋” denseํ•œ ์ ‘์ด‰์„ ํฌํ•จ.

์‹คํ—˜

๋Œ€๊ทœ๋ชจ ํ‰๊ฐ€(4.1). 1,831๊ฐœ ํƒœ์Šคํฌ์— ๋™์ผ ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ(VOC gainยท์ปค๋ฆฌํ˜๋Ÿผยทreward weight)๋ฅผ ์จ์„œ ํ‰๊ฐ€ํ–ˆ๋‹ค. ์„ฑ๊ณต ํŒ์ •์€ object-centric termination(์œ„์น˜ ์˜ค์ฐจ 15cm ๋˜๋Š” ํšŒ์ „ 40ยฐ ์ดˆ๊ณผ) ์—†์ด ์™„๋ฃŒํ•˜๋ฉด rollout ์„ฑ๊ณต, completion ratio > 0.7์ด๋ฉด ํƒœ์Šคํฌ ์„ฑ๊ณต์ด๋‹ค. ๊ฒฐ๊ณผ๋Š” ๋ฐ์ดํ„ฐ์…‹ยทhorizon ์ „๋ฐ˜์—์„œ ๊ณ ๋ฅด๊ฒŒ ๊ฐ•ํ•˜๋‹ค.


๋ฐ์ดํ„ฐ์…‹๋ณ„ ์„ฑ๊ณต๋ฅ (Fig. 5์— ๋Œ€์‘) โ€” ์™ผ์ชฝ: 1,831 ํƒœ์Šคํฌ(HOT3D-1Obj 0.772, OakInk2-1Obj 0.786, OakInk2-2Obj 0.746, ARCTIC 0.814, TACO 0.935, HOT3D-2Obj 0.753). ์˜ค๋ฅธ์ชฝ: whole-body ํ‰๊ฐ€(ARCTIC 0.994, TPV 0.867, ARCTIC-articulated 0.914, TACO 0.866). ์ƒ‰=rigid/articulated/multi-object.

baseline ๋น„๊ต(4.1, Table 1). ManipTransยทDexMachinaยทSPIDER๋ฅผ ๊ฐ์ž์˜ ์›๋ž˜ ํƒœ์Šคํฌ ์Šค์œ„ํŠธยท์ง€ํ‘œ ๋กœ ๋น„๊ตํ•œ๋‹ค(Sharpa hand, Isaac Lab์—์„œ ์‹ ๋ขฐ์„ฑ ์žˆ๊ฒŒ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ฐ€๋Šฅํ•œ ํƒœ์Šคํฌ๋งŒ: MT 8, SP 3, DM 7๊ฐœ + ์ž์ฒด 9๊ฐœ). CHORD๋Š” ๋ชจ๋“  ํ–‰์—์„œ ๋งค์นญยท์ƒํšŒํ•˜๊ณ , ๋ฌด์—‡๋ณด๋‹ค ๊ธฐ์กด์ž‘์ด rigid/articulated/multi-object ์ค‘ ์ผ๋ถ€์— ๊ตญํ•œ๋œ ๋ฐ˜๋ฉด ์„ธ ๋ฒ”์ฃผ ๋ชจ๋‘ ๋ฅผ ํ‘ผ๋‹ค.

์ ‘์ด‰ ๊ฐ€์ด๋“œ ๊ฒ€์ฆ(4.2, Table 2). ๋™์ผํ•œ non-contact reward๋ฅผ ๋‘๊ณ  ์ ‘์ด‰ ๊ฐ๋…๋งŒ ๋ฐ”๊ฟ” ๋น„๊ตํ•œ๋‹ค โ€” CHORD(wrench support) > Position Only(DexMachina์‹ ์œ„์น˜ ๋ณด์ƒ) > No Contact(์ถ”์ข…๋งŒ). box grab 0.702/0.334/0.384, mixer use 0.894/0.624/0.423. ์œ„์น˜๋งŒ ๋งž์ถ”๋Š” ๊ฒƒ์œผ๋กœ๋Š” ์ ‘์ด‰-rich ์กฐ์ž‘์„ ์ถฉ์‹คํžˆ ์žฌํ˜„ํ•˜๊ธฐ ๋ถ€์กฑํ•จ์„ ๋ณด์ธ๋‹ค.

rewardโ€“์„ฑ๊ณต ์ƒ๊ด€(4.3). 1,831 run์—์„œ ์ •๊ทœํ™” CWS reward์™€ ์„ฑ๊ณต๋ฅ ์€ Pearson r\approx0.80(๋ฐ์ดํ„ฐ์…‹๋ณ„ 0.76โ€“0.89). ๋‹จ์กฐ์ด๋˜ ํฌํ™”ํ•˜๋Š” ๊ด€๊ณ„(๊ณ -reward ์˜์—ญ์—์„œ 1์— plateau)๋ผ, OLS ์ง์„ ์€ ์ „์ฒด ์ถ”์„ธ์˜ ~2/3 ๋ถ„์‚ฐ์„ ์„ค๋ช…ํ•˜๋ฉด์„œ ๊ณ -reward ์˜์—ญ์˜ ์ ํ•ฉ๋„๋ฅผ ๊ณผ์†Œํ‰๊ฐ€ํ•œ๋‹ค. CWS reward๊ฐ€ ํ•™์Šต ์‹ ํ˜ธ์ด์ž proxy metric ์œผ๋กœ ์œ ์šฉํ•จ์„ ์‹œ์‚ฌ.

์™ผ์ชฝ: CWS reward์™€ ํƒœ์Šคํฌ ์„ฑ๊ณต๋ฅ ์˜ ์ƒ๊ด€(Fig. 6, Pearson r\approx0.80). ์˜ค๋ฅธ์ชฝ: interaction horizon์— ๋”ฐ๋ฅธ ์ถ”์ข… ์ •ํ™•๋„(Fig. 7, DexMachina ADD-AUC) โ€” CHORD(์ƒ‰)๋Š” ์ตœ์žฅ 40โ€“48์ดˆ์—์„œ๋„ ADD-AUCโ‰ˆ0.85โ€“0.98์„ ์œ ์ง€, baseline์€ horizon์ด ๊ธธ์–ด์งˆ์ˆ˜๋ก ํฌ๊ฒŒ ์ €ํ•˜.

long-horizon(4.4). ์ ‘์ด‰ wrench๋ฅผ ๋‹จ์ผ ์ถ”์ƒ์œผ๋กœ ๋‹ค์–‘ํ•œ ์กฐ์ž‘์„ ํ•œ ๋ฉ”์ปค๋‹ˆ์ฆ˜์œผ๋กœ ํ•ฉ์„ฑํ•˜๋ฏ€๋กœ, ๊ฑฐ์˜ 1๋ถ„์— ์ด๋ฅด๋Š” ํƒœ์Šคํฌ ์‹œํ€€์Šค๊นŒ์ง€ ํ™•์žฅ๋œ๋‹ค. ๋Œ€๋ถ€๋ถ„ ์‹œํ€€์Šค์—์„œ near-saturated ์ถ”์ข…์„ ์œ ์ง€ํ•œ๋‹ค.

whole-body ์ผ๋ฐ˜ํ™”(4.5, Table 3). hand-only reference 12๊ฐœ(rigidยทarticulatedยทmulti ๊ฐ 4) + TPV whole-body 5๊ฐœ loco-manipulation. ์‚ฌ๋žŒ 5์ง€ ์† โ†’ G1 + Dex3 3์ง€ ์†์ด๋ผ๋Š” ํ˜•ํƒœ ์ฐจ์ด์—๋„ wrench-space ์ •๋ ฌ์ด ํšจ๊ณผ์  ์ ‘์ด‰์„ ํ•™์Šตํ•˜๊ฒŒ ํ•œ๋‹ค. ๋™์ผ ๋ ˆ์‹œํ”ผ์—์„œ position reward๋กœ ๋ฐ”๊พธ๋ฉด(rigid 0.460, articulated 0.000, multi 0.192, overall 0.217) ํฌ๊ฒŒ ๋ฌด๋„ˆ์ง€๋Š” ๋ฐ˜๋ฉด CHORD๋Š” 0.994/0.914/0.866/0.925 โ€” ์œ„์น˜ ๋ณด์ƒ์€ ์‹œ์—ฐ ์ž„๋ฒ ๋””๋จผํŠธ์— ๊ฐ•ํ•˜๊ฒŒ ๊ฒฐํ•ฉ๋ผ cross-embodiment์— ์ทจ์•ฝํ•จ์„ ๋ณด์ธ๋‹ค. TPV๋Š” ์žฌ๊ตฌ์„ฑ ๋…ธ์ด์ฆˆ ๋•Œ๋ฌธ์— force-closure ๋ชฉํ‘œ๋กœ ์ผ๊ด€ ์„ฑ๊ณต.

์‹ค์„ธ๊ณ„(4.6). Dexmate + Sharpa ๋‘ ์†, mocap pose tracking. open-loop action-chunk์™€ closed-loop inference ๋ชจ๋‘์—์„œ rigidยทarticulated ๋ฌผ์ฒด๋ฅผ bimanual๋กœ ์กฐ์ž‘.


์‹ค์„ธ๊ณ„ ๋ฐฐ์น˜(Fig. 9) โ€” ์ขŒ์ƒ๋‹จ์ด closed-loop, ๋‚˜๋จธ์ง€๋Š” open-loop ๋ฐฐ์น˜. ๋ฐ•์Šคยทarticulated ๋ฌผ์ฒด๋ฅผ ๋‘ ์† ํ˜‘์‘์œผ๋กœ ์กฐ์ž‘.

teleoperation ๋Œ€์กฐ(4.7). RL์˜ ๊ฐ€์น˜๋ฅผ ๊ฐ€๋Š ํ•˜๋ ค ์†Œ๊ทœ๋ชจ ์ •์„ฑ pilot์„ ํ–ˆ๋‹ค. ์˜์™ธ๋กœ box-lifting์ด ๊ฐ€์žฅ ์–ด๋ ค์› ๋‹ค โ€” grasp ํ˜•์„ฑยท์ ‘์ด‰ ํƒ€์ด๋ฐยทํž˜ ์ ์šฉ์˜ ์ •๋ฐ€ ํ˜‘์‘์ด ํ•„์š”. ์„ธ ๊ฐ€์ง€ ๋‚œ์ : โ‘  IK๊ฐ€ ๋„“์€ aperture๋‚˜ ํ•œ๊ณ„ ๊ทผ์ฒ˜์—์„œ ์˜๋„ํ•œ ์†๊ฐ€๋ฝ ๊ตฌ์„ฑ์„ ๋ณด์กด ๋ชป ํ•จ, โ‘ก hapticยท์ ‘์ด‰ ํ”ผ๋“œ๋ฐฑ ๋ถ€์žฌ๋กœ ์ ‘์ด‰ ์ƒํƒœ๋ฅผ ์‹œ๊ฐ์œผ๋กœ๋งŒ ์ถ”์ •(occlusion์— ์ทจ์•ฝ), โ‘ข joint torque ์ง์ ‘ ์ œ์–ด ๋ถˆ๊ฐ€๋กœ ํŠน์ • ์†๊ฐ€๋ฝ์— ์˜๋„์  ํž˜์„ ๋ชป ์คŒ. mixerยทwaffle-iron์€ ์—ฐ์Šต ํ›„ ๋Œ€๋žต ๋งž๋Š” ๊ถค์ ์€ ์ฐพ์•„๋„ ์ ‘์ด‰ ์ƒํ˜ธ์ž‘์šฉ์ด ๋‹ฌ๋ผ grasp๊ฐ€ fragileํ–ˆ๋‹ค.

๋น„ํŒ์ ์œผ๋กœ ๋ณด๋ฉด

๊ฐ•์ 

  • ํ‘œํ˜„์˜ ํ•ต์‹ฌ์„ ์ •ํ™•ํžˆ ์งš์—ˆ๋‹ค. โ€œ์ ‘์ด‰ ์œ„์น˜ โ‰  ์ ‘์ด‰ ํšจ๊ณผโ€๋ผ๋Š” ๊ด€์ฐฐ์€ ๋‹จ์ˆœํ•˜์ง€๋งŒ ๊ฐ•๋ ฅํ•˜๊ณ , support function์œผ๋กœ wrench polytope๋ฅผ ์ž„๋ฒ ๋””๋จผํŠธ-๋ถˆ๋ณ€ ์ฒ™๋„๋กœ ํ™˜์›ํ•œ ์„ค๊ณ„๊ฐ€ ๊น”๋”ํ•˜๋‹ค. position reward๊ฐ€ articulated cross-embodiment์—์„œ 0.000 ์œผ๋กœ ๋ถ•๊ดดํ•˜๋Š” ๋Œ€๋น„(Table 3)๋Š” ์ด ์ฃผ์žฅ์„ ์„ค๋“๋ ฅ ์žˆ๊ฒŒ ๋งŒ๋“ ๋‹ค.
  • ๊ทœ๋ชจ์™€ ํ†ต์ œ๋œ ํ‰๊ฐ€. ๋‹จ์ผ ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ๋กœ 1,831 ํƒœ์Šคํฌ๋ฅผ ๋Œ๋ฆฐ ์ , baseline์„ ๊ฐ์ž์˜ ์›๋ž˜ ์ง€ํ‘œ๋กœ ์žฌํ˜„ํ•ด ๋น„๊ตํ•œ ์ ์€ cherry-picking ์šฐ๋ ค๋ฅผ ์ค„์ธ๋‹ค. rewardโ€“์„ฑ๊ณต ์ƒ๊ด€(r\approx0.80)์€ CWS๊ฐ€ proxy metric์œผ๋กœ ์“ฐ์ผ ์ˆ˜ ์žˆ์Œ์„ ์ •๋Ÿ‰ํ™”ํ•œ๋‹ค.
  • ๋…ธ์ด์ฆˆ์— ๋Œ€ํ•œ graceful degradation. ๊นจ๋—ํ•œ ์‹œ์—ฐ์—” full CWS, noisyํ•˜๋ฉด reduced force-closure๋กœ ์ž๋™ ์ „ํ™˜ํ•˜๋Š” ์„ค๊ณ„๋Š” ๋น„๋””์˜ค ์žฌ๊ตฌ์„ฑยทTPV๊นŒ์ง€ ๊ฐ™์€ ๊ณจ๊ฒฉ์œผ๋กœ ํก์ˆ˜ํ•œ๋‹ค.
  • ํ•œ ๋ฉ”์ปค๋‹ˆ์ฆ˜์œผ๋กœ ๋„“์€ ์ปค๋ฒ„๋ฆฌ์ง€. rigidยทarticulatedยทmulti-object๋ฅผ ๋ชจ๋‘, ๊ทธ๋ฆฌ๊ณ  hand-only/whole-body๊นŒ์ง€ ๋™์ผ ๋ณด์ƒ ์ถ”์ƒ์œผ๋กœ ๋‹ค๋ฃฌ๋‹ค โ€” ๊ธฐ์กด์ž‘์ด ๋ถ€๋ถ„์ง‘ํ•ฉ์— ๊ตญํ•œ๋œ ๊ฒƒ๊ณผ ๋Œ€๋น„๋œ๋‹ค.

์•ฝ์ ยทํ•œ๊ณ„

  • ์ƒํƒœ ๊ธฐ๋ฐ˜ ๊ด€์ธก์— ์˜์กด(์ €์ž ๋ช…์‹œ). ์‹ค์„ธ๊ณ„ ๋ฐฐ์น˜๊ฐ€ mocap์œผ๋กœ ๋ฌผ์ฒดยท๋กœ๋ด‡ pose๋ฅผ ์ถ”์ ํ•˜๋Š” state-based๋‹ค. vision-based ๋ฐฐ์น˜๊ฐ€ ์—†์œผ๋ฏ€๋กœ ์ง„์งœ ์•ผ์™ธ ์ผ๋ฐ˜ํ™”๋Š” ๋ฏธ๊ฒ€์ฆ์ด๋‹ค.
  • ๊นจ๋—ํ•œ ์‹œ์—ฐ ๊ฐ€์ •. ํšจ๊ณผ์  ์ ‘์ด‰ ๊ฐ€์ด๋“œ๋Š” ๋น„๊ต์  ๊นจ๋—ํ•œ ์‹œ์—ฐ์„ ์š”๊ตฌํ•˜๊ณ , noisyํ•˜๋ฉด force-closure ๊ฐ€์ •์œผ๋กœ ํ›„ํ‡ดํ•œ๋‹ค โ€” ์ด๋Š” wrench ๋งค์นญ์˜ ๊ฐ•์ ์„ ์ผ๋ถ€ ํฌ๊ธฐํ•˜๋Š” trade-off๋‹ค. ๋…ธ์ด์ฆˆ๊ฐ€ ๋ณด์ƒ ํ’ˆ์งˆ์„ ์–ผ๋งˆ๋‚˜ ๋–จ์–ด๋œจ๋ฆฌ๋Š”์ง€์˜ ์ฒด๊ณ„์  ๋ถ„์„์€ ๋ถ€์กฑํ•˜๋‹ค.
  • ํ‰๊ฐ€ ์ง€ํ‘œ์˜ ํ•œ๊ณ„(์ €์ž ๋ช…์‹œ). ๋ฌผ์ฒด pose ์˜ค์ฐจ๋Š” ๋ถˆ์™„์ „ํ•œ ์„ฑ๊ณต ์ฒ™๋„๋‹ค โ€” ์ •ํ™•ํ•œ ๋ฐฐ์น˜๊ฐ€ ๋ถˆํ•„์š”ํ•œ ํƒœ์Šคํฌ๋„, ์ž‘์€ pose ์˜ค์ฐจ๊ฐ€ ๊ธฐ๋Šฅ์  ์‹คํŒจ๋กœ ์ด์–ด์ง€๋Š” ํƒœ์Šคํฌ๋„ ์žˆ๋‹ค. completion ratio>0.7, 15cm/40ยฐ ์ž„๊ณ„ ๊ฐ™์€ cutoff์˜ ๋ฏผ๊ฐ๋„๋„ ๋” ๋“ค์—ฌ๋‹ค๋ณผ ์—ฌ์ง€๊ฐ€ ์žˆ๋‹ค.
  • VOCยทIK ๋“ฑ ์™ธ๋ถ€ ๋ถ€ํ’ˆ ์˜์กด. ํƒ์ƒ‰์€ DexMachina VOC, retarget์€ ๋‹จ์ˆœ keypoint IK์— ๊ธฐ๋Œ„๋‹ค. teleop pilot์—์„œ ๋“œ๋Ÿฌ๋‚œ IK์˜ ์†๊ฐ€๋ฝ ๊ตฌ์„ฑ ๋ณด์กด ์‹คํŒจ๋Š” retarget ํ’ˆ์งˆ์ด ์ „์ฒด ํŒŒ์ดํ”„๋ผ์ธ์˜ ์•ฝํ•œ ๊ณ ๋ฆฌ์ผ ์ˆ˜ ์žˆ์Œ์„ ์‹œ์‚ฌํ•œ๋‹ค.
  • ์žฌํ˜„ ๊ฒ€์ฆ ๋ถˆ๊ฐ€(ํ˜„ ์‹œ์ ). ์ฝ”๋“œยทarXiv๊ฐ€ ๋ชจ๋‘ ๋ฏธ๊ณต๊ฐœ๋ผ ๋ณธ ๋ฆฌ๋ทฐ๋Š” PDF๋งŒ์œผ๋กœ ์ž‘์„ฑํ–ˆ๋‹ค. ๋ฒค์น˜๋งˆํฌ 4,739ยทํ‰๊ฐ€ 1,831ยท82.12% ๋“ฑ ํ•ต์‹ฌ ์ˆ˜์น˜๋Š” PDF๋กœ ๊ต์ฐจ๊ฒ€์ฆํ–ˆ์œผ๋‚˜, ์™ธ๋ถ€ ์žฌํ˜„์€ ์•„์ง ๋ถˆ๊ฐ€๋Šฅ ํ•˜๋‹ค.

๊ด€๋ จ ์—ฐ๊ตฌ์™€์˜ ์ž๋ฆฌ ๋งค๊น€

CHORD๋Š” ์‚ฌ๋žŒ ์‹œ์—ฐ ๊ธฐ๋ฐ˜ ์†์žฌ์ฃผ RL ๊ณ„์—ด์—์„œ ์ ‘์ด‰ ํ‘œํ˜„ ์„ ๋ฐ”๊พผ ์ž‘์—…์œผ๋กœ ์ž๋ฆฌํ•œ๋‹ค. ์ง์ ‘ baseline์ธ SPIDERยทDexMachinaยทManipTrans๊ฐ€ ์ ‘์ด‰ ์œ„์น˜/ํž˜ ๊ณผ VOC ์ปค๋ฆฌํ˜๋Ÿผ์„ ์“ฐ๋Š” ๋ฐ ๋น„ํ•ด, CHORD๋Š” ์ ‘์ด‰์„ wrench space์˜ ์œ ๋ฐœ ์šด๋™ ์œผ๋กœ ๋น„๊ตํ•œ๋‹ค. ์‚ฌ๋žŒ ๋น„๋””์˜ค์—์„œ ์†์žฌ์ฃผ ๋ฐ์ดํ„ฐ๋ฅผ ๋งŒ๋“œ๋Š” Do as I Do, egocentric ๋น„๋””์˜ค๋กœ ๋ณดํŽธ ์† ์ œ์–ด๋ฅผ ํ•™์Šตํ•˜๋Š” UniDex์™€๋Š” โ€œ์‚ฌ๋žŒ ์‹œ์—ฐ โ†’ ๋กœ๋ด‡ ์†์žฌ์ฃผโ€ ๋ฌธ์ œ๋ฅผ ๊ณต์œ ํ•˜๋˜, CHORD๋Š” ๋ฐ์ดํ„ฐ ์ƒ์„ฑยทํ‘œํ˜„ ํ•™์Šต๋ณด๋‹ค RL ๋ณด์ƒ ์„ค๊ณ„ ์— ๋ฌด๊ฒŒ๋ฅผ ๋‘”๋‹ค. wrenchยทforce-closure๋ฅผ grasp ํ•ฉ์„ฑ์— ์“ฐ๋Š” GraspQP ๊ณ„์—ด๊ณผ๋Š” wrench space๋ผ๋Š” ๋„๊ตฌ๋ฅผ ๊ณต์œ ํ•˜์ง€๋งŒ, CHORD๋Š” ์ •์  ์•ˆ์ •์„ฑ์ด ์•„๋‹ˆ๋ผ ๋™์  ์กฐ์ž‘์—์„œ์˜ ์œ ๋ฐœ ์šด๋™ ์œ ์‚ฌ๋„ ๋กœ ๊ทธ ๋„๊ตฌ๋ฅผ ์žฌํ•ด์„ํ•œ ์ ์ด ๋‹ค๋ฅด๋‹ค. whole-body ํ™•์žฅ์€ humanoid ๋ชจ์…˜ ์ถ”์ข… ๊ณ„์—ด์ธ WholeBody-Loco์™€ ๋งž๋‹ฟ์•„, hand-only ์‹œ์—ฐ์„ inpainting์œผ๋กœ ์ „์‹  ๋ชจ์…˜์œผ๋กœ ๋“ค์–ด์˜ฌ๋ฆฐ ๋’ค ๊ฐ™์€ ์ ‘์ด‰ ๋ณด์ƒ์„ ์ ์šฉํ•œ๋‹ค.

์š”์•ฝ

CHORD์˜ ํ•œ ๋ฌธ์žฅ์€ โ€œ์ ‘์ด‰์„ ์œ„์น˜๊ฐ€ ์•„๋‹ˆ๋ผ wrench(์œ ๋ฐœ ์šด๋™)๋กœ ๋น„๊ตํ•˜๋ผโ€ ๋‹ค. ์‚ฌ๋žŒ ์ ‘์ด‰์˜ wrench matrix๋ฅผ support function์œผ๋กœ ์š”์•ฝํ•ด ์ž„๋ฒ ๋””๋จผํŠธ-๋ถˆ๋ณ€ ๋ณด์ƒ์œผ๋กœ ์“ฐ๊ณ , ์—ฌ๊ธฐ์— taskยทimitation reward์™€ VOC ์ปค๋ฆฌํ˜๋Ÿผ, noisy ์‹œ์—ฐ์šฉ force-closure ํ›„ํ‡ด๋ฅผ ๋”ํ•ด ์‚ฌ๋žŒ ์‹œ์—ฐ โ†’ ์†์žฌ์ฃผ RL ์ „์ด๋ฅผ ํ™•์žฅ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ–ˆ๋‹ค. 4,739 ํƒœ์Šคํฌ ๋ฒค์น˜๋งˆํฌ์™€ 1,831 ํƒœ์Šคํฌ 82.12%(whole-body 90.77%), rewardโ€“์„ฑ๊ณต ์ƒ๊ด€ r\approx0.80, ์‹ค์„ธ๊ณ„ open/closed-loop ์ „์ด๊ฐ€ ์ด๋ฅผ ๋’ท๋ฐ›์นจํ•œ๋‹ค. ๋‹ค๋งŒ state-based ๋ฐฐ์น˜ยท๊นจ๋—ํ•œ ์‹œ์—ฐ ๊ฐ€์ •ยทpose ๊ธฐ๋ฐ˜ ์ง€ํ‘œ์˜ ํ•œ๊ณ„, ๊ทธ๋ฆฌ๊ณ  ํ˜„์žฌ ์ฝ”๋“œยทarXiv ๋ฏธ๊ณต๊ฐœ๋กœ ์ธํ•œ ์™ธ๋ถ€ ์žฌํ˜„ ๋ถˆ๊ฐ€๋Š” ๋‚จ๋Š” ๊ณผ์ œ๋‹ค.

Copyright 2026, JungYeon Lee