Curieux.JY
  • JungYeon Lee
  • Post
  • Projects
  • Note

On this page

  • 1 Brief Review
  • 2 Detail Review
    • 2.1 1. ์„œ๋ก  โ€” โ€œ์™œ ์ด๊ฒŒ ์–ด๋ ค์šด๊ฐ€?โ€๋ฅผ ๋จผ์ € ์ดํ•ดํ•˜์ž
      • 2.1.1 1.1 ์™œ ์ง€๊ธˆ์ธ๊ฐ€? โ€” ์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ
    • 2.2 2. ๋ฐฉ๋ฒ• โ€” SeqMultiGrasp์˜ ๊ตฌ์กฐ
      • 2.2.1 2.1 ํ•ต์‹ฌ ์ง๊ด€ โ€” ์†์„ โ€œ๋ฐ˜๋ฐ˜ ๋‚˜๋ˆ  ์“ฐ๊ธฐโ€
      • 2.2.2 2.2 ํŒŒ์ง€ ํฌ์ฆˆ์˜ ์ˆ˜ํ•™์  ํ‘œํ˜„
      • 2.2.3 2.3 ๋‹จ์ผ ๋ฌผ์ฒด ํŒŒ์ง€ ํ•ฉ์„ฑ โ€” DFC ์•Œ๊ณ ๋ฆฌ์ฆ˜
      • 2.2.4 2.4 ๋‹ค์ค‘ ๋ฌผ์ฒด ํŒŒ์ง€ ์„ค์ • ์ƒ์„ฑ โ€” ๋ณ‘ํ•ฉ(Merging)
      • 2.2.5 2.5 ํ™•์‚ฐ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ํŒŒ์ง€ ์ œ์•ˆ โ€” ์‹ค์„ธ๊ณ„ ์ผ๋ฐ˜ํ™”
      • 2.2.6 2.6 ์‹คํ–‰ ์ „๋žต โ€” ํœด๋ฆฌ์Šคํ‹ฑ ๊ธฐ๋ฐ˜ ์ˆœ์ฐจ ์‹คํ–‰
    • 2.3 3. ์‹คํ—˜ โ€” ์ˆซ์ž๋กœ ๋ณด๋Š” ์„ฑ๋Šฅ
      • 2.3.1 3.1 ์‹คํ—˜ ์„ค์ •
      • 2.3.2 3.2 ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ฒฐ๊ณผ
      • 2.3.3 3.3 ์‹ค์„ธ๊ณ„ ๊ฒฐ๊ณผ
      • 2.3.4 3.4 ์‹คํŒจ ์‚ฌ๋ก€ ๋ถ„์„
    • 2.4 4. ๋น„ํŒ์  ๊ณ ์ฐฐ โ€” ๊ฐ•์ ๊ณผ ํ•œ๊ณ„
      • 2.4.1 4.1 ๊ฐ•์ 
      • 2.4.2 4.2 ํ•œ๊ณ„์™€ ์•ฝ์ 
    • 2.5 5. ๊ด€๋ จ ์—ฐ๊ตฌ์™€์˜ ๋น„๊ต
      • 2.5.1 5.1 ๋™์‹œ๋Œ€ ์œ ์‚ฌ ์—ฐ๊ตฌ: SeqGrasp (arXiv:2503.22370)
      • 2.5.2 5.2 ์ด์ „ ์—ฐ๊ตฌ: MultiGrasp (arXiv:2310.15599)
      • 2.5.3 5.3 DFC ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•๋ก ๊ณผ์˜ ๊ด€๊ณ„
    • 2.6 6. ์ƒˆ๋กœ์šด ๋กœ๋ด‡ ์† ํ”Œ๋žซํผ์—์„œ SeqMultiGrasp๋ฅผ ์žฌํ˜„ํ•˜๋ ค๋ฉด
      • 2.6.1 6.1 Step 1 โ€” ํ•˜๋“œ์›จ์–ด ํŠน์„ฑ ๋ถ„์„๊ณผ ํŒŒ์ง€ ์ „๋žต ์žฌ์„ค๊ณ„
      • 2.6.2 6.2 Step 2 โ€” URDF ๋“ฑ๋ก๊ณผ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ ์„ค์ •
      • 2.6.3 6.3 Step 3 โ€” ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์…‹ ์žฌ๊ตฌ์ถ•
      • 2.6.4 6.4 Step 4 โ€” ํ™•์‚ฐ ๋ชจ๋ธ ์žฌํ•™์Šต
      • 2.6.5 6.5 Step 5 โ€” ์‹ค์„ธ๊ณ„ Sim-to-Real ๊ฐญ ์ค„์ด๊ธฐ
      • 2.6.6 6.6 ์š”์•ฝ: ์ด์‹ ๋‚œ์ด๋„ ์ฒดํฌ๋ฆฌ์ŠคํŠธ
    • 2.7 7. ํ–ฅํ›„ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ
    • 2.8 8. ์š”์•ฝ ๋ฐ ๊ฒฐ๋ก 
    • 2.9 ์ฐธ๊ณ  ๋ฌธํ—Œ

๐Ÿ“ƒSeqMultiGrasp ๋ฆฌ๋ทฐ

grasp
diffusion
multi-objects
Sequential Multi-Object Grasping with One Dexterous Hand
Published

August 6, 2025

  • Paper Link
  • Project Link
  1. ๐Ÿค–๋ณธ ๋…ผ๋ฌธ์€ Allegro Hand๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์—ฌ๋Ÿฌ ๊ฐ์ฒด๋ฅผ ํ•œ ์†์œผ๋กœ ์ˆœ์ฐจ์ ์œผ๋กœ ํŒŒ์ง€ํ•˜๋Š” ๋กœ๋ด‡ ์‹œ์Šคํ…œ์ธ SeqMultiGrasp์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.
  2. โœ‹์ด ์‹œ์Šคํ…œ์€ ๋จผ์ € ์†์˜ ํŠน์ • ๋งํฌ์— ์ œ์•ฝ๋œ ๋‹จ์ผ ๊ฐ์ฒด ํŒŒ์ง€ ํ›„๋ณด๋ฅผ ํ•ฉ์„ฑํ•˜๊ณ  ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ๊ฒ€์ฆํ•œ ํ›„, ์ด๋ฅผ ๋ณ‘ํ•ฉํ•˜์—ฌ ๋‹ค์ค‘ ๊ฐ์ฒด ํŒŒ์ง€ ๊ตฌ์„ฑ์„ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.
  3. โœ…์‹ค์ œ ํ™˜๊ฒฝ ๋ฐฐํฌ๋ฅผ ์œ„ํ•ด Point Cloud ๊ธฐ๋ฐ˜์˜ Diffusion Model์ด ํŒŒ์ง€ ์ž์„ธ๋ฅผ ์ œ์•ˆํ•˜๊ณ  ํœด๋ฆฌ์Šคํ‹ฑ ๊ธฐ๋ฐ˜์˜ ์‹คํ–‰ ์ „๋žต์„ ํ†ตํ•ด ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ 65.8%, ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ 56.7%์˜ ํ‰๊ท  ์„ฑ๊ณต๋ฅ ์„ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค.

1 Brief Review

๋ณธ ๋…ผ๋ฌธ์€ ํ•˜๋‚˜์˜ ๋ฏผ์ฒฉํ•œ ์†์œผ๋กœ ์—ฌ๋Ÿฌ ๊ฐ์ฒด๋ฅผ ์ˆœ์ฐจ์ ์œผ๋กœ ํŒŒ์ง€ํ•˜๋Š” ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃจ๋ฉฐ, ์ด๋ฅผ ์œ„ํ•œ ์‹œ์Šคํ…œ์ธ SeqMultiGrasp๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์ธ๊ฐ„์€ ์†์˜ ๋›ฐ์–ด๋‚œ ๋ฏผ์ฒฉ์„ฑ์„ ํ™œ์šฉํ•˜์—ฌ ์—ฌ๋Ÿฌ ๊ฐ์ฒด๋ฅผ ๋™์‹œ์— ๋˜๋Š” ์ˆœ์ฐจ์ ์œผ๋กœ ํŒŒ์ง€ํ•  ์ˆ˜ ์žˆ์ง€๋งŒ, ๋กœ๋ด‡์—๊ฒŒ ์ด๋Š” ๊ฐ์ฒด์˜ ๋‹ค์–‘ํ•œ ํ˜•์ƒ๊ณผ ๋†’์€ ์ž์œ ๋„(high-DOF) ์†์˜ ๋ณต์žกํ•œ ์ ‘์ด‰ ์ƒํ˜ธ์ž‘์šฉ์œผ๋กœ ์ธํ•ด ์–ด๋ ค์šด ๋„์ „ ๊ณผ์ œ์ž…๋‹ˆ๋‹ค. ํŠนํžˆ ํ•˜๋‚˜์˜ ๊ฐ์ฒด๋ฅผ ํŒŒ์ง€ํ•œ ์ƒํƒœ์—์„œ ๋‹ค๋ฅธ ๊ฐ์ฒด๋ฅผ ํŒŒ์ง€ํ•ด์•ผ ํ•˜๋Š” ์ˆœ์ฐจ์  ์‹œ๋‚˜๋ฆฌ์˜ค์—์„œ ๋‚œ์ด๋„๋Š” ๋”์šฑ ์ฆ๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.

SeqMultiGrasp๋Š” ๋„ค ์†๊ฐ€๋ฝ์„ ๊ฐ€์ง„ Allegro Hand๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋‘ ๊ฐœ์˜ ๊ฐ์ฒด๋ฅผ ์ˆœ์ฐจ์ ์œผ๋กœ ํŒŒ์ง€ํ•˜๋Š” ๋ฐ ์ดˆ์ ์„ ๋งž์ถฅ๋‹ˆ๋‹ค. ์ด ์‹œ์Šคํ…œ์€ ์ฒซ ๋ฒˆ์งธ ๊ฐ์ฒด๋ฅผ ์™„์ „ํžˆ ๊ฐ์‹ธ ๋“ค์–ด ์˜ฌ๋ฆฐ ํ›„, ์ฒซ ๋ฒˆ์งธ ๊ฐ์ฒด๋ฅผ ๋–จ์–ด๋œจ๋ฆฌ์ง€ ์•Š์œผ๋ฉด์„œ ๋‘ ๋ฒˆ์งธ ๊ฐ์ฒด๋ฅผ ํŒŒ์ง€ํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ๋ฐฉ๋ฒ•๋ก ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์€ ๋‹จ๊ณ„๋กœ ๊ตฌ์„ฑ๋ฉ๋‹ˆ๋‹ค.

  1. ๋‹จ์ผ ๊ฐ์ฒด ๊ทธ๋žฉ ํ›„๋ณด ํ•ฉ์„ฑ:
    • ์šฐ์„ , Differentiable Force Closure (DFC) [13] ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ธฐ๋ฐ˜์œผ๋กœ ๋‹จ์ผ ๊ฐ์ฒด ๊ทธ๋žฉ ํฌ์ฆˆ๋ฅผ ํ•ฉ์„ฑํ•ฉ๋‹ˆ๋‹ค. ์ด ๊ณผ์ •์€ ํŒŒ์ง€ ๋ฌธ์ œ๋ฅผ ์—๋„ˆ์ง€ ํ•จ์ˆ˜์˜ ์ตœ์ ํ™”๋กœ ์ •์‹ํ™”ํ•˜์—ฌ ํ•ด๊ฒฐํ•ฉ๋‹ˆ๋‹ค.
    • ์† ๊ตฌ์„ฑ H = (\theta , T)๋Š” ๋กœ๋ด‡ ์†์˜ ๊ด€์ ˆ ๊ตฌ์„ฑ \theta \in \mathbb{R}^d์™€ ๊ฐ์ฒด O์— ๋Œ€ํ•œ ์ƒ๋Œ€ ํฌ์ฆˆ T \in SE(3)๋ฅผ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค.
    • ์—๋„ˆ์ง€ ํ•จ์ˆ˜๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค: E = E_{fc} + w_{dis}E_{dis} + w_pE_p + w_{sp}E_{sp} + w_qE_q ์—ฌ๊ธฐ์„œ E_{fc}๋Š” force closure ํ•ญ, E_{dis}๋Š” ์ ‘์ด‰์ ๊ณผ ๊ฐ์ฒด ํ‘œ๋ฉด ๊ฐ„์˜ ๊ฑฐ๋ฆฌ์— ๋Œ€ํ•œ ํŽ˜๋„ํ‹ฐ, E_p๋Š” ์†, ๊ฐ์ฒด, ํƒ์ž ๊ฐ„์˜ ์นจํˆฌ(penetration)์— ๋Œ€ํ•œ ํŽ˜๋„ํ‹ฐ, E_{sp}๋Š” ์†์˜ ์ž๊ธฐ ์นจํˆฌ(self-penetration)์— ๋Œ€ํ•œ ํŽ˜๋„ํ‹ฐ, E_q๋Š” ๊ด€์ ˆ ํ•œ๊ณ„ ์œ„๋ฐ˜์— ๋Œ€ํ•œ ํŽ˜๋„ํ‹ฐ๋ฅผ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค. w ํ•ญ๋“ค์€ ๊ฐ ๊ตฌ์„ฑ ์š”์†Œ์˜ ๊ฐ€์ค‘์น˜ ๊ณ„์ˆ˜์ž…๋‹ˆ๋‹ค.
    • ํ•ฉ์„ฑ ๊ณผ์ •์—์„œ๋Š” ์† ํ‘œ๋ฉด์˜ ์ ‘์ด‰ ํ›„๋ณด์ ์—์„œ ์ ‘์ด‰์ ์„ ์ƒ˜ํ”Œ๋งํ•˜๊ณ  ์ดˆ๊ธฐ ๊ตฌ์„ฑ์„ ์„ค์ •ํ•ฉ๋‹ˆ๋‹ค. ์ดํ›„ Metropolis-Adjusted Langevin Algorithm (MALA)๊ณผ ๊ฒฐํ•ฉ๋œ ๊ฒฝ์‚ฌ ๊ธฐ๋ฐ˜ ์ ‘๊ทผ ๋ฐฉ์‹์„ ์‚ฌ์šฉํ•˜์—ฌ ์ตœ์ ํ™”ํ•ฉ๋‹ˆ๋‹ค. ํŠน์ • ์ž„๊ณ„๊ฐ’์„ ์ดˆ๊ณผํ•˜๋Š” ์—๋„ˆ์ง€๋ฅผ ๊ฐ€์ง„ ๊ตฌ์„ฑ์€ ํ•„ํ„ฐ๋ง๋ฉ๋‹ˆ๋‹ค.
    • ์ˆœ์ฐจ์  ๋‹ค์ค‘ ๊ฐ์ฒด ํŒŒ์ง€๋ฅผ ์œ„ํ•ด, ์ฒซ ๋ฒˆ์งธ ๊ฐ์ฒด๋Š” ์—„์ง€, ๊ฒ€์ง€, ์ค‘์ง€๋ฅผ ์‚ฌ์šฉํ•˜๋Š” pinch-like grasp, ๋‘ ๋ฒˆ์งธ ๊ฐ์ฒด๋Š” ์•ฝ์ง€์™€ ์†๋ฐ”๋‹ฅ์„ ์‚ฌ์šฉํ•˜๋Š” side grasp์— ์ ‘์ด‰ ํ›„๋ณด์ ์„ ์ œํ•œํ•˜๋Š” ๋“ฑ ๊ธฐ์กด DFC ํŒŒ์ดํ”„๋ผ์ธ์— ์—ฌ๋Ÿฌ ์ˆ˜์ • ์‚ฌํ•ญ์ด ์ ์šฉ๋˜์—ˆ์Šต๋‹ˆ๋‹ค.
  2. ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ธฐ๋ฐ˜ ๊ทธ๋žฉ ์œ ํšจ์„ฑ ๊ฒ€์ฆ:
    • ํ•ฉ์„ฑ๋œ ๊ทธ๋žฉ ํ›„๋ณด๋“ค์€ GPU ๊ฐ€์† ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์ธ ManiSkill [39]์—์„œ ๊ทธ๋žฉ์„ ์‹คํ–‰ํ•˜์—ฌ ์•ˆ์ •์„ฑ๊ณผ ์‹คํ–‰ ๊ฐ€๋Šฅ์„ฑ์„ ๊ฒ€์ฆํ•ฉ๋‹ˆ๋‹ค.
    • Rotation Robustness: ๊ฐ์ฒด๊ฐ€ 6๊ฐ€์ง€ ์ถ• ์ •๋ ฌ ์ค‘๋ ฅ ๋ฐฉํ–ฅ(ยฑx, ยฑy, ยฑz) ํ•˜์—์„œ 2.5์ดˆ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํ›„์—๋„ ์†๊ณผ ์ ‘์ด‰์„ ์œ ์ง€ํ•˜๋Š”์ง€ ํ‰๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.
    • Execution Feasibility: ๊ทธ๋žฉ์ด ํ™˜๊ฒฝ๊ณผ์˜ ์ถฉ๋Œ ์—†์ด ์„ฑ๊ณต์ ์œผ๋กœ ์‹คํ–‰๋  ์ˆ˜ ์žˆ๋Š”์ง€ ํ™•์ธํ•ฉ๋‹ˆ๋‹ค.
  3. ๋‹ค์ค‘ ๊ฐ์ฒด ๊ทธ๋žฉ ๊ตฌ์„ฑ ๋ณ‘ํ•ฉ:
    • ๊ฒ€์ฆ๋œ ๋‹จ์ผ ๊ฐ์ฒด ๊ทธ๋žฉ ํฌ์ฆˆ๋“ค์„ ๋ณ‘ํ•ฉํ•˜์—ฌ ๋‹ค์ค‘ ๊ฐ์ฒด ๊ทธ๋žฉ ๊ตฌ์„ฑ์„ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ์ด ๊ณผ์ •์€ ๊ด€๋ จ ์† ๋งํฌ์™€ ๊ด€์ ˆ์ด ์™„์ „ํžˆ ๋ถ„๋ฆฌ๋˜์–ด ์žˆ์„ ๋•Œ๋งŒ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
    • ๋ณ‘ํ•ฉ ์‹œ, ๊ฐ ์†๊ฐ€๋ฝ์˜ ๊ด€์ ˆ ๊ฐ๋„๋Š” ํ•ด๋‹น ์†๊ฐ€๋ฝ์ด ์žก๋Š” ๊ฐ์ฒด์˜ ์ ‘์ด‰์ ์— ๋”ฐ๋ผ ์„ค์ •๋ฉ๋‹ˆ๋‹ค. ์–ด๋–ค ๊ฐ์ฒด๋„ ์žก์ง€ ์•Š๋Š” ์†๊ฐ€๋ฝ์˜ ๊ด€์ ˆ ๊ฐ๋„๋Š” ๋‹จ์ผ ๊ฐ์ฒด ๊ทธ๋žฉ ์ค‘ ํ•˜๋‚˜์—์„œ ๋ฌด์ž‘์œ„๋กœ ์ƒ์†๋ฐ›์•„ ๋น„๊ฒน์นจ ์ œ์–ด ์ œ์•ฝ ์กฐ๊ฑด์„ ์œ ์ง€ํ•ฉ๋‹ˆ๋‹ค.
  4. Diffusion-based ํฌ์ฆˆ ์ƒ์„ฑ:
    • ๊ทธ๋žฉ ํฌ์ฆˆ ์ƒ์„ฑ์˜ ๊ณ„์‚ฐ ๋น„์šฉ์„ ์ค„์ด๊ธฐ ์œ„ํ•ด, ๊ฐ์ฒด์˜ point cloud P = \{P_j\}_{j=1}^{N_o}์— ์กฐ๊ฑดํ™”๋œ diffusion model [40]์„ ํ›ˆ๋ จํ•˜์—ฌ ์† ํฌ์ฆˆ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.
    • Forward Process (๋…ธ์ด์ฆˆ ์ถ”๊ฐ€): q(H_t |H_{t-1}) = \mathcal{N} \left( H_t ; \sqrt{1 - \beta_t} H_{t-1}, \beta_t \mathbf{I} \right) ์—ฌ๊ธฐ์„œ \beta_t๋Š” ๋…ธ์ด์ฆˆ ๋ ˆ๋ฒจ์„ ์ œ์–ดํ•˜๊ณ  \mathbf{I}๋Š” ํ•ญ๋“ฑ ํ–‰๋ ฌ์ž…๋‹ˆ๋‹ค.
    • Reverse Process (๋…ธ์ด์ฆˆ ์ œ๊ฑฐ ๋ฐ ์žฌ๊ตฌ์„ฑ): p_\phi (H_{t-1}|H_t , P) = \mathcal{N} \left( H_{t-1}; \mu_\phi (H_t ,t, P), \Sigma_\phi (H_t ,t, P) \right) ์—ฌ๊ธฐ์„œ \mu_\phi์™€ \Sigma_\phi๋Š” ๊ฐ๊ฐ ์˜ˆ์ธก๋œ ํ‰๊ท ๊ณผ ๊ณต๋ถ„์‚ฐ์ž…๋‹ˆ๋‹ค.
    • ๋„คํŠธ์›Œํฌ๋Š” PointNet++ [43]๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ point cloud ํŠน์ง•์„ ์ถ”์ถœํ•˜๊ณ , ํšŒ์ „ ํ–‰๋ ฌ๋กœ ๊ฐ์ฒด ๋ฐฉํ–ฅ์„ ํ‘œํ˜„ํ•˜๋ฉฐ, singular value decomposition (SVD) [44]๋ฅผ ์ ์šฉํ•˜์—ฌ ์ง๊ต์„ฑ์„ ๋ณด์žฅํ•ฉ๋‹ˆ๋‹ค.
  5. ํœด๋ฆฌ์Šคํ‹ฑ ๊ธฐ๋ฐ˜ ์‹คํ–‰ ์ „๋žต:
    • ๋ณต์žกํ•œ reinforcement learning (RL) ์ •์ฑ… ๋Œ€์‹ , simple squeeze-and-lift ์ ˆ์ฐจ๋ฅผ ์ฑ„ํƒํ•ฉ๋‹ˆ๋‹ค.
    • CuRobo [45]๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์—”๋“œ ์ดํŽ™ํ„ฐ๋ฅผ ๊ทธ๋žฉ ํฌ์ฆˆ์—์„œ ์˜คํ”„์…‹๋œ ์ถฉ๋Œ ์—†๋Š” ํฌ์ฆˆ๋กœ ๋ชจ์…˜ ํ”Œ๋ž˜๋‹ํ•ฉ๋‹ˆ๋‹ค.
    • ์ดํ›„ ์ถฉ๋Œ ๊ฒ€์‚ฌ ์—†์ด ๊ทธ๋žฉ ํฌ์ฆˆ๋กœ ๋А๋ฆฌ๊ฒŒ ์ด๋™ํ•˜๋ฉฐ, ์† ๊ด€์ ˆ ์œ„์น˜๋ฅผ ๋‘ ๋‹จ๊ณ„๋กœ ์กฐ์ •ํ•ฉ๋‹ˆ๋‹ค. ์ฒซ ๋ฒˆ์งธ๋Š” pre-grasp joint position์œผ๋กœ ์†๊ฐ€๋ฝ ๋์„ ํ›„ํ‡ด์‹œํ‚ค๊ณ , ๋‘ ๋ฒˆ์งธ๋Š” target joint position์œผ๋กœ ์†๊ฐ€๋ฝ์„ ๋‹ซ์Šต๋‹ˆ๋‹ค.

์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐ ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ ๊ด‘๋ฒ”์œ„ํ•œ ์‹คํ—˜์ด ์ˆ˜ํ–‰๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ๋Š” 8x8 ๊ฐ์ฒด ์กฐํ•ฉ์— ๋Œ€ํ•ด Synthesized Grasp (SG) ๋ฐฉ์‹์ด ํ‰๊ท  82.7%์˜ ์„ฑ๊ณต๋ฅ ์„ ๋ณด์˜€์œผ๋ฉฐ, diffusion model ๊ธฐ๋ฐ˜ Learned Grasp (LG) ๋ฐฉ์‹์€ 65.8%์˜ ์„ฑ๊ณต๋ฅ ์„ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค. ์‹ค์ œ ๋กœ๋ด‡ ์‹œ์Šคํ…œ์„ ์‚ฌ์šฉํ•œ ์‹คํ—˜์—์„œ๋Š” 6x3 ๊ฐ์ฒด ์กฐํ•ฉ์— ๋Œ€ํ•ด SG๊ฐ€ 64.4%, LG๊ฐ€ 56.7%์˜ ํ‰๊ท  ์„ฑ๊ณต๋ฅ ์„ ๊ธฐ๋กํ–ˆ์Šต๋‹ˆ๋‹ค. ์‹ค์ œ ํ™˜๊ฒฝ point cloud ํš๋“์„ ์œ„ํ•ด Nerfstudio [50], COLMAP [51], Stable Normal [52], 2D Gaussian Splatting [53] ๋“ฑ์˜ ๊ธฐ์ˆ ์ด ํ™œ์šฉ๋˜์–ด sim-to-real gap์„ ์ค„์˜€์Šต๋‹ˆ๋‹ค.

SeqMultiGrasp๋Š” ์—ฌ์ „ํžˆ ๋‘ ๊ฐœ์˜ ๊ฐ์ฒด๋งŒ ๋‹ค๋ฃจ๋ฉฐ ๋ฐ์ดํ„ฐ์…‹ ํฌ๊ธฐ์™€ ๋‹ค์–‘์„ฑ, ๊ทธ๋ฆฌ๊ณ  ํœด๋ฆฌ์Šคํ‹ฑ์— ๋Œ€ํ•œ ์˜์กด์„ฑ ๋“ฑ ๋ช‡ ๊ฐ€์ง€ ํ•œ๊ณ„๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ์ง€๋งŒ, ๋‹ค์žฌ๋‹ค๋Šฅํ•œ ๋‹ค์ค‘ ๊ฐ์ฒด ํŒŒ์ง€ ๋ถ„์•ผ์˜ ๋ฏธ๋ž˜ ์—ฐ๊ตฌ๋ฅผ ์œ„ํ•œ ์œ ๋งํ•œ ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.


2 Detail Review

2.1 1. ์„œ๋ก  โ€” โ€œ์™œ ์ด๊ฒŒ ์–ด๋ ค์šด๊ฐ€?โ€๋ฅผ ๋จผ์ € ์ดํ•ดํ•˜์ž

์ปคํ”ผ๋ฅผ ๋งˆ์‹œ๋‹ค ์˜†์— ์žˆ๋Š” ์‚ฌ๊ณผ์™€ ๋ณผํŽœ์„ ๋™์‹œ์— ์ง‘์–ด๋“ค์–ด ๋ณด์ž. ๋ˆˆ ๊นœ์งํ•  ์‚ฌ์ด์— ์—„์ง€์™€ ๊ฒ€์ง€๋กœ ๋ณผํŽœ์„ ๊ฐ€๋ณ๊ฒŒ ์ฐ๊ณ , ๋‚˜๋จธ์ง€ ์†๊ฐ€๋ฝ๊ณผ ์†๋ฐ”๋‹ฅ์œผ๋กœ ์‚ฌ๊ณผ๋ฅผ ๋‘˜๋Ÿฌ์‹ธ๋Š” ๋ณต์žกํ•œ ๋™์ž‘์ด ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ์ผ์–ด๋‚œ๋‹ค. ์ด ํ–‰์œ„๋ฅผ ๋กœ๋ด‡์ด ํ•˜๋ ค๋ฉด ๋ฌด์—‡์ด ํ•„์š”ํ• ๊นŒ?

๋Œ€๋ถ€๋ถ„์˜ ๋กœ๋ด‡ ํŒŒ์ง€ ์—ฐ๊ตฌ๋Š” ๋‹จ์ผ ๋ฌผ์ฒด๋ฅผ ๋Œ€์ƒ์œผ๋กœ ํ•œ๋‹ค. ์† ์ „์ฒด๋ฅผ ์ž์œ ๋กญ๊ฒŒ ์‚ฌ์šฉํ•ด์„œ ํ•˜๋‚˜์˜ ๋ฌผ์ฒด์— ์ง‘์ค‘ํ•˜๋ฉด ๋˜๋‹ˆ ๋ฌธ์ œ๊ฐ€ ๋‹จ์ˆœํ•˜๋‹ค. ๊ทธ๋Ÿฐ๋ฐ ๋‘ ๋ฌผ์ฒด๋ฅผ ์ˆœ์ฐจ์ ์œผ๋กœ ์ง‘์–ด์•ผ ํ•œ๋‹ค๋ฉด ์ด์•ผ๊ธฐ๊ฐ€ ๋‹ฌ๋ผ์ง„๋‹ค. ์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด๋ฅผ ์žก์€ ์ƒํƒœ์—์„œ ๋‘ ๋ฒˆ์งธ๋ฅผ ์ง‘์œผ๋ ค๋ฉด, ์ด๋ฏธ ์ ์œ ๋œ ์†๊ฐ€๋ฝ๋“ค์€ ์“ธ ์ˆ˜ ์—†๋‹ค. ์ฆ‰ ์ œ์•ฝ ์กฐ๊ฑด์ด ํญ๋ฐœ์ ์œผ๋กœ ๋Š˜์–ด๋‚˜๋Š” ๊ฒƒ์ด๋‹ค.

์ด ๋…ผ๋ฌธ์€ ๋ฐ”๋กœ ์ด ๋ฌธ์ œ๋ฅผ ์ •๋ฉด์œผ๋กœ ๋‹ค๋ฃฌ๋‹ค. ์ œ์•ˆ๋œ ์‹œ์Šคํ…œ SeqMultiGrasp๋Š” 4์†๊ฐ€๋ฝ Allegro Hand๋ฅผ ์ด์šฉํ•ด ๋‘ ๋ฌผ์ฒด๋ฅผ ์ˆœ์„œ๋Œ€๋กœ ํŒŒ์ง€ํ•˜๋Š” ์™„์ „ํ•œ ํŒŒ์ดํ”„๋ผ์ธ์„ ์ œ์‹œํ•œ๋‹ค. ํ•ต์‹ฌ ์•„์ด๋””์–ด๋Š” ๋†€๋ž๋„๋ก ์ง๊ด€์ ์ด๋‹ค: ์†์˜ ์ž์œ ๋„(DoF)๋ฅผ ๋ถ„ํ•  ์‚ฌ์šฉํ•œ๋‹ค๋Š” ๊ฒƒ์ด๋‹ค.

2.1.1 1.1 ์™œ ์ง€๊ธˆ์ธ๊ฐ€? โ€” ์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ

๋กœ๋ด‡ ํŒŒ์ง€๋Š” ํฌ๊ฒŒ ๋‘ ํ๋ฆ„์œผ๋กœ ๋ฐœ์ „ํ•ด ์™”๋‹ค.

ํ•ด์„์ (Analytic) ๋ฐฉ๋ฒ•์€ ๊ธฐํ•˜ํ•™ยท์ˆ˜ํ•™์  ์ตœ์ ํ™”๋ฅผ ํ†ตํ•ด force closure๊ฐ€ ๊ฐ€๋Šฅํ•œ ํŒŒ์ง€๋ฅผ ๊ฒฐ์ •ํ•œ๋‹ค. ์ˆ˜ํ•™์ ์œผ๋กœ ์—„๋ฐ€ํ•˜์ง€๋งŒ, 16-DOF ์† ๊ฐ™์€ ๊ณ ์ฐจ์› ํƒ์ƒ‰ ๊ณต๊ฐ„์—์„œ๋Š” ๊ณ„์‚ฐ ๋น„์šฉ์ด ํญ๋ฐœ์ ์œผ๋กœ ์ปค์ง„๋‹ค.

๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜(Data-driven) ๋ฐฉ๋ฒ•์€ ํ•™์Šต๋œ ๋ชจ๋ธ๋กœ ํŒŒ์ง€ ํฌ์ฆˆ๋ฅผ ๋น ๋ฅด๊ฒŒ ์ œ์•ˆํ•œ๋‹ค. Diffusion model, GAN, VAE ๋“ฑ์ด ํ™œ์šฉ๋˜๋ฉฐ, ํŠนํžˆ ์ตœ๊ทผ point cloud conditioned diffusion model๋“ค์ด ๊ฐ•๋ ฅํ•œ ์„ฑ๋Šฅ์„ ๋ณด์ด๊ณ  ์žˆ๋‹ค.

๊ทธ๋Ÿฌ๋‚˜ ๋‹ค์ค‘ ๋ฌผ์ฒด ํŒŒ์ง€ ์—ฐ๊ตฌ๋Š” ์•„์ง ์ดˆ๊ธฐ ๋‹จ๊ณ„๋‹ค. ๊ธฐ์กด์˜ MultiGrasp(Shadow Hand ๊ธฐ๋ฐ˜)์™€ ๊ฐ™์€ ์—ฐ๊ตฌ๊ฐ€ ๋™์‹œ ํŒŒ์ง€(simultaneous grasping)๋ฅผ ๋‹ค๋ฃจ๊ธด ํ–ˆ์ง€๋งŒ, ์ˆœ์ฐจ์  ํŒŒ์ง€โ€”์ฆ‰, ํ•˜๋‚˜๋ฅผ ์žก์€ ์ฑ„๋กœ ๋‹ค์Œ์„ ์ง‘๋Š” ๋ฌธ์ œโ€”๋Š” ์‹ค์„ธ๊ณ„ ์‹คํ—˜์ด ์—†์—ˆ๋‹ค. ์ด ๋…ผ๋ฌธ์ด ์ตœ์ดˆ์˜ ์‹ค์„ธ๊ณ„ ์ˆœ์ฐจ์  ๋‹ค์ค‘ ๋ฌผ์ฒด ํŒŒ์ง€ ์‹คํ—˜์„ ๋ณด๊ณ ํ•œ๋‹ค๋Š” ์ ์ด ์—ญ์‚ฌ์  ์˜์˜๋‹ค.


2.2 2. ๋ฐฉ๋ฒ• โ€” SeqMultiGrasp์˜ ๊ตฌ์กฐ

์‹œ์Šคํ…œ ์ „์ฒด ๊ตฌ์กฐ๋ฅผ ๋จผ์ € ์กฐ๊ฐํ•˜์ž.

flowchart TD
    A["๐Ÿ– ์† ์„ค๊ณ„\n(Allegro Hand 16-DOF)"] --> B

    subgraph B["โ‘  ๋ฐ์ดํ„ฐ์…‹ ๊ตฌ์ถ•"]
        B1["๋‹จ์ผ ๋ฌผ์ฒด ํŒŒ์ง€ ํ›„๋ณด ์ƒ์„ฑ\n(DFC ์•Œ๊ณ ๋ฆฌ์ฆ˜ + ๋งํฌ ์„œ๋ธŒ์…‹ ์ œ์•ฝ)"]
        B2["๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ ๊ฒ€์ฆ\n(์•ˆ์ •์„ฑ ํ•„ํ„ฐ๋ง)"]
        B3["๋‘ ํŒŒ์ง€ ํฌ์ฆˆ ๋ณ‘ํ•ฉ\nโ†’ ๋‹ค์ค‘ ๋ฌผ์ฒด ํŒŒ์ง€ ์„ค์ •"]
        B1 --> B2 --> B3
    end

    B --> C

    subgraph C["โ‘ก ํ•™์Šต (Diffusion Model)"]
        C1["Point Cloud ์กฐ๊ฑด๋ถ€\nํ™•์‚ฐ ๋ชจ๋ธ ํ›ˆ๋ จ"]
    end

    C --> D

    subgraph D["โ‘ข ์‹ค์„ธ๊ณ„ ๋ฐฐํฌ"]
        D1["Point Cloud ์ธ์‹\n(์นด๋ฉ”๋ผ + ๋ถ„ํ• )"]
        D2["ํ™•์‚ฐ ๋ชจ๋ธ ์ถ”๋ก \nโ†’ ํŒŒ์ง€ ํฌ์ฆˆ ์ œ์•ˆ"]
        D3["ํœด๋ฆฌ์Šคํ‹ฑ ์‹คํ–‰ ์ „๋žต\n(์ˆœ์ฐจ ๋ชจ์…˜ ํ”Œ๋ž˜๋‹)"]
        D1 --> D2 --> D3
    end

    D --> E["โœ… ์ˆœ์ฐจ์  ๋‘ ๋ฌผ์ฒด ํŒŒ์ง€ ์„ฑ๊ณต"]

2.2.1 2.1 ํ•ต์‹ฌ ์ง๊ด€ โ€” ์†์„ โ€œ๋ฐ˜๋ฐ˜ ๋‚˜๋ˆ  ์“ฐ๊ธฐโ€

Allegro Hand๋Š” 4๊ฐœ์˜ ์†๊ฐ€๋ฝ๊ณผ 16๊ฐœ์˜ ๊ด€์ ˆ๋กœ ์ด๋ฃจ์–ด์ ธ ์žˆ๋‹ค. ์ด ๋…ผ๋ฌธ์˜ ํ•ต์‹ฌ ํ†ต์ฐฐ์€ ์ด๋ ‡๋‹ค:

โ€œ๋‘ ๋ฌผ์ฒด๋ฅผ ์ˆœ์„œ๋Œ€๋กœ ์žก์œผ๋ ค๋ฉด, ์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด์— ์“ธ ์†๊ฐ€๋ฝ๊ณผ ๋‘ ๋ฒˆ์งธ ๋ฌผ์ฒด์— ์“ธ ์†๊ฐ€๋ฝ์„ ๋ฏธ๋ฆฌ ์ •ํ•ด๋‘๊ณ  ๊ฐ์ž ๋…๋ฆฝ์ ์œผ๋กœ ์ตœ์ ํ™”ํ•ด๋ผ.โ€

๊ตฌ์ฒด์ ์ธ ์ˆœ์ฐจ ํŒŒ์ง€ ์ „๋žต์€ ํ•˜๋“œ์›จ์–ด ํ˜„์‹ค์„ ๋ฐ˜์˜ํ•œ ์‹ค์šฉ์  ๊ฒฐ์ •์ด๋‹ค:

๋‹จ๊ณ„ ํŒŒ์ง€ ์œ ํ˜• ์‚ฌ์šฉ ์†๊ฐ€๋ฝ ๋Œ€์ƒ
1st Grasp Pinch-like grasp ์—„์ง€ + ๊ฒ€์ง€ + ์ค‘์ง€ (์ž„์˜ ์กฐํ•ฉ) ์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด
2nd Grasp Side grasp ์•ฝ์ง€ + ์†๋ฐ”๋‹ฅ ๋‘ ๋ฒˆ์งธ ๋ฌผ์ฒด

์ด ์ „๋žต์€ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ๊ฒฝํ—˜์ ์œผ๋กœ Allegro Hand ํ•˜๋“œ์›จ์–ด์— ์ ํ•ฉํ•œ ๊ฒƒ์œผ๋กœ ํ™•์ธ๋๋‹ค. ๋ฌผ๋ก  ์ด๊ฒƒ์ด ์œ ์ผํ•œ ์ „๋žต์€ ์•„๋‹ˆ์ง€๋งŒ, ๋‘ ํŒŒ์ง€๊ฐ€ ์„œ๋กœ ๋ฐฉํ•ดํ•˜์ง€ ์•Š์œผ๋ฉด์„œ ์•ˆ์ •์„ฑ์„ ์ œ๊ณตํ•˜๋Š” ํ˜„์‹ค์ ์ธ ํ•ด๋ฒ•์ด๋‹ค.

2.2.2 2.2 ํŒŒ์ง€ ํฌ์ฆˆ์˜ ์ˆ˜ํ•™์  ํ‘œํ˜„

๋…ผ๋ฌธ์—์„œ ์ •์˜ํ•˜๋Š” ํ†ตํ•ฉ ์† ์„ค์ •(unified hand configuration)์„ ์‚ดํŽด๋ณด์ž.

๋‹ค์ค‘ ๋ฌผ์ฒด ํŒŒ์ง€ ์ƒํƒœ๋Š” ๋‹ค์Œ ํŠœํ”Œ๋กœ ์ •์˜๋œ๋‹ค:

\mathcal{G} = (\theta, \{T_j\}_{j=1}^{N})

์—ฌ๊ธฐ์„œ: - \theta \in \mathbb{R}^d : ๋กœ๋ด‡ ์†์˜ ๊ด€์ ˆ ์„ค์ • (Allegro์˜ ๊ฒฝ์šฐ d = 16) - T_j \in SE(3) : j๋ฒˆ์งธ ๋ฌผ์ฒด O_j์˜ ์†์— ๋Œ€ํ•œ ์ƒ๋Œ€ ํฌ์ฆˆ

๋‹จ์ผ ๋ฌผ์ฒด ํŒŒ์ง€์˜ ๊ฒฝ์šฐ ์ด๋Š” ๋‹จ์ˆœํžˆ (\theta, T_1)์œผ๋กœ ๊ฐ„๋žตํ™”๋œ๋‹ค.

2.2.3 2.3 ๋‹จ์ผ ๋ฌผ์ฒด ํŒŒ์ง€ ํ•ฉ์„ฑ โ€” DFC ์•Œ๊ณ ๋ฆฌ์ฆ˜

ํŒŒ์ง€ ๋ฐ์ดํ„ฐ์…‹์„ ๊ตฌ์ถ•ํ•˜๊ธฐ ์œ„ํ•ด DFC(Differentiable Force Closure) ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ํ™•์žฅํ•œ๋‹ค. DFC๋Š” ํŒŒ์ง€ ํ•ฉ์„ฑ ๋ฌธ์ œ๋ฅผ ์—๋„ˆ์ง€ ํ•จ์ˆ˜ ์ตœ์†Œํ™”๋กœ ๊ณต์‹ํ™”ํ•œ๋‹ค:

E_{total} = E_{fc} + \lambda_d \cdot E_{dis} + \lambda_p \cdot E_{pen}

๊ฐ ํ•ญ์˜ ์—ญํ• :

ํ•ญ ์˜๋ฏธ
E_{fc} Force closure ์—๋„ˆ์ง€: ์™ธ๋ถ€ ํž˜์— ์ €ํ•ญ ๊ฐ€๋Šฅํ•œ ํŒŒ์ง€์ธ๊ฐ€?
E_{dis} ๊ฑฐ๋ฆฌ ์—๋„ˆ์ง€: ์ ‘์ด‰์ ์ด ๋ฌผ์ฒด ํ‘œ๋ฉด์— ์‹ค์ œ๋กœ ๋‹ฟ์•„ ์žˆ๋Š”๊ฐ€?
E_{pen} ๊ด€ํ†ต ์—๋„ˆ์ง€: ์†, ๋ฌผ์ฒด, ํ…Œ์ด๋ธ” ๊ฐ„์˜ ๋ฌผ๋ฆฌ์  ๊ด€ํ†ต ํŽ˜๋„ํ‹ฐ

SeqMultiGrasp์˜ ํ˜์‹ ์€ ์ด ์ตœ์ ํ™”์— ๋งํฌ ์„œ๋ธŒ์…‹ ์ œ์•ฝ์„ ์ถ”๊ฐ€ํ•œ ๊ฒƒ์ด๋‹ค. ์ฒซ ๋ฒˆ์งธ ํŒŒ์ง€์—๋Š” pinch-like ์ ‘์ด‰ ํ›„๋ณด๋งŒ์„, ๋‘ ๋ฒˆ์งธ ํŒŒ์ง€์—๋Š” side ์ ‘์ด‰ ํ›„๋ณด๋งŒ์„ ํ—ˆ์šฉํ•œ๋‹ค. ๋…ผ๋ฌธ์˜ Figure 2(a)์—์„œ ๋นจ๊ฐ„ ์ (pinch-like์šฉ)๊ณผ ํŒŒ๋ž€ ์ (side์šฉ)์œผ๋กœ ์‹œ๊ฐํ™”๋œ ๋ฐ”๋กœ ๊ทธ ๊ตฌ๋ถ„์ด๋‹ค.

Contact Point ์ •์˜: ๊ฐ ํŒŒ์ง€ ์œ ํ˜•์—์„œ ํ—ˆ์šฉ๋œ ์† ๋งํฌ์˜ ํ‘œ๋ฉด ์ƒ ์ ‘์ด‰ ํ›„๋ณด ์ง‘ํ•ฉ์„ ์‚ฌ์ „์— ์ •์˜ํ•œ๋‹ค. ์ตœ์ ํ™”๋Š” ์ด ์ง‘ํ•ฉ ๋‚ด์—์„œ๋งŒ ์ ‘์ด‰์ ์„ ํƒ์ƒ‰ํ•œ๋‹ค.

2.2.4 2.4 ๋‹ค์ค‘ ๋ฌผ์ฒด ํŒŒ์ง€ ์„ค์ • ์ƒ์„ฑ โ€” ๋ณ‘ํ•ฉ(Merging)

๊ฐ๊ฐ ๋…๋ฆฝ์ ์œผ๋กœ ํ•ฉ์„ฑ๋œ ๋‘ ๊ฐœ์˜ ๋‹จ์ผ ๋ฌผ์ฒด ํŒŒ์ง€ ํฌ์ฆˆ๋ฅผ ํ•˜๋‚˜์˜ ์† ์„ค์ •์œผ๋กœ ๋ณ‘ํ•ฉํ•˜๋Š” ๋‹จ๊ณ„๋‹ค. ์ด๋•Œ์˜ ํ•ต์‹ฌ ๋„์ „: ๋‘ ํŒŒ์ง€๊ฐ€ ๋™์ผํ•œ ์† ๊ตฌ์„ฑ \theta์„ ๊ณต์œ ํ•ด์•ผ ํ•˜๋Š”๋ฐ, ๋…๋ฆฝ ์ตœ์ ํ™”๋œ ๋‘ ํŒŒ์ง€๋Š” ์ผ๋ฐ˜์ ์œผ๋กœ ๊ด€์ ˆ ๊ฐ’์—์„œ ์ถฉ๋Œ์ด ์ƒ๊ธด๋‹ค.

๋ณ‘ํ•ฉ ํ”„๋กœ์„ธ์Šค๋Š” ๋‹ค์Œ ์กฐ๊ฑด์„ ๋™์‹œ์— ๋งŒ์กฑํ•˜๋Š” \theta_{merged}๋ฅผ ์ฐพ๋Š”๋‹ค:

  1. ์ฒซ ๋ฒˆ์งธ ํŒŒ์ง€์˜ force closure ์œ ์ง€
  2. ๋‘ ๋ฒˆ์งธ ํŒŒ์ง€์˜ force closure ์œ ์ง€
  3. ์†-๋ฌผ์ฒด-๋ฌผ์ฒด ๊ฐ„ ๊ด€ํ†ต ์—†์Œ
  4. ๋ฌผ์ฒด ๊ฐ„ ์ถฉ๋Œ ์—†์Œ

์ดํ›„ ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์—์„œ ๋ณ‘ํ•ฉ๋œ ํŒŒ์ง€ ์„ค์ •์˜ ์•ˆ์ •์„ฑ์„ ๊ฒ€์ฆํ•˜์—ฌ ๋ฐ์ดํ„ฐ์…‹์„ ๊ตฌ์„ฑํ•œ๋‹ค.

2.2.5 2.5 ํ™•์‚ฐ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ํŒŒ์ง€ ์ œ์•ˆ โ€” ์‹ค์„ธ๊ณ„ ์ผ๋ฐ˜ํ™”

์˜คํ”„๋ผ์ธ์œผ๋กœ ํ•ฉ์„ฑํ•œ ํŒŒ์ง€ ๋ฐ์ดํ„ฐ์…‹์€ ์ด์ œ point cloud ์กฐ๊ฑด๋ถ€ ํ™•์‚ฐ ๋ชจ๋ธ(diffusion model) ํ•™์Šต์— ์‚ฌ์šฉ๋œ๋‹ค.

flowchart LR
    subgraph ํ›ˆ๋ จ
        T1["ํŒŒ์ง€ ๋ฐ์ดํ„ฐ์…‹\n(์‹œ๋ฎฌ ํ•ฉ์„ฑ)"] --> T2["๋…ธ์ด์ฆˆ ์ฃผ์ž…\n(Forward Diffusion)"]
        T2 --> T3["๋…ธ์ด์ฆˆ ์ œ๊ฑฐ ํ•™์Šต\n(Reverse Diffusion)"]
    end

    subgraph ์ถ”๋ก 
        I1["RGB-D ์นด๋ฉ”๋ผ\nโ†’ Point Cloud"] --> I2["๊ฐ์ฒด ๋ถ„ํ• \n(SAM ๋“ฑ)"]
        I2 --> I3["ํ™•์‚ฐ ๋ชจ๋ธ ์ถ”๋ก \nโ†’ ํŒŒ์ง€ ํฌ์ฆˆ ์ œ์•ˆ"]
        I3 --> I4["๋ฌผ๋ฆฌ์  ํ•„ํ„ฐ๋ง\n+ ํฌ์ฆˆ ์„ ํƒ"]
    end

ํ™•์‚ฐ ๋ชจ๋ธ์€ ๋‘ ๋ฌผ์ฒด์˜ point cloud๋ฅผ ์กฐ๊ฑด์œผ๋กœ ๋ฐ›์•„ (ฮธ, T_1, T_2) ํ˜•ํƒœ์˜ ํŒŒ์ง€ ์„ค์ •์„ ์ง์ ‘ ์ƒ์„ฑํ•œ๋‹ค. ์ด ์ ‘๊ทผ๋ฒ•์˜ ์žฅ์ ์€ ์‹ค์„ธ๊ณ„์˜ noisyํ•œ point cloud ์ž…๋ ฅ์—์„œ๋„ ํ•ฉ๋ฆฌ์ ์ธ ํŒŒ์ง€๋ฅผ ์ œ์•ˆํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์ ์ด๋‹ค.

2.2.6 2.6 ์‹คํ–‰ ์ „๋žต โ€” ํœด๋ฆฌ์Šคํ‹ฑ ๊ธฐ๋ฐ˜ ์ˆœ์ฐจ ์‹คํ–‰

ํŒŒ์ง€ ํฌ์ฆˆ๊ฐ€ ๊ฒฐ์ •๋œ ํ›„์˜ ์‹คํ–‰ ๋‹จ๊ณ„๋„ ์ค‘์š”ํ•˜๋‹ค. ๋…ผ๋ฌธ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์€ ํœด๋ฆฌ์Šคํ‹ฑ ๊ธฐ๋ฐ˜ ์ˆœ์ฐจ ์‹คํ–‰ ์ „๋žต์„ ์‚ฌ์šฉํ•œ๋‹ค:

์ˆœ์ฐจ ์‹คํ–‰ ์•Œ๊ณ ๋ฆฌ์ฆ˜:

1. ์†์„ pre-grasp ์œ„์น˜๋กœ ์ด๋™
2. [Phase 1: ์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด ํŒŒ์ง€]
   a. ์ฒซ ๋ฒˆ์งธ ํŒŒ์ง€ ์ž์„ธ(pinch-like)๋กœ ์†๊ฐ€๋ฝ ์ด๋™
   b. ์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด๋ฅผ ๋‹ซ์•„ ํŒŒ์ง€
   c. ๋ฌผ์ฒด ๋“ค์–ด์˜ฌ๋ฆฌ๊ธฐ
3. [Phase 2: ๋‘ ๋ฒˆ์งธ ๋ฌผ์ฒด ํŒŒ์ง€]
   a. ์† ํšŒ์ „ (side grasp ๊ฐ€๋Šฅ ์œ„์น˜๋กœ)
   b. ๋‘ ๋ฒˆ์งธ ๋ฌผ์ฒด ์œ„์น˜๋กœ ์ด๋™
   c. ์•ฝ์ง€ + ์†๋ฐ”๋‹ฅ์œผ๋กœ ๋‘ ๋ฒˆ์งธ ๋ฌผ์ฒด ํŒŒ์ง€
   d. ๋‘ ๋ฌผ์ฒด ๋ชจ๋‘ ๋“ค์–ด์˜ฌ๋ฆฌ๊ธฐ

ํ•ต์‹ฌ์€ Phase 1์—์„œ ์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด๋ฅผ ์žก์€ ์ƒํƒœ๋ฅผ ์œ ์ง€ํ•˜๋ฉด์„œ Phase 2๋ฅผ ์ˆ˜ํ–‰ํ•ด์•ผ ํ•œ๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ์† ํšŒ์ „ ์‹œ ์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด๋ฅผ ๋–จ์–ด๋œจ๋ฆฌ์ง€ ์•Š๋„๋ก ๋ชจ์…˜์„ ์‹ ์ค‘ํ•˜๊ฒŒ ๊ณ„ํšํ•œ๋‹ค.


2.3 3. ์‹คํ—˜ โ€” ์ˆซ์ž๋กœ ๋ณด๋Š” ์„ฑ๋Šฅ

2.3.1 3.1 ์‹คํ—˜ ์„ค์ •

ํ•˜๋“œ์›จ์–ด: Franka Panda ๋กœ๋ด‡ ํŒ” + 4์†๊ฐ€๋ฝ 16-DOF Allegro Hand

๋ฌผ์ฒด: ์ด 18๊ฐœ ๋ฌผ์ฒด ์Œ ๋Œ€์ƒ ์‹คํ—˜ (์‹œ๋ฎฌ: 8ร—8=64๊ฐœ ์กฐํ•ฉ, ์‹ค์„ธ๊ณ„: 6ร—3=18๊ฐœ ์กฐํ•ฉ)

ํ‰๊ฐ€ ๋ฐฉ์‹: ๋‘ ๊ฐ€์ง€๋กœ ๊ตฌ๋ถ„

ํ‰๊ฐ€ ์œ ํ˜• ์„ค๋ช… ํŠธ๋ผ์ด์–ผ ์ˆ˜
SG (Synthesized Grasp) ์‹œ๋ฎฌ์—์„œ ํ•ฉ์„ฑ๋œ ํŒŒ์ง€ ํฌ์ฆˆ๋ฅผ ์ง์ ‘ ์‹คํ–‰ 90ํšŒ
LG (Learned Grasp) ํ™•์‚ฐ ๋ชจ๋ธ์ด ์ƒ์„ฑํ•œ ํŒŒ์ง€ ํฌ์ฆˆ ์‚ฌ์šฉ 90ํšŒ

์„ฑ๊ณต ๊ธฐ์ค€: ๋‘ ๋ฌผ์ฒด ๋ชจ๋‘ ๋™์‹œ์— ๋“ค์–ด ์˜ฌ๋ฆฌ๊ธฐ

2.3.2 3.2 ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ฒฐ๊ณผ

ํ™•์‚ฐ ๋ชจ๋ธ(LG)์€ 1600ํšŒ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํŠธ๋ผ์ด์–ผ์—์„œ ํ‰๊ท  65.8% ์„ฑ๊ณต๋ฅ ์„ ๊ธฐ๋กํ–ˆ๋‹ค.

xychart-beta
    title "์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์„ฑ๊ณต๋ฅ  (Diffusion Model, LG)"
    x-axis ["์ „์ฒด ํ‰๊ท ", "์‰ฌ์šด ์Œ", "์–ด๋ ค์šด ์Œ"]
    y-axis "์„ฑ๊ณต๋ฅ  (%)" 0 --> 100
    bar [65.8, 78.2, 43.1]

์ฐธ๊ณ : ์œ„ ์ฐจํŠธ๋Š” ๋…ผ๋ฌธ ๊ฒฐ๊ณผ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ๋Œ€๋žต์  ์ˆ˜์น˜๋ฅผ ํ‘œํ˜„ํ•œ ๊ฒƒ์ด๋ฉฐ, ์‹ค์ œ ๋…ผ๋ฌธ์˜ ์„ธ๋ถ€ ๋ถ„๋ฅ˜์™€ ๋‹ค๋ฅผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

2.3.3 3.3 ์‹ค์„ธ๊ณ„ ๊ฒฐ๊ณผ

๋ฐฉ๋ฒ• ์„ฑ๊ณต๋ฅ  ํŠธ๋ผ์ด์–ผ ์ˆ˜
SG (์‹œ๋ฎฌ ํ•ฉ์„ฑ ํŒŒ์ง€ ์ง์ ‘ ์‹คํ–‰) ~์ธก์ •๋จ 90
LG (ํ™•์‚ฐ ๋ชจ๋ธ ํŒŒ์ง€) 56.7% 90

์‹ค์„ธ๊ณ„ ์„ฑ๊ณต๋ฅ  56.7%๋Š” ๋‹จ์ˆœํžˆ โ€œ๋ฐ˜ ๋„˜๊ฒŒ ์„ฑ๊ณตํ–ˆ๋‹คโ€๋Š” ๊ฒƒ ์ด์ƒ์˜ ์˜๋ฏธ๊ฐ€ ์žˆ๋‹ค. ์™œ๋ƒํ•˜๋ฉด:

  1. ์ตœ์ดˆ์˜ ์‹ค์„ธ๊ณ„ ์‹คํ—˜์ด๋ผ๋Š” ์ ์—์„œ ๋น„๊ต ๋ฒ ์ด์Šค๋ผ์ธ ์ž์ฒด๊ฐ€ ์กด์žฌํ•˜์ง€ ์•Š์•˜๋‹ค
  2. Sim-to-real ๊ฐญ์„ ํ™•์‚ฐ ๋ชจ๋ธ์ด ์ƒ๋‹นํžˆ ๊ทน๋ณตํ–ˆ์Œ์„ ๋ณด์—ฌ์ค€๋‹ค
  3. 18๊ฐœ์˜ ๋‹ค์–‘ํ•œ ๋ฌผ์ฒด ์Œ์— ๊ฑธ์ณ ์ผ๋ฐ˜ํ™”๋œ ์„ฑ๋Šฅ์„ ๋ณด์ธ๋‹ค

2.3.4 3.4 ์‹คํŒจ ์‚ฌ๋ก€ ๋ถ„์„

๋…ผ๋ฌธ์ด ๋ช…์‹œํ•œ ๋Œ€ํ‘œ์  ์‹คํŒจ ์‚ฌ๋ก€:

  • ๋ ˆ๋ชฌ + ํŽฉ์‹œ ์บ”: ๋ ˆ๋ชฌ์˜ ๋ถˆ๊ทœ์น™ํ•œ ํ‘œ๋ฉด๊ณผ ๋‘ฅ๊ทผ ํ˜•ํƒœ๋กœ ์ธํ•œ ํŒŒ์ง€ ๋ถˆ์•ˆ์ •
  • ๋ฒ„๋‹ˆ + ์‹ค๋ฆฐ๋”: ๋ณต์žกํ•œ ๊ธฐํ•˜ํ•™์  ํ˜•ํƒœ์—์„œ์˜ ์ ‘์ด‰์  ์˜ˆ์ธก ์˜ค๋ฅ˜

์‹คํŒจ๋Š” ์ฃผ๋กœ ๋‘ ๊ฐ€์ง€ ์›์ธ์—์„œ ๊ธฐ์ธํ•œ๋‹ค: 1. Point cloud์˜ ๋…ธ์ด์ฆˆ/๋ถˆ์™„์ „์„ฑ์œผ๋กœ ์ธํ•œ ํŒŒ์ง€ ํฌ์ฆˆ ์˜ค๋ฅ˜ 2. ์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด๋ฅผ ์žก์€ ์ฑ„ ์†์„ ํšŒ์ „ํ•  ๋•Œ์˜ ์Šฌ๋ฆฝ


2.4 4. ๋น„ํŒ์  ๊ณ ์ฐฐ โ€” ๊ฐ•์ ๊ณผ ํ•œ๊ณ„

2.4.1 4.1 ๊ฐ•์ 

โ‘  ๋ฌธ์ œ ์ •์˜์˜ ๋ช…ํ™•์„ฑ

โ€œํ•œ ๋ฌผ์ฒด๋ฅผ ์žก์€ ์ฑ„๋กœ ๋‹ค๋ฅธ ๋ฌผ์ฒด๋ฅผ ์ˆœ์ฐจ์ ์œผ๋กœ ํŒŒ์ง€โ€๋ผ๋Š” ๋ฌธ์ œ๋ฅผ ์ฒ˜์Œ์œผ๋กœ ์‹ค์„ธ๊ณ„์—์„œ ๊ตฌํ˜„ํ•œ ์—ฐ๊ตฌ๋กœ, ๋ฌธ์ œ ์„ค์ • ์ž์ฒด๊ฐ€ ์ปค๋ฎค๋‹ˆํ‹ฐ์— ๊ธฐ์—ฌํ•œ๋‹ค. ๊ธฐ์กด MultiGrasp(๋™์‹œ ํŒŒ์ง€)์™€์˜ ์ฐจ๋ณ„์ ์ด ๋ช…ํ™•ํ•˜๋‹ค.

โ‘ก ์‹ค์šฉ์ ์ธ ํŒŒ์ดํ”„๋ผ์ธ ์™„์„ฑ๋„

  • ํ•ฉ์„ฑ โ†’ ๊ฒ€์ฆ โ†’ ํ•™์Šต โ†’ ๋ฐฐํฌ๋กœ ์ด์–ด์ง€๋Š” ์—”๋“œ-ํˆฌ-์—”๋“œ ํŒŒ์ดํ”„๋ผ์ธ
  • ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ฒ€์ฆ + ํ™•์‚ฐ ๋ชจ๋ธ ์กฐํ•ฉ์œผ๋กœ ์‹ค์„ธ๊ณ„ ์ผ๋ฐ˜ํ™” ๋‹ฌ์„ฑ
  • ์‹ค์ œ ๋กœ๋ด‡ ํ•˜๋“œ์›จ์–ด(Franka + Allegro)๋กœ์˜ ์„ฑ๊ณต์  ๋ฐฐํฌ

โ‘ข Allegro Hand ํŠนํ™” ์„ค๊ณ„

๊ธฐ์กด MultiGrasp๋Š” Shadow Hand ๊ธฐ๋ฐ˜์ด์—ˆ๋‹ค. ์ด ์—ฐ๊ตฌ๋Š” Allegro Hand์— ํŠนํ™”๋œ ๋ฐ์ดํ„ฐ์…‹๊ณผ ์ „๋žต์„ ์ œ์‹œํ•จ์œผ๋กœ์จ, ์ƒ๋Œ€์ ์œผ๋กœ ์ €๋ ดํ•œ($16,000) Allegro Hand ์—ฐ๊ตฌ์ž๋“ค์—๊ฒŒ ์ง์ ‘์ ์œผ๋กœ ์œ ์šฉํ•˜๋‹ค.

2.4.2 4.2 ํ•œ๊ณ„์™€ ์•ฝ์ 

โ‘  ๊ณ ์ •๋œ ํŒŒ์ง€ ์ „๋žต

Pinch-like(์ฒซ ๋ฒˆ์งธ) + Side grasp(๋‘ ๋ฒˆ์งธ)๋ผ๋Š” ์กฐํ•ฉ์ด ํ•˜๋“œ์›จ์–ด ๊ฒฝํ—˜์—์„œ ๋„์ถœ๋œ ๊ฒƒ์€ ํ•ฉ๋ฆฌ์ ์ด์ง€๋งŒ, ์ด๊ฒƒ์ด ์ตœ์ ์ธ์ง€ ๋ณด์žฅ์ด ์—†๋‹ค. ๋…ผ๋ฌธ ์ž์ฒด๋„ โ€œ๋‹ค๋ฅธ ์ˆœ์ฐจ ํŒŒ์ง€ ์ „๋žต ํƒ์ƒ‰์€ ํ–ฅํ›„ ์—ฐ๊ตฌ๋กœ ๋‚จ๊ธด๋‹คโ€๊ณ  ๋ช…์‹œํ•œ๋‹ค.

์„œ๋กœ ๋‹ค๋ฅธ ๋ฌผ์ฒด ์Œ์— ๋Œ€ํ•ด ์ตœ์  ์ „๋žต์ด ๋‹ค๋ฅผ ์ˆ˜ ์žˆ์Œ์„ ๊ณ ๋ คํ•˜๋ฉด, ์ ์‘์  ์ „๋žต ์„ ํƒ ๋ฉ”์ปค๋‹ˆ์ฆ˜์ด ์—†๋‹ค๋Š” ์ ์ด ์•„์‰ฝ๋‹ค.

โ‘ก ๋‘ ๋ฌผ์ฒด๋กœ์˜ ์ œํ•œ

ํ˜„์žฌ๋Š” ๋‘ ๋ฌผ์ฒด๋งŒ์„ ๋Œ€์ƒ์œผ๋กœ ํ•œ๋‹ค. ์„ธ ๊ฐœ ์ด์ƒ์˜ ๋ฌผ์ฒด๋กœ ํ™•์žฅํ•  ๋•Œ ์†๊ฐ€๋ฝ ๋ถ„ํ•  ์ „๋žต์ด ์–ด๋–ป๊ฒŒ ์ผ๋ฐ˜ํ™”๋ ์ง€ ๋ถˆ๋ช…ํ™•ํ•˜๋‹ค. ๋น„๊ต ๋Œ€์ƒ์ธ SeqGrasp(arXiv:2503.22370)๋Š” ์ตœ๋Œ€ 4๊ฐœ ๋ฌผ์ฒด๋ฅผ ๋‹ค๋ฃจ๋Š” ๋ฐ˜๋ฉด, ์ด ๋…ผ๋ฌธ์€ 2๊ฐœ์— ๊ตญํ•œ๋œ๋‹ค.

โ‘ข ๊ณ ์ • ๋ฌผ์ฒด ๋ฐฐ์น˜

์‹คํ—˜์—์„œ ๋ฌผ์ฒด๋“ค์ด ๋ฏธ๋ฆฌ ์ •ํ•ด์ง„ ์บ๋…ธ๋‹ˆ์ปฌ ํฌ์ฆˆ๋กœ ๋ฐฐ์น˜๋œ๋‹ค. ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ๋Š” ๋ฌผ์ฒด๊ฐ€ ์ž„์˜์˜ ์œ„์น˜์™€ ์ž์„ธ๋กœ ์žˆ์„ ๊ฒƒ์ด๋ฏ€๋กœ, ์ด ๊ฐ€์ •์€ ํ˜„์‹ค ์ ์šฉ์„ฑ์„ ์•ฝํ™”์‹œํ‚จ๋‹ค.

โ‘ฃ ๋ชจ์…˜ ํ”Œ๋ž˜๋‹์˜ ๋ฏธ์™„์„ฑ

ํŒŒ์ง€ ํฌ์ฆˆ ์ƒ์„ฑ๊ณผ ํŒŒ์ง€ ์‹คํ–‰ ์‚ฌ์ด์˜ ๋ชจ์…˜ ํ”Œ๋ž˜๋‹ ํ†ตํ•ฉ์ด ์ถฉ๋ถ„ํ•˜์ง€ ์•Š๋‹ค. ์ƒ์„ฑ๋œ ํŒŒ์ง€ ํฌ์ฆˆ๋กœ ๋„๋‹ฌํ•˜๋Š” ๊ฒฝ๋กœ๋ฅผ ๊ณ„ํšํ•˜๋Š” ๋ถ€๋ถ„์ด ํœด๋ฆฌ์Šคํ‹ฑ์— ์˜์กดํ•˜๋ฉฐ, ์ถฉ๋Œ ํšŒํ”ผ๋‚˜ ๊ด€์ ˆ ํ•œ๊ณ„ ๊ณ ๋ ค๊ฐ€ ์ œํ•œ์ ์ด๋‹ค.

โ‘ค ๋ฌผ์ฒด ์ƒํ˜ธ์ž‘์šฉ ๋ชจ๋ธ๋ง์˜ ๋ถ€์žฌ

๋‘ ๋ฌผ์ฒด๊ฐ€ ๊ฐ€๊นŒ์ด ์žˆ์„ ๋•Œ ๋ฐœ์ƒํ•˜๋Š” ๋ฌผ์ฒด-๋ฌผ์ฒด ์ ‘์ด‰์„ ํŒŒ์ง€ ํ•ฉ์„ฑ ๋‹จ๊ณ„์—์„œ ์ถฉ๋ถ„ํžˆ ๊ณ ๋ คํ•˜์ง€ ์•Š๋Š”๋‹ค. ํŒŒ์ง€ ์ค‘ ๋‘ ๋ฌผ์ฒด๊ฐ€ ์„œ๋กœ ๋ฐ€๋ฆฌ๊ฑฐ๋‚˜ ํ•˜๋Š” ์ƒํ˜ธ์ž‘์šฉ์ด ์‹คํŒจ๋ฅผ ์œ ๋ฐœํ•  ์ˆ˜ ์žˆ๋‹ค.


2.5 5. ๊ด€๋ จ ์—ฐ๊ตฌ์™€์˜ ๋น„๊ต

2.5.1 5.1 ๋™์‹œ๋Œ€ ์œ ์‚ฌ ์—ฐ๊ตฌ: SeqGrasp (arXiv:2503.22370)

๊ฑฐ์˜ ๋™์‹œ์— ๋“ฑ์žฅํ•œ โ€œGrasping a Handful: Sequential Multi-Object Dexterous Grasp Generationโ€ ๋…ผ๋ฌธ๊ณผ ์ง์ ‘ ๋น„๊ต๊ฐ€ ํฅ๋ฏธ๋กญ๋‹ค.

๋น„๊ต ํ•ญ๋ชฉ SeqMultiGrasp (์ด ๋…ผ๋ฌธ) SeqGrasp / SeqDiffuser
๋Œ€์ƒ ํ•˜๋“œ์›จ์–ด Allegro Hand Allegro Hand (hand-agnostic ์„ค๊ณ„)
์ตœ๋Œ€ ๋ฌผ์ฒด ์ˆ˜ 2๊ฐœ 4๊ฐœ
๋ฐ์ดํ„ฐ์…‹ ๊ทœ๋ชจ ์ž์ฒด ๊ตฌ์ถ• (๊ทœ๋ชจ ๋ฏธ๊ณต๊ฐœ) 870K ํŒŒ์ง€, 509 ๋ฌผ์ฒด
์‹ค์„ธ๊ณ„ ์‹คํ—˜ โœ… 180 ํŠธ๋ผ์ด์–ผ โœ… ์‹ค์„ธ๊ณ„ ๊ฒ€์ฆ
ํŒŒ์ง€ ์ „๋žต ๊ณ ์ •๋œ pinch+side ์กฐํ•ฉ Opposition Space ๊ธฐ๋ฐ˜ ๋™์  ์„ ํƒ
์ถ”๋ก  ์†๋„ ํ™•์‚ฐ ๋ชจ๋ธ (๋น ๋ฆ„) SeqDiffuser: ์ตœ์ ํ™” ๋Œ€๋น„ 750~1250ร— ๋น ๋ฆ„
์„ฑ๊ณต๋ฅ  ๋น„๊ต 56.7% (์‹ค์„ธ๊ณ„) MultiGrasp ๋Œ€๋น„ 8.71~43.33% ํ–ฅ์ƒ

SeqGrasp๊ฐ€ ๋” ํฐ ์Šค์ผ€์ผ๊ณผ ๋” ๋งŽ์€ ๋ฌผ์ฒด๋ฅผ ๋‹ค๋ฃจ๋Š” ๋ฐ˜๋ฉด, SeqMultiGrasp๋Š” ์‹ค์„ธ๊ณ„ ์‹คํ—˜์˜ ๊ตฌ์ฒด์„ฑ๊ณผ ์™„์ „ํ•œ ์‹œ์Šคํ…œ ํŒŒ์ดํ”„๋ผ์ธ์—์„œ ๊ฐ•์ ์„ ๋ณด์ธ๋‹ค.

2.5.2 5.2 ์ด์ „ ์—ฐ๊ตฌ: MultiGrasp (arXiv:2310.15599)

MultiGrasp๋Š” Shadow Hand๋ฅผ ์ด์šฉํ•œ ๋™์‹œ(simultaneous) ๋‹ค์ค‘ ๋ฌผ์ฒด ํŒŒ์ง€๋ฅผ ๋‹ค๋ฃฌ๋‹ค. ํ•ต์‹ฌ ์ฐจ์ด:

  • MultiGrasp: ๋ชจ๋“  ๋ฌผ์ฒด๋ฅผ ํ•œ ๋ฒˆ์— ํŒŒ์ง€ (๋™์‹œ์„ฑ)
  • SeqMultiGrasp: ๋ฌผ์ฒด๋ฅผ ํ•˜๋‚˜์”ฉ ์ˆœ์„œ๋Œ€๋กœ ํŒŒ์ง€ (์ˆœ์ฐจ์„ฑ)

์ˆœ์ฐจ์  ์ ‘๊ทผ์˜ ์žฅ์ ์€ ๊ฐ ํŒŒ์ง€๋ฅผ ๋…๋ฆฝ์ ์œผ๋กœ ์ตœ์ ํ™”ํ•  ์ˆ˜ ์žˆ์–ด ๋ณต์žกํ•œ ์ƒํ˜ธ์ž‘์šฉ์„ ํ”ผํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์ ์ด๋‹ค. ๋‹ค๋งŒ ์‹คํ–‰ ์‹œ๊ฐ„์ด ๋” ๊ธธ์–ด์ง„๋‹ค.

2.5.3 5.3 DFC ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•๋ก ๊ณผ์˜ ๊ด€๊ณ„

์ด ๋…ผ๋ฌธ์€ DFC(Differentiable Force Closure) ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋ฉฐ, ์ด๋Š” Liu et al. (2021)์˜ ์ž‘์—…์—์„œ ๋น„๋กฏ๋œ๋‹ค. DFC์˜ ํ•ต์‹ฌ์€ force closure๋ฅผ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ ์—๋„ˆ์ง€ ํ•จ์ˆ˜๋กœ ํ‘œํ˜„ํ•ด ๊ฒฝ์‚ฌ ๊ธฐ๋ฐ˜ ์ตœ์ ํ™”๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•œ๋‹ค๋Š” ์ ์ด๋‹ค. SeqMultiGrasp๋Š” ์—ฌ๊ธฐ์— ๋งํฌ ์„œ๋ธŒ์…‹ ์ œ์•ฝ์„ ์ถ”๊ฐ€ํ•จ์œผ๋กœ์จ ์ˆœ์ฐจ ํŒŒ์ง€ ๋ฌธ์ œ์— ๋งž๊ฒŒ ํ™•์žฅํ•œ๋‹ค.


2.6 6. ์ƒˆ๋กœ์šด ๋กœ๋ด‡ ์† ํ”Œ๋žซํผ์—์„œ SeqMultiGrasp๋ฅผ ์žฌํ˜„ํ•˜๋ ค๋ฉด

SeqMultiGrasp๋Š” Allegro Hand์— ํŠนํ™”๋œ ์„ค๊ณ„๋ฅผ ๊ฐ–๊ณ  ์žˆ์ง€๋งŒ, ๊ทธ ๊ตฌ์กฐ๋Š” ๋‹ค๋ฅธ ๋‹ค์ง€ ๋กœ๋ด‡ ์†์œผ๋กœ๋„ ์ถฉ๋ถ„ํžˆ ์ด์‹ํ•  ์ˆ˜ ์žˆ๋‹ค. LEAP Hand, Shadow Hand, Inspire Hand, ํ˜น์€ ์ž์ฒด ์ œ์ž‘ ์† ํ”Œ๋žซํผ์„ ์‚ฌ์šฉํ•˜๋Š” ์—ฐ๊ตฌ์ž๋ผ๋ฉด ๋‹ค์Œ ๊ณผ์ •์„ ๋”ฐ๋ผ๊ฐ€ ๋ณด์ž.

2.6.1 6.1 Step 1 โ€” ํ•˜๋“œ์›จ์–ด ํŠน์„ฑ ๋ถ„์„๊ณผ ํŒŒ์ง€ ์ „๋žต ์žฌ์„ค๊ณ„

SeqMultiGrasp์˜ โ€œPinch-like + Side graspโ€ ์กฐํ•ฉ์€ Allegro Hand์˜ ์†๊ฐ€๋ฝ ๋ฐฐ์น˜์™€ ๊ด€์ ˆ ๊ฐ€๋™ ๋ฒ”์œ„์—์„œ ๊ฒฝํ—˜์ ์œผ๋กœ ๋„์ถœ๋œ ๊ฒƒ์ด๋‹ค. ์ƒˆ๋กœ์šด ํ”Œ๋žซํผ์—์„œ๋Š” ์ด ์ „๋žต์„ ๊ทธ๋Œ€๋กœ ๊ฐ€์ ธ์˜ค๋ฉด ์•ˆ ๋œ๋‹ค. ๋จผ์ € ๋‹ค์Œ ์งˆ๋ฌธ์— ๋‹ตํ•ด์•ผ ํ•œ๋‹ค.

  • ์†๊ฐ€๋ฝ ์ˆ˜์™€ ๊ฐ ์†๊ฐ€๋ฝ์˜ DOF๋Š” ์–ผ๋งˆ์ธ๊ฐ€?
  • ๊ฐ ์†๊ฐ€๋ฝ์ด ๋…๋ฆฝ์ ์œผ๋กœ ๋ฌผ์ฒด๋ฅผ ์•ˆ์ •์ ์œผ๋กœ ์žก์„ ์ˆ˜ ์žˆ๋Š”๊ฐ€, ์•„๋‹ˆ๋ฉด ๋ฐ˜๋“œ์‹œ ์—ฌ๋Ÿฌ ์†๊ฐ€๋ฝ์ด ํ˜‘๋ ฅํ•ด์•ผ ํ•˜๋Š”๊ฐ€?
  • ์†๋ฐ”๋‹ฅ(palm)์˜ ํ˜•ํƒœ๊ฐ€ ๋ฌผ์ฒด ๋ฐ›์นจ์œผ๋กœ ํ™œ์šฉ ๊ฐ€๋Šฅํ•œ๊ฐ€?
  • ์—„์ง€๊ฐ€ ๋Œ€ํ–ฅ ๊ฐ€๋Šฅํ•œ ๊ตฌ์กฐ์ธ๊ฐ€(opposable thumb)?

์ด ๋ถ„์„ ๊ฒฐ๊ณผ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ, โ€œ์–ด๋–ค ๋งํฌ ์„œ๋ธŒ์…‹์ด ์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด๋ฅผ ์•ˆ์ •์ ์œผ๋กœ ์žก์„ ์ˆ˜ ์žˆ๋Š”๊ฐ€โ€์™€ โ€œ๋‚จ์€ ๋งํฌ๋กœ ๋‘ ๋ฒˆ์งธ ๋ฌผ์ฒด๋ฅผ ์žก์„ ์ˆ˜ ์žˆ๋Š”๊ฐ€โ€๋ฅผ ๋™์‹œ์— ๋งŒ์กฑํ•˜๋Š” ๋ถ„ํ•  ์ „๋žต์„ ์ƒˆ๋กœ ์ •์˜ํ•ด์•ผ ํ•œ๋‹ค.

flowchart TD
    A["์ƒˆ ํ”Œ๋žซํผ URDF / ๊ด€์ ˆ ๊ตฌ์กฐ ๋ถ„์„"] --> B["์†๊ฐ€๋ฝ ์กฐํ•ฉ๋ณ„ ๊ฐ€๋™ ๋ฒ”์œ„ ์‹œ๋ฎฌ๋ ˆ์ด์…˜"]
    B --> C{"๋…๋ฆฝ ํŒŒ์ง€ ๊ฐ€๋Šฅํ•œ\n์„œ๋ธŒ์…‹ ์Œ ์กด์žฌ?"}
    C -- "์˜ˆ" --> D["ํŒŒ์ง€ ์œ ํ˜• ์ •์˜\n(Grasp Type 1 / Type 2)"]
    C -- "์•„๋‹ˆ์˜ค" --> E["์†๊ฐ€๋ฝ ์ˆ˜ ๋ถ€์กฑ\nโ†’ ๋‹จ์ผ ๋ฌผ์ฒด๋กœ ๋ฒ”์œ„ ์ถ•์†Œ ๊ฒ€ํ† "]
    D --> F["๊ฐ ์œ ํ˜•์˜ Contact Candidate\nํฌ์ธํŠธ ๋งต ์ˆ˜๋™ ์ •์˜"]

์˜ˆ๋ฅผ ๋“ค์–ด Shadow Hand(5์†๊ฐ€๋ฝ, 20+ DOF)๋ผ๋ฉด, ์—„์ง€-๊ฒ€์ง€-์ค‘์ง€๋กœ Pinch, ์•ฝ์ง€-์†Œ์ง€๋กœ Power grasp๋ฅผ ๊ตฌ์„ฑํ•˜๋Š” ๋ฐฉ์‹์ด ์ž์—ฐ์Šค๋Ÿฝ๋‹ค. LEAP Hand์ฒ˜๋Ÿผ ์†๊ฐ€๋ฝ 4๊ฐœ์— ๊ด€์ ˆ์ด ์ ์€ ๊ฒฝ์šฐ, ์“ธ ์ˆ˜ ์žˆ๋Š” DoF๊ฐ€ ๋” ์ œํ•œ์ ์ด๋ฏ€๋กœ ์ „๋žต์„ ๋‹จ์ˆœํ™”ํ•ด์•ผ ํ•œ๋‹ค.

2.6.2 6.2 Step 2 โ€” URDF ๋“ฑ๋ก๊ณผ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ ์„ค์ •

DFC ์•Œ๊ณ ๋ฆฌ์ฆ˜๊ณผ ๋ฌผ๋ฆฌ ๊ฒ€์ฆ์„ ์ƒˆ ํ”Œ๋žซํผ์—์„œ ๋Œ๋ฆฌ๋ ค๋ฉด ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์— ์† ๋ชจ๋ธ์„ ์˜ฌ๋ฐ”๋ฅด๊ฒŒ ๋“ฑ๋กํ•ด์•ผ ํ•œ๋‹ค.

์ฒดํฌ๋ฆฌ์ŠคํŠธ:

ํ•ญ๋ชฉ ๋‚ด์šฉ
URDF ์ •ํ™•๋„ ๊ด€์ ˆ ํ•œ๊ณ„(joint limits), ๋งํฌ ์งˆ๋Ÿ‰, ์ถฉ๋Œ ๋ฉ”์‹œ(collision mesh) ์ •ํ™•์„ฑ ํ™•์ธ
์ ‘์ด‰ ํ›„๋ณด ๋งต ๊ฐ ๋งํฌ ํ‘œ๋ฉด์—์„œ ์ ‘์ด‰์ ์œผ๋กœ ์‚ฌ์šฉํ•  ํ›„๋ณด ํฌ์ธํŠธ ์ง‘ํ•ฉ ์ •์˜
๋งˆ์ฐฐ ๊ณ„์ˆ˜ ์‹ค์ œ ํ•˜๋“œ์›จ์–ด์˜ ์†๊ฐ€๋ฝ ํ‘œ๋ฉด ์žฌ์งˆ์— ๋งž๋Š” ๋งˆ์ฐฐ ๊ณ„์ˆ˜ ์„ค์ •
ํŒŒ์ง€ ์œ ํ˜• ๋งˆ์Šคํฌ Type 1(์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด์šฉ)๊ณผ Type 2(๋‘ ๋ฒˆ์งธ ๋ฌผ์ฒด์šฉ) ๋งํฌ ๋งˆ์Šคํฌ ์ฝ”๋“œ ์ˆ˜์ •

Isaac Gym / Isaac Sim ํ™˜๊ฒฝ์„ ์‚ฌ์šฉํ•˜๋Š” ๊ฒฝ์šฐ, GPU ๋ณ‘๋ ฌํ™”๋ฅผ ์ตœ๋Œ€ํ•œ ํ™œ์šฉํ•ด ๋Œ€๊ทœ๋ชจ ํŒŒ์ง€ ํ›„๋ณด๋ฅผ ๊ฒ€์ฆํ•  ์ˆ˜ ์žˆ๋‹ค. MuJoCo๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค๋ฉด mjcf ํ˜•์‹ ๋ณ€ํ™˜์ด ์„ ํ–‰๋˜์–ด์•ผ ํ•œ๋‹ค.

2.6.3 6.3 Step 3 โ€” ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์…‹ ์žฌ๊ตฌ์ถ•

๊ธฐ์กด SeqMultiGrasp ๋ฐ์ดํ„ฐ์…‹์€ Allegro Hand์— ํŠนํ™”๋˜์–ด ์žˆ์œผ๋ฏ€๋กœ ์ƒˆ ํ”Œ๋žซํผ์šฉ ๋ฐ์ดํ„ฐ์…‹์„ ์ฒ˜์Œ๋ถ€ํ„ฐ ๊ตฌ์ถ•ํ•ด์•ผ ํ•œ๋‹ค. ํŒŒ์ดํ”„๋ผ์ธ ์ž์ฒด๋Š” ๋™์ผํ•˜๊ฒŒ ์ ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.

1. ๋ฌผ์ฒด ๋ฉ”์‹œ ์ˆ˜์ง‘ (YCB, ShapeNet, ๋˜๋Š” ์ž์ฒด ์Šค์บ”)
2. DFC ์—๋„ˆ์ง€ ์ตœ์†Œํ™”๋กœ ๋‹จ์ผ ๋ฌผ์ฒด ํŒŒ์ง€ ํ›„๋ณด ํ•ฉ์„ฑ
   - ๋งํฌ ์„œ๋ธŒ์…‹ ์ œ์•ฝ ์ ์šฉ (์ƒˆ ํ”Œ๋žซํผ์— ๋งž๊ฒŒ)
3. ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์—์„œ ์•ˆ์ •์„ฑ ๊ฒ€์ฆ (์‹œ๋ฎฌ ๋“œ๋กญ ํ…Œ์ŠคํŠธ)
4. ๊ฒ€์ฆ๋œ ํŒŒ์ง€ ์Œ์„ ๋ณ‘ํ•ฉ โ†’ ๋‹ค์ค‘ ๋ฌผ์ฒด ํŒŒ์ง€ ์„ค์ • ์ƒ์„ฑ
5. ๋ณ‘ํ•ฉ ํ›„ ์žฌ๊ฒ€์ฆ (๋‘ ๋ฌผ์ฒด ๋™์‹œ ์•ˆ์ •์„ฑ)

์†Œ์š” ์‹œ๊ฐ„ ์˜ˆ์ƒ: GPU ํด๋Ÿฌ์Šคํ„ฐ ์‚ฌ์šฉ ์‹œ ์ˆ˜์ฒœ ๊ฐœ ํŒŒ์ง€ ํ›„๋ณด ํ•ฉ์„ฑ์— ์ˆ˜ ์‹œ๊ฐ„~์ˆ˜์‹ญ ์‹œ๊ฐ„ ์ˆ˜์ค€. hesic73/SeqMultiGrasp ์ฝ”๋“œ๋ฒ ์ด์Šค์—์„œ ์† ๋ชจ๋ธ ๊ด€๋ จ ํด๋ž˜์Šค๋งŒ ๊ต์ฒดํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ์‹œ์ž‘ํ•˜๋ฉด ๋น ๋ฅด๋‹ค.

2.6.4 6.4 Step 4 โ€” ํ™•์‚ฐ ๋ชจ๋ธ ์žฌํ•™์Šต

ํ™•์‚ฐ ๋ชจ๋ธ ๊ตฌ์กฐ ์ž์ฒด๋Š” ์ž…๋ ฅ ์กฐ๊ฑด(point cloud) ๊ณผ ์ถœ๋ ฅ ๊ณต๊ฐ„(๊ด€์ ˆ ์„ค์ • + ํฌ์ฆˆ) ์˜ ์ฐจ์›๋งŒ ๋‹ฌ๋ผ์ง€๋ฏ€๋กœ ๋น„๊ต์  ์‰ฝ๊ฒŒ ์ด์‹๋œ๋‹ค.

์ฃผ์˜ํ•  ์ :

  • ์ถœ๋ ฅ ์ฐจ์› ๋ณ€๊ฒฝ: Allegro๋Š” ๊ด€์ ˆ์ด 16๊ฐœ์ง€๋งŒ, ์ƒˆ ํ”Œ๋žซํผ์˜ DOF ์ˆ˜์— ๋งž๊ฒŒ ์ถœ๋ ฅ ํ—ค๋“œ๋ฅผ ์ˆ˜์ •ํ•ด์•ผ ํ•œ๋‹ค
  • ์ •๊ทœํ™” ๋ฒ”์œ„ ์žฌ์„ค์ •: ๊ด€์ ˆ ๊ฐ๋„์˜ ๋ฒ”์œ„๊ฐ€ ๋‹ค๋ฅด๋ฏ€๋กœ ์ •๊ทœํ™” ์Šค์ผ€์ผ ์žฌ์กฐ์ • ํ•„์š”
  • ๊ธฐํ•˜ํ•™ ์†์‹ค ์ถ”๊ฐ€ ์—ฌ๋ถ€: ์ผ๋ถ€ ํ›„์† ์—ฐ๊ตฌ(DexEvolve ๋“ฑ)๋Š” forward kinematics๋ฅผ ํ†ตํ•ด keypoint ์œ„์น˜ ์†์‹ค์„ ์ถ”๊ฐ€ํ•˜๋Š”๋ฐ, ์ƒˆ ํ”Œ๋žซํผ์—์„œ ์ด๋ฅผ ์ ์šฉํ•˜๋ฉด sim-to-real ํ’ˆ์งˆ์„ ๋†’์ผ ์ˆ˜ ์žˆ๋‹ค
# ์˜ˆ: ๊ด€์ ˆ ์ฐจ์› ์ˆ˜์ • ์˜ˆ์‹œ (์˜์‚ฌ์ฝ”๋“œ)
class GraspDiffusionModel(nn.Module):
    def __init__(self, dof: int, ...):
        # Allegro: dof=16
        # Shadow: dof=24
        # LEAP:   dof=16
        self.joint_head = nn.Linear(hidden_dim, dof)
        self.wrist_head = nn.Linear(hidden_dim, 6)  # SE(3) ํ‘œํ˜„

2.6.5 6.5 Step 5 โ€” ์‹ค์„ธ๊ณ„ Sim-to-Real ๊ฐญ ์ค„์ด๊ธฐ

์ƒˆ๋กœ์šด ํ”Œ๋žซํผ์ผ์ˆ˜๋ก sim-to-real ๊ฐญ์ด ํฌ๋‹ค. SeqMultiGrasp๋„ ์‹ค์„ธ๊ณ„ ์„ฑ๊ณต๋ฅ ์ด ์‹œ๋ฎฌ(65.8%)๋ณด๋‹ค ๋‚ฎ์€ 56.7%์˜€๋Š”๋ฐ, ์ƒˆ ํ”Œ๋žซํผ์—์„œ๋Š” ์ดˆ๊ธฐ์— ๋” ๋‚ฎ์„ ์ˆ˜ ์žˆ๋‹ค. ์ด๋ฅผ ์ค„์ด๋Š” ์‹ค์šฉ์  ๋ฐฉ๋ฒ•๋“ค:

์ „๋žต ์„ค๋ช…
๋„๋ฉ”์ธ ๋žœ๋คํ™” ์‹œ๋ฎฌ ํ›ˆ๋ จ ์‹œ ๋งˆ์ฐฐ๊ณ„์ˆ˜, ๋ฌผ์ฒด ์งˆ๋Ÿ‰, ๊ด€์ ˆ ๋Œํ•‘์„ ๋ฌด์ž‘์œ„๋กœ ๋ณ€ํ™”
์‹ค์„ธ๊ณ„ ๋ฏธ์„ธ์กฐ์ • ์†Œ๋Ÿ‰์˜ ์‹ค์„ธ๊ณ„ ํŒŒ์ง€ ์‹œ๋„ ๊ฒฐ๊ณผ๋กœ ํ™•์‚ฐ ๋ชจ๋ธ fine-tuning
์ด‰๊ฐ ํ”ผ๋“œ๋ฐฑ ํ†ตํ•ฉ ์Šฌ๋ฆฝ ๊ฐ์ง€ ์„ผ์„œ๋กœ ํŒŒ์ง€ ์ค‘ ์‹ค์‹œ๊ฐ„ ์กฐ์ • (Tactile sensing)
Point cloud ๋…ธ์ด์ฆˆ ์ฆ๊ฐ• ํ›ˆ๋ จ ์‹œ depth camera ๋…ธ์ด์ฆˆ๋ฅผ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ํ•ด real-world ๊ฐ•๊ฑด์„ฑ ํ–ฅ์ƒ

2.6.6 6.6 ์š”์•ฝ: ์ด์‹ ๋‚œ์ด๋„ ์ฒดํฌ๋ฆฌ์ŠคํŠธ

flowchart LR
    A["๐Ÿ”ต ์‰ฌ์›€\nํ™•์‚ฐ ๋ชจ๋ธ ๊ตฌ์กฐ\n(์ฐจ์›๋งŒ ์ˆ˜์ •)"] --> B
    B["๐ŸŸก ์ค‘๊ฐ„\nํŒŒ์ง€ ์ „๋žต ์žฌ์„ค๊ณ„\n(๋งํฌ ์„œ๋ธŒ์…‹ ์ •์˜)"] --> C
    C["๐Ÿ”ด ์–ด๋ ค์›€\n์ƒˆ ๋ฐ์ดํ„ฐ์…‹ ๊ตฌ์ถ•\n(์‹œ๋ฎฌ ๋Œ€๊ทœ๋ชจ ํ•ฉ์„ฑ)"] --> D
    D["๐Ÿ”ด ์–ด๋ ค์›€\nSim-to-Real ๊ฐญ\n(๋ฐ˜๋ณต ์‹คํ—˜ ํ•„์š”)"]

๊ฒฐ๊ตญ ๊ฐ€์žฅ ํฐ ๋น„์šฉ์€ ์ƒˆ ํ”Œ๋žซํผ์— ๋งž๋Š” ๋ฐ์ดํ„ฐ์…‹ ์žฌ๊ตฌ์ถ•๊ณผ sim-to-real ๊ฐญ ํ•ด์†Œ์— ์žˆ๋‹ค. ๊ตฌ์กฐ ์ดํ•ด์™€ ์ฝ”๋“œ ์ด์‹ ์ž์ฒด๋Š” ๊ณต๊ฐœ๋œ ์ฝ”๋“œ๋ฒ ์ด์Šค๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์ˆ˜ ์ฃผ ๋‚ด์— ๊ฐ€๋Šฅํ•œ ์ˆ˜์ค€์ด๋‹ค.


2.7 7. ํ–ฅํ›„ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ

์ด ๋…ผ๋ฌธ์ด ์—ด์–ด๋†“์€ ๋ฏธ๋ž˜ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ๋“ค์„ ์ •๋ฆฌํ•˜๋ฉด:

โ‘  ์ ์‘์  ํŒŒ์ง€ ์ „๋žต ์„ ํƒ

๋ฌผ์ฒด ์Œ์˜ ํŠน์„ฑ(ํฌ๊ธฐ, ํ˜•ํƒœ, ๋ฌด๊ฒŒ ๋ถ„ํฌ)์— ๋”ฐ๋ผ ์ตœ์ ์˜ ์†๊ฐ€๋ฝ ๋ถ„ํ•  ์ „๋žต์„ ์ž๋™์œผ๋กœ ์„ ํƒํ•˜๋Š” ๋ฐฉ๋ฒ•. ๊ฐ•ํ™”ํ•™์Šต์ด๋‚˜ LLM ๊ธฐ๋ฐ˜ ์ „๋žต ์„ ํƒ์ด ํ›„๋ณด๋‹ค.

โ‘ก ์„ธ ๊ฐœ ์ด์ƒ ๋ฌผ์ฒด๋กœ ํ™•์žฅ

Allegro Hand์˜ ๋‚จ์€ ์†๊ฐ€๋ฝ ์ž์œ ๋„๋ฅผ ๋” ์„ธ๋ฐ€ํ•˜๊ฒŒ ํ™œ์šฉํ•ด ์„ธ ๋ฒˆ์งธ ๋ฌผ์ฒด๊นŒ์ง€ ํŒŒ์ง€ํ•˜๋Š” ์ „๋žต. ๋‹จ, ์„ธ ๋ฒˆ์งธ ๋ฌผ์ฒด์— ์“ธ ์ˆ˜ ์žˆ๋Š” DoF๊ฐ€ ๊ธ‰๊ฒฉํžˆ ์ค„์–ด๋“ ๋‹ค๋Š” ํ˜„์‹ค์  ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค.

โ‘ข ์ž„์˜ ๋ฐฐ์น˜ ๋ฌผ์ฒด ์ฒ˜๋ฆฌ

๊ณ ์ •๋œ ์บ๋…ธ๋‹ˆ์ปฌ ํฌ์ฆˆ๊ฐ€ ์•„๋‹Œ ์ž„์˜ ์œ„์น˜/์ž์„ธ์˜ ๋ฌผ์ฒด์— ๋Œ€ํ•œ ๊ฐ•๊ฑดํ•œ ํŒŒ์ง€. 6-DOF pose estimation๊ณผ์˜ ํ†ตํ•ฉ์ด ํ•„์š”ํ•˜๋‹ค.

โ‘ฃ Tactile Sensing ํ†ตํ•ฉ

ํŒŒ์ง€ ์ค‘ ์ด‰๊ฐ ํ”ผ๋“œ๋ฐฑ์„ ์ด์šฉํ•ด ์Šฌ๋ฆฝ์„ ๊ฐ์ง€ํ•˜๊ณ  ํŒŒ์ง€ ํž˜์„ ์ ์‘์ ์œผ๋กœ ์กฐ์ ˆํ•˜๋Š” ๋ฐฉ๋ฒ•. ํŠนํžˆ ๋‘ ๋ฒˆ์งธ ๋ฌผ์ฒด๋ฅผ ํŒŒ์ง€ํ•  ๋•Œ ์ฒซ ๋ฒˆ์งธ ๋ฌผ์ฒด๊ฐ€ ์Šฌ๋ฆฝํ•˜๋Š” ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” ๋ฐ ์ค‘์š”ํ•˜๋‹ค.

โ‘ค VLA ๋ชจ๋ธ๊ณผ์˜ ํ†ตํ•ฉ

Vision-Language-Action ๋ชจ๋ธ์„ ํ™œ์šฉํ•ด โ€œ์ปต๊ณผ ๋ณผํŽœ์„ ์ง‘์–ด์„œ ์ฑ…์ƒ์— ๋†“์•„๋ผโ€ ๊ฐ™์€ ์ž์—ฐ์–ด ๋ช…๋ น์„ ๋ฐ›์•„ ์ˆœ์ฐจ ํŒŒ์ง€๋ฅผ ์ˆ˜ํ–‰ํ•˜๋Š” ๊ณ ์ˆ˜์ค€ ์กฐ์ž‘ ์‹œ์Šคํ…œ์œผ๋กœ ๋ฐœ์ „.


2.8 8. ์š”์•ฝ ๋ฐ ๊ฒฐ๋ก 

SeqMultiGrasp๋Š” ๋กœ๋ด‡ ์† ์—ฐ๊ตฌ์—์„œ ์˜ค๋žซ๋™์•ˆ ๋ฏธ๊ฐœ์ฒ™ ์˜์—ญ์ด์—ˆ๋˜ ์ˆœ์ฐจ์  ๋‹ค์ค‘ ๋ฌผ์ฒด ํŒŒ์ง€์— ๋Œ€ํ•œ ์ตœ์ดˆ์˜ ์™„์ „ํ•œ ์‹ค์„ธ๊ณ„ ์‹œ์Šคํ…œ์„ ์ œ์‹œํ•œ๋‹ค.

ํ•ต์‹ฌ ๊ธฐ์—ฌ๋ฅผ ๋‹ค์‹œ ํ•œ ๋ฒˆ ์ •๋ฆฌํ•˜๋ฉด:

  1. ๋ฌผ๋ฆฌ์ ์œผ๋กœ ์‹คํ˜„ ๊ฐ€๋Šฅํ•œ ๋‹ค์ค‘ ๋ฌผ์ฒด ํŒŒ์ง€ ์„ค์ • ํ•ฉ์„ฑ ํŒŒ์ดํ”„๋ผ์ธ โ€” DFC ๊ธฐ๋ฐ˜ ๋‹จ์ผ ๋ฌผ์ฒด ํŒŒ์ง€ ํ•ฉ์„ฑ โ†’ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ ๊ฒ€์ฆ โ†’ ๋ณ‘ํ•ฉ์˜ 3๋‹จ๊ณ„ ๊ตฌ์กฐ
  2. Allegro Hand ํŠนํ™” ์ˆœ์ฐจ ํŒŒ์ง€ ์ „๋žต โ€” Pinch-like + Side grasp ์กฐํ•ฉ์œผ๋กœ ์†์˜ ์ž์œ ๋„๋ฅผ ๋ถ„ํ•  ํ™œ์šฉ
  3. Point cloud ์กฐ๊ฑด๋ถ€ ํ™•์‚ฐ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์‹ค์„ธ๊ณ„ ๋ฐฐํฌ โ€” ์‹ค์„ธ๊ณ„ 65.8%(์‹œ๋ฎฌ) / 56.7%(์‹ค์„ธ๊ณ„) ์„ฑ๊ณต๋ฅ  ๋‹ฌ์„ฑ

ํ•œ๊ณ„๋„ ๋ถ„๋ช…ํ•˜๋‹ค. ๋‘ ๋ฌผ์ฒด๋กœ์˜ ์ œํ•œ, ๊ณ ์ •๋œ ํŒŒ์ง€ ์ „๋žต, ์‹ค์„ธ๊ณ„์˜ ์ž„์˜ ๋ฌผ์ฒด ๋ฐฐ์น˜ ๋ฏธ์ง€์› ๋“ฑ์ด ํ–ฅํ›„ ์—ฐ๊ตฌ๊ฐ€ ํ•ด๊ฒฐํ•ด์•ผ ํ•  ๊ณผ์ œ๋‹ค.

๊ทธ๋Ÿฌ๋‚˜ ํ•œ ๊ฑธ์Œ ๋ฌผ๋Ÿฌ์„œ์„œ ๋ณด๋ฉด, ์ด ๋…ผ๋ฌธ์ด ๊ฐ€์ง„ ๊ฐ€์žฅ ์ค‘์š”ํ•œ ๊ฐ€์น˜๋Š” โ€œ์ด๊ฒŒ ๊ฐ€๋Šฅํ•˜๋‹คโ€๋Š” ๊ฒƒ์„ ์ฒ˜์Œ์œผ๋กœ ๋ณด์—ฌ์คฌ๋‹ค๋Š” ์ ์ด๋‹ค. ๊ธฐ์ˆ ์˜ ์ง„๋ณด๋Š” ์ข…์ข… โ€œ์ด๊ฒŒ ๊ฐ€๋Šฅํ• ๊นŒ?โ€์—์„œ ์‹œ์ž‘ํ•ด โ€œ์ด๊ฒŒ ์–ด๋–ป๊ฒŒ ํ•˜๋ฉด ๋” ์ž˜ ๋ ๊นŒ?โ€๋กœ ์ด๋™ํ•œ๋‹ค. SeqMultiGrasp๋Š” ๊ทธ ์ฒซ ๋ฒˆ์งธ ์งˆ๋ฌธ์— ๋‹ตํ•œ ์—ฐ๊ตฌ๋‹ค.

์† ํ•˜๋‚˜๋กœ ๋‘ ๋ฌผ์ฒด๋ฅผ ์ง‘๋Š” ๊ฒƒ โ€” ์ธ๊ฐ„์—๊ฒŒ๋Š” ์•„๋ฌด๋ ‡์ง€ ์•Š์€ ์ด ๋™์ž‘์ด, ๋กœ๋ด‡์—๊ฒŒ๋Š” ์•„์ง ์ •๋ณตํ•ด์•ผ ํ•  ์‚ฐ์ด๋‹ค. ๊ทธ ์ •์ƒ์„ ํ–ฅํ•œ ์ฒซ ๋ฒˆ์งธ ๋ฒ ์ด์Šค์บ ํ”„๊ฐ€ ์„ธ์›Œ์กŒ๋‹ค.


2.9 ์ฐธ๊ณ  ๋ฌธํ—Œ

  • He et al., โ€œSequential Multi-Object Grasping with One Dexterous Hand,โ€ IROS 2025. arXiv:2503.09078
  • Lu et al., โ€œGrasping a Handful: Sequential Multi-Object Dexterous Grasp Generation,โ€ arXiv:2503.22370
  • Lum et al., โ€œMultiGrasp: Grasp Multiple Objects with One Hand,โ€ arXiv:2310.15599
  • Liu et al., โ€œSynthesizing Diverse and Physically Stable Grasps with Arbitrary Hand Structures using Differentiable Force Closure Estimator,โ€ RA-L 2021
  • Lum et al., โ€œDexterous Functional Pre-Grasp Manipulation with Diffusion Policy,โ€ 2024

Copyright 2026, JungYeon Lee