Curieux.JY
  • JungYeon Lee
  • Post
  • Projects
  • Note

On this page

  • ๐Ÿ” Ping Review
  • ๐Ÿ”” Ring Review
    • ์„œ๋ก : ์™œ ์ด ๋ฌธ์ œ๊ฐ€ ์ค‘์š”ํ•œ๊ฐ€
    • ๋ฐฉ๋ฒ•๋ก : SPIDER์˜ ํ•ต์‹ฌ ์•„์ด๋””์–ด
      • ์ „์ฒด ํŒŒ์ดํ”„๋ผ์ธ ๊ฐœ๊ด€
      • ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ๋ฆฌํƒ€๊ฒŒํŒ… ๋ฌธ์ œ ์ •์˜
      • ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜ ์ตœ์ ํ™”: ์™œ ์ƒ˜ํ”Œ๋ง์ธ๊ฐ€?
      • ๊ฐ€์ƒ ์ ‘์ด‰ ๊ฐ€์ด๋“œ: SPIDER์˜ ๋น„๋ฐ€ ๋ฌด๊ธฐ
      • ๊ถค์  ๊ฐ•๊ฑดํ™”(Robustification)
      • ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•
    • ์‹คํ—˜: ์ˆซ์ž๊ฐ€ ๋งํ•˜๋Š” ๊ฒƒ๋“ค
      • ์‹คํ—˜ ์„ค์ •
      • Ablation Study: ๊ฐ ๊ตฌ์„ฑ ์š”์†Œ์˜ ๊ธฐ์—ฌ
      • ๋Œ€๊ทœ๋ชจ ๋ฆฌํƒ€๊ฒŒํŒ… ๊ฒฐ๊ณผ
      • SOTA ๋น„๊ต: ์†๋„์™€ ํ’ˆ์งˆ์˜ ํŠธ๋ ˆ์ด๋“œ์˜คํ”„
      • ์‹ค์ œ ๋กœ๋ด‡ ๋ฐฐํฌ ๊ฒฐ๊ณผ
      • ํœด๋จธ๋…ธ์ด๋“œ ๋ฆฌํƒ€๊ฒŒํŒ… ๊ฒฐ๊ณผ
      • RL ์ •์ฑ… ํ•™์Šต ๊ฐ€์†
    • ๋น„ํŒ์  ๊ณ ์ฐฐ
      • ๊ฐ•์ 
      • ์•ฝ์  ๋ฐ ํ•œ๊ณ„
    • ๊ด€๋ จ ์—ฐ๊ตฌ์™€์˜ ๋น„๊ต
      • ๋ฆฌํƒ€๊ฒŒํŒ… ๋ฐฉ๋ฒ•๋ก  ์ŠคํŽ™ํŠธ๋Ÿผ์—์„œ์˜ ์œ„์น˜
      • ํ•ต์‹ฌ ๋น„๊ต ๋Œ€์ƒ
    • Allegro Hand ์—ฐ๊ตฌ์ž๋ฅผ ์œ„ํ•œ ์‹œ์‚ฌ์ 
    • ์š”์•ฝ ๋ฐ ๊ฒฐ๋ก 
    • ์ฐธ๊ณ  ์ •๋ณด

๐Ÿ“ƒSPIDER ๋ฆฌ๋ทฐ

retargeting
humanoid
cross-embodiment
Scaling Dexterous Manipulation with Diverse Egocentric Human Data
Published

February 27, 2026

  • Paper Link
  • Project Link
  • Code Link
  1. ๐Ÿค– ๋กœ๋ด‡๋ณ„ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘์˜ ๋†’์€ ๋น„์šฉ๊ณผ ๋กœ๋ด‡-์ธ๊ฐ„ ๊ฐ„์˜ ์ฒดํ˜• ์ฐจ์ด(embodiment gap)๋กœ ์ธํ•ด ๋Œ€๊ทœ๋ชจ ์ธ๊ฐ„ ๋™์ž‘ ๋ฐ์ดํ„ฐ๋ฅผ ๋กœ๋ด‡ ๋™์ž‘์œผ๋กœ ์ง์ ‘ ๋ณ€ํ™˜ํ•˜๋Š” ๋ฐ ์–ด๋ ค์›€์ด ์žˆ์Šต๋‹ˆ๋‹ค.
  2. โœจ SPIDER๋Š” ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ์ƒ˜ํ”Œ๋ง๊ณผ ๊ฐ€์ƒ ์ ‘์ด‰ ์•ˆ๋‚ด(virtual contact guidance)๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์šด๋™ํ•™์  ์ธ๊ฐ„ ์‹œ์—ฐ(kinematic human demonstrations)์„ ๋™์ ์œผ๋กœ ์‹คํ–‰ ๊ฐ€๋Šฅํ•œ ๋กœ๋ด‡ ๊ถค์ ์œผ๋กœ ๋Œ€๊ทœ๋ชจ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.
  3. ๐Ÿš€ SPIDER๋Š” 9๊ฐ€์ง€์˜ ํœด๋จธ๋…ธ์ด๋“œ/์ •๊ตํ•œ ์† ๋กœ๋ด‡๊ณผ 6๊ฐ€์ง€ ๋ฐ์ดํ„ฐ์…‹์— ๊ฑธ์ณ ์ ์šฉ ๊ฐ€๋Šฅํ•˜๋ฉฐ, ์„ฑ๊ณต๋ฅ ์„ 18% ํ–ฅ์ƒ์‹œํ‚ค๊ณ  ๊ธฐ์กด RL(๊ฐ•ํ™” ํ•™์Šต) ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•๋ณด๋‹ค 10๋ฐฐ ๋น ๋ฅด๊ฒŒ 240๋งŒ ํ”„๋ ˆ์ž„ ๊ทœ๋ชจ์˜ ๋กœ๋ด‡ ๋ฐ์ดํ„ฐ์…‹ ์ƒ์„ฑ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ” Ping Review

๐Ÿ” Ping โ€” A light tap on the surface. Get the gist in seconds.

SPIDER(Scalable Physics-Informed DExterous Retargeting)๋Š” ๋Œ€๊ทœ๋ชจ์˜ ์ธ๊ฐ„ ๋™์ž‘ ์‹œ์—ฐ(demonstration) ๋ฐ์ดํ„ฐ๋ฅผ ๋กœ๋ด‡์ด ์‹คํ–‰ ๊ฐ€๋Šฅํ•œ ๋™์ ์œผ๋กœ ์‹คํ˜„ ๊ฐ€๋Šฅํ•œ(dynamically feasible) ๊ถค์ ์œผ๋กœ ๋ณ€ํ™˜ํ•˜๊ณ  ์ฆ๊ฐ•ํ•˜๊ธฐ ์œ„ํ•œ ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ๋ฆฌํƒ€๊ฒŸํŒ…(retargeting) ํ”„๋ ˆ์ž„์›Œํฌ์ž…๋‹ˆ๋‹ค. ๋กœ๋ด‡-ํŠน์ •(robot-specific) ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘์˜ ๋†’์€ ๋น„์šฉ๊ณผ ๋ฐฉ๋Œ€ํ•œ ์ธ๊ฐ„ ๋ชจ์…˜ ๋ฐ์ดํ„ฐ์˜ ๊ฐ€์šฉ์„ฑ ์‚ฌ์ด์˜ ๊ฐ„๊ทน์„ ๋ฉ”์šฐ๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.

๋ฌธ์ œ ์ •์˜:

์ˆ™๋ จ๋˜๊ณ  ๋ฏผ์ฒฉํ•œ(agile) ๋กœ๋ด‡ ์ •์ฑ… ํ•™์Šต์€ ๋Œ€๊ทœ๋ชจ์˜ ๋กœ๋ด‡ ์‹œ์—ฐ์„ ํ•„์š”๋กœ ํ•˜์ง€๋งŒ, ์ด๋ฅผ ์ˆ˜์ง‘ํ•˜๋Š” ๊ฒƒ์€ ๋งค์šฐ ๋น„์šฉ์ด ๋งŽ์ด ๋“ญ๋‹ˆ๋‹ค. ๋ฐ˜๋ฉด, ๋ชจ์…˜ ์บก์ฒ˜, ๋น„๋””์˜ค, ๊ฐ€์ƒ ํ˜„์‹ค์—์„œ ์–ป์„ ์ˆ˜ ์žˆ๋Š” ๋ฐฉ๋Œ€ํ•œ ์ธ๊ฐ„ ๋ชจ์…˜ ๋ฐ์ดํ„ฐ๋Š” ํ’๋ถ€ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๋กœ๋ด‡๊ณผ ์ธ๊ฐ„ ๊ฐ„์˜ ์ฒดํ˜„(embodiment) ์ฐจ์ด(ํ˜•ํƒœํ•™, ๋™์—ญํ•™, ์•ก์ธ„์—์ด์…˜ ๋ถˆ์ผ์น˜)์™€ ํž˜(force) ๋ฐ ํ† ํฌ(torque)์™€ ๊ฐ™์€ ๋™์  ์ •๋ณด์˜ ๋ถ€์กฑ์œผ๋กœ ์ธํ•ด ์ด๋Ÿฌํ•œ ์ธ๊ฐ„ ์‹œ์—ฐ์€ ๋กœ๋ด‡์— ์ง์ ‘ ์‹คํ–‰๋  ์ˆ˜ ์—†์Šต๋‹ˆ๋‹ค. ์ด์— ๋Œ€ํ•œ ํ•ต์‹ฌ ์งˆ๋ฌธ์€ โ€œ์–ด๋–ป๊ฒŒ ํ•˜๋ฉด ์ธ๊ฐ„์˜ ์›€์ง์ž„์„ ๋™์—ญํ•™ ๋ฐ ์ ‘์ด‰์„ ๊ณ ๋ คํ•œ ์‹คํ–‰ ๊ฐ€๋Šฅํ•œ ๋กœ๋ด‡ ๊ถค์ ์œผ๋กœ ํšจ์œจ์ ์ด๊ณ  ์‹ ๋ขฐ์„ฑ ์žˆ๊ฒŒ ๋ณ€ํ™˜ํ•  ์ˆ˜ ์žˆ๋Š”๊ฐ€?โ€์ž…๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ๋ฐฉ๋ฒ•๋ก :

SPIDER๋Š” ์ธ๊ฐ„ ์‹œ์—ฐ์ด ๋†’์€ ์ˆ˜์ค€์˜ ๋กœ๋ด‡ ๋™์ž‘ ๋ฐ ํƒœ์Šคํฌ ๋ช…์„ธ๋ฅผ ์ œ๊ณตํ•˜๊ณ , ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ์˜ ๋Œ€๊ทœ๋ชจ ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ์ƒ˜ํ”Œ๋ง(sampling)์ด ๋™์  ์‹คํ˜„ ๊ฐ€๋Šฅ์„ฑ(dynamical feasibility)๊ณผ ์ •ํ™•ํ•œ ์ ‘์ด‰ ์‹œํ€€์Šค(contact sequence)๋ฅผ ๋ณด์žฅํ•˜๋„๋ก ๊ถค์ ์„ ๋‹ค๋“ฌ๋Š”๋‹ค๋Š” ํ•ต์‹ฌ ํ†ต์ฐฐ๋ ฅ์„ ๋ฐ”ํƒ•์œผ๋กœ ํ•ฉ๋‹ˆ๋‹ค.

  1. ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ๋ฆฌํƒ€๊ฒŸํŒ… ๋ฌธ์ œ ์ •์‹ํ™” (Physics-based Retargeting Problem Formulation): ๋ฆฌํƒ€๊ฒŸํŒ…์€ ์ œ์•ฝ์ด ์žˆ๋Š” ์ตœ์ ํ™” ๋ฌธ์ œ๋กœ ์ •์‹ํ™”๋ฉ๋‹ˆ๋‹ค. ๋กœ๋ด‡ ์ œ์–ด ์‹œํ€€์Šค u_{0:T-1}๋Š” ์ฐธ์กฐ ๊ถค์  x^{ref}_{0:T}์™€์˜ ๊ฑฐ๋ฆฌ์™€ ์ œ์–ด ๋…ธ๋ ฅ(control effort)์„ ์ตœ์†Œํ™”ํ•˜๋„๋ก ์ตœ์ ํ™”๋ฉ๋‹ˆ๋‹ค. ๋ชฉ์  ํ•จ์ˆ˜๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค: \min_{u_{0:T-1}} J(u_{0:T-1}) = \min_{u_{0:T-1}} \left\|x_T - x^{ref}_T\right\|^2_{Q_T} + \sum_{t=0}^{T-1} \left(\left\|x_{t+1} - x^{ref}_{t+1}\right\|^2_{Q_t} + \left\|u_t\right\|^2_{R_t}\right) ์—ฌ๊ธฐ์„œ x^{ref}_t = \{q^{ref}_{robot_t}, q^{ref}_{object_t}\}๋Š” ์ฐธ์กฐ ์ƒํƒœ(์œ„์น˜ q^{ref} ๋ฐ ์†๋„ \dot{q}^{ref})๋ฅผ ๋‚˜ํƒ€๋‚ด๋ฉฐ, Q_t์™€ R_t๋Š” ์ƒํƒœ ๋ฐ ์ œ์–ด ์ž…๋ ฅ ๊ฐ€์ค‘ ํ–‰๋ ฌ(weighting matrices)์ž…๋‹ˆ๋‹ค. ์ œ์•ฝ ์กฐ๊ฑด์€ x_{t+1} = f(x_t, u_t, t)๋กœ, ์ƒํƒœ ์ „์ด ํ•จ์ˆ˜(state transition function)๋ฅผ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค.

  2. ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜ ์ตœ์ ํ™” (Sampling for Physics-based Retargeting): ์ ‘์ด‰์ด ๋งŽ์€(contact-rich) ๋ฆฌํƒ€๊ฒŸํŒ… ๋ฌธ์ œ์˜ ๋น„๋ณผ๋ก์„ฑ(non-convexity) ๋ฐ ๋น„์—ฐ์†์„ฑ(non-continuity)์„ ๋‹ค๋ฃจ๊ธฐ ์œ„ํ•ด ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜ ์ตœ์ ํ™”๊ฐ€ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. ์ด๋Š” ์ •์ฑ… ๋„คํŠธ์›Œํฌ๋ฅผ ์—…๋ฐ์ดํŠธํ•˜๋Š” ๋Œ€์‹  ์ œ์–ด ์‹œํ€€์Šค๋ฅผ ์ง์ ‘ ์ตœ์ ํ™”ํ•œ๋‹ค๋Š” ์ ์—์„œ ๊ฐ•ํ™” ํ•™์Šต(RL)๊ณผ ์œ ์‚ฌํ•ฉ๋‹ˆ๋‹ค. Annealed sampling kernel์„ ์‚ฌ์šฉํ•˜์—ฌ ํƒ์ƒ‰-ํ™œ์šฉ(exploration-exploitation) ๊ท ํ˜•์„ ์กฐ์ ˆํ•ฉ๋‹ˆ๋‹ค. ์†”๋ฃจ์…˜ U^i๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์—…๋ฐ์ดํŠธ๋ฉ๋‹ˆ๋‹ค: U^{i+1} = U^i + \sum_{j=1}^{N_W} \frac{\exp\left(-\frac{J(U^i + [W]_j)}{\lambda}\right)}{\sum_{k=1}^{N_W} \exp\left(-\frac{J(U^i + [W]_k)}{\lambda}\right)} [W]_j ์ƒ˜ํ”Œ๋ง ๊ณต๋ถ„์‚ฐ(covariance) \Sigma^i_h๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์กฐ์ ˆ๋ฉ๋‹ˆ๋‹ค: \Sigma^i_h = \exp\left(-\frac{N-i}{\beta_1 N} - \frac{H-h}{\beta_2 H}\right) I ์—ฌ๊ธฐ์„œ [W]_j \sim \mathcal{N}(0, \Sigma^i_{0:H-1})๋Š” ์ƒ˜ํ”Œ๋ง๋œ ๊ฐ€์šฐ์‹œ์•ˆ ๋…ธ์ด์ฆˆ(Gaussian noise)์ด๊ณ , \beta_1, \beta_2๋Š” Annealing ํŒŒ๋ผ๋ฏธํ„ฐ์ž…๋‹ˆ๋‹ค. ์ดˆ๊ธฐ์—๋Š” ๋„“์€ ํƒ์ƒ‰์„, ํ›„๋ฐ˜๋ถ€์—๋Š” ์œ ๋งํ•œ ๊ถค์  ์ฃผ๋ณ€์˜ ์ •๊ตํ•œ ํ™œ์šฉ์„ ์ˆ˜ํ–‰ํ•ฉ๋‹ˆ๋‹ค.

  3. ๊ฐ€์ƒ ์ ‘์ด‰ ์•ˆ๋‚ด (Virtual Contact Guidance): ํƒœ์Šคํฌ๋ฅผ ์™„๋ฃŒํ•˜๋Š” ์—ฌ๋Ÿฌ ์ ‘์ด‰ ๋ชจ๋“œ(contact modes)๊ฐ€ ์กด์žฌํ•  ์ˆ˜ ์žˆ๋Š” โ€œ์†”๋ฃจ์…˜ ๋ชจํ˜ธ์„ฑ(solution ambiguity)โ€์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋„์ž…๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ์ด๋Š” ๋กœ๋ด‡๊ณผ ๊ฐ์ฒด ์‚ฌ์ด์˜ ์˜๋„๋œ ์ ‘์ด‰ ์ง€์ (intended contact points)์— ๊ฐ€์ƒ ํž˜(virtual force)์„ ์ ์šฉํ•˜์—ฌ ์ƒ˜ํ”Œ๋ง์„ ์›ํ•˜๋Š” ์ ‘์ด‰ ๋ชจ๋“œ๋กœ ์œ ๋„ํ•ฉ๋‹ˆ๋‹ค. ๊ฐ€์ƒ ์ œ์•ฝ(virtual constraint)์€ ์ดˆ๊ธฐ ๋‹จ๊ณ„์—์„œ ๊ฐ์ฒด๋ฅผ ๋ชฉํ‘œ ๊ตฌ์„ฑ(target configuration)์— โ€œ๊ณ ์ •โ€์‹œํ‚ค๊ณ , ์ตœ์ ํ™”๊ฐ€ ์ง„ํ–‰๋จ์— ๋”ฐ๋ผ ์ ์ฐจ ์ด ์ œ์•ฝ์„ ์™„ํ™”ํ•ฉ๋‹ˆ๋‹ค. ์ด๋Š” ์ปค๋ฆฌํ˜๋Ÿผ(curriculum) ๋ฐฉ์‹๊ณผ ์œ ์‚ฌํ•ฉ๋‹ˆ๋‹ค. ์ œ์•ฝ ์กฐ๊ฑด์€ ์ ‘์ด‰ ์Œ(contact pair) ๊ฐ„์˜ ์ƒ๋Œ€ ์œ„์น˜๋ฅผ ์œ ์ง€ํ•˜๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค: c_{k,t} \left\|^{\text{robot}}p_{object_{k,t}} - ^{\text{robot}}p_{object,ref_{k,t}}\right\|^2_2 \le \eta^i ์—ฌ๊ธฐ์„œ c_{k,t}๋Š” ์ ‘์ด‰ ์ง€์‹œ์ž(contact indicator)์ด๋ฉฐ, \eta^i๋Š” i \to N์ผ ๋•Œ \eta^i \to \infty๋กœ ์ปค์ ธ ์ œ์•ฝ ๊ฐ•๋„(constraint strength)๊ฐ€ ์™„ํ™”๋ฉ๋‹ˆ๋‹ค. ๋ถˆ์™„์ „ํ•œ ์ฐธ์กฐ ์ ‘์ด‰(imperfect reference contact)์— ๋Œ€ํ•œ ๊ฐ•๊ฑด์„ฑ(robustness)์„ ์œ„ํ•ด ์ ‘์ด‰ ํ•„ํ„ฐ(contact filter)๊ฐ€ ๋ถˆ์•ˆ์ •ํ•œ ์ƒํ˜ธ์ž‘์šฉ์„ ๊ฐ์ง€ํ•˜์—ฌ ํ•ด๋‹น ๊ฐ€์ƒ ์ œ์•ฝ์„ ๋น„ํ™œ์„ฑํ™”ํ•ฉ๋‹ˆ๋‹ค.

  4. ๊ถค์  ๊ฐ•๊ฑดํ™” (Trajectory Robustification): ์žฌ๊ตฌ์„ฑ๋œ ์‹œ์—ฐ(reconstructed demonstrations)์—์„œ ๋ฐœ์ƒํ•˜๋Š” ๋…ธ์ด์ฆˆ(noise)์™€ ๋ถˆํ™•์‹คํ•œ ์—ญํ•™(unknown dynamics) (์˜ˆ: ๋งˆ์ฐฐ, ์ ‘์ด‰ ์ค€์ˆ˜)์— ๋Œ€ํ•œ ๊ถค์ ์˜ ๊ฐ•๊ฑด์„ฑ์„ ๋†’์ž…๋‹ˆ๋‹ค. ์ด๋Š” ๊ฒฝ๊ณ„๊ฐ€ ์žˆ๋Š” ๋งค๊ฐœ๋ณ€์ˆ˜ ์ง‘ํ•ฉ \mathcal{D} (์˜ˆ: ์ ‘์ด‰ ์—ฌ์œ , ๋งˆ์ฐฐ ๊ณ„์ˆ˜, ๊ฐ์ฒด ์งˆ๋Ÿ‰)์— ๋Œ€ํ•œ ๋น„๊ด€์ ์ธ(pessimistic) (min-max) ๋ชฉ์  ํ•จ์ˆ˜๋กœ ์ œ์–ด ์‹œํ€€์Šค๋ฅผ ์ตœ์ ํ™”ํ•ฉ๋‹ˆ๋‹ค: J_{rob}(U) = \max_{d \in \mathcal{D}} J(U, d) ์ด๋Š” ๋„๋ฉ”์ธ ๋ฌด์ž‘์œ„ํ™”(domain randomization)์™€ ์œ ์‚ฌํ•˜๊ฒŒ ์ž‘๋™ํ•˜๋ฉฐ, GPU ๋ณ‘๋ ฌํ™”(parallelization)๋ฅผ ํ†ตํ•ด ๋ฐฐ์น˜ ๋กค์•„์›ƒ(batched rollouts)์œผ๋กœ ํšจ์œจ์ ์œผ๋กœ ๊ณ„์‚ฐ๋ฉ๋‹ˆ๋‹ค.

  5. ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ ์ฆ๊ฐ• (Physics-based Data Augmentation): ๋‹จ์ผ ์ธ๊ฐ„ ์‹œ์—ฐ์—์„œ ์‹œ์ž‘ํ•˜์—ฌ ๋ฌผ๋ฆฌ์ ์œผ๋กœ ์‹คํ˜„ ๊ฐ€๋Šฅํ•œ ๋‹ค์–‘ํ•œ ๋™์ž‘์„ ์ƒ์„ฑํ•˜์—ฌ ๋ฆฌํƒ€๊ฒŸํŒ…๋œ ๋ฐ์ดํ„ฐ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ์ฆ๊ฐ•ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ด๋Š” ๊ธฐํ•˜ํ•™์  ๋ณ€ํ˜•(geometric variations) (๊ฐ์ฒด ๋ฉ”์‰ฌ ๊ต์ฒด, ํฌ๊ธฐ ๋ฐ ์œ„์น˜ ๋ณ€๊ฒฝ, ์ง€ํ˜• ๋ณ€๊ฒฝ) ๋ฐ ๋ฌผ๋ฆฌ์  ๋ณ€ํ˜•(physics variations) (๋กœ๋ด‡์— ์™ธ๋ ฅ ์ ์šฉ)์„ ํ†ตํ•ด ์ด๋ฃจ์–ด์ง‘๋‹ˆ๋‹ค.

์„ฑ๊ณผ ๋ฐ ํ™œ์šฉ:

SPIDER๋Š” 6๊ฐœ์˜ ๋ฐ์ดํ„ฐ์…‹, 9๊ฐœ์˜ ๋กœ๋ด‡ ํ˜•ํƒœ, ๋‘ ๊ฐ€์ง€ ํƒœ์Šคํฌ ๋„๋ฉ”์ธ(์ˆ™๋ จ๋œ ์† ๋ฐ ํœด๋จธ๋…ธ์ด๋“œ)์— ๊ฑธ์ณ ํ™•์žฅ ๊ฐ€๋Šฅํ•œ ์œ ์—ฐํ•˜๊ณ  ์ผ๋ฐ˜์ ์ธ ํ”„๋ ˆ์ž„์›Œํฌ์ž…๋‹ˆ๋‹ค. ํ‘œ์ค€ ์ƒ˜ํ”Œ๋ง์— ๋น„ํ•ด ์„ฑ๊ณต๋ฅ ์„ 18% ํ–ฅ์ƒ์‹œํ‚ค๊ณ , ๊ฐ•ํ™” ํ•™์Šต(RL) ๊ธฐ์ค€์„ ๋ณด๋‹ค 10๋ฐฐ ๋น ๋ฅด๊ฒŒ ๊ถค์ ์„ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด 2.4M ํ”„๋ ˆ์ž„์˜ ๋Œ€๊ทœ๋ชจ ๋™์  ์‹คํ˜„ ๊ฐ€๋Šฅํ•œ ๋กœ๋ด‡ ๋ฐ์ดํ„ฐ์…‹์„ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๋ฌผ๋ฆฌ์  ๋กœ๋ด‡์— ์ง์ ‘ ๋ฐฐํฌ ๊ฐ€๋Šฅํ•˜๋ฉฐ, RGB ์นด๋ฉ”๋ผ ์˜์ƒ๊ณผ ๊ฐ™์€ ๋…ธ์ด์ฆˆ๊ฐ€ ๋งŽ์€ ๋ฐ์ดํ„ฐ์—๋„ ๊ฐ•๊ฑดํ•˜๊ฒŒ ์ž‘๋™ํ•ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ, SPIDER์—์„œ ์ƒ์„ฑ๋œ ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•˜์—ฌ RL ์ •์ฑ… ํ•™์Šต ํ”„๋กœ์„ธ์Šค๋ฅผ ํฌ๊ฒŒ ๊ฐ€์†ํ™”ํ•˜๊ณ , ๋” ๋‚˜์€ ๊ฐ์ฒด ์ถ”์  ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์—ฌ์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค. ์ด๋Š” RL ์ •์ฑ…์ด SPIDER๊ฐ€ ์ œ๊ณตํ•˜๋Š” ๋ช…๋ชฉ ์ œ์–ด(nominal control)๋กœ๋ถ€ํ„ฐ ์ž”์ฐจ ํ”ผ๋“œ๋ฐฑ(residual feedback)๋งŒ ํ•™์Šตํ•˜๋ฉด ๋˜๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค.


NoteAllegro hand์— ๋Œ€ํ•œ ํ‰๊ฐ€

Allegro hand์— ๋Œ€ํ•œ ํ‰๊ฐ€๋Š” ์ „์ฒด์ ์œผ๋กœ ๋น„๊ต์  ์–‘ํ˜ธํ•ฉ๋‹ˆ๋‹ค. ์„ธ๋ถ€ ๋‚ด์šฉ์€ ๋…ผ๋ฌธ์˜ ํ‘œยท๊ทธ๋ฆผ์„ ๊ทผ๊ฑฐ๋กœ ์ •๋ฆฌํ•˜๋ฉด ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

  • ์ •๋Ÿ‰์  ์„ฑ๋Šฅ
    • Oakink ์ „์ฒด ๋ฐ์ดํ„ฐ์—์„œ SPIDER์˜ Allegro ์„ฑ๊ณต๋ฅ ์€ (45.9%)์ž…๋‹ˆ๋‹ค (Table 2).
    • GigaHands ์ „์ฒด ๋ฐ์ดํ„ฐ์—์„œ๋Š” Allegro ์„ฑ๊ณต๋ฅ ์ด (81.0%)๋กœ ํ›จ์”ฌ ๋†’์Šต๋‹ˆ๋‹ค (Table 2).
    • ์ž‘์€-์˜ˆ์ œ(ablations) ๊ฒฐ๊ณผ์—์„œ Annealed sampling + virtual contact guidance(์™„์ „ํ•œ SPIDER ๊ตฌ์„ฑ)๋Š” Oakink ์˜ˆ์ œ์—์„œ Allegro ์„ฑ๊ณต๋ฅ ์„ (85%)๊นŒ์ง€ ๋Œ์–ด์˜ฌ๋ ธ๊ณ , GigaHands ์˜ˆ์ œ์—์„œ๋Š” (100%) ์„ฑ๊ณต์„ ๋ณด์˜€์Šต๋‹ˆ๋‹ค (Table 1).
  • ๋น„๊ตยทํ•ด์„
    • ๋…ผ๋ฌธ์€ DoF๊ฐ€ ๋งŽ๊ณ  ๊ด€์ ˆ ์ž์œ ๋„๊ฐ€ ๋†’์€ ์†๋“ค(์˜ˆ: Inspire, Allegro)์ด ๋ฆฌํƒ€๊ฒŸํŒ…์—์„œ ๋” ์œ ๋ฆฌํ•˜๋‹ค๊ณ  ๋ณด๊ณ ํ•ฉ๋‹ˆ๋‹ค. Allegro๋Š” ์ด ๋ฒ”์ฃผ์— ๋“ค์–ด๊ฐ€๊ธฐ ๋•Œ๋ฌธ์— ์ „๋ฐ˜์ ์œผ๋กœ ๋†’์€ ์„ฑ๊ณต๋ฅ ์„ ๋ณด์˜€์Šต๋‹ˆ๋‹ค.
    • ๋ฐ์ดํ„ฐ์…‹ ํŠน์„ฑ์— ๋”ฐ๋ผ ๊ฒฐ๊ณผ ์ฐจ์ด๊ฐ€ ํฝ๋‹ˆ๋‹ค. GigaHands๋Š” ํ”ฝยทํ”Œ๋ ˆ์ด์Šค ๊ณ„์—ด ์ž‘์—…์ด ๋งŽ์•„ retargeting์— ์œ ๋ฆฌํ–ˆ๊ณ , Oakink์€ ์‚ฌ์ „ ๊ทธ๋ฆฝ(pre-grasp) ๋“ฑ ์ •๋ฐ€ํ•œ ์ดˆ๊ธฐ ์ ‘์ด‰์„ ์š”๊ตฌํ•ด ๋” ์–ด๋ ค์› ์Šต๋‹ˆ๋‹ค. Allegro๋„ ๋ฐ์ดํ„ฐ ๋‚œ์ด๋„์— ๋”ฐ๋ผ ์„ฑ๋Šฅ ํŽธ์ฐจ๊ฐ€ ๋ฐœ์ƒํ–ˆ์Šต๋‹ˆ๋‹ค.
  • ์‹ค๋ฌด/๋ฐฐํฌ ์ธก๋ฉด
    • ๋…ผ๋ฌธ์€ Allegro hand๋ฅผ Franka Emika Panda ํŒ”๊ณผ ๊ฒฐํ•ฉํ•œ ์‹ค์ œ ์‹œ์Šคํ…œ์—์„œ ๋„ค ๊ฐ€์ง€ ์„ฌ์„ธํ•œ ์กฐ์ž‘(์ „๊ตฌ ํšŒ์ „, ์ž‘์€ ์ˆŸ๊ฐ€๋ฝ ์กฐ์ž‘, ๊ธฐํƒ€ ์—ฐ์ฃผ, ์ถฉ์ „๊ธฐ ๋ถ„๋ฆฌ)์„ ์ง์ ‘ ์‹คํ–‰ํ•ด ์„ฑ๊ณต ์‚ฌ๋ก€๋ฅผ ๋ณด๊ณ ํ•ฉ๋‹ˆ๋‹ค(Deployment ์„น์…˜, Figure 7). ์ด๋Š” ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ์ƒ์„ฑ๋œ Allegro ๊ถค์ ์ด ํ˜„์‹ค ํ•˜๋“œ์›จ์–ด๋กœ๋„ ์ด์‹ ๊ฐ€๋Šฅํ•จ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.
  • ์†๋„ยทํšจ์œจ
    • ์ „์ฒด SPIDER ํŒŒ์ดํ”„๋ผ์ธ(๊ฐ€์ƒ ์ ‘์ด‰ ์•ˆ๋‚ด ํฌํ•จ)์€ annealed-only๋ณด๋‹ค ๋‹ค์†Œ ๋А๋ฆฌ์ง€๋งŒ(์˜ˆ: ์ „์ฒด ๋ฐฉ๋ฒ•์˜ ๋Œ€ํ‘œ์  FPS๋Š” ๋…ผ๋ฌธ์—์„œ 2.5Hz ๋“ฑ์œผ๋กœ ๋ณด๊ณ ๋จ), RL ๊ธฐ๋ฐ˜ ๋Œ€์•ˆ์— ๋น„ํ•ด ํ›จ์”ฌ ๋น ๋ฅด๊ฒŒ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ์ด ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค. Allegro ์ „์šฉ FPS๋Š” ํ‘œ์— ์ง์ ‘ ์ œ์‹œ๋˜์ง€๋Š” ์•Š์ง€๋งŒ ์ „์ฒด ์†๋“ค ์ค‘์—์„œ๋„ ์‹ค์šฉ์  ์ƒ์„ฑ ์†๋„๋ฅผ ์œ ์ง€ํ•ฉ๋‹ˆ๋‹ค.

๊ถŒ์žฅ๋˜๋Š” ํ›„์† ๋ถ„์„(์—ฐ๊ตฌ/์‹คํ—˜)

  • Allegro์˜ ์‹คํŒจ ์‚ฌ๋ก€ ๋ถ„์„: Oakink์—์„œ ์‹คํŒจ๊ฐ€ ์ง‘์ค‘๋˜๋Š” ํŠน์ • ํ”„๋ฆฌ๊ทธ๋ฆฝ/์ดˆ๊ธฐ ์ ‘์ด‰ ํŒจํ„ด์„ ํŒŒ์•…ํ•˜๋ฉด virtual-contact ์กฐ๊ฑด์ด๋‚˜ ํ•„ํ„ฐ๋ง(tc,min, dc,max)์„ ๊ฐœ์„ ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  • ์„ผ์„œ ๋ณด๊ฐ• ์‹คํ—˜: Allegro์— ์ด‰๊ฐ(๋˜๋Š” ์˜ˆ์ธก๋œ ์ ‘์ด‰ ํ† ํฌ) ์ •๋ณด๋ฅผ ๊ฒฐํ•ฉํ•˜๋ฉด ์ ‘์ด‰ ์•ˆ์ •์„ฑยท์ด์‹์„ฑ์ด ๋” ์ข‹์•„์งˆ ๊ฐ€๋Šฅ์„ฑ์ด ํฝ๋‹ˆ๋‹ค.
  • sim-to-real ์ฐจ์ด ์ •๋Ÿ‰ํ™”: Allegro ์‹ค์ œ ๋ฐฐํฌ์—์„œ ์‹คํŒจ ์›์ธ์„ ๋งˆ์ฐฐ๊ณ„์ˆ˜ยท๋ฌด๊ฒŒ์ถ” ์˜ค์ฐจยท๋ชจ๋ธ๋ง ํŽธ์ฐจ๋ณ„๋กœ ๋ถ„ํ•ดํ•ด ๋กœ๋ฒ„์ŠคํŠธํ™” ์ง‘ํ•ฉ D๋ฅผ ์žฌ์„ค๊ณ„ํ•˜๋ฉด ๋ฐฐํฌ ์„ฑ๊ณต๋ฅ ์„ ๋†’์ผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”” Ring Review

๐Ÿ”” Ring โ€” An idea that echoes. Grasp the core and its value.

์‚ฌ๋žŒ์˜ ๋ชจ์…˜์บก์ฒ˜ ๋ฐ์ดํ„ฐ๋ฅผ ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์•ˆ์—์„œ โ€œ๋Œ€๊ทœ๋ชจ ์ƒ˜ํ”Œ๋ง + ๊ฐ€์ƒ ์ ‘์ด‰ ๊ฐ€์ด๋“œโ€๋กœ ์ •์ œํ•˜์—ฌ, 9์ข…์˜ ๋กœ๋ด‡ ร— 6๊ฐœ ๋ฐ์ดํ„ฐ์…‹์— ๊ฑธ์ณ ๋™์—ญํ•™์ ์œผ๋กœ ์‹คํ–‰ ๊ฐ€๋Šฅํ•œ ๊ถค์ ์„ ์ƒ์„ฑํ•˜๋Š” ๋ฒ”์šฉ ๋ฆฌํƒ€๊ฒŒํŒ… ํ”„๋ ˆ์ž„์›Œํฌ.


์„œ๋ก : ์™œ ์ด ๋ฌธ์ œ๊ฐ€ ์ค‘์š”ํ•œ๊ฐ€

๋กœ๋ด‡ ์†์œผ๋กœ ๋ฌผ๊ฑด์„ ์ง‘๊ณ , ๋Œ๋ฆฌ๊ณ , ์˜ฎ๊ธฐ๋Š” ์ผโ€”์ด๋ฅธ๋ฐ” Dexterous Manipulationโ€”์„ ํ•™์Šต์‹œํ‚ค๋ ค๋ฉด ๋Œ€๊ทœ๋ชจ ์‹œ์—ฐ ๋ฐ์ดํ„ฐ๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋Ÿฐ๋ฐ ๋กœ๋ด‡ ํ•˜๋“œ์›จ์–ด๋กœ ์ง์ ‘ ๋ฐ์ดํ„ฐ๋ฅผ ๋ชจ์œผ๋Š” ๊ฑด ๋น„์šฉ์ด ์–ด๋งˆ์–ด๋งˆํ•ฉ๋‹ˆ๋‹ค. ๋ฐ˜๋ฉด ์‚ฌ๋žŒ์˜ ์† ๋™์ž‘ ๋ฐ์ดํ„ฐ๋Š” ๋ชจ์…˜์บก์ฒ˜, ๋น„๋””์˜ค, VR ๋“ฑ์œผ๋กœ ์ด๋ฏธ ๋„˜์ณ๋‚ฉ๋‹ˆ๋‹ค.

๋ฌธ์ œ๋Š” ์ฒดํ˜„ ๊ฒฉ์ฐจ(Embodiment Gap)์ž…๋‹ˆ๋‹ค. ์‚ฌ๋žŒ ์†์—๋Š” 27๊ฐœ์˜ ์ž์œ ๋„(DoF)๊ฐ€ ์žˆ์ง€๋งŒ, Allegro Hand๋Š” 16๊ฐœ, Schunk Hand๋Š” 7๊ฐœ๋ฟ์ด์ฃ . ์†๊ฐ€๋ฝ ๊ธธ์ด๋„, ๊ด€์ ˆ ๋ฐฐ์น˜๋„, ํž˜์„ ๋‚ด๋Š” ๋ฐฉ์‹๋„ ๋‹ค๋ฆ…๋‹ˆ๋‹ค. ๊ทธ๋ž˜์„œ ์‚ฌ๋žŒ์ด ์ปต์„ ์žก๋Š” ๋™์ž‘์„ ๊ทธ๋Œ€๋กœ ๋กœ๋ด‡์— ๋„ฃ์œผ๋ฉด ๋ฌผ๋ฆฌ์ ์œผ๋กœ ๋ถˆ๊ฐ€๋Šฅํ•œ ๋™์ž‘์ด ๋ฉ๋‹ˆ๋‹คโ€”์†๊ฐ€๋ฝ์ด ๋ฌผ์ฒด๋ฅผ ๊ด€ํ†ตํ•˜๊ฑฐ๋‚˜, ์ ‘์ด‰์ด ํ˜•์„ฑ๋˜์ง€ ์•Š๊ฑฐ๋‚˜, ๊ณต์ค‘์—์„œ ๋ฌผ์ฒด๊ฐ€ ๋–  ์žˆ๋Š” โ€œ์œ ๋ น ๊ทธ๋ฆฝโ€ ํ˜„์ƒ์ด ์ƒ๊ธฐ์ฃ .

๊ธฐ์กด ์ ‘๊ทผ๋ฒ•๋“ค์˜ ํ•œ๊ณ„๋ฅผ ์ •๋ฆฌํ•˜๋ฉด ์ด๋ ‡์Šต๋‹ˆ๋‹ค:

์ ‘๊ทผ๋ฒ• ์žฅ์  ํ•œ๊ณ„
์—ญ๊ธฐ๊ตฌํ•™(IK) ๊ธฐ๋ฐ˜ ๋น ๋ฅด๊ณ  ๊ฐ„๋‹จ ๋™์—ญํ•™ ๋ฌด์‹œ, ์ ‘์ด‰ ๋ถ€์ •ํ™•
๊ฐ•ํ™”ํ•™์Šต(RL) ๊ธฐ๋ฐ˜ ๋ฌผ๋ฆฌ์  ํƒ€๋‹น์„ฑ ํ™•๋ณด ๊ถค์ ๋งˆ๋‹ค ํ•™์Šต ํ•„์š”, ๋А๋ฆผ
ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜ ์‹ค์‹œ๊ฐ„, ๋™์—ญํ•™ ๋ฐ˜์˜ ๋…ธ๋™์ง‘์•ฝ์ , ์ฒดํ˜„ ์ข…์†์ 
์‹ ๊ฒฝ๋ง ๊ธฐ๋ฐ˜ ์ถ”๋ก  ๋น ๋ฆ„ OOD ๋ชจ์…˜์— ์ทจ์•ฝ, ์‚ฌ์ „ํ•™์Šต ํ•„์š”

SPIDER๋Š” ์ด ์ŠคํŽ™ํŠธ๋Ÿผ์—์„œ RL์˜ ๋ฌผ๋ฆฌ์  ํƒ€๋‹น์„ฑ๊ณผ ์ตœ์ ํ™” ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•์˜ ํšจ์œจ์„ฑ์„ ๋™์‹œ์— ์žก๊ฒ ๋‹ค๋Š” ์•ผ์‹ฌ์ฐฌ ๋ชฉํ‘œ๋ฅผ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค. ํ•ต์‹ฌ ํ†ต์ฐฐ์€ ๋‹จ์ˆœํ•˜๋ฉด์„œ๋„ ๊ฐ•๋ ฅํ•ฉ๋‹ˆ๋‹ค:

โ€œ์‚ฌ๋žŒ ๋ฐ์ดํ„ฐ๋Š” ๋ฌด์—‡์„ ํ• ์ง€(task structure)๋ฅผ ์•Œ๋ ค์ฃผ๊ณ , ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์•ˆ์—์„œ์˜ ๋Œ€๊ทœ๋ชจ ์ƒ˜ํ”Œ๋ง์ด ์–ด๋–ป๊ฒŒ ํ• ์ง€(dynamical feasibility)๋ฅผ ์ฐพ์•„์ค€๋‹ค.โ€


๋ฐฉ๋ฒ•๋ก : SPIDER์˜ ํ•ต์‹ฌ ์•„์ด๋””์–ด

์ „์ฒด ํŒŒ์ดํ”„๋ผ์ธ ๊ฐœ๊ด€

SPIDER์˜ ํŒŒ์ดํ”„๋ผ์ธ์€ ํฌ๊ฒŒ ๋„ค ๋‹จ๊ณ„๋กœ ๊ตฌ์„ฑ๋ฉ๋‹ˆ๋‹ค. ์ „์ฒด ํ๋ฆ„์„ ๋จผ์ € ๋„์‹์œผ๋กœ ์‚ดํŽด๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค.

flowchart LR
    A["๐Ÿง‘ ์‚ฌ๋žŒ ๋ชจ์…˜ ๋ฐ์ดํ„ฐ<br>(MoCap / Video / VR)"] --> B["1๏ธโƒฃ ๊ธฐ๊ตฌํ•™ ๋ฆฌํƒ€๊ฒŒํŒ…<br>(Inverse Kinematics)"]
    B --> C["2๏ธโƒฃ ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ์ƒ˜ํ”Œ๋ง<br>(Annealed Sampling<br>+ Contact Guidance)"]
    C --> D["3๏ธโƒฃ ๊ถค์  ๊ฐ•๊ฑดํ™”<br>(Robustification)"]
    D --> E["4๏ธโƒฃ ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•<br>(Physics-based<br>Augmentation)"]
    E --> F["๐Ÿค– ๋กœ๋ด‡ ์‹คํ–‰ ๊ฐ€๋Šฅ ๊ถค์ <br>/ ์ •์ฑ… ํ•™์Šต ๋ฐ์ดํ„ฐ"]

    style A fill:#e1f5fe
    style F fill:#e8f5e9
    style C fill:#fff3e0

๋…ผ๋ฌธ Figure 2 ์„ค๋ช… (Pipeline Overview): ๋…ผ๋ฌธ์˜ Figure 2๋Š” ์ด ํŒŒ์ดํ”„๋ผ์ธ์„ ์‹œ๊ฐ์ ์œผ๋กœ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค. ์™ผ์ชฝ์—์„œ ์žฌ๊ตฌ์„ฑ๋œ ๋ฌผ์ฒด ๋ฉ”์‹œ์™€ ์ฐธ์กฐ ๋กœ๋ด‡/๋ฌผ์ฒด ๋ชจ์…˜์ด ์ž…๋ ฅ๋˜๊ณ , ๊ฐ€์šด๋ฐ์—์„œ ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ธฐ๋ฐ˜ ์ƒ˜ํ”Œ๋ง๊ณผ ๊ฐ€์ƒ ์ ‘์ด‰ ๊ฐ€์ด๋“œ๊ฐ€ ์ ์šฉ๋˜๋ฉฐ, ์˜ค๋ฅธ์ชฝ์—์„œ ๋™์—ญํ•™์ ์œผ๋กœ ์‹คํ–‰ ๊ฐ€๋Šฅํ•œ ๊ถค์ ์ด ์ถœ๋ ฅ๋ฉ๋‹ˆ๋‹ค.

๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ๋ฆฌํƒ€๊ฒŒํŒ… ๋ฌธ์ œ ์ •์˜

SPIDER๋Š” ๋ฆฌํƒ€๊ฒŒํŒ…์„ ์ œ์•ฝ ์ตœ์ ํ™” ๋ฌธ์ œ๋กœ ๊ณต์‹ํ™”ํ•ฉ๋‹ˆ๋‹ค. ์ด ๊ณต์‹ํ™”๋ฅผ ์ง๊ด€์ ์œผ๋กœ ์ดํ•ดํ•ด ๋ด…์‹œ๋‹ค.

๋กœ๋ด‡์˜ ์ œ์–ด ์ž…๋ ฅ ์‹œํ€€์Šค u_{0:T-1}์„ ์ฐพ์•„์„œ, ์ฐธ์กฐ ๊ถค์  x_{0:T}^{\text{ref}}๊ณผ์˜ ๊ฑฐ๋ฆฌ์™€ ์ œ์–ด ๋…ธ๋ ฅ์„ ์ตœ์†Œํ™”ํ•ฉ๋‹ˆ๋‹ค:

\min_{u_{0:T-1}} J(u_{0:T-1}) = \|x_T - x_T^{\text{ref}}\|_{Q_T}^2 + \sum_{t=0}^{T-1}\left(\|x_{t+1} - x_{t+1}^{\text{ref}}\|_{Q_t}^2 + \|u_t\|_{R_t}^2\right)

์—ฌ๊ธฐ์„œ ํ•ต์‹ฌ์ ์ธ ์ œ์•ฝ ์กฐ๊ฑด์€:

x_{t+1} = f(x_t, u_t, t) \quad \text{(๋ฌผ๋ฆฌ ๋ฒ•์น™์— ์˜ํ•œ ์ƒํƒœ ์ „์ด)}

์ด๊ฑธ ์ผ์ƒ ๋น„์œ ๋กœ ํ’€๋ฉด ์ด๋ ‡์Šต๋‹ˆ๋‹ค. ์—ฌ๋Ÿฌ๋ถ„์ด ์ถค ๋™์˜์ƒ์„ ๋ณด๊ณ  ๋”ฐ๋ผ ํ•˜๋ ค ํ•œ๋‹ค๊ณ  ํ•ฉ์‹œ๋‹ค. ๋™์˜์ƒ ์† ๋Œ„์„œ(์‚ฌ๋žŒ ๋ฐ์ดํ„ฐ)์˜ ํฌ์ฆˆ๋ฅผ ์ตœ๋Œ€ํ•œ ๋น„์Šทํ•˜๊ฒŒ ๋”ฐ๋ผ ํ•˜๋˜(Q_t ํ•ญ), ํž˜๋“ค๊ฒŒ ์›€์ง์ด์ง€ ์•Š์œผ๋ฉด์„œ(R_t ํ•ญ), ๋ฌผ๋ฆฌ ๋ฒ•์น™์„ ์–ด๊ธฐ์ง€ ์•Š๋Š” ๋™์ž‘์„ ์ฐพ์•„์•ผ ํ•ฉ๋‹ˆ๋‹ค. ๋ฒฝ์„ ๊ด€ํ†ตํ•˜๊ฑฐ๋‚˜, ๊ณต์ค‘์— ๋– ์„œ๋Š” ์•ˆ ๋˜์ฃ . ์—ฌ๊ธฐ์„œ Q_t๋Š” โ€œ์–ผ๋งˆ๋‚˜ ์›๋ณธ๊ณผ ๋น„์Šทํ•ด์•ผ ํ•˜๋Š”์ง€โ€, R_t๋Š” โ€œ์–ผ๋งˆ๋‚˜ ๋ถ€๋“œ๋Ÿฝ๊ฒŒ ์›€์ง์—ฌ์•ผ ํ•˜๋Š”์ง€โ€๋ฅผ ์กฐ์ ˆํ•˜๋Š” ๊ฐ€์ค‘์น˜์ž…๋‹ˆ๋‹ค.

์ƒํƒœ x_t^{\text{ref}}๋Š” ๋กœ๋ด‡์˜ ๊ด€์ ˆ ์œ„์น˜์™€ ๋ฌผ์ฒด์˜ SE(3) ํฌ์ฆˆ๋ฅผ ๋ชจ๋‘ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค. ์ด๊ฒƒ์ด ์ค‘์š”ํ•œ ์ด์œ ๋Š”, SPIDER๊ฐ€ ๋กœ๋ด‡ ๋™์ž‘๋ฟ ์•„๋‹ˆ๋ผ ๋ฌผ์ฒด์˜ ์›€์ง์ž„๊นŒ์ง€ ๋™์‹œ์— ์ถ”์ ํ•˜๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค.

์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜ ์ตœ์ ํ™”: ์™œ ์ƒ˜ํ”Œ๋ง์ธ๊ฐ€?

์ ‘์ด‰์ด ํ’๋ถ€ํ•œ(contact-rich) ๋ฌธ์ œ์—์„œ ๋น„์šฉ ํ•จ์ˆ˜ J์˜ ์ง€ํ˜•์€ ๋งค์šฐ ์šธํ‰๋ถˆํ‰ํ•ฉ๋‹ˆ๋‹ค. ๋ฏธ๋ถ„์ด ๋ถˆ์—ฐ์†์ด๊ณ , ์ ‘์ด‰ ๋ชจ๋“œ๊ฐ€ ๋ฐ”๋€Œ๋ฉด ๋น„์šฉ์ด ๊ฐ‘์ž๊ธฐ ์ ํ”„ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋ž˜๋””์–ธํŠธ ๊ธฐ๋ฐ˜ ์ตœ์ ํ™”๊ฐ€ ์ œ๋Œ€๋กœ ์ž‘๋™ํ•˜๊ธฐ ์–ด๋ ค์šด ํ™˜๊ฒฝ์ด์ฃ .

SPIDER๋Š” Model Predictive Path Integral (MPPI) ๊ณ„์—ด์˜ ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜ ์ตœ์ ํ™”๋ฅผ ์ฑ„ํƒํ•ฉ๋‹ˆ๋‹ค. ์ง๊ด€์ ์œผ๋กœ ์„ค๋ช…ํ•˜๋ฉด:

  1. ํ˜„์žฌ ์ œ์–ด ์‹œํ€€์Šค U^i์— ๊ฐ€์šฐ์‹œ์•ˆ ๋…ธ์ด์ฆˆ๋ฅผ ๋”ํ•ด N_W๊ฐœ์˜ ํ›„๋ณด ๊ถค์ ์„ ๋งŒ๋“ญ๋‹ˆ๋‹ค
  2. ๊ฐ ํ›„๋ณด๋ฅผ ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์—์„œ ๋ณ‘๋ ฌ ๋กค์•„์›ƒํ•ฉ๋‹ˆ๋‹ค
  3. ๋น„์šฉ์ด ๋‚ฎ์€ ํ›„๋ณด์— ๋” ๋†’์€ ๊ฐ€์ค‘์น˜๋ฅผ ๋ถ€์—ฌํ•˜์—ฌ ๋‹ค์Œ ์ œ์–ด ์‹œํ€€์Šค๋ฅผ ์—…๋ฐ์ดํŠธํ•ฉ๋‹ˆ๋‹ค

U^{i+1} = U^i + \frac{\sum_{j=1}^{N_W} \exp\left(-\frac{J(U^i + [W]_j)}{\lambda}\right)[W]_j}{\sum_{j=1}^{N_W} \exp\left(-\frac{J(U^i + [W]_j)}{\lambda}\right)}

์—ฌ๊ธฐ์„œ \lambda๋Š” ์˜จ๋„ ํŒŒ๋ผ๋ฏธํ„ฐ๋กœ, ๋‚ฎ์„์ˆ˜๋ก ์ตœ์ € ๋น„์šฉ ์ƒ˜ํ”Œ์— ์ง‘์ค‘ํ•ฉ๋‹ˆ๋‹ค. ์ด๊ฑด RL๊ณผ ๋น„์Šทํ•œ ์ ์ด ์žˆ์Šต๋‹ˆ๋‹คโ€”๋‘˜ ๋‹ค ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ์ƒ˜ํ”Œ๋งํ•œ ๊ถค์ ์œผ๋กœ ์˜์‚ฌ๊ฒฐ์ •์„ ํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ๊ฒฐ์ •์  ์ฐจ์ด๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค: RL์€ ์ •์ฑ… ๋„คํŠธ์›Œํฌ๋ฅผ ์—…๋ฐ์ดํŠธํ•˜๊ณ , SPIDER๋Š” ์ œ์–ด ์‹œํ€€์Šค๋ฅผ ์ง์ ‘ ์ตœ์ ํ™”ํ•ฉ๋‹ˆ๋‹ค. ๋„คํŠธ์›Œํฌ ํ•™์Šต์ด ํ•„์š” ์—†์œผ๋‹ˆ ํ›จ์”ฌ ๋น ๋ฆ…๋‹ˆ๋‹ค.

์–ด๋‹๋ง(Annealing) ์ „๋žต: ํƒ์ƒ‰์—์„œ ์ฐฉ์ทจ๋กœ

SPIDER์˜ ํ•ต์‹ฌ ๊ธฐ์—ฌ ์ค‘ ํ•˜๋‚˜๋Š” ์–ด๋‹๋ง ์ปค๋„(Annealed Kernel)์ž…๋‹ˆ๋‹ค. ์ƒ˜ํ”Œ๋ง ๋…ธ์ด์ฆˆ์˜ ๊ณต๋ถ„์‚ฐ์„ ์‹œ๊ฐ„์— ๋”ฐ๋ผ ์ค„์—ฌ๊ฐ€๋Š” ์ „๋žต์ž…๋‹ˆ๋‹ค:

\Sigma_h^i = \exp\left(-\frac{N-i}{\beta_1 N} - \frac{H-h}{\beta_2 H}\right) I

๋‘ ๊ฐœ์˜ ์ถ•์œผ๋กœ ์–ด๋‹๋งํ•ฉ๋‹ˆ๋‹ค:

  • \beta_1 (๋ฐ˜๋ณต ์ถ•): ์ตœ์ ํ™” ๋ฐ˜๋ณต์ด ์ง„ํ–‰๋ ์ˆ˜๋ก ํƒ์ƒ‰ ๋ฐ˜๊ฒฝ์„ ์ค„์ž…๋‹ˆ๋‹ค. ์ดˆ๋ฐ˜์—๋Š” ๋„“๊ฒŒ ํƒ์ƒ‰ํ•˜๊ณ , ํ›„๋ฐ˜์—๋Š” ์ข‹์€ ํ•ด ๊ทผ์ฒ˜๋ฅผ ์„ธ๋ฐ€ํ•˜๊ฒŒ ๋‹ค๋“ฌ์Šต๋‹ˆ๋‹ค.
  • \beta_2 (์‹œ๊ฐ„ ์ถ•): ์˜ˆ์ธก ์‹œ๊ฐ„ ์ถ•์„ ๋”ฐ๋ผ ๋…ธ์ด์ฆˆ๋ฅผ ์กฐ์ ˆํ•ฉ๋‹ˆ๋‹ค. ๊ฐ€๊นŒ์šด ๋ฏธ๋ž˜๋Š” ์ •๋ฐ€ํ•˜๊ฒŒ, ๋จผ ๋ฏธ๋ž˜๋Š” ๋„“๊ฒŒ ํƒ์ƒ‰ํ•ฉ๋‹ˆ๋‹ค.

์ด๊ฒƒ์€ ๊ธˆ์†์˜ ์–ด๋‹๋ง(ํ’€๋ฆผ) ๊ณผ์ •๊ณผ ์ •ํ™•ํžˆ ๊ฐ™์€ ์›๋ฆฌ์ž…๋‹ˆ๋‹ค. ๋œจ๊ฑฐ์šธ ๋•Œ(ํฐ ๋…ธ์ด์ฆˆ)๋Š” ์›์ž๋“ค์ด ์ž์œ ๋กญ๊ฒŒ ์›€์ง์ด๋ฉฐ ์ „์ฒด ์—๋„ˆ์ง€ ์ง€ํ˜•์„ ํƒ์ƒ‰ํ•˜๊ณ , ์ฒœ์ฒœํžˆ ์‹ํžˆ๋ฉด(์ž‘์€ ๋…ธ์ด์ฆˆ) ๊ฐ€์žฅ ์•ˆ์ •์ ์ธ ๊ฒฐ์ • ๊ตฌ์กฐ์— ์ˆ˜๋ ดํ•ฉ๋‹ˆ๋‹ค. ํ‘œ์ค€ MPPI๊ฐ€ ๊ณ ์ •๋œ ์˜จ๋„์—์„œ ์ƒ˜ํ”Œ๋งํ•˜๋Š” ๊ฒƒ๊ณผ ๋Œ€๋น„๋ฉ๋‹ˆ๋‹ค.

graph TD
    subgraph "ํ‘œ์ค€ MPPI"
        A1["๊ณ ์ • ํƒ์ƒ‰ ๋ฐ˜๊ฒฝ"] --> B1["๋†’์€ ๋ถ„์‚ฐ"]
        B1 --> C1["์ˆ˜๋ ด ์‹คํŒจ ๊ฐ€๋Šฅ"]
    end
    subgraph "์–ด๋‹๋ง ์ƒ˜ํ”Œ๋ง"
        A2["ํฐ ๋ฐ˜๊ฒฝ์œผ๋กœ ์‹œ์ž‘"] --> B2["์ ์ง„์  ์ถ•์†Œ"]
        B2 --> C2["์ •๋ฐ€ ์ˆ˜๋ ด"]
    end
    subgraph "์–ด๋‹๋ง + ์ ‘์ด‰ ๊ฐ€์ด๋“œ (SPIDER)"
        A3["ํฐ ๋ฐ˜๊ฒฝ + ๊ฐ€์ƒ ํž˜"] --> B3["์‹คํ˜„ ๊ฐ€๋Šฅ ์˜์—ญ ํ™•์žฅ"]
        B3 --> C3["์˜ฌ๋ฐ”๋ฅธ ์ ‘์ด‰ ๋ชจ๋“œ๋กœ ์ˆ˜๋ ด"]
    end

    style C1 fill:#ffcdd2
    style C2 fill:#fff9c4
    style C3 fill:#c8e6c9

๊ฐ€์ƒ ์ ‘์ด‰ ๊ฐ€์ด๋“œ: SPIDER์˜ ๋น„๋ฐ€ ๋ฌด๊ธฐ

์—ฌ๊ธฐ๊ฐ€ ์ด ๋…ผ๋ฌธ์—์„œ ๊ฐ€์žฅ ์ฐฝ์˜์ ์ธ ๋ถ€๋ถ„์ž…๋‹ˆ๋‹ค.

๋ฌธ์ œ: ๊ฐ™์€ ๋ฌผ์ฒด๋ฅผ ์žก๋Š” ๋ฐ์—๋„ ์—ฌ๋Ÿฌ ์ ‘์ด‰ ๋ชจ๋“œ(contact mode)๊ฐ€ ์กด์žฌํ•ฉ๋‹ˆ๋‹ค. ๋…ผ๋ฌธ์˜ Figure 3์—์„œ ๋ง‰๋Œ€๋ฅผ ์žก๋Š” ์˜ˆ์‹œ๋ฅผ ๋ณด๋ฉด, ์—„์ง€-๊ฒ€์ง€ ์‚ฌ์ด๋กœ ์žก์„ ์ˆ˜๋„ ์žˆ๊ณ  ๊ฒ€์ง€-์ค‘์ง€ ์‚ฌ์ด๋กœ ์žก์„ ์ˆ˜๋„ ์žˆ์Šต๋‹ˆ๋‹ค. ๋‘˜ ๋‹ค ๋ฌผ์ฒด๋ฅผ ์›€์ง์ด๋Š” ๋ฐ ์„ฑ๊ณตํ•˜์ง€๋งŒ, ์‚ฌ๋žŒ์˜ ์›๋ž˜ ์˜๋„์™€ ๋‹ค๋ฅธ ์ ‘์ด‰ ๋ชจ๋“œ๋กœ ์ˆ˜๋ ดํ•˜๋ฉด ์ž์—ฐ์Šค๋Ÿฝ์ง€ ์•Š์€ ๋™์ž‘์ด ๋ฉ๋‹ˆ๋‹ค.

ํ•ด๊ฒฐ: SPIDER๋Š” ๊ฐ€์ƒ ํž˜(virtual force)์„ ๋„์ž…ํ•ฉ๋‹ˆ๋‹ค. ๋กœ๋ด‡ ์†๊ฐ€๋ฝ์˜ ์˜๋„๋œ ์ ‘์ด‰์ ๊ณผ ๋ฌผ์ฒด ์‚ฌ์ด์— โ€œ๋ณด์ด์ง€ ์•Š๋Š” ์Šคํ”„๋งโ€์„ ๋‹ฌ์•„์„œ, ์ดˆ๊ธฐ์—๋Š” ๋ฌผ์ฒด๋ฅผ ์›ํ•˜๋Š” ์ ‘์ด‰์ ์— โ€œ๋ถ™์—ฌ๋†“๊ณ โ€, ์ ์ฐจ ์ด ํž˜์„ ํ’€์–ด์ค๋‹ˆ๋‹ค.

์ˆ˜ํ•™์ ์œผ๋กœ, k๋ฒˆ์งธ ์ ‘์ด‰ ์Œ์— ๋Œ€ํ•ด:

c_{k,t} \cdot \|{}^{\text{robot}}p_{k,t}^{\text{object}} - {}^{\text{robot}}p_{k,t}^{\text{object,ref}}\|_2^2 \leq \eta_i

์—ฌ๊ธฐ์„œ:

  • c_{k,t}๋Š” ์ ‘์ด‰ ์ง€์‹œ์ž (์ฐธ์กฐ์—์„œ ์ ‘์ด‰์ด ์ผ์–ด๋‚˜๋Š” ์‹œ์ ์—๋งŒ ํ™œ์„ฑํ™”)
  • {}^{\text{robot}}p_{k,t}^{\text{object}}๋Š” ๋กœ๋ด‡ ์†๊ฐ€๋ฝ๊ณผ ๋ฌผ์ฒด ์‚ฌ์ด์˜ ์ƒ๋Œ€ ์œ„์น˜
  • \eta_i \to \infty (i \to N)๋Š” ์ปค๋ฆฌํ˜๋Ÿผ ์Šคํƒ€์ผ๋กœ ์ œ์•ฝ์„ ์™„ํ™”

์ด๊ฑธ ๋น„์œ ํ•˜์ž๋ฉด ์ด๋ ‡์Šต๋‹ˆ๋‹ค. ์ž์ „๊ฑฐ๋ฅผ ์ฒ˜์Œ ๋ฐฐ์šธ ๋•Œ ๋ณด์กฐ ๋ฐ”ํ€ด๋ฅผ ๋‹ฌ์•„๋‘๊ณ , ๊ท ํ˜• ๊ฐ๊ฐ์ด ์ƒ๊ธฐ๋ฉด ๋ณด์กฐ ๋ฐ”ํ€ด๋ฅผ ๋–ผ๋Š” ๊ฒƒ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค. ์ฒ˜์Œ์—๋Š” โ€œ๋ฌผ์ฒด๊ฐ€ ์ด ์†๊ฐ€๋ฝ์— ๋ถ™์–ด์žˆ์–ด์•ผ ํ•ดโ€๋ผ๋Š” ๊ฐ•ํ•œ ์ œ์•ฝ์„ ๊ฑธ๊ณ , ์ตœ์ ํ™”๊ฐ€ ์˜ฌ๋ฐ”๋ฅธ ์ ‘์ด‰ ๋ชจ๋“œ ๊ทผ์ฒ˜์— ๋„๋‹ฌํ•˜๋ฉด ์ œ์•ฝ์„ ํ’€์–ด์„œ ์ž์—ฐ์Šค๋Ÿฌ์šด ๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ์ ‘์ด‰์œผ๋กœ ์ „ํ™˜ํ•ฉ๋‹ˆ๋‹ค.

๊ธฐ์กด์˜ ์ ‘์ด‰ ๋น„์šฉ(contact cost)๊ณผ์˜ ์ฐจ์ด๊ฐ€ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค. ๋‹จ์ˆœํžˆ ๋น„์šฉ์— ์ ‘์ด‰ ํŽ˜๋„ํ‹ฐ๋ฅผ ์ถ”๊ฐ€ํ•˜๋Š” ๊ฒƒ์€ ์‹คํ˜„ ๊ฐ€๋Šฅ ์˜์—ญ(feasible set)์„ ๋ฐ”๊พธ์ง€ ์•Š์Šต๋‹ˆ๋‹คโ€”๊ทธ์ € ๋น„์šฉ ์ง€ํ˜•์˜ ๊ธฐ์šธ๊ธฐ๋ฅผ ๋ฐ”๊ฟ€ ๋ฟ์ด์ฃ . SPIDER์˜ ๊ฐ€์ƒ ์ ‘์ด‰ ๊ฐ€์ด๋“œ๋Š” ๋ฌผ๋ฆฌ์ ์œผ๋กœ ์‹คํ˜„ ๊ฐ€๋Šฅ ์˜์—ญ ์ž์ฒด๋ฅผ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค. ๊ฐ€์ƒ ํž˜์ด ๋ฌผ์ฒด๋ฅผ ์›ํ•˜๋Š” ์œ„์น˜์— ์žก์•„๋‘๋ฏ€๋กœ, ์˜ฌ๋ฐ”๋ฅธ ์ ‘์ด‰ ๋ชจ๋“œ์˜ โ€œ์œ ์—ญ ๋ถ„์ง€(basin of attraction)โ€๊ฐ€ ์ปค์ง€๋Š” ํšจ๊ณผ๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.

๋ถˆ์™„์ „ํ•œ ์ฐธ์กฐ์— ๋Œ€ํ•œ ๊ฐ•๊ฑด์„ฑ: ํ˜„์‹ค์˜ ๋ชจ์…˜์บก์ฒ˜ ๋ฐ์ดํ„ฐ๋Š” ๋…ธ์ด์ฆˆ๊ฐ€ ๋งŽ์Šต๋‹ˆ๋‹ค. ์ ‘์ด‰์ด ๋„ˆ๋ฌด ์งง๊ฑฐ๋‚˜(< t_{c,\min}) ์ ‘์ด‰์ ์ด ํฌ๊ฒŒ ์ด๋™ํ•˜๋Š” ๊ฒฝ์šฐ(> d_{c,\max}) ๋ถˆ์•ˆ์ •ํ•œ ์ ‘์ด‰์œผ๋กœ ๋ถ„๋ฅ˜ํ•˜์—ฌ ๊ฐ€์ƒ ์ œ์•ฝ์„ ๋น„ํ™œ์„ฑํ™”ํ•ฉ๋‹ˆ๋‹ค. ์ด๋Š” ๋…ธ์ด์ฆˆ๊ฐ€ ์ตœ์ ํ™”๋ฅผ ์˜ค์—ผ์‹œํ‚ค๋Š” ๊ฒƒ์„ ๋ฐฉ์ง€ํ•˜๋Š” ์•ˆ์ „์žฅ์น˜์ž…๋‹ˆ๋‹ค.

๊ถค์  ๊ฐ•๊ฑดํ™”(Robustification)

์‹œ๋ฎฌ๋ ˆ์ด์…˜๊ณผ ํ˜„์‹ค ์‚ฌ์ด์˜ ๊ฒฉ์ฐจ(sim-to-real gap)๋ฅผ ๋‹ค๋ฃจ๊ธฐ ์œ„ํ•ด, SPIDER๋Š” ๋น„๊ด€์ (pessimistic) ์ตœ์ ํ™”๋ฅผ ์ˆ˜ํ–‰ํ•ฉ๋‹ˆ๋‹ค:

J_{\text{rob}}(U) = \max_{d \in \mathcal{D}} J(U, d)

์—ฌ๊ธฐ์„œ \mathcal{D}๋Š” ์ ‘์ด‰ ๋งˆ์ง„, ๋งˆ์ฐฐ ๊ณ„์ˆ˜, ๋ฌผ์ฒด ์งˆ๋Ÿ‰ ๋“ฑ์˜ ๋ฌผ๋ฆฌ ํŒŒ๋ผ๋ฏธํ„ฐ ๋ณ€๋™ ๋ฒ”์œ„์ž…๋‹ˆ๋‹ค. ์ผ๋ฐ˜์ ์ธ ๋„๋ฉ”์ธ ๋žœ๋คํ™”(DR)๊ฐ€ ๊ธฐ๋Œ€๊ฐ’์„ ์ตœ์†Œํ™”ํ•˜๋Š” ๊ฒƒ๊ณผ ๋‹ฌ๋ฆฌ, SPIDER๋Š” ์ตœ์•…์˜ ๊ฒฝ์šฐ๋ฅผ ์ตœ์†Œํ™”ํ•ฉ๋‹ˆ๋‹ค.

์ด๊ฒƒ์„ ์ง๊ด€์ ์œผ๋กœ ์ดํ•ดํ•˜๋ฉด, ๋ณดํ†ต์˜ DR์ด โ€œํ‰๊ท ์ ์œผ๋กœ ์ž˜ ๋˜๋Š” ์ œ์–ดโ€๋ฅผ ์ฐพ๋Š”๋‹ค๋ฉด, SPIDER์˜ ๊ฐ•๊ฑดํ™”๋Š” โ€œ์–ด๋–ค ์ƒํ™ฉ์—์„œ๋„ ์ตœ์†Œํ•œ ์ด ์ •๋„๋Š” ๋˜๋Š” ์ œ์–ดโ€๋ฅผ ์ฐพ์Šต๋‹ˆ๋‹ค. ๋ฏธ๋‹ˆ๋ฐฐ์น˜ d_{1:K}์—์„œ ๊ฐ€์žฅ ๋‚˜์œ ๋น„์šฉ์„ ์‚ฌ์šฉํ•˜์—ฌ ์—…๋ฐ์ดํŠธํ•˜๋ฏ€๋กœ, ๊ฒฐ๊ณผ ๊ถค์ ์ด ๋‹ค์–‘ํ•œ ๋ฌผ๋ฆฌ ์กฐ๊ฑด์—์„œ๋„ ์‹คํ–‰ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.

๋ฌผ๋ฆฌ ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•

SPIDER์˜ ๋˜ ๋‹ค๋ฅธ ์žฅ์ ์€ ๋‹จ์ผ ์‹œ์—ฐ์œผ๋กœ๋ถ€ํ„ฐ ๋‹ค์–‘ํ•œ ๋ฌผ๋ฆฌ์  ๋ณ€ํ˜•์„ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์ ์ž…๋‹ˆ๋‹ค:

mindmap
  root((๋ฐ์ดํ„ฐ ์ฆ๊ฐ•))
    ๊ธฐํ•˜ํ•™์  ๋ณ€ํ˜•
      ๋ฌผ์ฒด ํฌ๊ธฐ ๋ณ€๊ฒฝ
      ๋ฌผ์ฒด ๋ฉ”์‹œ ๊ต์ฒด
      ์ดˆ๊ธฐ ํฌ์ฆˆ ๋ณ€๋™
      ์ง€ํ˜• ๋ณ€๊ฒฝ: ํ‰์ง€โ†’๊ณ„๋‹จ
    ๋ฌผ๋ฆฌ์  ๋ณ€ํ˜•
      ์™ธ๋ ฅ ์ธ๊ฐ€: 120N, 240N
      ๋งˆ์ฐฐ ๊ณ„์ˆ˜ ๋ณ€๊ฒฝ
      ๋ฌผ์ฒด ์งˆ๋Ÿ‰ ๋ณ€๊ฒฝ

ํŠนํžˆ ์ธ์ƒ์ ์ธ ์˜ˆ์‹œ๋Š” ๋ฌผ์ฒด ๋ฉ”์‹œ ๊ต์ฒด์ž…๋‹ˆ๋‹ค. ์ฝœ๋ผ๋ณ‘์„ ์žก๋Š” ๋™์ž‘์˜ ๋ฌผ์ฒด๋ฅผ ๊ณ ์–‘์ด ์žฅ๋‚œ๊ฐ์œผ๋กœ ๋ฐ”๊พธ๋ฉด, SPIDER๊ฐ€ ์ž๋™์œผ๋กœ ์ƒˆ๋กœ์šด ๋ฌผ์ฒด์— ๋งž๋Š” ํŒŒ์ง€ ์ „๋žต์„ ์ฐพ์•„๋ƒ…๋‹ˆ๋‹ค. ๋˜ํ•œ ํ‰์ง€ ๋‹ฌ๋ฆฌ๊ธฐ ๋™์ž‘์— ๊ณ„๋‹จ ์ง€ํ˜•์„ ์ถ”๊ฐ€ํ•˜๋ฉด, ์ƒˆ๋กœ์šด ์ ‘์ด‰ ํŒจํ„ด์ด ํ•„์š”ํ•œ ๊ณ„๋‹จ ์˜ค๋ฅด๊ธฐ ๋™์ž‘์œผ๋กœ ๋ณ€ํ™˜๋ฉ๋‹ˆ๋‹คโ€”์ด๋Š” ์ˆœ์ˆ˜ ๊ธฐ๊ตฌํ•™ ๋ฆฌํƒ€๊ฒŒํŒ…์œผ๋กœ๋Š” ๋ถˆ๊ฐ€๋Šฅํ•œ ์ผ์ž…๋‹ˆ๋‹ค.


์‹คํ—˜: ์ˆซ์ž๊ฐ€ ๋งํ•˜๋Š” ๊ฒƒ๋“ค

์‹คํ—˜ ์„ค์ •

SPIDER์˜ ์‹คํ—˜ ๊ทœ๋ชจ๋Š” ์ƒ๋‹นํžˆ ์ธ์ƒ์ ์ž…๋‹ˆ๋‹ค:

๋ฒ”์ฃผ ๊ตฌ์„ฑ
Dexterous Hand Allegro, XHand, Inspire, Ability, Schunk (5์ข…)
Humanoid Unitree G1, H1-2, Fourier N1, Booster T1 (4์ข…)
์† ์กฐ์ž‘ ๋ฐ์ดํ„ฐ์…‹ GigaHands, OakInk, ARCTIC
ํœด๋จธ๋…ธ์ด๋“œ ๋ฐ์ดํ„ฐ์…‹ LAFAN1, AMASS, OMOMO
์ด ๋ฐ์ดํ„ฐ ๊ทœ๋ชจ 1,262 ์—ํ”ผ์†Œ๋“œ, 2.4M ํ”„๋ ˆ์ž„, 103์ข… ๋ฌผ์ฒด

๋…ผ๋ฌธ Figure 5 ์„ค๋ช…: ํ‰๊ฐ€์— ์‚ฌ์šฉ๋œ 9์ข… ๋กœ๋ด‡์˜ ์‚ฌ์–‘์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค. Dexterous hand๋Š” DoF๊ฐ€ 7(Schunk)๋ถ€ํ„ฐ 16(Allegro)๊นŒ์ง€, ์†๊ฐ€๋ฝ ์ˆ˜๊ฐ€ 3๊ฐœ(Schunk)๋ถ€ํ„ฐ 5๊ฐœ(XHand)๊นŒ์ง€ ๋‹ค์–‘ํ•ฉ๋‹ˆ๋‹ค. ์ด ํญ๋„“์€ ๋ณ€์ด๊ฐ€ SPIDER์˜ ๊ต์ฐจ-์ฒดํ˜„(cross-embodiment) ์ผ๋ฐ˜์„ฑ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.

ํ‰๊ฐ€ ์ง€ํ‘œ:

  • ๋ฌผ์ฒด ํšŒ์ „ ์˜ค์ฐจ E_{\text{rot}}: ์Šคํ…๋ณ„ ํ‰๊ท  ์ฟผํ„ฐ๋‹ˆ์–ธ ์˜ค์ฐจ
  • ๋ฌผ์ฒด ์œ„์น˜ ์˜ค์ฐจ E_{\text{pos}}: ์Šคํ…๋ณ„ ํ‰๊ท  ์œ„์น˜ ์˜ค์ฐจ
  • ์„ฑ๊ณต ๊ธฐ์ค€: E_{\text{rot}} < 0.5 rad ๊ทธ๋ฆฌ๊ณ  E_{\text{pos}} < 3 cm
  • FPS (Frames Per Second): ๊ถค์  ์ƒ์„ฑ ์†๋„

Ablation Study: ๊ฐ ๊ตฌ์„ฑ ์š”์†Œ์˜ ๊ธฐ์—ฌ

๋…ผ๋ฌธ์˜ Table 1์€ ๊ฐ ๊ธฐ๋ฒ•์˜ ์ ์ง„์  ํšจ๊ณผ๋ฅผ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค. ์•„๋ž˜๋Š” Allegro Hand์—์„œ์˜ ๊ฒฐ๊ณผ๋ฅผ ์˜ˆ์‹œ๋กœ ์ •๋ฆฌํ•œ ๊ฒƒ์ž…๋‹ˆ๋‹ค:

๋ฐฉ๋ฒ• OakInk ์„ฑ๊ณต๋ฅ  GigaHands ์„ฑ๊ณต๋ฅ 
๊ธฐ๊ตฌํ•™ ๋ฆฌํƒ€๊ฒŒํŒ… (IK๋งŒ) 0.13 0.00
ํ‘œ์ค€ ์ƒ˜ํ”Œ๋ง (MPPI) 0.40 0.40
์–ด๋‹๋ง ์ƒ˜ํ”Œ๋ง 0.70 0.80
์–ด๋‹๋ง + ์ ‘์ด‰ ๊ฐ€์ด๋“œ (SPIDER) 1.00 1.00

์ด ๊ฒฐ๊ณผ์—์„œ ์ฝ์„ ์ˆ˜ ์žˆ๋Š” ํ•ต์‹ฌ ๋ฉ”์‹œ์ง€๋“ค:

  1. IK๋งŒ์œผ๋กœ๋Š” ๋ถ€์กฑํ•ฉ๋‹ˆ๋‹ค. ํŠนํžˆ GigaHands์—์„œ๋Š” ์„ฑ๊ณต๋ฅ ์ด 0%์ž…๋‹ˆ๋‹ค. ๊ธฐ๊ตฌํ•™์ ์œผ๋กœ ๋งคํ•‘ํ•œ ๊ถค์ ์€ ๋ฌผ๋ฆฌ์ ์œผ๋กœ ์‹คํ–‰ ๋ถˆ๊ฐ€๋Šฅํ•œ ๊ฒฝ์šฐ๊ฐ€ ๋Œ€๋ถ€๋ถ„์ž…๋‹ˆ๋‹ค.

  2. ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์ด ํฐ ํญ์˜ ๊ฐœ์„ ์„ ๊ฐ€์ ธ์˜ต๋‹ˆ๋‹ค. ํ‘œ์ค€ MPPI๋งŒ์œผ๋กœ๋„ IK ๋Œ€๋น„ ํฌ๊ฒŒ ํ–ฅ์ƒ๋ฉ๋‹ˆ๋‹ค.

  3. ์–ด๋‹๋ง์ด ํ•ต์‹ฌ์ ์ธ ์—ญํ• ์„ ํ•ฉ๋‹ˆ๋‹ค. ๊ณ ์ • ํƒ์ƒ‰ ๋ฐ˜๊ฒฝ ๋Œ€๋น„, ์–ด๋‹๋ง ์ „๋žต์ด ์•ฝ 30%p์˜ ์„ฑ๊ณต๋ฅ  ํ–ฅ์ƒ์„ ๊ฐ€์ ธ์˜ต๋‹ˆ๋‹ค.

  4. ์ ‘์ด‰ ๊ฐ€์ด๋“œ๊ฐ€ ๋งˆ์ง€๋ง‰ ํผ์ฆ ์กฐ๊ฐ์ž…๋‹ˆ๋‹ค. ์–ด๋‹๋ง ๊ธฐ๋ฐ˜ ๋Œ€๋น„ ํ‰๊ท  ์•ฝ 18%์˜ ์ถ”๊ฐ€ ๊ฐœ์„ ์ž…๋‹ˆ๋‹ค. ํŠนํžˆ ์ •๋ฐ€ํ•œ ์ดˆ๊ธฐ ์ ‘์ด‰์ด ํ•„์š”ํ•œ OakInk ์ž‘์—…์—์„œ ํšจ๊ณผ๊ฐ€ ๋‘๋“œ๋Ÿฌ์ง‘๋‹ˆ๋‹ค.

์ „์ฒด ๋ฐ์ดํ„ฐ์…‹์— ๊ฑธ์นœ ํŒจํ„ด์„ ๋ณด๋ฉด, ์ ‘์ด‰ ๊ฐ€์ด๋“œ์˜ ํšจ๊ณผ๋Š” ๋กœ๋ด‡๋งˆ๋‹ค ๋‹ค๋ฆ…๋‹ˆ๋‹ค. ์ž์œ ๋„๊ฐ€ ๋†’์€ ์†(Allegro, XHand)์—์„œ๋Š” ์–ด๋‹๋ง๋งŒ์œผ๋กœ๋„ ์ƒ๋‹นํžˆ ์ข‹์€ ์„ฑ๊ณผ๋ฅผ ๋ณด์ด์ง€๋งŒ, ์ž์œ ๋„๊ฐ€ ๋‚ฎ์€ ์†(Ability)์—์„œ๋Š” ์ ‘์ด‰ ๊ฐ€์ด๋“œ์˜ ์ถ”๊ฐ€ ํšจ๊ณผ๊ฐ€ ์ƒ๋Œ€์ ์œผ๋กœ ์ž‘์Šต๋‹ˆ๋‹ค. ์ด๋Š” ์ž์œ ๋„๊ฐ€ ์ ์„์ˆ˜๋ก ์‹คํ˜„ ๊ฐ€๋Šฅํ•œ ์ ‘์ด‰ ๋ชจ๋“œ ์ž์ฒด๊ฐ€ ์ œํ•œ์ ์ด๊ธฐ ๋•Œ๋ฌธ์œผ๋กœ ํ•ด์„๋ฉ๋‹ˆ๋‹ค.

๋Œ€๊ทœ๋ชจ ๋ฆฌํƒ€๊ฒŒํŒ… ๊ฒฐ๊ณผ

์ „์ฒด ๋ฐ์ดํ„ฐ์…‹์— ๋Œ€ํ•œ ์„ฑ๊ณต๋ฅ  (Table 2):

๋ฐ์ดํ„ฐ์…‹ ๊ถค์  ์ˆ˜ Ability Allegro Inspire Schunk XHand
OakInk 1,022 0.413 0.459 0.479 0.431 0.422
GigaHands 756 0.741 0.810 0.879 0.706 0.812

๋ฐ์ดํ„ฐ์…‹ ๊ฐ„ ์ฐจ์ด ํ•ด์„: GigaHands๊ฐ€ OakInk๋ณด๋‹ค ์„ฑ๊ณต๋ฅ ์ด ๋†’์€ ์ด์œ ๋Š” ์ž‘์—… ํŠน์„ฑ ์ฐจ์ด ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค. GigaHands๋Š” ์ง‘์–ด-๋†“๊ธฐ(pick-and-place) ๋™์ž‘์ด ์ฃผ๋ฅผ ์ด๋ฃจ์–ด ๋ฆฌํƒ€๊ฒŒํŒ…์— ์œ ๋ฆฌํ•œ ๋ฐ˜๋ฉด, OakInk์€ ๋ฌผ์ฒด๊ฐ€ ์ด๋ฏธ ์žกํžŒ ์ƒํƒœ(pre-grasped)์—์„œ ์‹œ์ž‘ํ•˜๋ฏ€๋กœ ์ •๋ฐ€ํ•œ ์ดˆ๊ธฐ ์ ‘์ด‰ ๊ตฌ์„ฑ์ด ํ•„์ˆ˜์ ์ž…๋‹ˆ๋‹ค.

๋กœ๋ด‡ ๊ฐ„ ์ฐจ์ด ํ•ด์„: Inspire Hand(12 DoF, 5์†๊ฐ€๋ฝ)๊ฐ€ ์ „๋ฐ˜์ ์œผ๋กœ ๊ฐ€์žฅ ๋†’์€ ์„ฑ๊ณต๋ฅ ์„ ๋ณด์ž…๋‹ˆ๋‹ค. ์ž์œ ๋„๊ฐ€ ๋†’์„์ˆ˜๋ก ๋‹ค์–‘ํ•œ ํŒŒ์ง€ ์ „๋žต์„ ๊ตฌ์‚ฌํ•  ์ˆ˜ ์žˆ์–ด ๋ฆฌํƒ€๊ฒŒํŒ…์ด ์ˆ˜์›”ํ•ฉ๋‹ˆ๋‹ค.

SOTA ๋น„๊ต: ์†๋„์™€ ํ’ˆ์งˆ์˜ ํŠธ๋ ˆ์ด๋“œ์˜คํ”„

Table 3์—์„œ RL ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•๊ณผ์˜ ๋น„๊ต๊ฐ€ ํฅ๋ฏธ๋กญ์Šต๋‹ˆ๋‹ค:

๋ฐฉ๋ฒ• ๋ฐ์ดํ„ฐ์…‹ ์„ฑ๊ณต๋ฅ  FPS
SPIDER OakInk 47.9% 2.5
ManipTrans (RL) OakInk 39.5% 0.1
SPIDER ARCTIC 42.0% 1.5
DexMachina (RL) ARCTIC 67.1% 0.05

์ด ๊ฒฐ๊ณผ๋ฅผ ์–ด๋–ป๊ฒŒ ์ฝ์–ด์•ผ ํ• ๊นŒ์š”?

  • OakInk์—์„œ๋Š” SPIDER๊ฐ€ ๋” ๋†’์€ ์„ฑ๊ณต๋ฅ ๊ณผ 25๋ฐฐ ๋น ๋ฅธ ์†๋„๋ฅผ ๋‹ฌ์„ฑํ•ฉ๋‹ˆ๋‹ค. ManipTrans ๋Œ€๋น„ ์„ฑ๊ณต๋ฅ  +8.4%p, ์†๋„ 25ร—.
  • ARCTIC์—์„œ๋Š” DexMachina๊ฐ€ ์„ฑ๊ณต๋ฅ ์€ ๋” ๋†’์ง€๋งŒ, SPIDER๊ฐ€ 30๋ฐฐ ๋น ๋ฆ…๋‹ˆ๋‹ค. ARCTIC์€ ์–‘์† ์กฐ์ž‘๊ณผ ๊ฐ™์€ ๋ณต์žกํ•œ ์ž‘์—…์„ ํฌํ•จํ•˜๋ฏ€๋กœ, RL์˜ ํƒ์ƒ‰ ๋Šฅ๋ ฅ์ด ์œ ๋ฆฌํ•ฉ๋‹ˆ๋‹ค.

ํ•ต์‹ฌ์ ์ธ ํ†ต์ฐฐ์€ ์ด๊ฒƒ์ž…๋‹ˆ๋‹ค: SPIDER๋Š” โ€œ๊ถค์ ๋ณ„ ํ•™์Šตโ€์ด ํ•„์š” ์—†๋Š” ์ง์ ‘ ์ตœ์ ํ™” ๋ฐฉ์‹์ด๋ฏ€๋กœ, ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹ ์ „์ฒด๋ฅผ ์ฒ˜๋ฆฌํ•˜๋Š” ๋ฐ RL ๋Œ€๋น„ 10๋ฐฐ ์ด์ƒ ๋น ๋ฆ…๋‹ˆ๋‹ค. 1,022๊ฐœ ๊ถค์ ์— ๋Œ€ํ•ด SPIDER๋Š” ํ•ฉ๋ฆฌ์  ์‹œ๊ฐ„ ์•ˆ์— ์ „์ฒด ๋ฆฌํƒ€๊ฒŒํŒ…์ด ๊ฐ€๋Šฅํ•˜์ง€๋งŒ, RL์€ ๊ฐ ๊ถค์ ๋งˆ๋‹ค ์ •์ฑ…์„ ํ•™์Šตํ•ด์•ผ ํ•˜๋ฏ€๋กœ ๊ณ„์‚ฐ ๋น„์šฉ์ด ๊ธฐํ•˜๊ธ‰์ˆ˜์ ์œผ๋กœ ์ฆ๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.

์‹ค์ œ ๋กœ๋ด‡ ๋ฐฐํฌ ๊ฒฐ๊ณผ

๋…ผ๋ฌธ Figure 7 ์„ค๋ช…: Franka Emika Panda + Allegro Hand ์‹œ์Šคํ…œ์—์„œ์˜ ์˜คํ”ˆ๋ฃจํ”„ ์‹คํ–‰ ๊ฒฐ๊ณผ๋ฅผ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค. ์ „๊ตฌ ๋Œ๋ฆฌ๊ธฐ, ์ˆŸ๊ฐ€๋ฝ ์ง‘๊ธฐ, ๊ธฐํƒ€ ์—ฐ์ฃผ, ์ถฉ์ „๊ธฐ ๋ฝ‘๊ธฐ ๋“ฑ ์ •๋ฐ€ํ•œ ์†๊ฐ€๋ฝ ํ˜‘์‘์ด ํ•„์š”ํ•œ ์ž‘์—…์„ ์ถ”๊ฐ€ ์ ์‘ ์—†์ด ์„ฑ๊ณต์ ์œผ๋กœ ์ˆ˜ํ–‰ํ•ฉ๋‹ˆ๋‹ค.

์ด๋Š” ๊ฐ€์ƒ ์ œ์•ฝ์œผ๋กœ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ์ตœ์ ํ™”ํ•œ ๊ถค์ ์ด ๊ฐ•๊ฑดํ™”(robustification) ๋‹จ๊ณ„๋งŒ์œผ๋กœ ์‹ค์ œ ํ•˜๋“œ์›จ์–ด์— ์ง์ ‘ ์ „์ด๋  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์—ฌ์ฃผ๋Š” ์ค‘์š”ํ•œ ๊ฒฐ๊ณผ์ž…๋‹ˆ๋‹ค.

ํœด๋จธ๋…ธ์ด๋“œ ๋ฆฌํƒ€๊ฒŒํŒ… ๊ฒฐ๊ณผ

SPIDER๋Š” Dexterous hand๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ํœด๋จธ๋…ธ์ด๋“œ ์ „์‹  ์ œ์–ด์—๋„ ์ ์šฉ๋ฉ๋‹ˆ๋‹ค:

๋ฐ์ดํ„ฐ์…‹ ๋ฐฉ๋ฒ• ๊ด€์ ˆ ์˜ค์ฐจ(ยฐ) ์œ„์น˜ ์˜ค์ฐจ(cm) ๋ฐฉํ–ฅ ์˜ค์ฐจ(ยฐ) FPS
LAFAN1 GMR 1.08 2.01 2.40 35.2
LAFAN1 SPIDER 0.58 0.11 0.07 23.1
AMASS GMR 6.2 4.1 18.7 37.2
AMASS SPIDER 0.75 0.23 0.08 22.0

SPIDER๊ฐ€ ์ถ”์  ์˜ค์ฐจ์—์„œ ์••๋„์ ์œผ๋กœ ์šฐ์ˆ˜ํ•ฉ๋‹ˆ๋‹ค. ํŠนํžˆ AMASS์—์„œ ๋ฐฉํ–ฅ ์˜ค์ฐจ๊ฐ€ 18.7ยฐ์—์„œ 0.08ยฐ๋กœ, 200๋ฐฐ ์ด์ƒ ๊ฐœ์„ ๋ฉ๋‹ˆ๋‹ค. ์ด๋Š” ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์ด ๋ฐœ ๋ฏธ๋„๋Ÿฌ์ง(foot sliding)์ด๋‚˜ ๋ฐ”๋‹ฅ ๊ด€ํ†ต(floor penetration) ๊ฐ™์€ ๊ธฐ๊ตฌํ•™์  ์•„ํ‹ฐํŒฉํŠธ๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ์ œ๊ฑฐํ•˜๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค.

๋‹ค๋งŒ FPS์—์„œ๋Š” GMR์ด ๋” ๋น ๋ฆ…๋‹ˆ๋‹ค(35~37 vs 19~23). ์ด๋Š” ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ธฐ๋ฐ˜ ์ตœ์ ํ™”์˜ ๋ณธ์งˆ์ ์ธ ๊ณ„์‚ฐ ๋น„์šฉ ๋•Œ๋ฌธ์ด์ง€๋งŒ, SPIDER์˜ FPS๋Š” ์—ฌ์ „ํžˆ ์‹ค์‹œ๊ฐ„์— ๊ฐ€๊น์Šต๋‹ˆ๋‹ค.

RL ์ •์ฑ… ํ•™์Šต ๊ฐ€์†

๋…ผ๋ฌธ Figure 10 ์„ค๋ช…: OMOMO ๋ฐ์ดํ„ฐ์…‹์—์„œ ํœด๋จธ๋…ธ์ด๋“œ๊ฐ€ ์ƒ์ž๋ฅผ ์ง‘์–ด ๋ฐ”๋‹ฅ์— ๋†“๋Š” ์ž‘์—…์˜ RL ํ•™์Šต ๊ณก์„ ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.

SPIDER๊ฐ€ ์ƒ์„ฑํ•œ ๊ถค์ ์œผ๋กœ RL์„ ํ•™์Šตํ•  ๋•Œ์˜ ํ•ต์‹ฌ์€ ์ž”์ฐจ ํ•™์Šต(residual learning)์ž…๋‹ˆ๋‹ค:

u_t = u_t^{\text{SPIDER}} + \pi_\theta(o_t)

SPIDER๊ฐ€ ๋ช…๋ชฉ(nominal) ์ œ์–ด u_t^{\text{SPIDER}}๋ฅผ ์ œ๊ณตํ•˜๊ณ , RL ์ •์ฑ…์€ ํŽธ์ฐจ๋ฅผ ๋ณด์ •ํ•˜๋Š” ์ž”์ฐจ ํ•ญ \pi_\theta(o_t)๋งŒ ํ•™์Šตํ•˜๋ฉด ๋ฉ๋‹ˆ๋‹ค. ์›๋ณธ ์‚ฌ๋žŒ ๋ชจ์…˜์œผ๋กœ ์ง์ ‘ ํ•™์Šตํ•˜๋ฉด ๋กœ๋ด‡์ด ๋ฌผ์ฒด ์ ‘์ด‰์— ์‹คํŒจํ•˜์—ฌ body tracking๋งŒ ๋‹ฌ์„ฑํ•˜์ง€๋งŒ, SPIDER ๊ถค์ ์œผ๋กœ ํ•™์Šตํ•˜๋ฉด ๋” ๋น ๋ฅด๊ฒŒ ์ˆ˜๋ ดํ•˜๊ณ  ๋ฌผ์ฒด ์ถ”์  ์„ฑ๋Šฅ๋„ ์šฐ์ˆ˜ํ•ฉ๋‹ˆ๋‹ค.


๋น„ํŒ์  ๊ณ ์ฐฐ

๊ฐ•์ 

1. ์ฒดํ˜„ ๋ถˆ๊ฐ€์ง€๋ก ์  ์„ค๊ณ„(Embodiment-Agnostic Design)

9์ข…์˜ ๋กœ๋ด‡(5์ข… ์† + 4์ข… ํœด๋จธ๋…ธ์ด๋“œ)์— ๊ฑธ์ณ ๋™์ผํ•œ ํ”„๋ ˆ์ž„์›Œํฌ๊ฐ€ ์ž‘๋™ํ•ฉ๋‹ˆ๋‹ค. ๋กœ๋ด‡๋ณ„ ๋ณด์ƒ ํ•จ์ˆ˜ ์„ค๊ณ„๋‚˜ ์ปค๋ฆฌํ˜๋Ÿผ ํŠœ๋‹์ด ํ•„์š” ์—†๋‹ค๋Š” ์ ์€ ์‹ค๋ฌด์ ์œผ๋กœ ํฐ ์žฅ์ ์ž…๋‹ˆ๋‹ค. Allegro Hand๋ฅผ ์“ฐ๋‹ค๊ฐ€ XHand๋กœ ๋ฐ”๊ฟ”๋„, SPIDER ํŒŒ์ดํ”„๋ผ์ธ ์ž์ฒด๋ฅผ ์ˆ˜์ •ํ•  ํ•„์š”๊ฐ€ ์—†์Šต๋‹ˆ๋‹ค.

2. ํ™•์žฅ์„ฑ๊ณผ ํšจ์œจ์„ฑ์˜ ๊ท ํ˜•

RL ๋Œ€๋น„ 10๋ฐฐ ๋น ๋ฅธ ๊ถค์  ์ƒ์„ฑ ์†๋„๋Š” ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹ ์ฒ˜๋ฆฌ๋ฅผ ๊ฐ€๋Šฅ์ผ€ ํ•ฉ๋‹ˆ๋‹ค. 2.4M ํ”„๋ ˆ์ž„ ๊ทœ๋ชจ์˜ ๋ฐ์ดํ„ฐ์…‹ ์ƒ์„ฑ์€ RL ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ•์œผ๋กœ๋Š” ํ˜„์‹ค์ ์œผ๋กœ ์–ด๋ ค์› ์„ ๊ฒƒ์ž…๋‹ˆ๋‹ค. MuJoCo Warp์˜ GPU ๋ณ‘๋ ฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์„ ํ™œ์šฉํ•˜์—ฌ 10-20๋ฐฐ ๊ฐ€์†์„ ๋‹ฌ์„ฑํ•œ ์ ๋„ ์‹ค์šฉ์„ฑ์„ ๋†’์ž…๋‹ˆ๋‹ค.

3. ๊ฐ€์ƒ ์ ‘์ด‰ ๊ฐ€์ด๋“œ์˜ ์šฐ์•„ํ•จ

์‹คํ˜„ ๊ฐ€๋Šฅ ์˜์—ญ์„ ํ™•์žฅํ•œ๋‹ค๋Š” ์•„์ด๋””์–ด๋Š” ๋‹จ์ˆœํžˆ ๋น„์šฉ์„ ์ถ”๊ฐ€ํ•˜๋Š” ๊ฒƒ๋ณด๋‹ค ๊ทผ๋ณธ์ ์ธ ํ•ด๊ฒฐ์ฑ…์ž…๋‹ˆ๋‹ค. ์ปค๋ฆฌํ˜๋Ÿผ ๋ฐฉ์‹์˜ ์ ์ง„์  ์™„ํ™”๋Š” ์ด๋ก ์ ์œผ๋กœ๋„ ๊น”๋”ํ•˜๊ณ , ์‹คํ—˜์ ์œผ๋กœ๋„ ํšจ๊ณผ์ ์ž…๋‹ˆ๋‹ค.

4. ์™„์ „ํ•œ ํŒŒ์ดํ”„๋ผ์ธ ์ œ๊ณต

๋‹จ์ผ RGB ์นด๋ฉ”๋ผ โ†’ 3D ์žฌ๊ตฌ์„ฑ โ†’ ๋ฆฌํƒ€๊ฒŒํŒ… โ†’ ์‹ค์ œ ๋กœ๋ด‡ ๋ฐฐํฌ๊นŒ์ง€์˜ ์ „์ฒด ํŒŒ์ดํ”„๋ผ์ธ์„ ๋ณด์—ฌ์ค€ ์ , ๊ทธ๋ฆฌ๊ณ  ์ฝ”๋“œ๋ฅผ ๊ณต๊ฐœํ•œ ์ ์€ ์žฌํ˜„์„ฑ๊ณผ ์‹ค์šฉ์„ฑ ๋ฉด์—์„œ ๋†’์ด ํ‰๊ฐ€ํ•  ๋งŒํ•ฉ๋‹ˆ๋‹ค.

์•ฝ์  ๋ฐ ํ•œ๊ณ„

1. ๋ณต์žกํ•œ ์–‘์† ์กฐ์ž‘์—์„œ์˜ ์„ฑ๋Šฅ ๊ฒฉ์ฐจ

ARCTIC ๋ฐ์ดํ„ฐ์…‹์—์„œ DexMachina ๋Œ€๋น„ 25%p ๋‚ฎ์€ ์„ฑ๊ณต๋ฅ (42% vs 67.1%)์€ SPIDER์˜ ํ•œ๊ณ„๋ฅผ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค. ์–‘์† ํ˜‘์‘์ด ํ•„์š”ํ•œ ๋ณต์žกํ•œ ์ž‘์—…์—์„œ๋Š” RL์˜ ํƒ์ƒ‰ ๋Šฅ๋ ฅ์ด ์—ฌ์ „ํžˆ ์šฐ์œ„์ž…๋‹ˆ๋‹ค. SPIDER์˜ ์ƒ˜ํ”Œ๋ง์ด ๊ตญ์†Œ ์ตœ์ ์— ๋น ์ง€๊ธฐ ์‰ฌ์šด ์ƒํ™ฉ์œผ๋กœ ํ•ด์„๋ฉ๋‹ˆ๋‹ค.

2. ์˜คํ”ˆ๋ฃจํ”„ ์‹คํ–‰์˜ ๊ทผ๋ณธ์  ํ•œ๊ณ„

Dexterous hand์—์„œ์˜ ์‹ค๋กœ๋ด‡ ๊ฒฐ๊ณผ๊ฐ€ ์˜คํ”ˆ๋ฃจํ”„๋ผ๋Š” ์ ์€ ์ฃผ์˜๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค. ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ๋Š” ๋ฌผ์ฒด์˜ ๋ฏธ๋„๋Ÿฌ์ง, ์˜ˆ์ƒ์น˜ ๋ชปํ•œ ์ ‘์ด‰ ๋“ฑ์— ๋Œ€ํ•œ ํ”ผ๋“œ๋ฐฑ์ด ํ•„์ˆ˜์ ์ž…๋‹ˆ๋‹ค. ๋…ผ๋ฌธ๋„ ์ด๋ฅผ ์ธ์ง€ํ•˜๊ณ  RL ์ž”์ฐจ ํ•™์Šต์„ ์ œ์•ˆํ•˜์ง€๋งŒ, ๊ทธ ๊ฒฐ๊ณผ๋Š” ํœด๋จธ๋…ธ์ด๋“œ์— ํ•œ์ •๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค. Dexterous hand์—์„œ์˜ ํด๋กœ์ฆˆ๋“œ ๋ฃจํ”„ ์ •์ฑ… ํ•™์Šต ๊ฒฐ๊ณผ๊ฐ€ ์—†๋Š” ์ ์€ ์•„์‰ฝ์Šต๋‹ˆ๋‹ค.

3. ์ž…๋ ฅ ๋ฐ์ดํ„ฐ ํ’ˆ์งˆ ์˜์กด์„ฑ

SPIDER์˜ ์„ฑ๋Šฅ์€ 3D ์žฌ๊ตฌ์„ฑ ํ’ˆ์งˆ์— ์ƒ๋‹นํžˆ ์˜์กดํ•ฉ๋‹ˆ๋‹ค. ๋…ผ๋ฌธ ์Šค์Šค๋กœ๋„ ์ด๋ฅผ ์ธ์ •ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ๋…ธ์ด์ฆˆ๊ฐ€ ์‹ฌํ•œ ๋ฉ”์‹œ๋‚˜ ๋ชจ์…˜์€ ์ตœ์ ํ™”๋ฅผ ์˜ค์—ผ์‹œํ‚ฌ ์ˆ˜ ์žˆ์œผ๋ฉฐ, ์ ‘์ด‰ ํ•„ํ„ฐ๊ฐ€ ์ด๋ฅผ ์™„ํ™”ํ•˜์ง€๋งŒ ์™„์ „ํžˆ ํ•ด๊ฒฐํ•˜์ง€๋Š” ๋ชปํ•ฉ๋‹ˆ๋‹ค.

4. ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ๋ฏผ๊ฐ๋„ ๋ถ„์„ ๋ถ€์žฌ

\beta_1, \beta_2, \lambda, \eta_i, \epsilon_{\text{contact}}, t_{c,\min}, d_{c,\max} ๋“ฑ ๋‹ค์ˆ˜์˜ ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ๊ฐ€ ์กด์žฌํ•˜์ง€๋งŒ, ์ด๋“ค์˜ ๋ฏผ๊ฐ๋„ ๋ถ„์„์ด ์ถฉ๋ถ„ํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. โ€œ๋กœ๋ด‡๋ณ„ ์ถ”๊ฐ€ ํŠœ๋‹ ์—†์ดโ€ ์ž‘๋™ํ•œ๋‹ค๊ณ  ์ฃผ์žฅํ•˜์ง€๋งŒ, ์ตœ์ ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ ์„ค์ •์ด ๋กœ๋ด‡/์ž‘์—…์— ๋”ฐ๋ผ ๋‹ฌ๋ผ์ง€๋Š”์ง€ ์—ฌ๋ถ€๊ฐ€ ๋ถˆ๋ถ„๋ช…ํ•ฉ๋‹ˆ๋‹ค.

5. ์„ฑ๊ณต๋ฅ ์˜ ์ ˆ๋Œ€์  ์ˆ˜์ค€

OakInk ์ „์ฒด ๋ฐ์ดํ„ฐ์…‹์—์„œ ์ตœ๊ณ  ์„ฑ๊ณต๋ฅ ์ด 47.9%(Inspire)๋ผ๋Š” ์ ์€, ์‹ค๋ฌด ์ ์šฉ ๊ด€์ ์—์„œ ์ ˆ๋ฐ˜ ์ด์ƒ์˜ ๊ถค์ ์ด ์‹คํŒจํ•œ๋‹ค๋Š” ์˜๋ฏธ์ž…๋‹ˆ๋‹ค. ๋ฌผ๋ก  ์ด๋Š” ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹์—์„œ์˜ ๊ฒฐ๊ณผ์ด๊ณ , ์‹คํŒจ ๊ถค์ ์€ ํ•„ํ„ฐ๋งํ•˜์—ฌ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์ง€๋งŒ, ์—ฌ์ „ํžˆ ๊ฐœ์„ ์˜ ์—ฌ์ง€๊ฐ€ ํฝ๋‹ˆ๋‹ค.


๊ด€๋ จ ์—ฐ๊ตฌ์™€์˜ ๋น„๊ต

๋ฆฌํƒ€๊ฒŒํŒ… ๋ฐฉ๋ฒ•๋ก  ์ŠคํŽ™ํŠธ๋Ÿผ์—์„œ์˜ ์œ„์น˜

quadrantChart
    title ๋ฆฌํƒ€๊ฒŒํŒ… ๋ฐฉ๋ฒ•๋ก  ๋น„๊ต
    x-axis "๋А๋ฆผ" --> "๋น ๋ฆ„"
    y-axis "๋ฌผ๋ฆฌ์  ๋น„ํƒ€๋‹น" --> "๋ฌผ๋ฆฌ์  ํƒ€๋‹น"
    quadrant-1 ๋น ๋ฅด๊ณ  ๋ฌผ๋ฆฌ์  ํƒ€๋‹น
    quadrant-2 ๋А๋ฆฌ์ง€๋งŒ ๋ฌผ๋ฆฌ์  ํƒ€๋‹น
    quadrant-3 ๋А๋ฆฌ๊ณ  ๋ฌผ๋ฆฌ์  ๋น„ํƒ€๋‹น
    quadrant-4 ๋น ๋ฅด์ง€๋งŒ ๋ฌผ๋ฆฌ์  ๋น„ํƒ€๋‹น
    IK ๊ธฐ๋ฐ˜: [0.85, 0.15]
    ์‹ ๊ฒฝ๋ง ๊ธฐ๋ฐ˜: [0.75, 0.35]
    RL ๊ธฐ๋ฐ˜: [0.15, 0.80]
    SPIDER: [0.65, 0.75]

ํ•ต์‹ฌ ๋น„๊ต ๋Œ€์ƒ

ManipTrans (Li et al., 2025): OakInk ๋ฐ์ดํ„ฐ์…‹์—์„œ ์ง์ ‘ ๋น„๊ต๋˜์—ˆ์œผ๋ฉฐ, SPIDER๊ฐ€ ์„ฑ๊ณต๋ฅ ๊ณผ ์†๋„ ๋ชจ๋‘์—์„œ ์šฐ์ˆ˜ํ•ฉ๋‹ˆ๋‹ค. ManipTrans๋Š” ๊ถค์ ๋ณ„ RL ์ •์ฑ…์„ ํ•™์Šตํ•˜๋ฏ€๋กœ ํ™•์žฅ์„ฑ์ด ์ œํ•œ๋ฉ๋‹ˆ๋‹ค.

DexMachina (Mandi et al., 2025): ARCTIC์—์„œ ๋” ๋†’์€ ์„ฑ๊ณต๋ฅ ์„ ๋ณด์ด์ง€๋งŒ, 30๋ฐฐ ๋А๋ฆฝ๋‹ˆ๋‹ค. DexMachina์˜ virtual object constraint์™€ SPIDER์˜ virtual contact guidance๋Š” ์œ ์‚ฌํ•œ ์ฒ ํ•™์„ ๊ณต์œ ํ•˜์ง€๋งŒ, SPIDER๊ฐ€ ์ƒ๋Œ€ ์œ„์น˜ ๊ธฐ๋ฐ˜์œผ๋กœ ๋” ์ผ๋ฐ˜์ ์ธ ๊ณต์‹ํ™”๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

Dexplore (2025): ํฅ๋ฏธ๋กœ์šด ๋Œ€์กฐ์ ์ž…๋‹ˆ๋‹ค. Dexplore๋Š” ๋ฆฌํƒ€๊ฒŒํŒ…๊ณผ ์ถ”์ ์„ ๋‹จ์ผ ๋ฃจํ”„๋กœ ํ†ตํ•ฉํ•˜์—ฌ ์‚ฌ๋žŒ ์‹œ์—ฐ์„ โ€œsoft referenceโ€๋กœ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. SPIDER์™€ ์ฒ ํ•™์ด ๋‹ค๋ฅธ๋ฐโ€”SPIDER๋Š” ๋จผ์ € ์ข‹์€ ๊ถค์ ์„ ๋งŒ๋“ค๊ณ  ๋‚˜์ค‘์— ์ •์ฑ…์„ ํ•™์Šตํ•˜๋Š” 2๋‹จ๊ณ„ ์ ‘๊ทผ, Dexplore๋Š” ์ฒ˜์Œ๋ถ€ํ„ฐ ์ •์ฑ…์„ ํ•™์Šตํ•˜๋Š” 1๋‹จ๊ณ„ ์ ‘๊ทผ์ž…๋‹ˆ๋‹ค. ๊ฐ๊ฐ์˜ ์žฅ๋‹จ์ ์ด ์žˆ์œผ๋ฉฐ, ํ–ฅํ›„ ๋‘ ์ ‘๊ทผ์˜ ์œตํ•ฉ๋„ ํฅ๋ฏธ๋กœ์šด ๋ฐฉํ–ฅ์ผ ๊ฒƒ์ž…๋‹ˆ๋‹ค.

DIAL-MPC (Xue et al., 2025): SPIDER์˜ ์–ด๋‹๋ง ์ปค๋„์€ DIAL-MPC์—์„œ ์˜๊ฐ์„ ๋ฐ›์•˜์Šต๋‹ˆ๋‹ค. DIAL-MPC๊ฐ€ ๋ณดํ–‰ ์ œ์–ด์— ์–ด๋‹๋ง ์ƒ˜ํ”Œ๋ง์„ ์ ์šฉํ•œ ๊ฒƒ์„ ๋ฆฌํƒ€๊ฒŒํŒ… ๋ฌธ์ œ๋กœ ํ™•์žฅํ•˜๊ณ , ์ ‘์ด‰ ๊ฐ€์ด๋“œ๋ฅผ ์ถ”๊ฐ€ํ•œ ๊ฒƒ์ด SPIDER์˜ ๊ธฐ์—ฌ์ž…๋‹ˆ๋‹ค.


Allegro Hand ์—ฐ๊ตฌ์ž๋ฅผ ์œ„ํ•œ ์‹œ์‚ฌ์ 

SPIDER๊ฐ€ Allegro Hand๋ฅผ ํฌํ•จํ•œ 5์ข…์˜ ๋กœ๋ด‡ ์†์—์„œ ํ‰๊ฐ€๋ฅผ ์ˆ˜ํ–‰ํ•œ ๋งŒํผ, ๋ช‡ ๊ฐ€์ง€ ์‹ค์งˆ์  ์‹œ์‚ฌ์ ์„ ์งš์–ด๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค:

  1. ๋ฐ์ดํ„ฐ ์ƒ์„ฑ ํŒŒ์ดํ”„๋ผ์ธ์œผ๋กœ ํ™œ์šฉ: Allegro Hand ์—ฐ๊ตฌ์—์„œ ํ•™์Šต ๋ฐ์ดํ„ฐ ๋ถ€์กฑ ๋ฌธ์ œ๋ฅผ ๊ฒช๊ณ  ์žˆ๋‹ค๋ฉด, SPIDER๋ฅผ ํ†ตํ•ด ๊ณต๊ฐœ ์‚ฌ๋žŒ ์† ๋ฐ์ดํ„ฐ์…‹(OakInk, GigaHands)์œผ๋กœ๋ถ€ํ„ฐ Allegro ์ „์šฉ ๊ถค์  ๋ฐ์ดํ„ฐ๋ฅผ ๋Œ€๋Ÿ‰ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

  2. RL ํ•™์Šต ๊ฐ€์†: SPIDER ๊ถค์ ์„ ๋ช…๋ชฉ ์ œ์–ด๋กœ ์‚ฌ์šฉํ•œ ์ž”์ฐจ RL ํ•™์Šต์€ ๋ณด์ƒ ์„ค๊ณ„์™€ ์ปค๋ฆฌํ˜๋Ÿผ ์„ค๊ณ„์˜ ๋ถ€๋‹ด์„ ํฌ๊ฒŒ ์ค„์—ฌ์ค๋‹ˆ๋‹ค.

  3. Sim-to-Real ๊ฐ€๋Šฅ์„ฑ: ๊ฐ•๊ฑดํ™” ๋‹จ๊ณ„๋ฅผ ๊ฑฐ์นœ ๊ถค์ ์ด ์‹ค์ œ ๋กœ๋ด‡์—์„œ ์˜คํ”ˆ๋ฃจํ”„๋กœ ์‹คํ–‰ ๊ฐ€๋Šฅํ•˜๋‹ค๋Š” ๊ฒฐ๊ณผ๋Š”, ์‹ค์ „ ๋ฐฐํฌ๋ฅผ ๊ณ ๋ คํ•˜๋Š” ์—ฐ๊ตฌ์ž์—๊ฒŒ ์œ ์šฉํ•œ ๋ ˆํผ๋Ÿฐ์Šค์ž…๋‹ˆ๋‹ค.

  4. 16 DoF์˜ ์ด์ : Allegro Hand๋Š” 16 DoF๋กœ ํ‰๊ฐ€๋œ ์†๋“ค ์ค‘์—์„œ ๋†’์€ ์ถ•์— ์†ํ•˜๋ฉฐ, ์ด์— ๋”ฐ๋ผ ๋ฆฌํƒ€๊ฒŒํŒ… ์„ฑ๊ณต๋ฅ ๋„ ์–‘ํ˜ธํ•ฉ๋‹ˆ๋‹ค. ์ž์œ ๋„๊ฐ€ ์ ์€ ์†์—์„œ๋Š” ์„ฑ๋Šฅ์ด ์ œํ•œ๋  ์ˆ˜ ์žˆ์Œ์„ ๊ฐ์•ˆํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.


์š”์•ฝ ๋ฐ ๊ฒฐ๋ก 

SPIDER๋Š” โ€œ์‚ฌ๋žŒ์˜ ์†์ง“์„ ๋กœ๋ด‡์˜ ์†์ง“์œผ๋กœ ์–ด๋–ป๊ฒŒ ๋ฐ”๊ฟ€ ๊ฒƒ์ธ๊ฐ€?โ€๋ผ๋Š” ๊ทผ๋ณธ์ ์ธ ์งˆ๋ฌธ์— ๋Œ€ํ•ด, ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ธฐ๋ฐ˜ ๋Œ€๊ทœ๋ชจ ์ƒ˜ํ”Œ๋ง์ด๋ผ๋Š” ๊น”๋”ํ•œ ๋‹ต์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ๊ธฐ์—ฌ๋ฅผ ์„ธ ๊ฐ€์ง€๋กœ ์š”์•ฝํ•˜๋ฉด:

  1. ๊ฐ€์ƒ ์ ‘์ด‰ ๊ฐ€์ด๋“œ: ์‹คํ˜„ ๊ฐ€๋Šฅ ์˜์—ญ์„ ํ™•์žฅํ•˜์—ฌ ์˜ฌ๋ฐ”๋ฅธ ์ ‘์ด‰ ๋ชจ๋“œ๋กœ์˜ ์ˆ˜๋ ด์„ ์œ ๋„ํ•˜๋Š” ์ปค๋ฆฌํ˜๋Ÿผ ๊ธฐ๋ฒ•. ์„ฑ๊ณต๋ฅ  18% ํ–ฅ์ƒ.
  2. ํ™•์žฅ์„ฑ: 9์ข… ๋กœ๋ด‡ ร— 6๊ฐœ ๋ฐ์ดํ„ฐ์…‹์— ๊ฑธ์นœ ๋ฒ”์šฉ์„ฑ. RL ๋Œ€๋น„ 10๋ฐฐ ๋น ๋ฅธ ๊ถค์  ์ƒ์„ฑ. 2.4M ํ”„๋ ˆ์ž„ ๋ฐ์ดํ„ฐ์…‹ ์ƒ์„ฑ.
  3. ์‹ค์šฉ์„ฑ: ๋‹จ์ผ RGB ์นด๋ฉ”๋ผ๋ถ€ํ„ฐ ์‹ค์ œ ๋กœ๋ด‡ ๋ฐฐํฌ๊นŒ์ง€์˜ ์™„์ „ํ•œ ํŒŒ์ดํ”„๋ผ์ธ. ์ฝ”๋“œ ๊ณต๊ฐœ.

์—ด๋ฆฐ ์งˆ๋ฌธ๋“ค๋„ ๋‚จ์•„ ์žˆ์Šต๋‹ˆ๋‹ค. ๋ณต์žกํ•œ ์–‘์† ์กฐ์ž‘์—์„œ์˜ ์„ฑ๋Šฅ ๊ฐœ์„ , ํด๋กœ์ฆˆ๋“œ ๋ฃจํ”„ ์ •์ฑ…์œผ๋กœ์˜ ํ™•์žฅ, ์ž…๋ ฅ ๋ฐ์ดํ„ฐ ํ’ˆ์งˆ์— ๋Œ€ํ•œ ๊ฐ•๊ฑด์„ฑ ํ–ฅ์ƒ ๋“ฑ์€ ํ›„์† ์—ฐ๊ตฌ์˜ ๋ฐฉํ–ฅ์ด ๋  ๊ฒƒ์ž…๋‹ˆ๋‹ค. ๊ทธ๋Ÿผ์—๋„ SPIDER๋Š” โ€œ์ธํ„ฐ๋„ท ๊ทœ๋ชจ์˜ ์‚ฌ๋žŒ ๋ฐ์ดํ„ฐ๋ฅผ ๋กœ๋ด‡ ํ•™์Šต์— ํ™œ์šฉํ•˜๊ฒ ๋‹คโ€๋Š” ๋น„์ „์— ํ•œ ๊ฑธ์Œ ๋” ๊ฐ€๊นŒ์ด ๋‹ค๊ฐ€๊ฐ„ ์˜๋ฏธ ์žˆ๋Š” ์—ฐ๊ตฌ์ž…๋‹ˆ๋‹ค.

์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์•ˆ์—์„œ ์ˆ˜์ฒœ ๋ฒˆ์˜ ์‹œ๋„๋ฅผ ํ†ตํ•ด ๋ฌผ๋ฆฌ ๋ฒ•์น™์— ๋งž๋Š” ๋‹ต์„ ์ฐพ์•„๊ฐ€๋Š” SPIDER์˜ ์ ‘๊ทผ์€, ๋งˆ์น˜ ์ž์—ฐ์ด ์ง„ํ™”๋ฅผ ํ†ตํ•ด ์ตœ์ ์˜ ํ•ด๋ฅผ ์ฐพ์•„๊ฐ€๋Š” ๊ณผ์ •๊ณผ ๋‹ฎ์•„ ์žˆ์Šต๋‹ˆ๋‹ค. ๋‹ค๋งŒ ์ž์—ฐ๋ณด๋‹ค ํ›จ์”ฌ ๋น ๋ฅด๊ฒŒ์š”.


์ฐธ๊ณ  ์ •๋ณด

  • ๋…ผ๋ฌธ: Chaoyi Pan, Changhao Wang, Haozhi Qi, Zixi Liu, Homanga Bharadhwaj, Akash Sharma, Tingfan Wu, Guanya Shi, Jitendra Malik, Francois Hogan. โ€œSPIDER: Scalable Physics-Informed Dexterous Retargeting.โ€ arXiv:2511.09484, 2025.
  • ์†Œ์†: FAIR at Meta, Carnegie Mellon University
  • ํ”„๋กœ์ ํŠธ ํŽ˜์ด์ง€: https://jc-bao.github.io/spider-project
  • ์ฝ”๋“œ: https://github.com/facebookresearch/spider
  • ์ง€์› ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ: MuJoCo Warp (๊ธฐ๋ณธ), Genesis
  • ๋‹ค์šด์ŠคํŠธ๋ฆผ ํ†ตํ•ฉ: HDMI (ํœด๋จธ๋…ธ์ด๋“œ), DexMachina (Dexterous hand RL)

Copyright 2026, JungYeon Lee