Curieux.JY
  • JungYeon Lee
  • Post
  • Projects
  • Note

On this page

  • ๐Ÿ” Ping Review
  • ๐Ÿ”” Ring Review
    • ์™œ ์ด ๋…ผ๋ฌธ์ด ์ค‘์š”ํ•œ๊ฐ€ โ€” ๋ฌธ์ œ์˜ ํ•ต์‹ฌ ํŒŒ์•…
    • ๋ฐฐ๊ฒฝ: Force Closure๋ž€ ๋ฌด์—‡์ด๊ณ , ์™œ ์ธก์ •ํ•˜๊ธฐ ์–ด๋ ค์šด๊ฐ€
      • Force Closure์˜ ๋ฌผ๋ฆฌ์  ์ง๊ด€
      • ์™œ ๊ธฐ์กด ๋ฐฉ๋ฒ•๋“ค์ด ๋‹ค์–‘์„ฑ(Diversity)์„ ์žƒ๋Š”๊ฐ€
    • ๋ฐฉ๋ฒ•๋ก : GraspQP์˜ ์•„ํ‚คํ…์ฒ˜์™€ ํ•ต์‹ฌ ๊ธฐ์—ฌ
      • ํ•ต์‹ฌ ๊ธฐ์—ฌ 1: Differentiable Force Closure Energy via QP
      • ํ•ต์‹ฌ ๊ธฐ์—ฌ 2: MALA* ์ตœ์ ํ™” ์•Œ๊ณ ๋ฆฌ์ฆ˜
      • ์ „์ฒด ์—๋„ˆ์ง€ ํ•จ์ˆ˜ ๊ตฌ์„ฑ
    • ๊ทธ๋ž˜์Šคํ”„ ํƒ€์ž…๊ณผ ๋ฐ์ดํ„ฐ์…‹ ๊ตฌ์„ฑ
      • ๋ฐ์ดํ„ฐ์…‹ ๊ทœ๋ชจ
    • ์‹คํ—˜: ์–ด๋–ป๊ฒŒ ํ‰๊ฐ€ํ–ˆ๋Š”๊ฐ€
      • ํ‰๊ฐ€ ์ง€ํ‘œ
      • ๋น„๊ต ๋Œ€์ƒ
      • ์ฃผ์š” ๊ฒฐ๊ณผ
      • Ablation Study
    • ๊ด€๋ จ ์—ฐ๊ตฌ์™€์˜ ๋น„๊ต
      • Liu et al. (RA-L 2021) ๊ณผ์˜ ๋น„๊ต
      • Graspโ€™d / Fast-Graspโ€™d ์™€์˜ ๋น„๊ต
      • DexEvolve์™€์˜ ์‹œ๋„ˆ์ง€
    • ๋น„ํŒ์  ๊ณ ์ฐฐ: ๊ฐ•์ ๊ณผ ํ•œ๊ณ„
      • ๊ฐ•์ 
      • ์•ฝ์  ๋ฐ ํ•œ๊ณ„
    • Allegro Hand ์—ฐ๊ตฌ์ž๋ฅผ ์œ„ํ•œ ํŠน๋ณ„ ์ฃผ๋ชฉ ํฌ์ธํŠธ
    • ์š”์•ฝ ๋ฐ ๊ฒฐ๋ก 
    • ์ฐธ๊ณ  ๋ฌธํ—Œ

๐Ÿ“ƒGraspQP ๋ฆฌ๋ทฐ

qp
grasp
force closure
diff-opt
Differentiable Optimization of Force Closure for Diverse and Robust Dexterous Grasping
Published

March 11, 2026

  • Paper Link
  • Project Link
  • Code Link
  1. ๐Ÿค– ๋ณธ ์—ฐ๊ตฌ๋Š” ๋‹ค์–‘ํ•˜๊ณ  ๋ฌผ๋ฆฌ์ ์œผ๋กœ ํƒ€๋‹นํ•œ ๋ฑ์Šคํ„ฐ๋Ÿฌ์Šค ๊ทธ๋ฆฝ์„ ์ƒ์„ฑํ•˜๊ธฐ ์œ„ํ•ด Quadratic Program (QP)์„ ํ†ตํ•ด ์•”๋ฌต์ ์œผ๋กœ ์ •์˜๋œ ์—„๊ฒฉํ•œ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ force closure energy ์ •์‹์„ ๋„์ž…ํ•ฉ๋‹ˆ๋‹ค.
  2. ๐Ÿ’ก ์ œ์•ˆํ•˜๋Š” MALA* ์ตœ์ ํ™” ๋ฐฉ๋ฒ•์€ ์—๋„ˆ์ง€ ๊ฐ’ ๋ถ„ํฌ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ทธ๋ผ๋””์–ธํŠธ ๋‹จ๊ณ„๋ฅผ ๋™์ ์œผ๋กœ ๊ฑฐ๋ถ€ํ•˜์—ฌ, ์ตœ์ ํ™” ์ค‘ ๋ฐœ์ƒํ•˜๋Š” ๋ชจ๋“œ ๋ถ•๊ดด๋ฅผ ์™„ํ™”ํ•˜๊ณ  ๋ณด๋‹ค ํญ๋„“์€ ๊ทธ๋ฆฝ ๋‹ค์–‘์„ฑ์„ ์ด‰์ง„ํ•ฉ๋‹ˆ๋‹ค.
  3. ๐Ÿš€ GraspQP๋Š” ๊ธฐ์กด ์ ‘๊ทผ ๋ฐฉ์‹๋ณด๋‹ค ๊ทธ๋ฆฝ ๋‹ค์–‘์„ฑ๊ณผ ์˜ˆ์ธก ์•ˆ์ •์„ฑ์—์„œ ํฌ๊ฒŒ ํ–ฅ์ƒ๋œ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ๋ฉฐ, ๋‹ค์–‘ํ•œ ๊ทธ๋ฆฌํผ์™€ ๊ทธ๋ฆฝ ์œ ํ˜•์„ ํฌํ•จํ•˜๋Š” 5,700๊ฐœ ๊ฐ์ฒด์— ๋Œ€ํ•œ ๋Œ€๊ทœ๋ชจ ๊ทธ๋ฆฝ ๋ฐ์ดํ„ฐ์…‹์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ” Ping Review

๐Ÿ” Ping โ€” A light tap on the surface. Get the gist in seconds.

GraspQP ๋…ผ๋ฌธ์€ Dexterous Grasping ๋ถ„์•ผ์—์„œ ๋‹ค์–‘ํ•˜๊ณ  ๊ฒฌ๊ณ ํ•œ ๊ทธ๋žฉ(grasp)์„ ๋Œ€๊ทœ๋ชจ๋กœ ์ƒ์„ฑํ•˜๊ธฐ ์œ„ํ•œ ์ƒˆ๋กœ์šด ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด์˜ ๊ทธ๋žฉ ๋ฐ์ดํ„ฐ์…‹ ์ƒ์„ฑ ๋ฐฉ๋ฒ•๋“ค์€ ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด๋‚˜ ๋‹จ์ˆœํ™”๋œ Force Closure ๋ถ„์„์— ์˜์กดํ•˜์—ฌ, ์ฃผ๋กœ ํŒŒ์›Œ ๊ทธ๋žฉ(power grasp)์— ์ˆ˜๋ ดํ•˜๊ณ  ๋‹ค์–‘์„ฑ์ด ๋ถ€์กฑํ•˜๋‹ค๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ์—ˆ์Šต๋‹ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ์ด๋Ÿฌํ•œ ๋ฌธ์ œ์ ์„ ํ•ด๊ฒฐํ•˜๊ณ ์ž, Rigorousํ•˜๊ณ  ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ(differentiable) Force Closure ์—๋„ˆ์ง€ ํ•จ์ˆ˜์™€ ๊ฐœ์„ ๋œ ์ตœ์ ํ™” ๊ธฐ๋ฒ•์„ ๋„์ž…ํ•˜์—ฌ ์ •๊ตํ•œ ํ•€์น˜(pinch) ๋ฐ ์“ฐ๋ฆฌํ•‘๊ฑฐ ํ”„๋ฆฌ์‹œ์ „(tri-finger precision) ๊ทธ๋žฉ์„ ํฌํ•จํ•˜๋Š” ๋‹ค์–‘ํ•œ ๊ทธ๋žฉ ์ƒ์„ฑ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ๋ฐฉ๋ฒ•๋ก  (Core Methodology)

๋ณธ ๋…ผ๋ฌธ์˜ ํ•ต์‹ฌ์€ ๊ทธ๋žฉ์˜ ์•ˆ์ •์„ฑ๊ณผ ๋‹ค์–‘์„ฑ์„ ๋™์‹œ์— ๊ณ ๋ คํ•˜๋Š” ์ตœ์ ํ™” ํ”„๋ ˆ์ž„์›Œํฌ์— ์žˆ์Šต๋‹ˆ๋‹ค. ๊ทธ๋žฉ ์ตœ์ ํ™”๋Š” ๋‹ค์Œ์˜ ์—๋„ˆ์ง€ ํ•จ์ˆ˜๋ฅผ ์ตœ์†Œํ™”ํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ์ง„ํ–‰๋ฉ๋‹ˆ๋‹ค: E = E_{FC} + w_{dis}E_{dis} + w_{reg} E_{reg} ์—ฌ๊ธฐ์„œ E_{FC}๋Š” Force Closure Metric, E_{dis}๋Š” ์ ‘์ด‰์ (contact point)์ด ๊ฐ์ฒด ํ‘œ๋ฉด์— ์–ผ๋งˆ๋‚˜ ๊ทผ์ ‘ํ–ˆ๋Š”์ง€๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” ๊ฑฐ๋ฆฌ ํ•ญ, ๊ทธ๋ฆฌ๊ณ  E_{reg}๋Š” ๊ฐ์ฒด ๊ด€ํ†ต, ์ž๊ธฐ ๊ฐ„์„ญ, ์กฐ์ธํŠธ(joint) ํ•œ๊ณ„ ๋“ฑ์„ ์ œ์–ดํ•˜๋Š” ์ •๊ทœํ™”(regularization) ํ•ญ์ž…๋‹ˆ๋‹ค.

  1. ์—„๊ฒฉํ•˜๊ณ  ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ Force Closure Metric (E_{FC}): ๊ธฐ์กด DexGraspNet๊ณผ ๊ฐ™์€ ๋ฐฉ์‹์€ ๋งˆ์ฐฐ์„ ๋ฌด์‹œํ•˜๊ฑฐ๋‚˜, Theorem 3.1-(ii)์˜ \sum \alpha_i v_i = 0 ์กฐ๊ฑด์„ \alpha_i=1๋กœ ๋‹จ์ˆœํ™”ํ•˜์—ฌ Force Closure๊ฐ€ ์•„๋‹Œ Form Closure์— ๊ฐ€๊น๊ฒŒ ๋ชจ๋ธ๋งํ–ˆ์Šต๋‹ˆ๋‹ค. ์ด๋Š” ํŠน์ • \alpha_i ๊ฐ’์ด ์ž„์˜๋กœ ์ž‘์•„์งˆ ๊ฒฝ์šฐ ๊ธฐ์šธ๊ธฐ ์†Œ์‹ค(vanishing gradients)์„ ์œ ๋ฐœํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ์ด๋Ÿฌํ•œ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด Theorem 3.1-(iii)์— ๊ธฐ๋ฐ˜ํ•œ ๋” ์—„๊ฒฉํ•œ Force Closure Metric์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์ด Metric์€ ๊ฐ ์ ‘์ด‰์ ์—์„œ์˜ ์ƒํ˜ธ์ž‘์šฉ ํž˜์„ ๋‚˜ํƒ€๋‚ด๋Š” ๊ณ„์ˆ˜ \hat{\gamma}_i๋ฅผ ๋„์ž…ํ•˜์—ฌ ํ˜„์‹ค์ ์ธ ํž˜์˜ ๊ฒฝ๊ณ„๋ฅผ ์„ค์ •ํ•ฉ๋‹ˆ๋‹ค: E_{FC} = || \sum_{i \le |C'|}\hat{\gamma}_i w_i ||^2 \text{ s.t. } u \ge \hat{\gamma}_i \ge 1 ์—ฌ๊ธฐ์„œ \hat{\gamma}_i๋Š” i-๋ฒˆ์งธ ์ ‘์ด‰์ ์—์„œ์˜ ์ƒํ˜ธ์ž‘์šฉ ํž˜์„ ๋‚˜ํƒ€๋‚ด๋Š” ๊ณ„์ˆ˜์ด๋ฉฐ, u๋Š” ์ƒํ•œ์„ ์˜๋ฏธํ•˜๊ณ  w_i๋Š” i-๋ฒˆ์งธ ์ ‘์ด‰์ ์˜ Wrench(ํž˜๊ณผ ํ† ํฌ)์ž…๋‹ˆ๋‹ค. ์ด ๊ณต์‹์€ Torque Limits๋ฅผ ๊ฐ€์ง„ ์‹ค์ œ ๋กœ๋ด‡์— ํ•„์š”ํ•œ Bounded Interaction Forces๋ฅผ ๋ณด์žฅํ•ฉ๋‹ˆ๋‹ค.

    ์ด๋Ÿฌํ•œ ์ œ์•ฝ ์กฐ๊ฑด์ด ์žˆ๋Š” ์ตœ์ ํ™” ๋ฌธ์ œ๋ฅผ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ๋งŒ๋“ค๊ธฐ ์œ„ํ•ด, ๋ณธ ๋…ผ๋ฌธ์€ ์ด๋ฅผ ์ด์ฐจ ๊ณ„ํš๋ฒ•(Quadratic Program, QP)์œผ๋กœ ๊ณต์‹ํ™”ํ•˜๊ณ  KKT(Karushโ€“Kuhnโ€“Tucker) ์กฐ๊ฑด์„ ๋ฏธ๋ถ„ํ•˜์—ฌ Gradient๋ฅผ ๊ณ„์‚ฐํ•ฉ๋‹ˆ๋‹ค. QP ๊ณต์‹์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค: \min_z \frac{1}{2} z^T H z + g^T z \text{ s.t. } A z \ge b ์—ฌ๊ธฐ์„œ H = W_{FC}^T W_{FC}, g = 0, b = [1_{N_c}; u \cdot 1_{N_c}], z = [\hat{\gamma}_1, \dots, \hat{\gamma}_{N_c}] ์ด๊ณ , A = \text{diag}(1_{N_c \times N_c}, -1_{N_c \times N_c})๋Š” \hat{\gamma}_i์˜ ํ•˜ํ•œ(\ge 1) ๋ฐ ์ƒํ•œ(\le u) ์ œ์•ฝ์„ ์ธ์ฝ”๋”ฉํ•ฉ๋‹ˆ๋‹ค. W_{FC}๋Š” ๋งˆ์ฐฐ ํŠน์„ฑ์„ ๋ฐ˜์˜ํ•˜๋Š” Contact Wrench Matrix์ž…๋‹ˆ๋‹ค.

    ๋˜ํ•œ, Force Closure๊ฐ€ Wrench Space๊ฐ€ \mathbb{R}^6๋ฅผ ์„ ํ˜•์ ์œผ๋กœ ์ŠคํŒฌ(span)ํ•˜๋Š” ๊ฒƒ์„ ์˜๋ฏธํ•˜๋ฏ€๋กœ, Wrench Matrix W_{FC}์˜ Full Rank๋ฅผ ๋ณด์žฅํ•˜๊ณ  Wrench Space Volume์„ ๊ทน๋Œ€ํ™”ํ•˜๊ธฐ ์œ„ํ•ด Singular Value๋“ค์˜ ํ•ฉ์„ ์ด์šฉํ•œ ํ•ญ e^{-\sum_i \sigma_i(W_{FC})}๋ฅผ E_{FC}์— ๊ณฑํ•ด์ค๋‹ˆ๋‹ค. ์ตœ์ข… E_{FC}๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค: E_{FC} = || \sum_{i \le |C'|}\hat{\gamma}_i w_i ||^2 \cdot e^{-\sum_i \sigma_i(W_{FC})} \text{ s.t. } \hat{\gamma}_i \ge 1

  2. MALA* ์ตœ์ ํ™” ๊ธฐ๋ฒ• (MALA* Optimization Strategy): ๊ธฐ์กด ์ตœ์ ํ™” ๊ณผ์ •์—์„œ ๊ทธ๋žฉ ์ œ์•ˆ๋“ค์ด ๋กœ์ปฌ ๋ฏธ๋‹ˆ๋งˆ(local minima)์— ๊ฐ‡ํžˆ๊ฑฐ๋‚˜ ๋ชจ๋“œ ๋ถ•๊ดด(mode collapse)๋ฅผ ๊ฒช๋Š” ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, Metropolis-Adjusted Langevin Algorithm (MALA)์— ๊ธฐ๋ฐ˜ํ•œ MALA๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. MALA๋Š” ๊ทธ๋žฉ ๋ถ„ํฌ์˜ ํ˜„์žฌ ์—๋„ˆ์ง€ ๊ฐ’์— ๋Œ€ํ•œ ์ •๋ณด๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์ตœ์ ํ™” ๊ณผ์ •์„ ๋™์ ์œผ๋กœ ์กฐ์ •ํ•ฉ๋‹ˆ๋‹ค.

    • Dynamic Resetting: ํŠน์ • ๊ทธ๋žฉ์ด ์ „์ฒด ๊ทธ๋žฉ ๋ถ„ํฌ ๋Œ€๋น„ ํ˜„์ €ํžˆ ๋‚ฎ์€ ์„ฑ๋Šฅ์„ ๋ณด์ผ ๊ฒฝ์šฐ(์ฆ‰, ์—๋„ˆ์ง€ ๊ฐ’์ด ์—๋„ˆ์ง€ ๋ถ„ํฌ N_E(\mu, \sigma)์˜ ๊ฐ€์žฅ ๋‚ฎ์€ ๋ถ„์œ„์ˆ˜(p_{th})์— ์†ํ•  ๊ฒฝ์šฐ), ํ•ด๋‹น ๊ทธ๋žฉ์˜ ์ตœ์ ํ™” ์ƒํƒœ๋ฅผ ์žฌ์ดˆ๊ธฐํ™”ํ•˜์—ฌ ๋กœ์ปฌ ๋ฏธ๋‹ˆ๋งˆ์—์„œ ๋ฒ—์–ด๋‚˜๋„๋ก ์œ ๋„ํ•ฉ๋‹ˆ๋‹ค.
    • Adaptive Temperature Scaling: Metropolis-Hastings ์ˆ˜์šฉ ์กฐ๊ฑด p \sim e^{-\Delta E / T_i}์—์„œ ์˜จ๋„ T_i๋ฅผ ๋™์ ์œผ๋กœ ์กฐ์ ˆํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋žฉ i์˜ ์—๋„ˆ์ง€ E_i๊ฐ€ ์ „์ฒด ๋ถ„ํฌ์—์„œ ๋‚˜์ ์ˆ˜๋ก T_i๋ฅผ ๋†’์—ฌ, ์ƒˆ๋กœ์šด ๊ทธ๋ž˜๋””์–ธํŠธ(gradient) ์Šคํ…์„ ์ˆ˜์šฉํ•  ํ™•๋ฅ ์„ ์ฆ๊ฐ€์‹œํ‚ด์œผ๋กœ์จ ํƒ์ƒ‰(exploration)์„ ์žฅ๋ คํ•ฉ๋‹ˆ๋‹ค: T_i = T \cdot (1 + \Phi_E(E_i)), ์—ฌ๊ธฐ์„œ \Phi_E๋Š” ๋ˆ„์  ๋ถ„ํฌ ํ•จ์ˆ˜์ž…๋‹ˆ๋‹ค.

์‹คํ—˜ ๋ฐ ๊ฒฐ๊ณผ (Experiments and Results)

๋ณธ ๋…ผ๋ฌธ์€ Isaac Lab ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํ™˜๊ฒฝ์—์„œ 5,700๊ฐœ์˜ ๊ฐ์ฒด์™€ 5๊ฐ€์ง€ ๋‹ค๋ฅธ Gripper (Psyonic Ability Hand, Shadow Hand, Allegro Hand, Robotiq2f140, Robotiq3F)๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•๋ก ์„ ํ‰๊ฐ€ํ•ฉ๋‹ˆ๋‹ค. ์ฃผ์š” ํ‰๊ฐ€ ์ง€ํ‘œ๋Š” Unique Grasp Rate (UGR)์™€ Entropy (H)์ž…๋‹ˆ๋‹ค. UGR์€ ์ƒ์„ฑ๋œ ๊ณ ์œ ํ•˜๊ณ  ์•ˆ์ •์ ์ธ ๊ทธ๋žฉ์˜ ๋น„์œจ์„ ๋‚˜ํƒ€๋‚ด๋ฉฐ, H๋Š” ๊ทธ๋žฉ ๊ตฌ์„ฑ์˜ ๋‹ค์–‘์„ฑ์„ ์ธก์ •ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋žฉ์˜ ์•ˆ์ •์„ฑ์€ ๊ฐ์ฒด์— 6๊ฐ€์ง€ ๋ฐฉํ–ฅ์œผ๋กœ ํž˜์„ ๊ฐ€ํ–ˆ์„ ๋•Œ ๊ฐ์ฒด์˜ CoM(Center of Mass)์ด 3cm ๋ฐ˜๊ฒฝ ๋‚ด์— ์œ ์ง€๋˜๋Š”์ง€๋ฅผ ๊ธฐ์ค€์œผ๋กœ ํ‰๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.

๊ฒฐ๊ณผ์ ์œผ๋กœ GraspQP๋Š” ๊ธฐ์กด์˜ DexGraspNet, GenDexGrasp, TDG, MultiGripperDataset ๋“ฑ ๋ชจ๋“  Baseline ๋ฐฉ๋ฒ•๋ก ๋“ค์„ UGR๊ณผ H ์ธก๋ฉด์—์„œ ์ผ๊ด€์ ์œผ๋กœ ๋Šฅ๊ฐ€ํ•˜๋Š” ์„ฑ๋Šฅ์„ ๋ณด์˜€์Šต๋‹ˆ๋‹ค. ํŠนํžˆ, ์ ‘์ด‰์  ์ˆ˜๊ฐ€ ๋งŽ์•„์งˆ์ˆ˜๋ก ์„ฑ๋Šฅ ํ–ฅ์ƒ ํญ์ด ๋” ๋‘๋“œ๋Ÿฌ์กŒ์œผ๋ฉฐ, ์ด๋Š” ๋ณธ ๋ฐฉ๋ฒ•๋ก ์ด ๋ณต์žกํ•œ ๋‹ค์ง€(multi-fingered) ํ•ธ๋“œ์— ๋” ํšจ๊ณผ์ ์ž„์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค. MALA* ์ตœ์ ํ™” ๊ธฐ๋ฒ• ๋˜ํ•œ ๊ทธ๋žฉ์˜ UGR๊ณผ H๋ฅผ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ๋ฐ ๊ธฐ์—ฌํ•จ์„ ํ™•์ธํ–ˆ์Šต๋‹ˆ๋‹ค. Ablation ์—ฐ๊ตฌ๋ฅผ ํ†ตํ•ด MALA*์˜ Dynamic Resetting๊ณผ Adaptive Temperature Scaling์ด ๋‹ค์–‘์„ฑ ๋ฐ ์•ˆ์ •์„ฑ ํ–ฅ์ƒ์— ์ค‘์š”ํ•œ ์—ญํ• ์„ ํ•˜๋ฉฐ, ์ œ์•ˆ๋œ QP ๊ธฐ๋ฐ˜์˜ ์—„๊ฒฉํ•œ Force Closure Formulation์ด ์„ฑ๋Šฅ์— ํ•„์ˆ˜์ ์ž„์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.

๊ณ„์‚ฐ ๋น„์šฉ ์ธก๋ฉด์—์„œ๋Š” ๊ธฐ์กด Baseline๋ณด๋‹ค 1.5~3๋ฐฐ ๋А๋ฆฌ์ง€๋งŒ (Shadow Hand์˜ ๊ฒฝ์šฐ ๊ทธ๋žฉ๋‹น 3.4์ดˆ vs 1.15์ดˆ), ์ด๋Š” ์˜คํ”„๋ผ์ธ ๋ฐ์ดํ„ฐ์…‹ ์ƒ์„ฑ์—๋Š” ํ—ˆ์šฉ ๊ฐ€๋Šฅํ•œ ์ˆ˜์ค€์œผ๋กœ ๊ฐ„์ฃผ๋ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ, ๋” ์ ์€ ์ดˆ๊ธฐ ์‹œ๋“œ(seed) ์ˆ˜๋กœ๋„ ๋” ๋งŽ์€ ๊ณ ์œ  ๊ทธ๋žฉ์„ ๋‹ฌ์„ฑํ•˜๋ฉฐ ๋‹ค์–‘์„ฑ ์ธก๋ฉด์—์„œ ํšจ์œจ์ ์ž„์„ ๋ณด์—ฌ์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค.

๊ฒฐ๋ก  ๋ฐ ํ•œ๊ณ„ (Conclusion and Limitations)

GraspQP๋Š” ์—„๊ฒฉํ•œ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ Force Closure ์—๋„ˆ์ง€ ๊ณต์‹๊ณผ MALA* ์ตœ์ ํ™” ์ „๋žต์„ ํ†ตํ•ด ๋‹ค์–‘ํ•œ ๊ฐ์ฒด์™€ ๋กœ๋ด‡ ๊ทธ๋ฆฌํผ์— ๋Œ€ํ•ด ๋‹ค์–‘ํ•˜๊ณ  ์•ˆ์ •์ ์ธ ๊ทธ๋žฉ์„ ํšจ๊ณผ์ ์œผ๋กœ ์ƒ์„ฑํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ์ด ๋ฐฉ๋ฒ•๋ก ์€ ๊ธฐ์กด ์ ‘๊ทผ ๋ฐฉ์‹๋ณด๋‹ค ๊ทธ๋žฉ์˜ ์•ˆ์ •์„ฑ๊ณผ ์—”ํŠธ๋กœํ”ผ(๋‹ค์–‘์„ฑ) ๋ชจ๋‘์—์„œ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ๋ฉฐ, ๋Œ€๊ทœ๋ชจ์˜ ๊ณ ํ’ˆ์งˆ ๊ทธ๋žฉ ๋ฐ์ดํ„ฐ์…‹ ์ƒ์„ฑ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.

ํ•˜์ง€๋งŒ ๋‹ค์Œ๊ณผ ๊ฐ™์€ ํ•œ๊ณ„์ ๋„ ์กด์žฌํ•ฉ๋‹ˆ๋‹ค.

  1. ๊ณ„์‚ฐ ๋ณต์žก์„ฑ: ์ตœ์ ํ™” ๊ธฐ๋ฐ˜ ๊ณต์‹์€ ๊ธฐ์กด ๋ฐฉ๋ฒ•๋ณด๋‹ค ๊ณ„์‚ฐ ๋น„์šฉ์ด ๋†’์•„ ์‹ค์‹œ๊ฐ„ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์ด๋‚˜ ๊ฐ•ํ™” ํ•™์Šต(Reinforcement Learning, RL)์˜ ๋ณด์ƒ ํ•จ์ˆ˜๋กœ ์ง์ ‘ ์‚ฌ์šฉํ•˜๊ธฐ์—๋Š” ์ œํ•œ์ ์ผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. (ํ–ฅํ›„ ADMM Solver๋ฅผ ์ด์šฉํ•œ GPU ๊ฐ€์†ํ™”๊ฐ€ ์ž ์žฌ์  ํ•ด๊ฒฐ์ฑ…์œผ๋กœ ์–ธ๊ธ‰).
  2. ๋ชจ๋“œ ๋ถ•๊ดด ๋ฌธ์ œ: ์—ฌ๋Ÿฌ ๊ทธ๋žฉ์ด ์œ ์‚ฌํ•œ ํ˜•ํƒœ๋กœ ์ˆ˜๋ ดํ•˜๋Š” ๋ชจ๋“œ ๋ถ•๊ดด๊ฐ€ ์—ฌ์ „ํžˆ ๋ฐœ์ƒํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๋ฐ€๋„ ๊ธฐ๋ฐ˜ ๋ฐ˜๋ฐœ๋ ฅ(density-based repulsion force)๊ณผ ๊ฐ™์€ ์‹œ๋„์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ํ•ด๊ฒฐ์ด ๋” ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
  3. ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํ™˜๊ฒฝ์˜ ์ œ์•ฝ: Isaac Sim ๋ฌผ๋ฆฌ ์—”์ง„์—์„œ ๊ฐ„ํ—์ ์œผ๋กœ ๋ฐœ์ƒํ•˜๋Š” ์†๊ฐ€๋ฝ ๋๊ณผ ๊ฐ์ฒด ํ‘œ๋ฉด์˜ ๊ด€ํ†ต ํ˜„์ƒ์€ ๊ทธ๋žฉ ํ‰๊ฐ€์˜ False Positive๋ฅผ ์œ ๋ฐœํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

ํ–ฅํ›„ ์—ฐ๊ตฌ๋Š” ์ด ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ๋™์  ์กฐ์ž‘(dynamic manipulation) ์‹œ๋‚˜๋ฆฌ์˜ค๋กœ ํ™•์žฅํ•˜๊ณ , ์ƒ์„ฑ๋œ ๋ฐ์ดํ„ฐ์…‹์„ ๊ธฐ๋ฐ˜์œผ๋กœ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์„ ํ›ˆ๋ จํ•˜๋Š” ๊ฒƒ์„ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”” Ring Review

๐Ÿ”” Ring โ€” An idea that echoes. Grasp the core and its value.

์™œ ์ด ๋…ผ๋ฌธ์ด ์ค‘์š”ํ•œ๊ฐ€ โ€” ๋ฌธ์ œ์˜ ํ•ต์‹ฌ ํŒŒ์•…

๋กœ๋ด‡ ์†์ด ๋ฌผ๊ฑด์„ ์žก๋Š”๋‹ค๋Š” ๊ฑด ๋‹จ์ˆœํ•ด ๋ณด์ด์ง€๋งŒ, ์‚ฌ์‹ค ๊ทธ ์†์—๋Š” ์ˆ˜์‹ญ ๋…„๊ฐ„ ํ•ด๊ฒฐ๋˜์ง€ ์•Š์€ ๋ฌธ์ œ๋“ค์ด ๋’ค์—‰์ผœ ์žˆ๋‹ค. ํŠนํžˆ ๋‹ค์ง€ ์†(dexterous hand) ์„ ๋‹ค๋ฃจ๋Š” ์—ฐ๊ตฌ์ž๋“ค์ด ๊ณตํ†ต์ ์œผ๋กœ ๋งž๋‹ฅ๋œจ๋ฆฌ๋Š” ๋ฒฝ์ด ์žˆ๋Š”๋ฐ, ๋ฐ”๋กœ โ€œ๋‹ค์–‘ํ•˜๊ณ (diverse) ๋ฌผ๋ฆฌ์ ์œผ๋กœ ์•ˆ์ •์ ์ธ(physically stable) ๊ทธ๋ž˜์Šคํ”„ ๋ฐ์ดํ„ฐ๋ฅผ ์–ด๋–ป๊ฒŒ ๋Œ€๊ทœ๋ชจ๋กœ ์ƒ์„ฑํ•˜๋А๋ƒโ€์˜ ๋ฌธ์ œ๋‹ค.

์ƒ๊ฐํ•ด๋ณด์ž. ๋‹น์‹ ์ด ๋ณผํŽœ์„ ์ง‘์„ ๋•Œ, ์–ด๋–ป๊ฒŒ ์žก์„์ง€๋Š” ์ƒํ™ฉ๋งˆ๋‹ค ๋‹ค๋ฅด๋‹ค. ๊ธ€์„ ์“ธ ๋•Œ๋Š” ์„ธ ์†๊ฐ€๋ฝ์œผ๋กœ ์ •๋ฐ€ํ•˜๊ฒŒ ์ง‘๊ณ , ๋ฉ€๋ฆฌ ๋˜์ง€๋ ค๋ฉด ์ฃผ๋จน์œผ๋กœ ๊ฐ์‹ธ ์ฅ”๋‹ค. ์ด ์ฐจ์ด๊ฐ€ ๋ฐ”๋กœ robotics์—์„œ ๋งํ•˜๋Š” precision grasp(์ •๋ฐ€ ๊ทธ๋ž˜์Šคํ”„) ์™€ power grasp(ํŒŒ์›Œ ๊ทธ๋ž˜์Šคํ”„) ์˜ ์ฐจ์ด๋‹ค. ์ง€๊ธˆ๊นŒ์ง€ ๋Œ€๋ถ€๋ถ„์˜ ๋ฐ์ดํ„ฐ์…‹ ์ƒ์„ฑ ๋ฐฉ๋ฒ•๋“ค์€ ํŒŒ์›Œ ๊ทธ๋ž˜์Šคํ”„์— ํŽธํ–ฅ๋˜์–ด ์žˆ์—ˆ๋‹ค โ€” ์ด์œ ๊ฐ€ ์žˆ๋‹ค. ํŒŒ์›Œ ๊ทธ๋ž˜์Šคํ”„๋Š” ์ฐพ๊ธฐ ์‰ฝ๋‹ค. ๋งŽ์€ ์ ‘์ด‰์ ์ด ๋ฌผ์ฒด๋ฅผ ๊ฐ์‹ธ๋ฉด ๋Œ€์ถฉ ์žก์•„๋„ ์•ˆ์ •์ ์ด๋‹ˆ๊นŒ.

์ด ๋…ผ๋ฌธ, GraspQP (Renรฉ Zurbrรผgg, Andrei Cramariuc, Marco Hutter / ETH Zรผrich, CoRL 2025)๋Š” ์ด ํŽธํ–ฅ ๋ฌธ์ œ๋ฅผ ์ •๋ฉด์œผ๋กœ ๋ŒํŒŒํ•œ๋‹ค. ํ•ต์‹ฌ ์•„์ด๋””์–ด๋ฅผ ํ•œ ๋ฌธ์žฅ์œผ๋กœ ์š”์•ฝํ•˜๋ฉด:

Force closure ์กฐ๊ฑด์„ Quadratic Program(QP)์œผ๋กœ ์ˆ˜์‹ํ™”ํ•˜๊ณ , ์ด QP๋ฅผ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ๋งŒ๋“ค์–ด gradient-based ์ตœ์ ํ™”๋กœ ๋‹ค์–‘ํ•˜๊ณ  ๋ฌผ๋ฆฌ์ ์œผ๋กœ ํƒ€๋‹นํ•œ ๊ทธ๋ž˜์Šคํ”„๋ฅผ ์ƒ์„ฑํ•œ๋‹ค.

์ด๊ฒŒ ์™œ ์ƒˆ๋กœ์šด๊ฐ€๋ฅผ ์ดํ•ดํ•˜๋ ค๋ฉด, ๋จผ์ € ๊ธฐ์กด ๋ฐฉ๋ฒ•๋“ค์ด ์™œ ์‹คํŒจํ–ˆ๋Š”์ง€๋ฅผ ์•Œ์•„์•ผ ํ•œ๋‹ค.


๋ฐฐ๊ฒฝ: Force Closure๋ž€ ๋ฌด์—‡์ด๊ณ , ์™œ ์ธก์ •ํ•˜๊ธฐ ์–ด๋ ค์šด๊ฐ€

Force Closure์˜ ๋ฌผ๋ฆฌ์  ์ง๊ด€

๋กœ๋ด‡์ด ๋ฌผ์ฒด๋ฅผ โ€œ์•ˆ์ „ํ•˜๊ฒŒโ€ ์žก์•˜๋‹ค๋Š” ๊ฑด ๋ฌด์Šจ ์˜๋ฏธ์ผ๊นŒ? ๊ฐ€์žฅ ์—„๋ฐ€ํ•œ ์ •์˜๋Š” force closure ๋‹ค โ€” ์™ธ๋ถ€์—์„œ ์–ด๋–ค ๋ฐฉํ–ฅ์œผ๋กœ ํž˜์ด ๊ฐ€ํ•ด์ ธ๋„, ์ ‘์ด‰์ ๋“ค์ด ์ƒ์„ฑํ•˜๋Š” ๋งˆ์ฐฐ๋ ฅ๊ณผ ๋ฒ•์„ ๋ ฅ์˜ ์กฐํ•ฉ์œผ๋กœ ๊ทธ ํž˜์„ ์ƒ์‡„ํ•  ์ˆ˜ ์žˆ๋Š” ์ƒํƒœ๋ฅผ ๋งํ•œ๋‹ค.

์ˆ˜ํ•™์ ์œผ๋กœ๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์ด ํ‘œํ˜„๋œ๋‹ค:

\text{A grasp is force-closure if} \quad \forall \mathbf{f}_0 \in \mathbb{R}^6, \exists \mathbf{x} \in \mathcal{FC} \text{ s.t. } \mathbf{G}\mathbf{x} = \mathbf{f}_0

์—ฌ๊ธฐ์„œ: - \mathbf{G} \in \mathbb{R}^{6 \times m} ๋Š” Grasp Matrix (๊ฐ ์ ‘์ด‰์ ์˜ wrench๋ฅผ ๋ฌผ์ฒด ์ค‘์‹ฌ ์ขŒํ‘œ๊ณ„๋กœ ๋งคํ•‘)
- \mathbf{x} \in \mathcal{FC} ๋Š” ๋งˆ์ฐฐ ์›๋ฟ”(friction cone) ๋‚ด์˜ ์ ‘์ด‰๋ ฅ ๋ฒกํ„ฐ
- m ์€ ์ „์ฒด ์ ‘์ด‰๋ ฅ ์ž์œ ๋„

์ง๊ด€์ ์œผ๋กœ ๋งํ•˜๋ฉด: grasp matrix์˜ ์—ด๋ฒกํ„ฐ๋“ค์ด ์ƒ์„ฑํ•˜๋Š” wrench space๊ฐ€ \mathbb{R}^6 ์ „์ฒด๋ฅผ ๋ฎ์„ ์ˆ˜ ์žˆ์–ด์•ผ ํ•œ๋‹ค. ์ฆ‰, Wrench Space๊ฐ€ ์›์ ์„ ๋‚ด๋ถ€์— ํฌํ•จํ•˜๋Š” ๋ณผ๋ก ๋‹ค๋ฉด์ฒด๋ฅผ ํ˜•์„ฑํ•ด์•ผ ํ•œ๋‹ค.

Force Closure ์กฐ๊ฑด ์‹œ๊ฐํ™” (2D ๋‹จ์ˆœํ™”):

  ์ ‘์ด‰์  1 (์ขŒ์ธก)          ์ ‘์ด‰์  2 (์šฐ์ธก)
     F1 ->  [   OBJECT   ]  <- F2
            /           \
           /  Wrench      \
          /   Space๊ฐ€      \
         /   ์›์  ํฌํ•จ     \
        
  => ์–ด๋–ค ๋ฐฉํ–ฅ์˜ ์™ธ๋ ฅ๋„ F1, F2์˜ ์กฐํ•ฉ์œผ๋กœ ์ƒ์‡„ ๊ฐ€๋Šฅ
  => Force Closure!

์™œ ๊ธฐ์กด ๋ฐฉ๋ฒ•๋“ค์ด ๋‹ค์–‘์„ฑ(Diversity)์„ ์žƒ๋Š”๊ฐ€

๊ธฐ์กด ์ ‘๊ทผ๋ฒ•๋“ค์˜ ๋ฌธ์ œ๋ฅผ ๋‘ ๊ฐ€์ง€๋กœ ๋ถ„๋ฅ˜ํ•  ์ˆ˜ ์žˆ๋‹ค:

1. ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜ ๋ฐฉ๋ฒ• (Sampling-based)
GraspIt! ๊ฐ™์€ ํˆด์ด ๋Œ€ํ‘œ์ ์ด๋‹ค. ๋ฌด์ž‘์œ„๋กœ ์ ‘์ด‰์ ์„ ์ƒ˜ํ”Œ๋งํ•˜๊ณ , force closure๋ฅผ ์ฒดํฌํ•œ๋‹ค. ๋ฌธ์ œ๋Š” ํƒ์ƒ‰ ๊ณต๊ฐ„์ด ์—„์ฒญ๋‚˜๊ฒŒ ๋„“๊ณ , force closure๋ฅผ ๋งŒ์กฑํ•˜๋Š” ๊ตฌ์„ฑ์€ ๋“œ๋ฌผ๊ธฐ ๋•Œ๋ฌธ์— ํŒŒ์›Œ ๊ทธ๋ž˜์Šคํ”„๋กœ ์ˆ˜๋ ดํ•˜๋Š” ๊ฒฝํ–ฅ์ด ์žˆ๋‹ค. ๋งŽ์€ ์†๊ฐ€๋ฝ์ด ๋ฌผ์ฒด๋ฅผ ๊ฐ์‹ธ๋Š” ํŒŒ์›Œ ๊ทธ๋ž˜์Šคํ”„๋Š” ๊ฑฐ์˜ ์–ด๋””์„œ๋‚˜ force closure๋ฅผ ๋งŒ์กฑํ•˜๋ฏ€๋กœ, ์ƒ˜ํ”Œ๋Ÿฌ๊ฐ€ ๊ทธ๊ฒƒ๋งŒ ๊ณ„์† ์ฐพ๊ฒŒ ๋œ๋‹ค.

2. ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ ๊ทผ์‚ฌ ๋ฐฉ๋ฒ• (Differentiable Approximation)
DexGraspNet, Liu et al. (RA-L 2021) ๊ฐ™์€ ์ตœ๊ทผ ๋ฐฉ๋ฒ•๋“ค์€ force closure์˜ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ ๊ทผ์‚ฌ์น˜๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ gradient descent๋กœ ๊ทธ๋ž˜์Šคํ”„๋ฅผ ์ตœ์ ํ™”ํ•œ๋‹ค. ๊ทธ๋Ÿฐ๋ฐ ์—ฌ๊ธฐ์„œ โ€œ๊ทผ์‚ฌโ€๊ฐ€ ํ•ต์‹ฌ ๋ฌธ์ œ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด, wrench space์˜ ๋ถ€ํ”ผ๋ฅผ ์ธก์ •ํ•˜๋Š” Q_1 ๋ฉ”ํŠธ๋ฆญ์„ softmax๋‚˜ convex relaxation์œผ๋กœ ๊ทผ์‚ฌํ•˜๋ฉด ์›๋ž˜ ๋ฌผ๋ฆฌ์  ์กฐ๊ฑด์ด ์•ฝํ™”๋œ๋‹ค. ๊ทธ ๊ฒฐ๊ณผ, ์ตœ์ ํ™”๋Š” ํ˜•์‹์ ์œผ๋กœ๋Š” ํ†ต๊ณผํ•˜์ง€๋งŒ ์‹ค์ œ๋กœ ์žก์œผ๋ฉด ํ”๋“ค๋ฆฌ๋Š” ๊ทธ๋ž˜์Šคํ”„๋ฅผ ์ƒ์„ฑํ•˜๊ฑฐ๋‚˜, ์†๊ฐ€๋ฝ์ด ๋ฌผ์ฒด ์•ˆ์œผ๋กœ ํŒŒ๊ณ ๋“œ๋Š”(penetration) ๊ทธ๋ž˜์Šคํ”„๊ฐ€ ๋‚˜์˜จ๋‹ค.

GraspQP๊ฐ€ ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๊ฒƒ์ด ๋ฐ”๋กœ ์ด๊ฒƒ์ด๋‹ค: ๋ฌผ๋ฆฌ์  ์กฐ๊ฑด์„ ํƒ€ํ˜‘ ์—†์ด ์œ ์ง€ํ•˜๋ฉด์„œ๋„ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ๋งŒ๋“ค๊ธฐ.


๋ฐฉ๋ฒ•๋ก : GraspQP์˜ ์•„ํ‚คํ…์ฒ˜์™€ ํ•ต์‹ฌ ๊ธฐ์—ฌ

GraspQP์˜ ๊ตฌ์กฐ๋ฅผ ์ „์ฒด์ ์œผ๋กœ ์กฐ๋งํ•˜๋ฉด ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค:

flowchart TD
    A[Coarse Initialization<br/>์† ์ž์„ธ ์ดˆ๊ธฐ๊ฐ’ ์ƒ˜ํ”Œ๋ง] --> B[Energy Function ์ •์˜]
    B --> C1[Distance Term<br/>์ ‘์ด‰ ๊ฑฐ๋ฆฌ ์—๋„ˆ์ง€]
    B --> C2[Regularization Term<br/>์† ์ž์„ธ ๊ทœ์ œํ™”]
    B --> C3[Force Closure Term<br/>QP ๊ธฐ๋ฐ˜ ์—๋„ˆ์ง€]
    C3 --> D[Differentiable QP<br/>Implicit Differentiation]
    D --> E[Wrench Matrix ๊ตฌ์„ฑ<br/>G โˆˆ R^{6xm}]
    E --> F[Singular Value Scaling<br/>e^{-Q} ์Šค์ผ€์ผ๋ง]
    C1 --> G[Total Energy E_total]
    C2 --> G
    F --> G
    G --> H[MALA* Optimizer]
    H --> H1[Dynamic Resetting<br/>์—๋„ˆ์ง€ ๋ถ„ํฌ ๊ธฐ๋ฐ˜ ์ดˆ๊ธฐํ™”]
    H --> H2[Adaptive Temperature<br/>Scaling]
    H1 --> I[์ˆ˜๋ ด ๊ทธ๋ž˜์Šคํ”„ ํ›„๋ณด๊ตฐ]
    H2 --> I
    I --> J[๋‹ค์–‘ํ•˜๊ณ  ๋ฌผ๋ฆฌ์ ์œผ๋กœ<br/>์•ˆ์ •์ ์ธ ๊ทธ๋ž˜์Šคํ”„ ๋ฐ์ดํ„ฐ์…‹]

    style C3 fill:#ff9999,stroke:#cc0000
    style D fill:#ff9999,stroke:#cc0000
    style H fill:#99ccff,stroke:#0066cc
    style H1 fill:#99ccff,stroke:#0066cc
    style H2 fill:#99ccff,stroke:#0066cc

ํ•ต์‹ฌ ๊ธฐ์—ฌ 1: Differentiable Force Closure Energy via QP

์ด ๋…ผ๋ฌธ์˜ ๊ฐ€์žฅ ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋Š” force closure ์กฐ๊ฑด์„ ์•”๋ฌต์ (implicit) QP๋กœ ์ˆ˜์‹ํ™”ํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ ์—๋„ˆ์ง€ ํ•ญ์„ ์œ ๋„ํ•œ ๊ฒƒ์ด๋‹ค.

Wrench Matrix ๊ตฌ์„ฑ

๋จผ์ € ๊ฐ ์ ‘์ด‰์  i์—์„œ, ๋งˆ์ฐฐ ์›๋ฟ”์„ ๋‹ค๋ฉด์ฒด ๊ทผ์‚ฌ(polyhedral approximation)๋กœ ํ‘œํ˜„ํ•œ๋‹ค. ๋งˆ์ฐฐ ๊ณ„์ˆ˜ \mu๊ฐ€ ์ฃผ์–ด์ง€๋ฉด, ๋งˆ์ฐฐ ์›๋ฟ” ๋‚ด์— K๊ฐœ์˜ ๊ทน์„ (extreme rays)์„ ๋ฐฐ์น˜ํ•˜์—ฌ ๋งˆ์ฐฐ๋ ฅ์„ ํ‘œํ˜„ํ•œ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด ๊ฐ ์ ‘์ด‰์ ๋งˆ๋‹ค K๊ฐœ์˜ wrench ๊ธฐ์ €๋ฒกํ„ฐ๊ฐ€ ์ƒ์„ฑ๋˜๊ณ , ์ด๋ฅผ ์—ด๋กœ ์Œ“์•„ Wrench Matrix \mathbf{G} \in \mathbb{R}^{6 \times m}๋ฅผ ๊ตฌ์„ฑํ•œ๋‹ค.

\mathbf{G} = [\mathbf{w}_1^1, \mathbf{w}_1^2, \ldots, \mathbf{w}_1^K, \mathbf{w}_2^1, \ldots, \mathbf{w}_n^K]

์—ฌ๊ธฐ์„œ \mathbf{w}_i^k๋Š” ์ ‘์ด‰์  i์˜ k๋ฒˆ์งธ ๋งˆ์ฐฐ ์›๋ฟ” ๊ทน์„ ์— ์˜ํ•œ wrench (force + torque).

Force Closure๋ฅผ QP๋กœ ํ‘œํ˜„

Force closure ์กฐ๊ฑด์€ ๋‹ค์Œ QP๊ฐ€ feasibleํ•œ์ง€ ์—ฌ๋ถ€๋กœ ํ™•์ธ๋œ๋‹ค:

\min_{\boldsymbol{\alpha}} \|\mathbf{G}\boldsymbol{\alpha}\|^2 \quad \text{s.t.} \quad \boldsymbol{\alpha} \geq 0, \quad \sum_i \alpha_i = 1

์ด QP๊ฐ€ \mathbf{G}\boldsymbol{\alpha} = \mathbf{0}์˜ ์†”๋ฃจ์…˜์„ ๊ฐ€์ง„๋‹ค๋ฉด(์ฆ‰, \boldsymbol{\alpha} > 0์œผ๋กœ zero resultant wrench๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค๋ฉด), ๊ทธ ๊ทธ๋ž˜์Šคํ”„๋Š” force closure๋ฅผ ๋งŒ์กฑํ•œ๋‹ค. ํ•ต์‹ฌ ์•„์ด๋””์–ด๋Š” ์ด QP์˜ ์ตœ์ ๊ฐ’(minimum value)์„ ์—๋„ˆ์ง€ ํ•จ์ˆ˜๋กœ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์ด๋‹ค:

E_{FC}(\theta) = \min_{\boldsymbol{\alpha} \geq 0, \sum \alpha_i = 1} \|\mathbf{G}(\theta)\boldsymbol{\alpha}\|^2

  • E_{FC} = 0์ด๋ฉด force closure ๋‹ฌ์„ฑ
  • E_{FC} > 0์ด๋ฉด force closure ๋ฏธ๋‹ฌ์„ฑ โ€” ๊ทธ ํฌ๊ธฐ๊ฐ€ โ€œ์–ผ๋งˆ๋‚˜ ๋ถ€์กฑํ•œ๊ฐ€โ€์˜ ์ฒ™๋„

๋ฏธ๋ถ„ ๊ฐ€๋Šฅ์„ฑ(differentiability): QP์˜ ์ตœ์ ๊ฐ’์€ ํŒŒ๋ผ๋ฏธํ„ฐ \theta (์† ์ž์„ธ, ๊ด€์ ˆ๊ฐ)์— ๋Œ€ํ•ด ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•˜๋‹ค. ์ด๋Š” KKT ์กฐ๊ฑด์˜ Implicit Function Theorem์„ ํ†ตํ•ด ์ด๋ฃจ์–ด์ง„๋‹ค. QP๋Š” cvxpylayers ๋˜๋Š” differentiable optimization ํ”„๋ ˆ์ž„์›Œํฌ๋กœ ๊ตฌํ˜„ ๊ฐ€๋Šฅํ•˜๋ฉฐ, ์—ญ์ „ํŒŒ ์‹œ \frac{\partial E_{FC}}{\partial \theta}๋ฅผ ํšจ์œจ์ ์œผ๋กœ ๊ณ„์‚ฐํ•  ์ˆ˜ ์žˆ๋‹ค.

Implicit Differentiation of QP:

  Forward:  theta -> G(theta) -> QP solve -> E_FC
  Backward: dE_FC/d_theta via KKT conditions
            (์ฒด์ธ ๋ฃฐ๋กœ G์˜ Jacobian์„ ํ†ตํ•ด ์ „ํŒŒ)

Wrench Matrix Rank ๋ณด์žฅ: Singular Value Scaling

์—ฌ๊ธฐ์„œ ์ถ”๊ฐ€์ ์ธ ๊ธฐ์ˆ ์  ์„ธ๋ถ€์‚ฌํ•ญ์ด ์žˆ๋‹ค. Force closure๋Š” ๋‹จ์ˆœํžˆ \mathbf{G}๊ฐ€ full row rank (rank 6)์ธ ๊ฒƒ๋งŒ์œผ๋กœ๋Š” ๋ถ€์กฑํ•˜๊ณ , wrenches๊ฐ€ \mathbb{R}^6 ์ „์ฒด๋ฅผ ์–‘์ˆ˜ span ํ•ด์•ผ ํ•œ๋‹ค.

๋…ผ๋ฌธ์€ ์—๋„ˆ์ง€ ํ•ญ์„ Wrench Matrix์˜ ํŠน์ด๊ฐ’์˜ ๊ณฑ์œผ๋กœ ์Šค์ผ€์ผ๋งํ•œ๋‹ค:

E_{scaled} = e^{-Q(\mathbf{G})} \cdot E_{FC}(\theta)

์—ฌ๊ธฐ์„œ Q(\mathbf{G}) = \prod_i \sigma_i(\mathbf{G}) (ํŠน์ด๊ฐ’๋“ค์˜ ๊ณฑ). ์ด ์Šค์ผ€์ผ๋ง์€ ๋‘ ๊ฐ€์ง€ ํšจ๊ณผ๋ฅผ ๋‚ธ๋‹ค:

  1. \mathbf{G}๊ฐ€ rank-deficientํ•  ๋•Œ(์ฆ‰, wrench space๊ฐ€ 6D๋ฅผ spanํ•˜์ง€ ๋ชปํ•  ๋•Œ) Q \approx 0์ด ๋˜์–ด ์—๋„ˆ์ง€๊ฐ€ ์ฆํญ๋˜๊ณ , ์ตœ์ ํ™”๊ฐ€ full-rank ๊ตฌ์„ฑ์œผ๋กœ ์œ ๋„๋œ๋‹ค.
  2. \mathbf{G}๊ฐ€ ์ด๋ฏธ ์ข‹์€ ๊ตฌ์„ฑ์ผ ๋•Œ ์Šค์ผ€์ผ๋ง์ด ์ค„์–ด๋“ค์–ด ์•ˆ์ •์ ์ธ ์ˆ˜๋ ด์„ ์œ ๋„ํ•œ๋‹ค.

์ด๊ฒƒ์€ ๊ต‰์žฅํžˆ ์˜๋ฆฌํ•œ ํŠธ๋ฆญ์ด๋‹ค. ๋งˆ์น˜ ์—ญํ–‰๋ ฌ์ด ์กด์žฌํ•˜์ง€ ์•Š๋Š” ๋ฐฉํ–ฅ์„ โ€œ๋” ํฌ๊ฒŒ ๋ณด์ด๊ฒŒโ€ ๋งŒ๋“ค์–ด์„œ ์ตœ์ ํ™”๊ฐ€ ๊ทธ์ชฝ์„ ํ”ผํ•˜๊ฒŒ ์œ ๋„ํ•˜๋Š” ๊ฒƒ๊ณผ ๊ฐ™๋‹ค.

ํ•ต์‹ฌ ๊ธฐ์—ฌ 2: MALA* ์ตœ์ ํ™” ์•Œ๊ณ ๋ฆฌ์ฆ˜

๋‘ ๋ฒˆ์งธ ๊ธฐ์—ฌ๋Š” MALA* (Modified Metropolis-Adjusted Langevin Algorithm) ๋‹ค.

MALA์˜ ๊ธฐ๋ณธ ์•„์ด๋””์–ด

MALA๋Š” MCMC์™€ gradient descent๋ฅผ ๊ฒฐํ•ฉํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด๋‹ค. ์ผ๋ฐ˜์ ์ธ gradient descent๊ฐ€ ์—๋„ˆ์ง€์˜ ๋‚ด๋ฆฌ๋ง‰์„ ๋”ฐ๋ผ ํ™•์ •์ ์œผ๋กœ ๋‚ด๋ ค๊ฐ€๋Š” ๊ฒƒ๊ณผ ๋‹ฌ๋ฆฌ, MALA๋Š” ๋…ธ์ด์ฆˆ๋ฅผ ์ถ”๊ฐ€ํ•œ gradient step์„ ์ œ์•ˆํ•˜๊ณ , ๊ทธ step์„ ๋ฐ›์•„๋“ค์ผ์ง€ ์—ฌ๋ถ€๋ฅผ ํ™•๋ฅ ์ ์œผ๋กœ ๊ฒฐ์ •ํ•œ๋‹ค:

\theta_{t+1} = \theta_t - \eta \nabla E(\theta_t) + \sqrt{2\eta} \boldsymbol{\epsilon}, \quad \boldsymbol{\epsilon} \sim \mathcal{N}(0, I)

์ด ํ™•๋ฅ ์  ์„ฑ์งˆ์ด ๊ทธ๋ž˜์Šคํ”„ ๋‹ค์–‘์„ฑ์˜ ํ•ต์‹ฌ์ด๋‹ค โ€” ์ˆœ์ˆ˜ gradient descent๋Š” local minimum์— ๋น ์ง€๋ฉด ํƒˆ์ถœํ•˜์ง€ ๋ชปํ•˜์ง€๋งŒ, MALA๋Š” ๋…ธ์ด์ฆˆ๋กœ ์ธํ•ด ๋‹ค์–‘ํ•œ ๊ตฌ์„ฑ์„ ํƒ์ƒ‰ํ•  ์ˆ˜ ์žˆ๋‹ค.

MALA*์˜ ๋‘ ๊ฐ€์ง€ ๊ฐœ์„ 

๋ฌธ์ œ๋Š”, ๋ณ‘๋ ฌ๋กœ ๋งŽ์€ ๊ทธ๋ž˜์Šคํ”„ ํ›„๋ณด๋ฅผ ๋™์‹œ์— ์ตœ์ ํ™”ํ•  ๋•Œ ์ผ๋ถ€ ํ›„๋ณด๋“ค์ด ๋‚˜์œ local minimum์— ๊ฐ‡ํ˜€ ๋ฒ„๋ฆฐ๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. ์ด๊ฒƒ๋“ค์€ ๊ท€์ค‘ํ•œ ๊ณ„์‚ฐ ์ž์›์„ ๋‚ญ๋น„ํ•  ๋ฟ ์•„๋‹ˆ๋ผ, ์ตœ์ข… ๋ฐ์ดํ„ฐ์…‹์˜ ๋‹ค์–‘์„ฑ์„ ๋‚ฎ์ถ˜๋‹ค.

GraspQP๋Š” MALA์— ๋‘ ๊ฐ€์ง€๋ฅผ ์ถ”๊ฐ€ํ•œ๋‹ค:

Dynamic Resetting (๋™์  ์ดˆ๊ธฐํ™”)

๋ฐฐ์น˜ ๋‚ด ์ „์ฒด ์ƒ˜ํ”Œ๋“ค์˜ ์—๋„ˆ์ง€ ๋ถ„ํฌ๋ฅผ ๋ณด๊ณ , ํŠน์ • ์ž„๊ณ„๊ฐ’์„ ์ดˆ๊ณผํ•˜๋Š” ํ›„๋ณด๋“ค์„ ์ƒˆ๋กœ์šด ์œ„์น˜์—์„œ ๋‹ค์‹œ ์‹œ์ž‘์‹œํ‚จ๋‹ค:

Algorithm: Dynamic Resetting in MALA*

FOR each optimization step t:
    Compute E_i for all grasp candidates i = 1...N
    Compute mu = mean(E_i), sigma = std(E_i)
    
    FOR each candidate i:
        IF E_i > mu + k * sigma:  // k is a hyperparameter
            Reset theta_i ~ p_init  // reinitialize from prior
        ELSE:
            theta_i <- MALA update

ํ•ต์‹ฌ์€ ์ „์ฒด ๋ฐฐ์น˜์˜ ์—๋„ˆ์ง€ ๋ถ„ํฌ๋ฅผ ๊ธฐ์ค€์œผ๋กœ ํ•œ๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. ์ ˆ๋Œ€์ ์ธ ์—๋„ˆ์ง€๊ฐ’์ด ์•„๋‹ˆ๋ผ ์ƒ๋Œ€์ ์ธ ์œ„์น˜๋กœ ํŒ๋‹จํ•˜๊ธฐ ๋•Œ๋ฌธ์—, ์ „์ฒด ๋ฐฐ์น˜๊ฐ€ ์ข‹์•„์ง€๋ฉด ๊ธฐ์ค€๋„ ๋†’์•„์ง„๋‹ค.

Adaptive Temperature Scaling (์ ์‘์  ์˜จ๋„ ์Šค์ผ€์ผ๋ง)

MALA์˜ acceptance probability๋ฅผ ์ƒ˜ํ”Œ์˜ ์ƒ๋Œ€์  ์—๋„ˆ์ง€ ์„ฑ๋Šฅ์— ๋”ฐ๋ผ ์กฐ์ ˆํ•œ๋‹ค. ์—๋„ˆ์ง€๊ฐ€ ๋‚ฎ์€(์ข‹์€) ์ƒ˜ํ”Œ์€ step์„ ์ ๊ทน์ ์œผ๋กœ ๋ฐ›์•„๋“ค์ด๊ณ , ์—๋„ˆ์ง€๊ฐ€ ๋†’์€(๋‚˜์œ) ์ƒ˜ํ”Œ์€ ๋” ํฐ ๋…ธ์ด์ฆˆ๋กœ ํƒ์ƒ‰์„ ์ด‰์ง„ํ•œ๋‹ค:

T_i = T_{\text{base}} \cdot f\left(\frac{E_i - \mu_E}{\sigma_E}\right)

์ด ๋‘ ๋ฉ”์ปค๋‹ˆ์ฆ˜์˜ ๊ฒฐํ•ฉ์€ ์ง๊ด€์ ์ด๋‹ค: ์ข‹์€ ํ›„๋ณด๋Š” ์ •๋ฐ€ํ•˜๊ฒŒ ์ˆ˜๋ ด์‹œํ‚ค๊ณ , ๋‚˜์œ ํ›„๋ณด๋Š” ๊ณผ๊ฐํ•˜๊ฒŒ ์ดˆ๊ธฐํ™”ํ•˜์—ฌ ๋‹ค์‹œ ํƒ์ƒ‰ํ•œ๋‹ค. ๋งˆ์น˜ ์—ฐ๊ตฌํŒ€์—์„œ ์ž˜ ๋˜๋Š” ์•„์ด๋””์–ด๋Š” ๊นŠ์ด ํŒŒ๊ณ ๋“ค๊ณ , ๋ง‰ํžŒ ์•„์ด๋””์–ด๋Š” ์™„์ „ํžˆ ์ƒˆ๋กœ ์‹œ์ž‘ํ•˜๋Š” ๊ฒƒ๊ณผ ๊ฐ™๋‹ค.

์ „์ฒด ์—๋„ˆ์ง€ ํ•จ์ˆ˜ ๊ตฌ์„ฑ

์ตœ์ข… ์—๋„ˆ์ง€ ํ•จ์ˆ˜๋Š” ์„ธ ํ•ญ์˜ ํ•ฉ์ด๋‹ค:

E_{\text{total}} = \lambda_d E_{\text{dist}} + \lambda_r E_{\text{reg}} + \lambda_{fc} E_{\text{FC}}

ํ•ญ ์˜๋ฏธ ์—ญํ• 
E_{\text{dist}} ์†๊ฐ€๋ฝ ๋๊ณผ ๋ฌผ์ฒด ํ‘œ๋ฉด ์‚ฌ์ด์˜ ๊ฑฐ๋ฆฌ ์ ‘์ด‰ ํ˜•์„ฑ ์œ ๋„
E_{\text{reg}} ์† ์ž์„ธ ์ •๊ทœํ™” (๊ด€์ ˆ ํ•œ๊ณ„, ์ถฉ๋Œ ๋ฐฉ์ง€ ๋“ฑ) ๋ฌผ๋ฆฌ์  ํƒ€๋‹น์„ฑ ์œ ์ง€
E_{\text{FC}} QP ๊ธฐ๋ฐ˜ force closure ์—๋„ˆ์ง€ ์•ˆ์ •์„ฑ ๋ณด์žฅ

\lambda_d, \lambda_r, \lambda_{fc}๋Š” ๊ฐ ํ•ญ์˜ ๊ฐ€์ค‘์น˜๋กœ, ablation study๋ฅผ ํ†ตํ•ด ๊ฒฐ์ •๋œ๋‹ค.


๊ทธ๋ž˜์Šคํ”„ ํƒ€์ž…๊ณผ ๋ฐ์ดํ„ฐ์…‹ ๊ตฌ์„ฑ

GraspQP์˜ ๋˜ ๋‹ค๋ฅธ ๊ฐ•์ ์€ ์„ธ ๊ฐ€์ง€ ๊ทธ๋ž˜์Šคํ”„ ํƒ€์ž…์„ ๋ชจ๋‘ ์ง€์›ํ•œ๋‹ค๋Š” ๊ฒƒ์ด๋‹ค:

graph LR
    A[GraspQP Grasp Types] --> B[Power Grasp\nํŒŒ์›Œ ๊ทธ๋ž˜์Šคํ”„\n์ „์ฒด ์†์œผ๋กœ ๊ฐ์‹ธ๊ธฐ]
    A --> C[Pinch Grasp\nํ•€์น˜ ๊ทธ๋ž˜์Šคํ”„\n์—„์ง€+๊ฒ€์ง€ ์ง‘๊ธฐ]
    A --> D[Tri-finger Grasp\n์‚ผ์ง€ ์ •๋ฐ€ ๊ทธ๋ž˜์Šคํ”„\n์„ธ ์†๊ฐ€๋ฝ ์ •๋ฐ€]
    
    B --> E[์•ˆ์ •์„ฑ ๋†’์Œ\n๋‹ค์–‘์„ฑ ๋‚ฎ์Œ]
    C --> F[์„ธ๋ฐ€ํ•œ ์กฐ์ž‘\n๋†’์€ ๋‹ค์–‘์„ฑ ์š”๊ตฌ]
    D --> G[๋„๊ตฌ ์‚ฌ์šฉ ๋“ฑ\nํƒœ์Šคํฌ ํŠนํ™”]
    
    style B fill:#ffcc99
    style C fill:#99ffcc
    style D fill:#99ccff

ํ•ต์‹ฌ์€ pinch์™€ tri-finger grasp์„ ์ƒ์„ฑํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ์ ์€ ์ˆ˜์˜ ์ ‘์ด‰์ ์œผ๋กœ๋„ force closure๋ฅผ ๋‹ฌ์„ฑํ•ด์•ผ ํ•œ๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. ํŒŒ์›Œ ๊ทธ๋ž˜์Šคํ”„์ฒ˜๋Ÿผ โ€œ๋งŽ์ด ๊ฐ์‹ธ์„œ ์•ˆ์ „ํ•˜๊ฒŒโ€๊ฐ€ ์•„๋‹ˆ๋ผ, โ€œ์ •ํ™•ํ•œ ์œ„์น˜์— ์ •ํ™•ํ•œ ๋ฐฉํ–ฅ์œผ๋กœ ํž˜์„ ๊ฐ€ํ•ด์„œ ์•ˆ์ „ํ•˜๊ฒŒโ€ ์žก์•„์•ผ ํ•œ๋‹ค. ์ด๊ฒƒ์ด ๋ฐ”๋กœ ์™„์ „ํ•œ force closure ์ˆ˜์‹ํ™”๊ฐ€ ํ•„์š”ํ•œ ์ด์œ ๋‹ค.

๋ฐ์ดํ„ฐ์…‹ ๊ทœ๋ชจ

๋…ผ๋ฌธ์—์„œ ์ œ๊ณตํ•˜๋Š” ๋ฐ์ดํ„ฐ์…‹์€:

ํ•ญ๋ชฉ ๋‚ด์šฉ
๋Œ€์ƒ ๋ฌผ์ฒด DexGraspNet์—์„œ 5,700๊ฐœ
๊ทธ๋ฆฌํผ ์ข…๋ฅ˜ 5์ข… (Psyonic Ability Hand, Shadow Hand, Allegro Hand, Robotiq 2f140, Robotiq 3F)
๊ทธ๋ž˜์Šคํ”„ ํƒ€์ž… 3์ข… (Power, Pinch, Tri-finger)
์ด ๊ทธ๋ž˜์Šคํ”„ ์ˆ˜ ๋‹ค์ˆ˜ (๋ฌผ์ฒด๋‹น ๋‹ค์ˆ˜์˜ ๋‹ค์–‘ํ•œ ๊ทธ๋ž˜์Šคํ”„)

ํŠนํžˆ Allegro Hand๊ฐ€ ํฌํ•จ๋˜์–ด ์žˆ์–ด Wonik Robotics ์—ฐ๊ตฌ ๋งฅ๋ฝ์—์„œ ์ง์ ‘ ํ™œ์šฉ ๊ฐ€๋Šฅ์„ฑ์ด ๋†’๋‹ค.


์‹คํ—˜: ์–ด๋–ป๊ฒŒ ํ‰๊ฐ€ํ–ˆ๋Š”๊ฐ€

ํ‰๊ฐ€ ์ง€ํ‘œ

๋…ผ๋ฌธ์€ ๋‘ ๊ฐ€์ง€ ํ•ต์‹ฌ ์ง€ํ‘œ๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค:

1. UGR (Unique Grasp Rate / Successful Unique Grasp Rate)
Isaac Lab ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์—์„œ 5N์˜ ์™ธ๋ž€๋ ฅ(disturbance force)์„ ์ ์šฉํ–ˆ์„ ๋•Œ ์„ฑ๊ณต์ ์œผ๋กœ ๋ฌผ์ฒด๋ฅผ ์œ ์ง€ํ•˜๋Š” ๊ทธ๋ž˜์Šคํ”„์˜ ์ˆ˜. ์—ฌ๊ธฐ์„œ โ€œUniqueโ€๊ฐ€ ์ค‘์š”ํ•˜๋‹ค โ€” ์„œ๋กœ ๋‹ค๋ฅธ ๊ตฌ์„ฑ์˜ ๊ทธ๋ž˜์Šคํ”„๊ฐ€ ์–ผ๋งˆ๋‚˜ ๋˜๋Š”์ง€๋ฅผ ๋ณธ๋‹ค. ๊ฐ™์€ ๊ทธ๋ž˜์Šคํ”„๋ฅผ 100๋ฒˆ ์ƒ์„ฑํ•˜๋Š” ๊ฑด ์˜๋ฏธ ์—†๋‹ค.

2. H (Entropy)
์ƒ์„ฑ๋œ ๊ทธ๋ž˜์Šคํ”„๋“ค์˜ ์† ์ž์„ธ ๊ณต๊ฐ„์—์„œ์˜ ์—”ํŠธ๋กœํ”ผ. ๋†’์„์ˆ˜๋ก ๋‹ค์–‘ํ•œ ๊ทธ๋ž˜์Šคํ”„๊ฐ€ ์ƒ์„ฑ๋˜์—ˆ๋‹ค๋Š” ์˜๋ฏธ.

์ด ๋‘ ์ง€ํ‘œ์˜ ๋™์‹œ ํ–ฅ์ƒ์ด GraspQP์˜ ๋ชฉํ‘œ๋‹ค โ€” ์•ˆ์ •์„ฑ๊ณผ ๋‹ค์–‘์„ฑ์˜ trade-off๋ฅผ ๊ทน๋ณตํ•˜๋Š” ๊ฒƒ.

๋น„๊ต ๋Œ€์ƒ

์ฃผ์š” ๋น„๊ต baseline:

๋ฐฉ๋ฒ• ํŠน์ง•
Liu et al. (RA-L 2021) Differentiable force closure ๊ทผ์‚ฌ, ํ˜„์žฌ state-of-the-art
Chen et al. (DexGraspNet) Sampling-based, ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹ ์ƒ์„ฑ
GraspQP (ours) QP ๊ธฐ๋ฐ˜ ์—„๋ฐ€ํ•œ force closure

์ฃผ์š” ๊ฒฐ๊ณผ

๋…ผ๋ฌธ์˜ Figure 4๋Š” ํ•ต์‹ฌ ๊ฒฐ๊ณผ๋ฅผ ์ž˜ ๋ณด์—ฌ์ค€๋‹ค:

์‹œ๋“œ(seed) ์ˆ˜ ๋Œ€๋น„ Unique Successful Grasps ๊ณก์„ :

  • ๊ธฐ์กด baseline (Liu et al.): 512๊ฐœ์˜ ์‹œ๋“œ๋ฅผ ์‚ฌ์šฉํ•ด๋„ ~60๊ฐœ์˜ unique successful grasp์—์„œ ํฌํ™”(saturation) ๋œ๋‹ค.
  • GraspQP: 128๊ฐœ์˜ ์‹œ๋“œ๋งŒ์œผ๋กœ ~80๊ฐœ์˜ unique successful grasp ๋‹ฌ์„ฑ. ์ฆ‰, ๋” ์ ์€ ๊ณ„์‚ฐ์œผ๋กœ ๋” ๋งŽ์€ ๋‹ค์–‘์„ฑ์„ ์–ป๋Š”๋‹ค.

์†๋„ ์ธก๋ฉด์—์„œ๋Š” GraspQP๊ฐ€ ๋А๋ฆฌ๋‹ค โ€” 24-DoF Shadow Hand ๊ธฐ์ค€ grasp๋‹น 3.4์ดˆ vs ๊ธฐ์กด 1.15์ดˆ. ํ•˜์ง€๋งŒ ์ด ๋…ผ๋ฌธ์€ ์˜คํ”„๋ผ์ธ ๋ฐ์ดํ„ฐ์…‹ ์ƒ์„ฑ์„ ๋ชฉ์ ์œผ๋กœ ํ•˜๊ธฐ ๋•Œ๋ฌธ์—, ์†๋„๋ณด๋‹ค ํ’ˆ์งˆ์ด ์ค‘์š”ํ•˜๋‹ค.

Ablation Study

๋…ผ๋ฌธ์€ ๋‹ค์–‘ํ•œ ablation์„ ์ˆ˜ํ–‰ํ•œ๋‹ค:

flowchart LR
    A[Full GraspQP] --> B[w/o MALA*\nStandard MALA๋งŒ ์‚ฌ์šฉ]
    A --> C[w/o Singular Value Scaling\nE_FC๋งŒ ์‚ฌ์šฉ]
    A --> D[Softmax ๊ทผ์‚ฌ ์‚ฌ์šฉ\nstrict QP ๋Œ€์‹ ]
    A --> E[Form Closure Only\nFriction ๋ฌด์‹œ]
    
    B --> F[๋‹ค์–‘์„ฑ ๊ฐ์†Œ\nLocal minima ๋ฌธ์ œ]
    C --> G[Rank ๋ถˆ์•ˆ์ •\n์ผ๋ถ€ degenerate grasp]
    D --> H[๋ฌผ๋ฆฌ์  ํƒ€๋‹น์„ฑ ์•ฝํ™”\n์‹ค์ œ ์•ˆ์ •์„ฑ ๊ฐ์†Œ]
    E --> I[Pinch/Precision grasp ๋ถˆ๊ฐ€\nํŒŒ์›Œ ๊ทธ๋ž˜์Šคํ”„ ํŽธํ–ฅ]

ํŠนํžˆ ์ค‘์š”ํ•œ ablation์€ form closure vs force closure ๋น„๊ต๋‹ค. Form closure๋Š” ๋งˆ์ฐฐ์„ ๊ณ ๋ คํ•˜์ง€ ์•Š๊ณ  ๊ธฐํ•˜ํ•™์  ๊ตฌ์†๋งŒ์œผ๋กœ ๋ฌผ์ฒด๋ฅผ ๊ณ ์ •ํ•˜๋Š” ๊ฐœ๋…์ด๋‹ค. ๋…ผ๋ฌธ์˜ Theorem 3.1์€ ์—„๋ฐ€ํ•œ force closure(condition iii)์™€ ์™„ํ™”๋œ ํ˜•ํƒœ๋“ค(condition ii, i) ์‚ฌ์ด์˜ ๊ณ„์ธต ๊ตฌ์กฐ๋ฅผ ์ œ์‹œํ•˜๊ณ , ์‹คํ—˜์ ์œผ๋กœ ์—„๋ฐ€ํ•œ ์กฐ๊ฑด์ด ๋” ์ข‹์€ ๊ฒฐ๊ณผ๋ฅผ ๋‚ธ๋‹ค๋Š” ๊ฒƒ์„ ๋ณด์ธ๋‹ค.


๊ด€๋ จ ์—ฐ๊ตฌ์™€์˜ ๋น„๊ต

GraspQP๋ฅผ ์ „์ฒด landscape์—์„œ ์ดํ•ดํ•˜๋ ค๋ฉด ๊ด€๋ จ ์—ฐ๊ตฌ๋“ค๊ณผ์˜ ๊ด€๊ณ„๋ฅผ ํŒŒ์•…ํ•ด์•ผ ํ•œ๋‹ค:

graph TD
    A[Grasp Dataset Generation Methods] --> B[Sampling-based]
    A --> C[Optimization-based]
    A --> D[Learning-based]
    
    B --> B1[GraspIt!\nMiller & Allen 2004]
    B --> B2[DexGraspNet\nChen et al. 2023]
    
    C --> C1[Liu et al. RA-L 2021\nSoft FC Approximation]
    C --> C2[Grasp'd\nTurpin et al. ECCV 2022]
    C --> C3[Fast-Grasp'd\nTurpin et al. 2023]
    C --> C4[BODex\nBilevel Optimization]
    C --> C5[GraspQP\nThis Paper - QP-based FC]
    
    D --> D1[UniDexGrasp\nWan et al. 2023]
    D --> D2[Grasp Prediction Models\nPoint Cloud based]
    
    style C5 fill:#ff9999,stroke:#cc0000,stroke-width:3px

Liu et al. (RA-L 2021) ๊ณผ์˜ ๋น„๊ต

๊ฐ€์žฅ ์ง์ ‘์ ์ธ ๋น„๊ต ๋Œ€์ƒ์ด๋‹ค. Liu et al.๋„ differentiable force closure๋ฅผ ์‚ฌ์šฉํ•˜์ง€๋งŒ, softmax relaxation์„ ํ†ตํ•ด QP๋ฅผ ๊ทผ์‚ฌํ•œ๋‹ค:

\alpha_i = \frac{e^{-E_i/T}}{\sum_j e^{-E_j/T}}

์ด ๊ทผ์‚ฌ๋Š” ๊ณ„์‚ฐ์ด ๋น ๋ฅด์ง€๋งŒ force closure์˜ ํ•ต์‹ฌ ์กฐ๊ฑด์ธ \boldsymbol{\alpha} \geq 0, \sum \alpha_i = 1 ์ œ์•ฝ์„ ์—ฐ์†์ ์œผ๋กœ ์™„ํ™”ํ•œ๋‹ค. ์˜จ๋„ T๊ฐ€ ์ž‘์œผ๋ฉด ๊ทผ์‚ฌ๊ฐ€ ์ข‹์•„์ง€์ง€๋งŒ gradient vanishing ๋ฌธ์ œ๊ฐ€ ์ƒ๊ธฐ๊ณ , T๊ฐ€ ํฌ๋ฉด ๋ฌผ๋ฆฌ์  ์กฐ๊ฑด์ด ํ๋ ค์ง„๋‹ค.

GraspQP๋Š” ์ด trade-off๋ฅผ ๊ทผ๋ณธ์ ์œผ๋กœ ํ•ด๊ฒฐํ•œ๋‹ค: QP๋ฅผ ์ง์ ‘ ํ’€๊ณ , implicit differentiation์œผ๋กœ ์ •ํ™•ํ•œ gradient๋ฅผ ์–ป๋Š”๋‹ค.

Graspโ€™d / Fast-Graspโ€™d ์™€์˜ ๋น„๊ต

Turpin et al.์˜ Graspโ€™d ์‹œ๋ฆฌ์ฆˆ๋Š” differentiable simulation (Warp, Isaac)์„ ์‚ฌ์šฉํ•˜์—ฌ contact-rich ๊ทธ๋ž˜์Šคํ”„๋ฅผ ์ตœ์ ํ™”ํ•œ๋‹ค. ์ด ์ ‘๊ทผ์€ ์‹ค์ œ ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์˜ gradient๋ฅผ ์‚ฌ์šฉํ•˜๊ธฐ ๋•Œ๋ฌธ์— ๋” ํ˜„์‹ค์ ์ด์ง€๋งŒ:

  • ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ ์˜์กด์„ฑ์ด ๊ฐ•ํ•จ (์ด์‹์„ฑ ์ œํ•œ)
  • ๊ณ„์‚ฐ ๋น„์šฉ์ด ๋” ๋†’์Œ
  • ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ ๋‹ค์–‘์„ฑ ๋ฌธ์ œ ์กด์žฌ

GraspQP๋Š” ๋ถ„์„์ (analytical) force closure ์ˆ˜์‹ํ™”๋ฅผ ์‚ฌ์šฉํ•˜๋ฏ€๋กœ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ ๋…๋ฆฝ์ ์ด๋ฉฐ, ์›ํ•˜๋Š” ํ”Œ๋žซํผ์—์„œ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค.

DexEvolve์™€์˜ ์‹œ๋„ˆ์ง€

์ตœ๊ทผ ๋‚˜์˜จ DexEvolve (arXiv:2602.15201)๋Š” GraspQP๋ฅผ ์‹œ๋“œ ์ƒ์„ฑ๊ธฐ(seed generator) ๋กœ ํ™œ์šฉํ•˜์—ฌ, ๊ทธ ์œ„์— evolutionary optimization + Isaac Sim์„ ์–น์–ด ๋” ์ข‹์€ ๊ฒฐ๊ณผ๋ฅผ ๋‚ธ๋‹ค. GraspQP๋กœ ์ƒ์„ฑํ•œ ๋ถ„์„์  ๊ทธ๋ž˜์Šคํ”„ 32๊ฐœ๋ฅผ ์‹œ๋“œ๋กœ ์ฃผ์—ˆ์„ ๋•Œ, evolutionary refinement๋กœ ~115๊ฐœ์˜ unique grasp์„ ๋‹ฌ์„ฑํ•œ๋‹ค. ์ด๋Š” GraspQP๊ฐ€ ํŒŒ์ดํ”„๋ผ์ธ์˜ ์ฒซ ๋‹จ๊ณ„๋กœ์„œ ์šฐ์ˆ˜ํ•œ ์ดˆ๊ธฐ๊ฐ’์„ ์ œ๊ณตํ•œ๋‹ค๋Š” ๊ฒƒ์„ ๋ณด์—ฌ์ค€๋‹ค.


๋น„ํŒ์  ๊ณ ์ฐฐ: ๊ฐ•์ ๊ณผ ํ•œ๊ณ„

๊ฐ•์ 

1. ๋ฌผ๋ฆฌ์  ์—„๋ฐ€์„ฑ์˜ ํšŒ๋ณต
๊ธฐ์กด ์ ‘๊ทผ๋“ค์ด force closure๋ฅผ โ€œ๊ทผ์‚ฌโ€๋กœ ํ‰์ณค๋‹ค๋ฉด, GraspQP๋Š” KKT ์กฐ๊ฑด์„ ํ†ตํ•œ ์ •ํ™•ํ•œ gradient๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค. ์ด๋Š” ์ƒ์„ฑ๋œ ๊ทธ๋ž˜์Šคํ”„๊ฐ€ ์‹ค์ œ๋กœ force closure๋ฅผ ๋งŒ์กฑํ•  ๊ฐ€๋Šฅ์„ฑ์„ ๋†’์ธ๋‹ค.

2. ๋‹ค์–‘์„ฑ๊ณผ ์•ˆ์ •์„ฑ์˜ ๋™์‹œ ํ–ฅ์ƒ
MALA*์˜ Dynamic Resetting์ด local minimum ํƒˆ์ถœ์„ ๋•๊ณ , ์ด๊ฒƒ์ด ๋‹ค์–‘์„ฑ์œผ๋กœ ์ง๊ฒฐ๋œ๋‹ค. ๊ธฐ์กด์—๋Š” ๋‹ค์–‘์„ฑ์„ ๋†’์ด๋ ค๋ฉด seed ์ˆ˜๋ฅผ ๋Š˜๋ ค์•ผ ํ–ˆ์ง€๋งŒ(๋น„์šฉ ์ฆ๊ฐ€), GraspQP๋Š” ๋™์ผํ•œ seed ์ˆ˜๋กœ ๋” ๋„“์€ ๊ตฌ์„ฑ ๊ณต๊ฐ„์„ ํƒ์ƒ‰ํ•œ๋‹ค.

3. ๋ฉ€ํ‹ฐ-๊ทธ๋ฆฌํผ / ๋ฉ€ํ‹ฐ-ํƒ€์ž… ์ง€์›
5์ข… ๊ทธ๋ฆฌํผ, 3์ข… ๊ทธ๋ž˜์Šคํ”„ ํƒ€์ž…์„ ํ†ต์ผ๋œ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ ๋‹ค๋ฃฌ๋‹ค. ๊ทธ๋ฆฌํผ๋งˆ๋‹ค ๋‹ค๋ฅธ ์ˆ˜์‹ํ™”๊ฐ€ ํ•„์š” ์—†๋‹ค.

4. ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ์…‹ ๊ธฐ์—ฌ
5,700 ๋ฌผ์ฒด ร— 5 ๊ทธ๋ฆฌํผ ร— 3 ํƒ€์ž…์˜ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹์€ downstream learning ์—ฐ๊ตฌ์— ์ฆ‰์‹œ ํ™œ์šฉ ๊ฐ€๋Šฅํ•˜๋‹ค.

์•ฝ์  ๋ฐ ํ•œ๊ณ„

1. ์—ฐ์‚ฐ ์†๋„
grasp๋‹น 3.4์ดˆ(Shadow Hand, 24-DoF)๋Š” ์˜คํ”„๋ผ์ธ ์ƒ์„ฑ์—๋Š” ํ—ˆ์šฉ ๊ฐ€๋Šฅํ•˜์ง€๋งŒ, ์˜จ๋ผ์ธ ์‹ค์‹œ๊ฐ„ ์‘์šฉ์ด๋‚˜ ๋งค์šฐ ๋Œ€๊ทœ๋ชจ ์ƒ์„ฑ์—๋Š” ๋ณ‘๋ชฉ์ด ๋œ๋‹ค. QP๋Š” ๊ธฐ๋ณธ์ ์œผ๋กœ iterative solver๋ฅผ ํ•„์š”๋กœ ํ•˜๋ฏ€๋กœ, ์ด ๋ถ€๋ถ„์˜ ์ตœ์ ํ™”๊ฐ€ ๊ณผ์ œ๋กœ ๋‚จ๋Š”๋‹ค.

2. Point Cloud Input ๋ถ€์žฌ
GraspQP๋Š” 3D mesh ๋ชจ๋ธ์ด ์žˆ๋Š” ๊ฐ์ฒด์— ๋Œ€ํ•ด์„œ๋งŒ ๋™์ž‘ํ•œ๋‹ค. ์‹ค์ œ ๋ฐฐํฌ ํ™˜๊ฒฝ์—์„œ๋Š” RGB-D ์นด๋ฉ”๋ผ๋กœ ์–ป์€ partial point cloud๋งŒ ์žˆ๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋งŽ๋‹ค. ๋…ผ๋ฌธ์€ ๋‹ค์šด์ŠคํŠธ๋ฆผ grasp prediction ๋ชจ๋ธ(point cloud โ†’ grasp pose)์˜ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๊ฒƒ์ด ๋ชฉ์ ์ด์ง€๋งŒ, ์ด ๊ฐญ ์ž์ฒด๋Š” ํ•ด์†Œ๋˜์ง€ ์•Š๋Š”๋‹ค.

3. Sim-to-Real ๊ฒ€์ฆ ๋ถ€์žฌ
์‹คํ—˜์ด Isaac Lab ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ๋งŒ ์ˆ˜ํ–‰๋œ๋‹ค. ์‹ค์ œ ๋กœ๋ด‡์—์„œ์˜ ๋ฌผ๋ฆฌ์  ๊ฒ€์ฆ์ด ์—†์–ด, ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์™€ ํ˜„์‹ค ์„ธ๊ณ„ ์‚ฌ์ด์˜ gap์ด ์–ผ๋งˆ๋‚˜ ๋˜๋Š”์ง€ ์•Œ ์ˆ˜ ์—†๋‹ค. ํŠนํžˆ ๋งˆ์ฐฐ ๊ณ„์ˆ˜๊ฐ€ ์‹œ๋ฎฌ๋ ˆ์ด์…˜๊ณผ ๋‹ค๋ฅผ ๋•Œ force closure๊ฐ€ ์–ผ๋งˆ๋‚˜ ์œ ์ง€๋˜๋Š”์ง€๋Š” ๋ฏธ์ง€์ˆ˜๋‹ค.

4. Contact Point ์‚ฌ์ „ ์ •์˜ ์˜์กด์„ฑ
๋…ผ๋ฌธ์—์„œ๋Š” ๊ฐ ๊ทธ๋ฆฌํผ์— ๋Œ€ํ•ด ์ˆ˜๋™์œผ๋กœ ์ •์˜๋œ contact mesh(๋…น์ƒ‰ ์ ์œผ๋กœ ํ‘œ์‹œ)๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค. ์ด contact region์ด ๊ฒฐ๊ณผ์— ํฐ ์˜ํ–ฅ์„ ๋ฏธ์น  ๊ฒƒ์œผ๋กœ ๋ณด์ด์ง€๋งŒ, ๊ทธ ๋ฏผ๊ฐ๋„ ๋ถ„์„์€ ์ถฉ๋ถ„ํ•˜์ง€ ์•Š๋‹ค.

5. ํƒœ์Šคํฌ-ํŠนํ™” ๊ณ ๋ ค ์—†์Œ
์ƒ์„ฑ๋œ ๊ทธ๋ž˜์Šคํ”„๋Š” ๋ฌผ๋ฆฌ์ ์œผ๋กœ ์•ˆ์ •์ ์ด์ง€๋งŒ, ํŠน์ • ํƒœ์Šคํฌ(์˜ˆ: ๋‚˜์‚ฌ ๋Œ๋ฆฌ๊ธฐ, ์กฐ์‹ฌ์Šค๋Ÿฝ๊ฒŒ ๋‹ค๋ฃจ๊ธฐ)์— ์ ํ•ฉํ•œ์ง€๋Š” ๋ณด์žฅํ•˜์ง€ ์•Š๋Š”๋‹ค. ํƒœ์Šคํฌ์™€ ์—ฐ๊ด€๋œ grasp selection์€ ๋ณ„๋„ ๋ ˆ์ด์–ด๊ฐ€ ํ•„์š”ํ•˜๋‹ค.


Allegro Hand ์—ฐ๊ตฌ์ž๋ฅผ ์œ„ํ•œ ํŠน๋ณ„ ์ฃผ๋ชฉ ํฌ์ธํŠธ

Wonik Robotics์˜ Allegro Hand๋ฅผ ์—ฐ๊ตฌํ•˜๋Š” ์ž…์žฅ์—์„œ ์ด ๋…ผ๋ฌธ์ด ํŠนํžˆ ํฅ๋ฏธ๋กœ์šด ์ด์œ :

1. ์ง์ ‘์ ์ธ ๋ฐ์ดํ„ฐ ํ™œ์šฉ: ๋…ผ๋ฌธ ๋ฐ์ดํ„ฐ์…‹์— Allegro Hand๊ฐ€ ํฌํ•จ๋˜์–ด ์žˆ์–ด, grasp prediction ๋ชจ๋ธ ํ›ˆ๋ จ์— ์ฆ‰์‹œ ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•˜๋‹ค.

2. ํŒŒ์ดํ”„๋ผ์ธ ์—ฐ๊ฒฐ ๊ฐ€๋Šฅ์„ฑ:

GraspQP (grasp ์ƒ์„ฑ)
  --> Point Cloud-based Grasp Prediction (ํ•™์Šต)
  --> GeoRT / Teleoperation (์‹ค์ œ ์† ์ œ์–ด)
  --> Real Allegro Hand Execution

์ด ํŒŒ์ดํ”„๋ผ์ธ์—์„œ GraspQP๋Š” ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ ํ’ˆ์งˆ์„ ๊ฒฐ์ •ํ•˜๋Š” ํ•ต์‹ฌ ๋‹จ๊ณ„๋‹ค.

3. HORA / In-hand Manipulation๊ณผ์˜ ๊ด€๊ณ„: Power grasp๋ฟ ์•„๋‹ˆ๋ผ precision grasp ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์ ์€, in-hand manipulation ์ค‘ ๊ทธ๋ž˜์Šคํ”„ ์žฌ๊ตฌ์„ฑ(grasp regrasp)์„ ํ•™์Šตํ•  ๋•Œ ๋‹ค์–‘ํ•œ ์ดˆ๊ธฐ ๊ตฌ์„ฑ์„ ์ œ๊ณตํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์˜๋ฏธ๋‹ค.


์š”์•ฝ ๋ฐ ๊ฒฐ๋ก 

GraspQP๋Š” dexterous grasp ๋ฐ์ดํ„ฐ์…‹ ์ƒ์„ฑ ๋ถ„์•ผ์—์„œ ์ค‘์š”ํ•œ ๋ฐœ์ „์„ ์ด๋ฃฉํ–ˆ๋‹ค. ํ•ต์‹ฌ์„ ์„ธ ๊ฐ€์ง€๋กœ ์••์ถ•ํ•˜๋ฉด:

1. โ€œForce closure๋ฅผ ์ œ๋Œ€๋กœ ์ธก์ •ํ•˜์žโ€ โ€” QP๋ฅผ ํ†ตํ•œ ์—„๋ฐ€ํ•˜๊ณ  ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ ์ˆ˜์‹ํ™”
2. โ€œ๋‚˜์œ ํ›„๋ณด๋Š” ๊ณผ๊ฐํžˆ ๋ฒ„๋ฆฌ์žโ€ โ€” MALA*์˜ Dynamic Resetting์œผ๋กœ ๋‹ค์–‘์„ฑ ํ™•๋ณด
3. โ€œpinch๋„ ๋ฐ์ดํ„ฐ๋กœ ๋งŒ๋“ค์žโ€ โ€” ํŒŒ์›Œ ๊ทธ๋ž˜์Šคํ”„ ํŽธํ–ฅ์—์„œ ํƒˆํ”ผํ•œ ์ •๋ฐ€ ๊ทธ๋ž˜์Šคํ”„ ์ƒ์„ฑ

์ด ๋…ผ๋ฌธ์ด ์ œ์‹œํ•˜๋Š” ๋” ํฐ ๋ฉ”์‹œ์ง€๋Š”: ๋ฌผ๋ฆฌ ๋ฒ•์น™์„ ๊ทผ์‚ฌ๋กœ ํ‰์น˜๋ฉด ๊ฒฐ๊ตญ ํ˜„์‹ค์—์„œ ๋™์ž‘ํ•˜์ง€ ์•Š๋Š”๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. ๋งˆ์ฐฐ์ด ์žˆ๋Š” ์ ‘์ด‰, force closure์˜ ์—„๋ฐ€ํ•œ ์กฐ๊ฑด โ€” ์ด๊ฒƒ๋“ค์„ ์ œ๋Œ€๋กœ ๋ชจ๋ธ๋งํ•ด์•ผ๋งŒ ์‹ค์ œ๋กœ ์žก์„ ์ˆ˜ ์žˆ๋Š” ๊ทธ๋ž˜์Šคํ”„๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค.

ํ–ฅํ›„ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์œผ๋กœ๋Š” (1) ์‹ค์ œ ๋กœ๋ด‡์—์„œ์˜ sim-to-real ๊ฒ€์ฆ, (2) partial observation(point cloud input) ํ™˜๊ฒฝ์—์„œ์˜ ์ ์šฉ, (3) ํƒœ์Šคํฌ-ํŠนํ™” grasp synthesis์™€์˜ ๊ฒฐํ•ฉ, (4) ์‹ค์‹œ๊ฐ„ ์ ์šฉ์„ ์œ„ํ•œ QP solver ๊ฐ€์†์ด ์ž์—ฐ์Šค๋Ÿฌ์šด ๋‹ค์Œ ๋‹จ๊ณ„๊ฐ€ ๋  ๊ฒƒ์ด๋‹ค.

DexEvolve์ฒ˜๋Ÿผ GraspQP๋ฅผ ์‹œ๋“œ๋กœ ์‚ฌ์šฉํ•˜๋Š” ํ›„์† ์—ฐ๊ตฌ๋“ค์ด ๋“ฑ์žฅํ•˜๋Š” ๊ฒƒ์„ ๋ณด๋ฉด, ์ด ๋…ผ๋ฌธ์ด ๋‹จ์ˆœํ•œ end-to-end ์†”๋ฃจ์…˜์„ ๋„˜์–ด ๋” ํฐ ํŒŒ์ดํ”„๋ผ์ธ์˜ ํ•ต์‹ฌ ๋ถ€ํ’ˆ์œผ๋กœ ์ž๋ฆฌ์žก์•„ ๊ฐ€๊ณ  ์žˆ์Œ์ด ๋ถ„๋ช…ํ•˜๋‹ค.


์ฐธ๊ณ  ๋ฌธํ—Œ

  • Zurbrรผgg, R., Cramariuc, A., & Hutter, M. (2025). GraspQP: Differentiable Optimization of Force Closure for Diverse and Robust Dexterous Grasping. CoRL 2025. arXiv:2508.15002
  • Liu, T., et al. (2021). Synthesizing diverse and physically stable grasps with arbitrary hand structures using differentiable force closure estimator. RA-L.
  • Chen, et al. (2023). DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset. CVPR.
  • Turpin, D., et al. (2022). Graspโ€™d: Differentiable Contact-Rich Grasp Synthesis. ECCV.
  • Lum, T.G.W., et al. (2024). DextrAH-G: Pixels-to-Action Dexterous Arm-Hand Grasping with Geometric Fabrics.
  • Project Page: https://graspqp.github.io/

Copyright 2026, JungYeon Lee