Curieux.JY
  • JungYeon Lee
  • Post
  • Projects
  • Note

On this page

  • ๐Ÿ” Ping Review
  • ๐Ÿ”” Ring Review

๐Ÿ“ƒContact Trust Region ๋ฆฌ๋ทฐ

mpc
contact-rich
dexterity
Dexterous Contact-Rich Manipulation via the Contact Trust Region
Published

March 8, 2026

  • Paper Link
  • Project Link
  • Code Link
  1. โœจ ์ ‘์ด‰์ด ํ’๋ถ€ํ•œ ๋กœ๋ด‡ ์กฐ์ž‘์—๋Š” ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š” ๊ตญ์†Œ ์ ‘์ด‰ ๋™์—ญํ•™ ๋ฌ˜์‚ฌ๊ฐ€ ์ค‘์š”ํ•˜๋ฉฐ, ๋ณธ ๋…ผ๋ฌธ์€ ์ ‘์ด‰์˜ ๋‹จ์ธก์„ฑ์„ ํฌ์ฐฉํ•˜๋Š” ์ปจํƒํŠธ ํŠธ๋Ÿฌ์ŠคํŠธ ๋ฆฌ์ „(CTR)์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.
  2. ๐Ÿค– CTR์„ ํ™œ์šฉํ•˜์—ฌ ๊ตญ์†Œ ์ ‘์ด‰ ์กฐ์ž‘ ๊ณ„ํš ๋ฐ ์•ˆ์ •ํ™”์— ํšจ์œจ์ ์ธ ๋ชจ๋ธ ์˜ˆ์ธก ์ œ์–ด(MPC) ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ฐœ๋ฐœํ•˜๊ณ , ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐ ํ•˜๋“œ์›จ์–ด ์‹คํ—˜์œผ๋กœ ์„ฑ๋Šฅ์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.
  3. ๐Ÿ—บ๏ธ MPC ๊ธฐ๋ฐ˜์˜ ๊ตญ์†Œ ๊ณ„ํš์„ ์—ฐ๊ฒฐํ•˜๋Š” ๋กœ๋“œ๋งต ๋ฐฉ์‹์„ ํ†ตํ•ด ์žฅ๊ฑฐ๋ฆฌ ๋ฑ์Šคํ„ฐ๋Ÿฌ์Šค ์กฐ์ž‘์„ ์œ„ํ•œ ์ „์—ญ ๊ณ„ํš์ด ๊ฐ€๋Šฅํ•˜๋ฉฐ, ๊ธฐ์กด ๋ฐฉ๋ฒ•๋ณด๋‹ค ๊ณ„์‚ฐ ์‹œ๊ฐ„์„ ํฌ๊ฒŒ ๋‹จ์ถ•ํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ” Ping Review

๐Ÿ” Ping โ€” A light tap on the surface. Get the gist in seconds.

๋ณธ ๋…ผ๋ฌธ โ€œDexterous Contact-Rich Manipulation via the Contact Trust Regionโ€์€ ๋กœ๋ด‡์˜ ๋Šฅ์ˆ™ํ•˜๊ณ  ์ ‘์ด‰์ด ๋งŽ์€ ์กฐ์ž‘(dexterous contact-rich manipulation)์„ ์œ„ํ•œ ํšจ์œจ์ ์ธ ์ง€์—ญ์  ๋™์—ญํ•™ ๋ชจ๋ธ๊ณผ ๊ทธ ์‹ ๋ขฐ ์˜์—ญ(trust region)์„ ์ •์˜ํ•˜๋Š” ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃฌ๋‹ค. ๊ธฐ์กด์˜ ๋งŽ์€ ์ ‘๊ทผ ๋ฐฉ์‹์€ ๋™์—ญํ•™์˜ Taylor ๊ทผ์‚ฌ์™€ ํƒ€์›ํ˜• trust region์— ์˜์กดํ•˜์ง€๋งŒ, ๋ณธ ๋…ผ๋ฌธ์€ ์ด๋Ÿฌํ•œ ๋ฐฉ์‹์ด ์ ‘์ด‰์˜ ๋น„๋Œ€์นญ์„ฑ(unilateral nature)๊ณผ ๊ทผ๋ณธ์ ์œผ๋กœ ์ผ๊ด€๋˜์ง€ ์•Š๋‹ค๊ณ  ์ฃผ์žฅํ•œ๋‹ค.

์ด๋Ÿฌํ•œ ๋ฌธ์ œ์ ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ๋ณธ ๋…ผ๋ฌธ์€ ์ ‘์ด‰์˜ ๋น„๋Œ€์นญ์„ฑ์„ ํฌ์ฐฉํ•˜๋ฉด์„œ๋„ ๊ณ„์‚ฐ ํšจ์œจ์„ฑ์„ ์œ ์ง€ํ•˜๋Š” Contact Trust Region(CTR)์„ ์ œ์•ˆํ•œ๋‹ค. CTR์„ ๊ธฐ๋ฐ˜์œผ๋กœ, ๋จผ์ € ์ง€์—ญ์ ์ธ ์ ‘์ด‰์ด ๋งŽ์€ ๊ณ„ํš์„ ํ•ฉ์„ฑํ•  ์ˆ˜ ์žˆ๋Š” Model-Predictive Control(MPC) ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ฐœ๋ฐœํ•œ๋‹ค. ๊ทธ ํ›„, ์ด ๊ธฐ๋Šฅ์„ ํ™•์žฅํ•˜์—ฌ ์ง€์—ญ MPC ๊ณ„ํš๋“ค์„ ์—ฐ๊ฒฐํ•จ์œผ๋กœ์จ ์ „์—ญ์ ์œผ๋กœ ๊ณ„ํšํ•˜๊ณ  ํšจ์œจ์ ์ด๋ฉฐ ๋Šฅ์ˆ™ํ•œ ์ ‘์ด‰์ด ๋งŽ์€ ์กฐ์ž‘์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•œ๋‹ค.

๋ณธ ๋…ผ๋ฌธ์˜ ์ฃผ์š” ๊ธฐ์—ฌ๋Š” ์„ธ ๊ฐ€์ง€์ด๋‹ค. ์ฒซ์งธ, ์ ‘์ด‰ ์—ญํ•™์„ ํšจ์œจ์ ์œผ๋กœ ๊ทผ์‚ฌํ•˜๋Š” Contact Trust Region(CTR)์ด๋‹ค. ๋‘˜์งธ, ์ง€์—ญ์ ์ธ ์ ‘์ด‰์ด ๋งŽ์€ ์กฐ์ž‘์— ํŠนํ™”๋œ ๋งค์šฐ ํšจ์œจ์ ์ธ ๊ธฐ์šธ๊ธฐ ๊ธฐ๋ฐ˜ MPC ์ปจํŠธ๋กค๋Ÿฌ์ด๋‹ค. ์…‹์งธ, ์ง€์—ญ ๊ถค์ ๋“ค์„ ์—ฐ๊ฒฐํ•˜๋Š” ์ „์—ญ ํ”Œ๋ž˜๋„ˆ์ด๋‹ค.

ํ•ต์‹ฌ ๋ฐฉ๋ฒ•๋ก : Contact Trust Region (CTR)

๋ณธ ๋…ผ๋ฌธ์€ ์ ‘์ด‰ ๋™์—ญํ•™์„ Convex Quasidynamic Differentiable Contact(CQDC) ๋ชจ๋ธ๋กœ ํ‘œํ˜„ํ•œ๋‹ค. ์ด๋Š” ์ ‘์ด‰ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์„ ๋‹ค์Œ ํ˜•ํƒœ์˜ Second-Order Cone Program(SOCP)์œผ๋กœ ์ •์‹ํ™”ํ•œ๋‹ค: \begin{aligned} \min_{q_+} & \quad \frac{1}{2} q_+^\top P(q)q_+ + b(q, u)^\top q_+, \\ \text{subject to} & \quad J_i(q)q_+ + c_i(q) \in K_i, \quad \forall i \in I_c. \end{aligned} ์—ฌ๊ธฐ์„œ q๋Š” ์‹œ์Šคํ…œ ์„ค์ •(configuration), u๋Š” ๋กœ๋ด‡์˜ ์ œ์–ด ์ž…๋ ฅ(actuated configuration command), P, b, J_i, c_i๋Š” q, u์— ์˜์กดํ•˜๋Š” ํ–‰๋ ฌ/๋ฒกํ„ฐ, I_c๋Š” ์ ‘์ด‰ ์Œ ์ธ๋ฑ์Šค ์ง‘ํ•ฉ, K_i๋Š” ๊ฐ€๋Šฅํ•œ ์†๋„(velocity)์˜ feasible cone์ด๋‹ค. ์ด SOCP์˜ KKT ์กฐ๊ฑด์€ ์ค€๋™์ (quasi-dynamic) ์šด๋™ ๋ฐฉ์ •์‹, ๋น„๊ด€ํ†ต(non-penetration), ๋งˆ์ฐฐ ์›๋ฟ”(friction cone), ์ƒ๋ณด์„ฑ(complementarity) ์ œ์•ฝ์„ ๋งŒ์กฑํ•œ๋‹ค.

์ด ๋ชจ๋ธ์˜ ์ง์ ‘์ ์ธ ๋ฏธ๋ถ„์€ ์ ‘์ด‰ ๋ชจ๋“œ ์ „ํ™˜์œผ๋กœ ์ธํ•ด ๊ธฐ์šธ๊ธฐ๊ฐ€ ๋ถˆ์—ฐ์†์ ์ด๋‹ค. ์ด๋ฅผ ์™„ํ™”ํ•˜๊ธฐ ์œ„ํ•ด ๋ณธ ๋…ผ๋ฌธ์€ ๋กœ๊ทธ ๋ฐฐ๋ฆฌ์–ด(log-barrier) ์Šค๋ฌด๋”ฉ์„ ์ ์šฉํ•œ ์™„ํ™”๋œ ๋™์—ญํ•™ f_\kappa(q,u)๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค. ์ด ์™„ํ™”๋œ ๋™์—ญํ•™์€ ์Šค๋ฌด๋”ฉ ํŒŒ๋ผ๋ฏธํ„ฐ \kappa์— ์˜์กดํ•˜๋ฉฐ, ์ ‘์ด‰์ด ์—†๋Š” ๊ฐ์ฒด ์‚ฌ์ด์—๋„ ํž˜์„ ๋ฐœ์ƒ์‹œํ‚จ๋‹ค. ์Šค๋ฌด๋”ฉ๋œ ๋™์—ญํ•™์˜ ๊ธฐ์šธ๊ธฐ๋Š” ๋ฏผ๊ฐ๋„ ๋ถ„์„(sensitivity analysis)์„ ํ†ตํ•ด ์–ป์„ ์ˆ˜ ์žˆ๋‹ค.

๋ณธ ๋…ผ๋ฌธ์€ smoothed dynamics์˜ Taylor ๊ทผ์‚ฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋‹ค์Œ ์ƒํƒœ \hat{q}_+์™€ ์ ‘์ด‰๋ ฅ \hat{\lambda}_{+,i}์— ๋Œ€ํ•œ ์„ ํ˜• ๋ชจ๋ธ์„ ๊ตฌ์ถ•ํ•œ๋‹ค: \begin{aligned} \hat{q}_+ &= A_\kappa \delta q + B_\kappa \delta u + f_\kappa(\bar{q}, \bar{u}), \\ \hat{\lambda}_{+,i} &= C_{\kappa,i} \delta q + D_{\kappa,i} \delta u + \lambda_{\kappa,i}(\bar{q}, \bar{u}). \end{aligned} ์—ฌ๊ธฐ์„œ (\bar{q}, \bar{u})๋Š” ํ˜„์žฌ nominal point์ด๊ณ  (\delta q, \delta u)๋Š” perturbation์ด๋‹ค.

Ellipsoidal Trust Region (ETR)์€ (\delta q, \delta u)์— ๋Œ€ํ•ด \delta z^\top \Sigma \delta z \leq 1 ํ˜•ํƒœ์˜ ์ œ์•ฝ์„ ๊ฐ€ํ•œ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ์ด๋Š” ์ ‘์ด‰์˜ ๋น„๋Œ€์นญ์„ฑ์„ ํฌ์ฐฉํ•˜์ง€ ๋ชปํ•œ๋‹ค.

Contact Trust Region (CTR)์€ ETR ์ œ์•ฝ์— ๋”ํ•˜์—ฌ, ์œ„ ์„ ํ˜• ๋ชจ๋ธ๋กœ ์˜ˆ์ธก๋œ ๋‹ค์Œ ์ƒํƒœ \hat{q}_+์™€ ์ ‘์ด‰๋ ฅ \hat{\lambda}_{+,i}๊ฐ€ ์›๋ž˜ ๋น„์™„ํ™”๋œ SOCP ๋™์—ญํ•™์˜ primal ๋ฐ dual feasibility constraint๋ฅผ ๋งŒ์กฑํ•ด์•ผ ํ•œ๋‹ค๋Š” ์ œ์•ฝ์„ ์ถ”๊ฐ€ํ•œ๋‹ค: \begin{aligned} J_i \hat{q}_+ + c_i &\in K_i, \\ \hat{\lambda}_{+,i} &\in K_i^*. \end{aligned} ์ด๋Ÿฌํ•œ ์ œ์•ฝ์€ ์„ ํ˜•ํ™”๋œ ๋ณ€์ˆ˜์— ๋Œ€ํ•ด ๋ถ€๊ณผ๋˜๋ฏ€๋กœ, CTR์€ ์—ฌ์ „ํžˆ ๋ณผ๋ก ์ง‘ํ•ฉ(convex set)์ด๋‹ค (๊ตฌ์ฒด์ ์œผ๋กœ, ์—ฌ๋Ÿฌ ๊ฐœ์˜ second-order cone constraints์˜ ๊ต์ง‘ํ•ฉ). Example 1๊ณผ 2๋ฅผ ํ†ตํ•ด, primal feasibility ์ œ์•ฝ(J_i \hat{q}_+ + c_i \in K_i)์ด ๋•Œ๋•Œ๋กœ ์‹ค์ œ ๋„๋‹ฌ ๊ฐ€๋Šฅํ•œ ์˜์—ญ๋ณด๋‹ค trust region์„ ์ง€๋‚˜์น˜๊ฒŒ ๋ณด์ˆ˜์ ์œผ๋กœ ์ œํ•œํ•จ์„ ๋ณด์—ฌ์ค€๋‹ค.

๋”ฐ๋ผ์„œ ๋ณธ ๋…ผ๋ฌธ์€ primal feasibility ์ œ์•ฝ์„ ์™„ํ™”ํ•œ Relaxed Contact Trust Region (R-CTR)์„ ์ œ์•ˆํ•œ๋‹ค. R-CTR์€ ETR ์ œ์•ฝ๊ณผ dual feasibility ์ œ์•ฝ(\hat{\lambda}_{+,i} \in K_i^*)๋งŒ์„ ํฌํ•จํ•œ๋‹ค. Example 3์€ R-CTR์„ ์‚ฌ์šฉํ•œ Motion Set(์„ ํ˜•ํ™”๋œ primal solution map์— ์˜ํ•œ RA-CTR์˜ ์ด๋ฏธ์ง€)์ด ๊ฐ์ฒด ์›€์ง์ž„์˜ ์ง€์—ญ์  ๋„๋‹ฌ ๊ฐ€๋Šฅ์„ฑ์„ ๋” ์ž˜ ํฌ์ฐฉํ•จ์„ ๋ณด์—ฌ์ค€๋‹ค. ๋˜ํ•œ, RA-CTR๊ณผ ๊ทธ์— ๋”ฐ๋ฅธ Wrench Set, Motion Set ๊ฐœ๋…์€ ๊ณ ์ „์ ์ธ ์ ‘์ด‰ ์—ญํ•™ ๊ฐœ๋…๊ณผ ์—ฐ๊ฒฐ๋  ์ˆ˜ ์žˆ์Œ์„ ์ด๋ก ์ ์œผ๋กœ ๋ณด์ธ๋‹ค (Lemma 2).

์ง€์—ญ ๊ณ„ํš ๋ฐ ์ œ์–ด (Local Planning and Control)

์ œ์•ˆ๋œ R-CTR์€ ์ง€์—ญ ๊ถค์  ์ตœ์ ํ™”(trajectory optimization) ๋ฐ MPC์— ํ™œ์šฉ๋œ๋‹ค. Algorithm 1์€ R-CTR ์ œ์•ฝ์„ ํฌํ•จํ•˜๋Š” SOCP subproblem์„ ๋ฐ˜๋ณต์ ์œผ๋กœ ํ•ด๊ฒฐํ•˜์—ฌ nominal trajectory๋ฅผ ๊ฐœ์„ ํ•˜๋Š” ๋ฐฉ์‹์ด๋‹ค. ์ด ๋ฐฉ๋ฒ•์€ smoothed dynamics์˜ ์„ ํ˜• ๊ทผ์‚ฌ๋ฅผ ์‚ฌ์šฉํ•˜์ง€๋งŒ, R-CTR์„ ํ†ตํ•ด ์ง€์—ญ์ ์œผ๋กœ ์œ ํšจํ•œ ์˜์—ญ ๋‚ด์—์„œ ๊ณ„ํš์ด ์ด๋ฃจ์–ด์ง€๋„๋ก ํ•œ๋‹ค. ํŠนํžˆ, ์ ‘์ด‰์ด ์—†๋Š” ์ดˆ๊ธฐ ์ƒํƒœ์—์„œ ์‹œ์ž‘ํ•  ๊ฒฝ์šฐ, ๋กœ๋ด‡์ด ๊ฐ์ฒด์— ์ ‘์ด‰ํ•˜๋„๋ก ์œ ๋„ํ•˜๋Š” ์ดˆ๊ธฐ ์ถ”์ธก ํœด๋ฆฌ์Šคํ‹ฑ์„ ์ ์šฉํ•˜์—ฌ ๊ณ„ํš์˜ ํšจ์œจ์„ฑ์„ ๋†’์ธ๋‹ค. Example 4์™€ 5๋Š” ์ด ๋ฐฉ๋ฒ•์ด ์ ‘์ด‰ ๋ชจ๋“œ ์ „ํ™˜์„ ํƒ์ƒ‰ํ•˜๊ณ  ๊ณ„ํš์— ์œ ๋ฆฌํ•œ ๋ฐฉํ–ฅ์œผ๋กœ ๋‚˜์•„๊ฐ€๋Š” ๊ณผ์ •์„ ๋ณด์—ฌ์ค€๋‹ค.

Algorithm 2๋Š” Algorithm 1์„ MPC ํ”„๋ ˆ์ž„์›Œํฌ์— ์ ์šฉํ•œ ๊ฒƒ์ด๋‹ค. ํ˜„์žฌ ์ƒํƒœ์—์„œ ๋ฏธ๋ž˜ ์ƒํƒœ๊นŒ์ง€์˜ ๊ถค์ ์„ ๊ณ„ํšํ•˜๊ณ , ๊ณ„ํš๋œ ์ฒซ ๋ฒˆ์งธ ์ œ์–ด ์ž…๋ ฅ์„ ์‹ค์ œ ์‹œ์Šคํ…œ์— ์ ์šฉํ•œ ํ›„, ๋‹ค์Œ ์ƒํƒœ๋ฅผ ๊ด€์ฐฐํ•˜์—ฌ ๋‹ค์‹œ ๊ณ„ํš์„ ์ˆ˜ํ–‰ํ•œ๋‹ค (re-planning).

์‹คํ—˜ ๊ฒฐ๊ณผ (Experiments)

๋ณธ ๋…ผ๋ฌธ์€ IiwaBimanual (planar, 29 collision geometries) ๋ฐ AllegroHand (3D in-hand, 39 collision geometries) ๋‘ ๊ฐ€์ง€ ์ ‘์ด‰์ด ๋งŽ์€ ๋กœ๋ด‡ ์‹œ์Šคํ…œ์—์„œ ์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•๋ก ์„ ํฌ๊ด„์ ์œผ๋กœ ํ‰๊ฐ€ํ•œ๋‹ค.

  1. CQDC ๋™์—ญํ•™์—์„œ์˜ ์ง€์—ญ MPC ์„ฑ๋Šฅ (Section 5):
    • R-CTR, CTR, ETR์„ ์‚ฌ์šฉํ•˜๋Š” MPC์˜ ๋ชฉํ‘œ ๋„๋‹ฌ ์„ฑ๋Šฅ(์ตœ์ข… ๊ฐ์ฒด ์œ„์น˜/ํšŒ์ „ ์˜ค๋ฅ˜) ๋น„๊ต.
    • ์ƒ์„ฑ๋œ ๋ชฉํ‘œ๋Š” ์ง€์—ญ์ ์œผ๋กœ ๋„๋‹ฌ ๊ฐ€๋Šฅํ•˜๋‚˜ MPC์— ๋„์ „์ ์ธ ๋ชฉํ‘œ๋“ค์ด๋‹ค (Figure 9).
    • ๊ฒฐ๊ณผ(Figure 9, Table 2): R-CTR์ด ๋‘ ์‹œ์Šคํ…œ ๋ชจ๋‘์—์„œ ํ‰๊ท  ์˜ค๋ฅ˜ ๋ฐ ๋ถ„์‚ฐ ์ธก๋ฉด์—์„œ ๊ฐ€์žฅ ์ข‹์€ ์„ฑ๋Šฅ์„ ๋ณด์˜€๋‹ค. ํŠนํžˆ IiwaBimanual์—์„œ CTR ๋ฐ ETR๋ณด๋‹ค ์œ ์˜๋ฏธํ•˜๊ฒŒ ์šฐ์ˆ˜ํ–ˆ๋‹ค. AllegroHand์—์„œ๋Š” ์ฐจ์ด๊ฐ€ ๋น„๊ต์  ์ž‘์•˜๋Š”๋ฐ, ์ด๋Š” ์‹œ์Šคํ…œ ํŠน์„ฑ์ƒ bilateral contact regime์ด ๋” ์ž์ฃผ ํ™œ์„ฑํ™”๋  ์ˆ˜ ์žˆ๊ธฐ ๋•Œ๋ฌธ์œผ๋กœ ์ถ”์ธก๋œ๋‹ค.
    • Trust region radius(r)์™€ MPC rollout horizon(H)์— ๋Œ€ํ•œ ์‹คํ—˜(Figure 10): ์ ์ ˆํ•œ r๊ณผ H์—์„œ ์„ฑ๋Šฅ์ด ์ตœ์ ํ™”๋˜๋ฉฐ, ๋„ˆ๋ฌด ์ž‘์€ r์€ ๋„๋‹ฌ ๊ฐ€๋Šฅ์„ฑ์„ ์ œํ•œํ•˜๊ณ  ๋„ˆ๋ฌด ํฐ r์€ ์„ ํ˜• ๊ทผ์‚ฌ์˜ ๋ถ€์ •ํ™•์„ฑ์œผ๋กœ ์ธํ•ด ์„ฑ๋Šฅ ์ €ํ•˜๋ฅผ ์•ผ๊ธฐํ•œ๋‹ค.
  2. 2์ฐจ ๋™์—ญํ•™ ํ•˜์—์„œ์˜ ์•ˆ์ •ํ™” ์„ฑ๋Šฅ (Section 6):
    • CQDC ๋™์—ญํ•™ ๋ชจ๋ธ๊ณผ ์‹ค์ œ ๋ฌผ๋ฆฌ(Drake ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐ ํ•˜๋“œ์›จ์–ด) ๊ฐ„์˜ ์ฐจ์ด(ํŠนํžˆ hydroplaning)๋ฅผ ๊ณ ๋ คํ•œ ์•ˆ์ •ํ™” ์„ฑ๋Šฅ ํ‰๊ฐ€.
    • Algorithm 3์„ ์ œ์•ˆ: MPC ๊ณ„ํš์„ ์—ฌ๋Ÿฌ ๋ฌผ๋ฆฌ ์Šคํ…์— ๊ฑธ์ณ ์‹คํ–‰ํ•˜๊ณ , ์žฌ๊ณ„ํš ์‹œ ํ˜„์žฌ ๋กœ๋ด‡ ์ƒํƒœ์— ๋Œ€ํ•ด ์ดˆ๊ธฐ ์ถ”์ธก ํœด๋ฆฌ์Šคํ‹ฑ์„ ๋‹ค์‹œ ์ ์šฉํ•˜์—ฌ ์ ‘์ด‰ ์œ ์ง€๋ฅผ ๊ฐ•ํ™” (MPCProj).
    • Open-loop, No Heuristics, Closed-loop ์„ธ ๊ฐ€์ง€ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ๋ณ€ํ˜• ํ‰๊ฐ€.
    • ๊ฒฐ๊ณผ(Figure 11, Table 4):
      • Closed-loop MPC๋Š” Open-loop๋ณด๋‹ค ํ›จ์”ฌ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์ด๋ฉฐ, ์ ‘์ด‰ ๋™์—ญํ•™ ๋ชจ๋ธ์˜ ๋ถ€์ •ํ™•์„ฑ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ํ”ผ๋“œ๋ฐฑ์ด ์ค‘์š”ํ•จ์„ ์‹œ์‚ฌํ•œ๋‹ค.
      • ์ดˆ๊ธฐ ์ถ”์ธก ํœด๋ฆฌ์Šคํ‹ฑ ์ ์šฉ(Closed-loop vs. No Heuristics): ํ‰๊ท  ์˜ค๋ฅ˜ ๊ฐ์†Œ ํšจ๊ณผ๋Š” ์ž‘์ง€๋งŒ, ์ ‘์ด‰ ์†์‹ค๋กœ ์ธํ•œ ํฐ ์˜ค๋ฅ˜ ๋ฐœ์ƒ ๋นˆ๋„๋ฅผ ์œ ์˜๋ฏธํ•˜๊ฒŒ ์ค„์˜€๋‹ค (Figure 11 histogram). ํœด๋ฆฌ์Šคํ‹ฑ ์ ์šฉ์€ ๋กœ๋ด‡ ๊ฒฝ๋กœ ๊ธธ์ด๋ฅผ ๋‹จ์ถ•์‹œํ‚ค๋Š” ํšจ๊ณผ๋„ ์žˆ์—ˆ๋‹ค (Figure 12).
      • IiwaBimanual๊ณผ AllegroHand ๋น„๊ต: AllegroHand ํƒœ์Šคํฌ(in-hand manipulation)์˜ ๋ณธ์งˆ์ ์ธ ์–ด๋ ค์›€(๋ฏธ๋„๋Ÿฌ์ง)์œผ๋กœ ์ธํ•ด IiwaBimanual๋ณด๋‹ค ํ‰๊ท  ์˜ค๋ฅ˜๊ฐ€ ์ปธ๋‹ค.
      • ํ•˜๋“œ์›จ์–ด ์‹คํ—˜: ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ฒฐ๊ณผ์™€ ์œ ์‚ฌํ•œ ์„ฑ๋Šฅ์„ ๋ณด์˜€๋‹ค (Table 4).

์ „์—ญ ๊ณ„ํš (Global Planning)

์ง€์—ญ MPC๋Š” ๋น„ํƒ์š•์  ์›€์ง์ž„์ด ํ•„์š”ํ•œ ์ „์—ญ ๋ชฉํ‘œ ๋‹ฌ์„ฑ์— ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค. ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋ณธ ๋…ผ๋ฌธ์€ ์ง€์—ญ MPC์˜ ์žฅ์ ์„ ํ™œ์šฉํ•˜๋Š” ๋กœ๋“œ๋งต(Roadmap) ๊ธฐ๋ฐ˜ ์ „์—ญ ๊ณ„ํš ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค.

  1. ๋ชฉํ‘œ ์ƒํƒœ ๊ธฐ๋ฐ˜ ์ ‘์ด‰ ์„ค์ • ์ƒ์„ฑ (Section 7):
    • ์ฃผ์–ด์ง„ ๊ฐ์ฒด ์ƒํƒœ(q_o)์™€ ๋ชฉํ‘œ(q_{og})์— ๋Œ€ํ•ด, ์ง€์—ญ MPC๊ฐ€ ํšจ์œจ์ ์œผ๋กœ ๋ชฉํ‘œ์— ๋„๋‹ฌํ•˜๋„๋ก ์œ ๋ฆฌํ•œ ๋กœ๋ด‡ ์„ค์ •(q_a)์„ ์ฐพ๋Š” ๋ฌธ์ œ ์ •์˜.
    • ์ตœ์ ํ™” ๋ฌธ์ œ์˜ ๋น„์šฉ ํ•จ์ˆ˜๋Š” ์ง€์—ญ MPC์˜ ์œ ํ•œ ์‹œ๊ฐ„ ๊ฐ€์น˜ ํ•จ์ˆ˜(V)์™€ ๊ฐ•๊ฑด์„ฑ(robustness) regularizer(r)๋ฅผ ์กฐํ•ฉํ•œ๋‹ค. r์€ RA-CTR ๊ธฐ๋ฐ˜ wrench set์˜ ์ตœ๋Œ€ ๋‚ด์ ‘๊ตฌ ๋ฐ˜๊ฒฝ์œผ๋กœ ์ •์˜๋˜๋ฉฐ, ์ด ์„ค์ •์—์„œ ๋กœ๋ด‡์ด ๊ฐ์ฒด์— ์–ผ๋งˆ๋‚˜ ํฐ ์™ธ๋ž€์„ ๊ฒฌ๋”œ ์ˆ˜ ์žˆ๋Š”์ง€๋ฅผ ๋‚˜ํƒ€๋‚ธ๋‹ค. ๋น„์šฉ ํ•จ์ˆ˜๋Š” C(q_a; q_o, q_{og}) = V(q_a; q_o, q_{og}) - \alpha r(q_a; q_o)^2 ํ˜•ํƒœ์ด๋‹ค.
    • ์ด ๋ฌธ์ œ๋Š” ๋น„๋ณผ๋กํ•˜๋ฉฐ ๊ธฐ์šธ๊ธฐ ๊ณ„์‚ฐ์ด ์–ด๋ ต๊ธฐ ๋•Œ๋ฌธ์— ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜ ์ตœ์ ํ™” ํœด๋ฆฌ์Šคํ‹ฑ์œผ๋กœ ํ•ด๊ฒฐํ•œ๋‹ค. AllegroHand์™€ ๊ฐ™์€ ๊ณ ์ฐจ์› ๋กœ๋ด‡์˜ ๊ฒฝ์šฐ, reduced-order model (4๊ฐœ์˜ sphere)์„ ์‚ฌ์šฉํ•˜๊ณ  ๊ทธ ํ•ด๋ฅผ ์—ญ๊ธฐ๊ตฌํ•™(IK)์œผ๋กœ ๋กœ๋ด‡ ์„ค์ •์— ๋งคํ•‘ํ•˜๋Š” ํœด๋ฆฌ์Šคํ‹ฑ์„ ๋„์ž…ํ•œ๋‹ค.
    • ๊ฒฐ๊ณผ(Figure 18, Table 6): AllegroHand์—์„œ ์ง๊ด€์ ์ด๊ณ  ๋ชฉํ‘œ ๋‹ฌ์„ฑ์— ํšจ๊ณผ์ ์ธ ์ดˆ๊ธฐ ๋กœ๋ด‡ ์„ค์ •๋“ค์„ ์ฐพ์•˜์œผ๋ฉฐ, MPC ๋กค์•„์›ƒ ๊ฒฐ๊ณผ 10mm ์ด๋‚ด์˜ ์œ„์น˜ ์˜ค๋ฅ˜์™€ 30mrad ์ด๋‚ด์˜ ํšŒ์ „ ์˜ค๋ฅ˜๋ฅผ ๋‹ฌ์„ฑํ–ˆ๋‹ค.
  2. ๋กœ๋“œ๋งต ๊ธฐ๋ฐ˜ ์ „์—ญ ๊ณ„ํš (Section 8):
    • ์˜คํ”„๋ผ์ธ ๋‹จ๊ณ„(Algorithm 4): ์ž‘์—… ๊ณต๊ฐ„์„ ์ถฉ๋ถ„ํžˆ ์ปค๋ฒ„ํ•˜๋Š” ์•ˆ์ •์ ์ธ ๊ฐ์ฒด ์„ค์ •๋“ค์— ํ•ด๋‹นํ•˜๋Š” ์ ‘์ด‰ ์„ค์ •๋“ค์„ ๋กœ๋“œ๋งต์˜ ์ •์ (vertices)์œผ๋กœ ์ƒ์„ฑํ•œ๋‹ค. ๊ฐ ์ •์  ์Œ์— ๋Œ€ํ•ด ์ง€์—ญ MPC(๊ฐ์ฒด ๋ชฉํ‘œ ๋„๋‹ฌ)์™€ ์ถฉ๋Œ ํšŒํ”ผ ๊ณ„ํš(๋กœ๋ด‡ ์žฌ๋ฐฐ์น˜)์„ ์ˆœ์ฐจ์ ์œผ๋กœ ์ ์šฉํ•˜์—ฌ ์ „์ด๊ฐ€ ์„ฑ๊ณตํ•˜๋ฉด ์—์ง€(edge)๋ฅผ ์ถ”๊ฐ€ํ•œ๋‹ค (Figure 19). AllegroHand์˜ ๊ฒฝ์šฐ ๊ฐ์ฒด์˜ ๋Œ€์นญ์„ฑ์„ ํ™œ์šฉํ•˜์—ฌ ๋กœ๋“œ๋งต ๊ตฌ์ถ•์„ ํšจ์œจํ™”ํ–ˆ์œผ๋ฉฐ, ํ‘œ์ค€ ๋…ธํŠธ๋ถ CPU๋งŒ์œผ๋กœ 10๋ถ„ ์ด๋‚ด์— ๋กœ๋“œ๋งต ๊ตฌ์ถ•์ด ๊ฐ€๋Šฅํ•˜๋‹ค. ํ•˜๋“œ์›จ์–ด์—์„œ 150ํšŒ ์—ฐ์† ์—์ง€ ์ „์ด์— ์„ฑ๊ณตํ•˜๋ฉฐ ๋กœ๋“œ๋งต์˜ ๊ฐ•๊ฑด์„ฑ์„ ํ™•์ธํ–ˆ๋‹ค.
    • ์˜จ๋ผ์ธ ๋‹จ๊ณ„: ์ž„์˜์˜ ์‹œ์ž‘ ์„ค์ •์—์„œ ์ž„์˜์˜ ๋ชฉํ‘œ ๊ฐ์ฒด ์„ค์ •๊นŒ์ง€์˜ ๊ณ„ํš์€, ์‹œ์ž‘/๋ชฉํ‘œ๋ฅผ ๋กœ๋“œ๋งต์˜ ๊ฐ€์žฅ ๊ฐ€๊นŒ์šด ์ •์ ์— ์—ฐ๊ฒฐํ•œ ํ›„ ๊ทธ๋ž˜ํ”„ ์ƒ์—์„œ ์ตœ๋‹จ ๊ฒฝ๋กœ๋ฅผ ํƒ์ƒ‰ํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ์ˆ˜ํ–‰๋œ๋‹ค (Figure 20).

๊ฒฐ๋ก  (Conclusion)

๋ณธ ๋…ผ๋ฌธ์€ Contact Trust Region(CTR) ๊ฐœ๋…์„ ํ†ตํ•ด ์ ‘์ด‰์˜ ๋น„๋Œ€์นญ์„ฑ์„ ๊ณ ๋ คํ•œ ์ง€์—ญ์  ๋™์—ญํ•™ ๊ทผ์‚ฌ๋ฅผ ์ œ๊ณตํ•˜๊ณ , ์ด๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํšจ์œจ์ ์ธ MPC ๊ธฐ๋ฐ˜ ์ง€์—ญ ๊ณ„ํš ๋ฐ ์ œ์–ด ๋ฐฉ๋ฒ•์„ ๊ฐœ๋ฐœํ•œ๋‹ค. ๋˜ํ•œ, ์ ‘์ด‰ ์„ค์ • ์ƒ์„ฑ ๋ฐ ๋กœ๋“œ๋งต ๊ธฐ๋ฒ•์„ ํ†ตํ•ด ์ „์—ญ์ ์ธ ์ ‘์ด‰์ด ๋งŽ์€ ์กฐ์ž‘ ๊ณ„ํš ๋Šฅ๋ ฅ์„ ๊ตฌํ˜„ํ–ˆ๋‹ค. ์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•์€ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐ ์‹ค์ œ ํ•˜๋“œ์›จ์–ด ์‹คํ—˜์„ ํ†ตํ•ด ๊ทธ ์„ฑ๋Šฅ๊ณผ ๊ณ„์‚ฐ ํšจ์œจ์„ฑ์„ ์ž…์ฆํ–ˆ๋‹ค. ํŠนํžˆ ์‹ฌ์ธต ๊ฐ•ํ™” ํ•™์Šต(deep RL) ๊ธฐ๋ฐ˜ ์ ‘๊ทผ ๋ฐฉ์‹์— ๋น„ํ•ด ํ˜„์ €ํžˆ ๋‚ฎ์€ ๊ณ„์‚ฐ ์‹œ๊ฐ„์œผ๋กœ ๋ชฉํ‘œ ๋‹ฌ์„ฑ์ด ๊ฐ€๋Šฅํ•จ์„ ๋ณด์—ฌ์ค€๋‹ค.

ํ•˜์ง€๋งŒ ์—ฌ์ „ํžˆ ํ•ด๊ฒฐํ•ด์•ผ ํ•  ๊ณผ์ œ๋“ค์ด ๋‚จ์•„์žˆ๋‹ค. ํŠน์ • ๊ณ„ํš ์‹คํŒจ์˜ ์›์ธ, IiwaBimanual๊ณผ AllegroHand ๊ฐ„ feasibility constraint์˜ ์—ญํ•  ์ฐจ์ด์— ๋Œ€ํ•œ ๊นŠ์€ ์ดํ•ด, ๊ทธ๋ฆฌ๊ณ  CQDC์˜ hydroplaning๊ณผ ๊ฐ™์€ ๋ชจ๋ธ-ํ˜„์‹ค ๋ฌผ๋ฆฌ ๊ฐ„์˜ ์ฐจ์ด๋ฅผ ๊ทน๋ณตํ•˜๊ณ  ์ ‘์ด‰์„ ๊ฐ•๊ฑดํ•˜๊ฒŒ ์œ ์ง€ํ•˜๋Š” ๋ฌธ์ œ ๋“ฑ์€ ํ–ฅํ›„ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•˜๋‹ค. ๊ทธ๋Ÿผ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ๋ณธ ๋…ผ๋ฌธ์—์„œ ์ œ์‹œ๋œ CTR, MPC, ์ ‘์ด‰ ์„ค์ • ์ƒ์„ฑ, ๋กœ๋“œ๋งต ๊ธฐ๋ฒ•์€ ์ ‘์ด‰์ด ๋งŽ์€ ๋กœ๋ด‡ ์กฐ์ž‘ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•œ ์ƒˆ๋กœ์šด ๊ฐ•๋ ฅํ•œ ๋„๊ตฌ๋“ค์„ ์ œ๊ณตํ•œ๋‹ค.

๐Ÿ”” Ring Review

๐Ÿ”” Ring โ€” An idea that echoes. Grasp the core and its value.

Copyright 2026, JungYeon Lee