Curieux.JY
  • JungYeon Lee
  • Post
  • Note

On this page

  • ๐Ÿ” Ping Review

๐Ÿ“ƒManipulation by Feel ๋ฆฌ๋ทฐ

tactile
mpc
Touch-Based Control with Deep Predictive Models
Published

March 14, 2026

  • Paper Link
  • Project Link
  • Code Link
    • Refactored Version

์™€ ๊ด€๋ จ์žˆ๋Š” ๋…ผ๋ฌธ. Digit 1๊ฐœ๋กœ๋งŒ ์‹คํ—˜ํ–ˆ์—ˆ์ง€๋งŒ ์ดํ›„ DIGIT ๋…ผ๋ฌธ์—์„œ๋Š” 2๊ฐœ๋กœ ๊ตฌ์Šฌ์„ pinching ํ•˜๋Š” ์‹คํ—˜์„ ์ง„ํ–‰

  1. ๐Ÿค– ๋ณธ ๋…ผ๋ฌธ์€ ๊ณ ํ•ด์ƒ๋„ GelSight ์ด‰๊ฐ ์„ผ์„œ์™€ ๋”ฅ ์˜ˆ์ธก ๋ชจ๋ธ์„ ๊ฒฐํ•ฉํ•˜์—ฌ ์‹œ๊ฐ์ด ๊ฐ€๋ ค์ง„ ์ƒํ™ฉ์—์„œ๋„ ๋กœ๋ด‡์˜ ์—ฐ์†์ ์ธ non-prehensile ์กฐ์ž‘์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๋Š” deep tactile MPC ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.
  2. ๐Ÿง  ์ด ๋ฐฉ๋ฒ•์€ ๋กœ๋ด‡์ด ๋น„์ง€๋„ ์ž์œจ ์ƒํ˜ธ์ž‘์šฉ์„ ํ†ตํ•ด ์›์‹œ ์ด‰๊ฐ ์„ผ์„œ ์ž…๋ ฅ์—์„œ ๋”ฅ ์˜ˆ์ธก ๋ชจ๋ธ์„ ํ•™์Šตํ•˜๊ณ , ์ด๋ฅผ Model Predictive Control (MPC)์— ํ™œ์šฉํ•˜์—ฌ ์‚ฌ์šฉ์ž๊ฐ€ ์ง€์ •ํ•œ ์ด‰๊ฐ ๋ชฉํ‘œ๋ฅผ ์ง์ ‘ ๋‹ฌ์„ฑํ•ฉ๋‹ˆ๋‹ค.
  3. โœจ ์ œ์•ˆ๋œ deep tactile MPC๋Š” ๊ณต ์žฌ๋ฐฐ์น˜, ์•„๋‚ ๋กœ๊ทธ ์Šคํ‹ฑ ์กฐ์ž‘, 20๋ฉด์ฒด ์ฃผ์‚ฌ์œ„ ๊ตด๋ฆฌ๊ธฐ์™€ ๊ฐ™์€ ์‹ค์ œ ์กฐ์ž‘ ์ž‘์—…์—์„œ ๊ธฐ์กด์˜ ์ˆ˜๋™ ์„ค๊ณ„ baseline๋ณด๋‹ค ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ, ์ด‰๊ฐ ๊ธฐ๋ฐ˜ ๋กœ๋ด‡ ์ œ์–ด์˜ ๋†’์€ ์ž ์žฌ๋ ฅ์„ ์ž…์ฆํ•ฉ๋‹ˆ๋‹ค.


๐Ÿ” Ping Review

๐Ÿ” Ping โ€” A light tap on the surface. Get the gist in seconds.

โ€œManipulation by Feel: Touch-Based Control with Deep Predictive Modelsโ€ ๋…ผ๋ฌธ์€ ๊ณ ํ•ด์ƒ๋„ ์ด‰๊ฐ ์„ผ์„œ(GelSight)์™€ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์˜ˆ์ธก ๋ชจ๋ธ์„ ํ™œ์šฉํ•˜์—ฌ ๋กœ๋ด‡์ด ์ด‰๊ฐ ์ •๋ณด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๋ฌผ์ฒด๋ฅผ ์กฐ์ž‘ํ•˜๋Š” ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃน๋‹ˆ๋‹ค. ํŠนํžˆ ์—ฐ์†์ ์ด๊ณ  ๋น„ํŒŒ์•…(non-prehensile)์ ์ธ ์กฐ์ž‘์—์„œ ์ด‰๊ฐ ์„ผ์‹ฑ์„ ํšจ๊ณผ์ ์œผ๋กœ ํ™œ์šฉํ•˜๋Š” ์–ด๋ ค์›€์„ ํ•ด๊ฒฐํ•˜๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค.

์ด ์—ฐ๊ตฌ๋Š” ์ธ๊ฐ„์ด ์กฐ์ž‘ ์‹œ ์ด‰๊ฐ์— ํฌ๊ฒŒ ์˜์กดํ•œ๋‹ค๋Š” ์ (์˜ˆ: ์†์ด ๋งˆ๋น„๋˜๋ฉด ์„ฑ๋ƒฅ์„ ์ผœ๊ธฐ ์–ด๋ ค์šด ์‹คํ—˜)์„ ๊ฐ•์กฐํ•˜๋ฉฐ, ๋กœ๋ด‡ ์กฐ์ž‘์—์„œ ์‹œ๊ฐ ์„ผ์‹ฑ์ด ๊ฐ€๋ ค์ง€๋Š” ์ ‘์ด‰ ์ง€์ ์—์„œ ์ด‰๊ฐ์˜ ์ค‘์š”์„ฑ์„ ์—ญ์„คํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด ๋กœ๋ด‡ ์ด‰๊ฐ ์„ผ์‹ฑ์˜ ํ•œ๊ณ„์ (๋‚ฎ์€ ํ•ด์ƒ๋„, ๋ฌผ๋ฆฌ ๋ชจ๋ธ์˜ ์–ด๋ ค์›€, ์ด‰๊ฐ ๋ชฉํ‘œ ์„ค์ •์˜ ๋ชจํ˜ธํ•จ)์„ ์ง€์ ํ•˜๋ฉฐ, GelSight์™€ ๊ฐ™์€ ์ตœ์‹  ์ด‰๊ฐ ์„ผ์‹ฑ ๊ธฐ์ˆ ๊ณผ ๋”ฅ๋Ÿฌ๋‹์˜ ๋ฐœ์ „์ด ์ด ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•  ์ˆ˜ ์žˆ์Œ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ๋ฐฉ๋ฒ•๋ก ์€ โ€œDeep Tactile MPCโ€ (Model Predictive Control) ํ”„๋ ˆ์ž„์›Œํฌ์ž…๋‹ˆ๋‹ค. ์ด ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ๋กœ๋ด‡์˜ ์ž์œจ์ ์ธ ์ƒํ˜ธ์ž‘์šฉ์„ ํ†ตํ•ด ์ˆ˜์ง‘๋œ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ์ด‰๊ฐ ์˜ˆ์ธก ๋ชจ๋ธ์„ ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค.

  1. ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ (Autonomous Data Collection): ๋กœ๋ด‡์€ ball, analog stick, 20-sided die์™€ ๊ฐ™์€ ๋ฌผ์ฒด์™€ ๋ฌด์ž‘์œ„ ์›€์ง์ž„์„ ํ†ตํ•ด ์ž์œจ์ ์œผ๋กœ ์ƒํ˜ธ์ž‘์šฉํ•˜๋ฉฐ ๋Œ€๋Ÿ‰์˜ ์ด‰๊ฐ ์ด๋ฏธ์ง€ ์‹œํ€€์Šค์™€ ํ•ด๋‹น ์•ก์…˜ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•ฉ๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ball repositioning task์—์„œ๋Š” 7400๊ฐœ์˜ ๊ถค์ , analog stick task์—์„œ๋Š” 3000๊ฐœ, die rolling task์—์„œ๋Š” 4500๊ฐœ์˜ ๊ถค์ ์ด ์ˆ˜์ง‘๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ๊ฐ ๊ถค์ ์€ 15~18 ํƒ€์ž„์Šคํ…์œผ๋กœ ๊ตฌ์„ฑ๋˜๋ฉฐ, ์„ผ์„œ์˜ x, y, z ๋ฐฉํ–ฅ์œผ๋กœ ยฑ6.0 mm์˜ ๋ฌด์ž‘์œ„ ์›€์ง์ž„์ด ์ ์šฉ๋ฉ๋‹ˆ๋‹ค. ์•ก์…˜์€ 3 ํƒ€์ž„์Šคํ… ๋™์•ˆ ๋ฐ˜๋ณต๋˜์ง€๋งŒ, ์ด๋ฏธ์ง€๋Š” ๋งค ํƒ€์ž„์Šคํ…๋งˆ๋‹ค ๊ธฐ๋ก๋˜์–ด ๋ถˆ์—ฐ์†์ ์ธ ๋™์—ญํ•™ ํ™˜๊ฒฝ์—์„œ ๋ชจ๋ธ ์˜ˆ์ธก ์ •ํ™•๋„๋ฅผ ๋†’์ž…๋‹ˆ๋‹ค.
  2. ๋”ฅ ์˜ˆ์ธก ๋ชจ๋ธ ํ•™์Šต (Deep Predictive Model Learning): ์ˆ˜์ง‘๋œ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋”ฅ ์ˆœํ™˜ ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง(deep recurrent convolutional network)์ด ํ•™์Šต๋ฉ๋‹ˆ๋‹ค. ์ด ๋ชจ๋ธ์€ ํ˜„์žฌ GelSight ์„ผ์„œ ์ด๋ฏธ์ง€(I_0)์™€ ๋ฏธ๋ž˜ ์•ก์…˜ ์‹œํ€€์Šค(a_{1:T})๋ฅผ ์ž…๋ ฅ๋ฐ›์•„ ๋ฏธ๋ž˜์˜ GelSight ์„ผ์„œ ๊ด€์ธก์น˜ ์‹œํ€€์Šค(\hat{I}_{1:T})๋ฅผ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค. ๋ชจ๋ธ์˜ ๊ตฌ์กฐ๋Š” ๊ธฐ์กด์˜ ๋น„๋””์˜ค ์˜ˆ์ธก ๋ชจ๋ธ(์˜ˆ: [12], [13]์—์„œ ์ œ์•ˆ๋œ ์•„ํ‚คํ…์ฒ˜)์„ ๋”ฐ๋ฅด๋ฉฐ, ์ด๋Š” ์ด‰๊ฐ ๋„๋ฉ”์ธ์—์„œ์˜ ๋™์—ญํ•™ \hat{I}_{1:T} = g(a_{1:T}, I_0)์„ ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค. ์ด ๋ชจ๋ธ์€ raw tactile observation์œผ๋กœ๋ถ€ํ„ฐ ๋ฌผ์ฒด์˜ ์ ‘์ด‰ ํŒจํ„ด ๋ณ€ํ™”๋ฅผ ์ •ํ™•ํ•˜๊ฒŒ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค.
  3. ์ด‰๊ฐ MPC ์ œ์–ด (Tactile MPC Control): ํ•™์Šต๋œ ์˜ˆ์ธก ๋ชจ๋ธ์€ MPC ํ”„๋ ˆ์ž„์›Œํฌ ๋‚ด์—์„œ ์ œ์–ด์— ํ™œ์šฉ๋ฉ๋‹ˆ๋‹ค. ์‚ฌ์šฉ์ž๋Š” ์›ํ•˜๋Š” ์ด‰๊ฐ ์ƒํƒœ๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” ๋ชฉํ‘œ ์ด‰๊ฐ ์ด๋ฏธ์ง€(I_g)๋ฅผ ์ง์ ‘ ์ œ๊ณตํ•˜์—ฌ ๋ชฉํ‘œ๋ฅผ ์ง€์ •ํ•ฉ๋‹ˆ๋‹ค.
    • ๊ณ„ํš (Planning): ๊ฐ ์ œ์–ด ์Šคํ…์—์„œ, ์ตœ์ ์˜ ๋ฏธ๋ž˜ ์•ก์…˜ ์‹œํ€€์Šค(a_{1:T})๋ฅผ ์ฐพ๊ธฐ ์œ„ํ•ด ์ตœ์ ํ™” ๋ฌธ์ œ๊ฐ€ ํ•ด๊ฒฐ๋ฉ๋‹ˆ๋‹ค. ์ด ๋ฌธ์ œ์˜ ๋ชฉ์ ์€ ์˜ˆ์ธก๋œ ์ด‰๊ฐ ์ด๋ฏธ์ง€(\hat{I}_t)์™€ ๋ชฉํ‘œ ์ด๋ฏธ์ง€(I_g) ์‚ฌ์ด์˜ ์ฐจ์ด๋ฅผ ์ตœ์†Œํ™”ํ•˜๋Š” ๋น„์šฉ ํ•จ์ˆ˜(c_t(I_g, \hat{I}_t))๋ฅผ ์ค„์ด๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ด ๋น„์šฉ ํ•จ์ˆ˜๋กœ ํ”ฝ์…€ ๊ณต๊ฐ„์—์„œ์˜ ํ‰๊ท  ์ œ๊ณฑ ์˜ค์ฐจ(Mean Squared Error, MSE)๋ฅผ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. ์ตœ์ ํ™” ๊ณต์‹์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค: a_{1:T} = \arg \min_{a_{1:T}} \sum_{t=1,...,T} c_t(I_g, \hat{I}_t)
    • ์ตœ์ ํ™” ๊ธฐ๋ฒ• (Optimization Method): ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜์˜ ์ตœ์ ํ™” ๊ธฐ๋ฒ•์ธ Cross-Entropy Method (CEM) [29]์ด ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. CEM์€ ๋ฐ˜๋ณต์ ์œผ๋กœ ์•ก์…˜ ์‹œํ€€์Šค๋ฅผ ์ƒ˜ํ”Œ๋งํ•˜๊ณ , ์˜ˆ์ธก ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์—ฌ ๋น„์šฉ์„ ํ‰๊ฐ€ํ•˜๋ฉฐ, ๋” ๋‚˜์€ ์‹œํ€€์Šค๋ฅผ ์ฐพ๊ธฐ ์œ„ํ•ด ํƒ์ƒ‰ ๋ถ„ํฌ๋ฅผ ์ •์ œํ•ฉ๋‹ˆ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” CEM์„ 3ํšŒ ๋ฐ˜๋ณตํ•˜๊ณ  ๊ฐ ๋ฐ˜๋ณต์—์„œ 100๊ฐœ์˜ ์ƒ˜ํ”Œ์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. ์˜ˆ์ธก horizon์€ 15~18 ํƒ€์ž„์Šคํ…์ž…๋‹ˆ๋‹ค.
    • ์‹คํ–‰ (Execution): CEM์œผ๋กœ ์ฐพ์€ ์ตœ์ ์˜ ์•ก์…˜ ์‹œํ€€์Šค ์ค‘ ์ฒซ ๋ฒˆ์งธ ์•ก์…˜๋งŒ ๋กœ๋ด‡์— ์ ์šฉ๋ฉ๋‹ˆ๋‹ค.
    • ์žฌ๊ณ„์‚ฐ (Receding Horizon Control): ๋ชจ๋ธ์˜ ๋ถ€์ •ํ™•์„ฑ๊ณผ ํ™˜๊ฒฝ ๋ณ€ํ™”์— ๋Œ€์‘ํ•˜๊ธฐ ์œ„ํ•ด, ์ด ๊ณ„ํš ๊ณผ์ •์€ ๋งค ํƒ€์ž„์Šคํ…๋งˆ๋‹ค ๋ฐ˜๋ณต๋˜์–ด (receding horizon control) ์ œ์–ด์˜ ๊ฐ•๊ฑด์„ฑ์„ ๋†’์ž…๋‹ˆ๋‹ค.

์‹คํ—˜์€ ball repositioning, analog stick deflection, die rolling ์„ธ ๊ฐ€์ง€ ์‹ค์ œ ์ด‰๊ฐ ์กฐ์ž‘ ํƒœ์Šคํฌ์—์„œ ์ˆ˜ํ–‰๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ํ‰๊ฐ€ ์ง€ํ‘œ๋กœ๋Š” ์ตœ์ข… ์ด‰๊ฐ ์ด๋ฏธ์ง€์™€ ๋ชฉํ‘œ ์ด๋ฏธ์ง€ ๊ฐ„์˜ MSE, ๊ทธ๋ฆฌ๊ณ  ์ˆ˜๋™์œผ๋กœ ์ฃผ์„ ์ฒ˜๋ฆฌ๋œ ๋ฌผ์ฒด(pressure centroid)์˜ ํ”ฝ์…€ ๊ณต๊ฐ„ ๊ฑฐ๋ฆฌ(๋ฌผ๋ฆฌ์  ์œ„์น˜๋ฅผ ๋ฐ˜์˜)๊ฐ€ ์‚ฌ์šฉ๋˜์—ˆ์Šต๋‹ˆ๋‹ค. Die rolling ํƒœ์Šคํฌ์˜ ๊ฒฝ์šฐ ์›ํ•˜๋Š” ๋ฉด์ด ์œ„๋กœ ํ–ฅํ•˜๋Š” ์„ฑ๊ณต๋ฅ ๋„ ์ธก์ •๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

์ด ๋ฐฉ๋ฒ•์€ ์ˆ˜๋™์œผ๋กœ ์„ค๊ณ„๋œ ๋ฒ ์ด์Šค๋ผ์ธ(์••๋ ฅ ์ค‘์‹ฌ์„ ๊ฐ์ง€ํ•˜๊ณ  ๋ชฉํ‘œ๋ฅผ ํ–ฅํ•ด ์ง์„ ์œผ๋กœ ์ด๋™ํ•˜๋Š” ๋ฐฉ์‹)๊ณผ ๋น„๊ต๋˜์—ˆ์„ ๋•Œ ๋ชจ๋“  ํƒœ์Šคํฌ์—์„œ ์ƒ๋‹นํ•œ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋ณด์˜€์Šต๋‹ˆ๋‹ค. Ball repositioning ํƒœ์Šคํฌ์—์„œ ์ด‰๊ฐ MPC๋Š” 2.10mm์˜ ์ค‘์•™๊ฐ’ L2 ๊ฑฐ๋ฆฌ(centroid)๋ฅผ ๋‹ฌ์„ฑํ•œ ๋ฐ˜๋ฉด ๋ฒ ์ด์Šค๋ผ์ธ์€ 2.97mm๋ฅผ ๊ธฐ๋กํ–ˆ์Šต๋‹ˆ๋‹ค. Analog stick ํƒœ์Šคํฌ์—์„œ๋Š” ๊ฐ๊ฐ 5.31mm ๋Œ€ 8.86mm๋ฅผ ๋ณด์˜€์Šต๋‹ˆ๋‹ค. Die rolling ํƒœ์Šคํฌ์—์„œ๋Š” ์ด‰๊ฐ MPC๊ฐ€ 86.6%์˜ ์„ฑ๊ณต๋ฅ ์„ ๋ณด์ด๋ฉฐ ๋ฒ ์ด์Šค๋ผ์ธ์˜ 46.6%๋ฅผ ํฌ๊ฒŒ ์•ž์„ฐ์Šต๋‹ˆ๋‹ค. ์ด๋Š” die rolling๊ณผ ๊ฐ™์ด ๋ณต์žกํ•œ ๋™์—ญํ•™(๋ฏธ๋„๋Ÿฌ์ง, ๊ตฌ๋ฆ„ ๋“ฑ)์„ ํฌํ•จํ•˜๋Š” ํƒœ์Šคํฌ์—์„œ ์‹ฌ์ธต ๋™์—ญํ•™ ๋ชจ๋ธ์˜ ๊ฐ•์ ์„ ์ž…์ฆํ•ฉ๋‹ˆ๋‹ค.

๋…ผ๋ฌธ์€ ์ด ๋ฐฉ๋ฒ•์ด ์ด‰๊ฐ๋งŒ์œผ๋กœ ๋‹ค์–‘ํ•œ ์กฐ์ž‘ ํƒœ์Šคํฌ๋ฅผ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์—ฌ์ฃผ์ง€๋งŒ, ๋‹จ๊ธฐ horizon ์ œ์–ด์— ํ•œ์ •๋œ๋‹ค๋Š” ์ ๊ณผ ๋‹จ์ผ ์†๊ฐ€๋ฝ ์กฐ์ž‘์˜ ํ•œ๊ณ„๋ฅผ ์–ธ๊ธ‰ํ•ฉ๋‹ˆ๋‹ค. ํ–ฅํ›„ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์œผ๋กœ๋Š” ๋น„๋””์˜ค ์˜ˆ์ธก ๋ชจ๋ธ์˜ ๋ฐœ์ „๊ณผ ํ•จ๊ป˜ ๋‹ค์ง€(multi-fingered) ๋กœ๋ด‡ ํ•ธ๋“œ์— ์ด‰๊ฐ ์„ผ์„œ๋ฅผ ํ†ตํ•ฉํ•˜์—ฌ ๋” ๋ณต์žกํ•œ ์ธ-ํ•ธ๋“œ(in-hand) ์กฐ์ž‘ ๋ฐ ์กฐ๋ฆฝ ๊ธฐ์ˆ ์„ ๊ตฌํ˜„ํ•˜๋Š” ๊ฒƒ์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.

Copyright 2026, JungYeon Lee