Curieux.JY
  • JungYeon Lee
  • Post
  • Note

On this page

  • ๐Ÿ” Ping Review
  • ๐Ÿ”” Ring Review

๐Ÿ“ƒDIGIT ๋ฆฌ๋ทฐ

tactile
visuo-tactile
sim-to-real
A Novel Design for a Low-Cost Compact High-Resolution Tactile Sensor with Application to In-Hand Manipulation
Published

March 15, 2026

  • Paper Link
  • Project Link
  1. ๐Ÿค– DIGIT์€ ๊ธฐ์กด vision-based tactile sensor์˜ ์ œ์•ฝ์„ ๊ฐœ์„ ํ•˜์—ฌ ์†Œํ˜•ํ™”, ๋‚ด๊ตฌ์„ฑ ๊ฐ•ํ™”, ์ €๋น„์šฉ ๋Œ€๋Ÿ‰ ์ƒ์‚ฐ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•œ ์ƒˆ๋กœ์šด ๊ณ ํ•ด์ƒ๋„ ์ด‰๊ฐ ์„ผ์„œ์ž…๋‹ˆ๋‹ค.
  2. ๐Ÿฆพ ์ด ๋…ผ๋ฌธ์€ DIGIT ์„ผ์„œ๋ฅผ Allegro hand์— ์žฅ์ฐฉํ•˜์—ฌ ๋ณต์žกํ•œ in-hand marble manipulation ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๋ฉฐ, tactile-MPC์™€ ํšจ์œจ์ ์ธ Struct-NN ๊ธฐ๋ฐ˜์˜ ๋™์—ญํ•™ ๋ชจ๋ธ ํ•™์Šต์„ ํ†ตํ•ด ์ด๋ฅผ ์ œ์–ดํ•ฉ๋‹ˆ๋‹ค.
  3. ๐Ÿš€ ํ•™์Šต๋œ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์ œ์–ด๊ธฐ๋Š” ์ˆ˜๋™์œผ๋กœ ํŠœ๋‹๋œ ์ปจํŠธ๋กค๋Ÿฌ๋ณด๋‹ค ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ๋ฉฐ, DIGIT์˜ ์˜คํ”ˆ ์†Œ์Šค ๋””์ž์ธ์€ ๋กœ๋ด‡ ์ปค๋ฎค๋‹ˆํ‹ฐ์—์„œ ์ด‰๊ฐ ์„ผ์„œ์˜ ๊ด‘๋ฒ”์œ„ํ•œ ์ฑ„ํƒ์„ ์ด‰์ง„ํ•  ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€๋ฉ๋‹ˆ๋‹ค.

๐Ÿ” Ping Review

๐Ÿ” Ping โ€” A light tap on the surface. Get the gist in seconds.

์ด ๋…ผ๋ฌธ์€ ๋กœ๋ด‡ ๊ณตํ•™์—์„œ In-Hand Manipulation์˜ ์˜ค๋žœ ๋‚œ์ œ ์ค‘ ํ•˜๋‚˜์ธ ์ •๋ฐ€ํ•œ ์ ‘์ด‰๋ ฅ ๊ฐ์ง€๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ์ €๋น„์šฉ, ์†Œํ˜•, ๊ณ ํ•ด์ƒ๋„ ์ด‰๊ฐ ์„ผ์„œ์ธ DIGIT๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. DIGIT๋Š” ๊ธฐ์กด vision-based tactile sensor์˜ ์žฅ์ ์„ ์œ ์ง€ํ•˜๋ฉด์„œ ์†Œํ˜•ํ™”, ์ œ์กฐ ๊ณต์ • ๊ฐ„์†Œํ™”, ์‹ ๋ขฐ์„ฑ ํ–ฅ์ƒ์„ ํ†ตํ•ด ๋‹จ์ ์„ ๊ฐœ์„ ํ–ˆ์Šต๋‹ˆ๋‹ค.

I. DIGIT ์„ผ์„œ ๋””์ž์ธ

DIGIT๋Š” 20mm x 27mm x 18mm ํฌ๊ธฐ์™€ ์•ฝ 20g์˜ ๋ฌด๊ฒŒ๋ฅผ ๊ฐ€์ง„ ์†Œํ˜• ์„ผ์„œ๋กœ, ๋‹ค์ง€ํ˜• ๋กœ๋ด‡ ํ•ธ๋“œ(์˜ˆ: Allegro hand)์— ์žฅ์ฐฉํ•˜๊ธฐ ์ ํ•ฉํ•ฉ๋‹ˆ๋‹ค. ์ฃผ์š” ๊ฐœ์„  ์‚ฌํ•ญ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

  1. ์†Œํ˜•ํ™” ๋ฐ ๋ชจ๋“ˆํ™”: ๊ธฐ์กด GelSight ์„ผ์„œ์— ๋น„ํ•ด ํฌ๊ธฐ๊ฐ€ ๋Œ€ํญ ์ค„์—ˆ์œผ๋ฉฐ, โ€˜press fitโ€™ ์—ฐ๊ฒฐ ๋ฐฉ์‹์„ ์‚ฌ์šฉํ•˜์—ฌ elastomer, housing, camera ๋“ฑ ๊ฐœ๋ณ„ ๊ตฌ์„ฑ ์š”์†Œ๋ฅผ ์‰ฝ๊ฒŒ ๊ต์ฒดํ•  ์ˆ˜ ์žˆ๋Š” modular ๋””์ž์ธ์„ ์ฑ„ํƒํ–ˆ์Šต๋‹ˆ๋‹ค. ์ด๋Š” ์†์ƒ๋œ ๋ถ€ํ’ˆ ๊ต์ฒด ๋ฐ ๋‹ค์–‘ํ•œ task-specific elastomer ์‚ฌ์šฉ์„ ์šฉ์ดํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.
  2. ์ €๋น„์šฉ ๋ฐ ์ œ์กฐ ์šฉ์ด์„ฑ: 3D ํ”„๋ฆฐํŒ… ๋˜๋Š” injection mold์— ์ ํ•ฉํ•œ ํ”Œ๋ผ์Šคํ‹ฑ multi-body housing ๋””์ž์ธ์„ ์‚ฌ์šฉํ•˜๋ฉฐ, ์ƒ์—…์šฉ ๋ถ€ํ’ˆ(commercial off-the-shelf components)์„ ์ ๊ทน ํ™œ์šฉํ•˜์—ฌ ๋Œ€๋Ÿ‰ ์ƒ์‚ฐ ์‹œ ๊ฐœ๋‹น ์•ฝ $15์˜ ๋‚ฎ์€ ์ œ์กฐ ๋น„์šฉ์„ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค.
  3. ํ–ฅ์ƒ๋œ ๊ธฐ๊ณ„์  ์‹ ๋ขฐ์„ฑ ๋ฐ ๋‚ด๊ตฌ์„ฑ: ์ ‘์ด‰๋ฉด์— ์‚ฌ์šฉ๋˜๋Š” elastomer์˜ ๋‚ด๊ตฌ์„ฑ์„ ๊ฐœ์„ ํ–ˆ์Šต๋‹ˆ๋‹ค. ์ƒˆ๋กœ์šด ์ œ์กฐ ๊ณต์ •๊ณผ Smooth-On Solaris ์‹ค๋ฆฌ์ฝ˜์„ ์‚ฌ์šฉํ•˜์—ฌ image transfer layer์˜ ๋งˆ๋ชจ๋ฅผ ์ค„์˜€์Šต๋‹ˆ๋‹ค. abrasion test ๊ฒฐ๊ณผ, DIGIT์˜ elastomer๋Š” ๋‹ค๋ฅธ GelSight ๊ณ„์—ด elastomer์— ๋น„ํ•ด ํ›จ์”ฌ ๋‚ฎ์€ ๋งˆ๋ชจ๋„๋ฅผ ๋ณด์—ฌ์ฃผ๋ฉฐ, ์ด๋Š” ์„ผ์„œ์˜ ์ˆ˜๋ช…๊ณผ ์‹ ๋ขฐ์„ฑ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚ต๋‹ˆ๋‹ค.
  4. ๋งž์ถคํ˜• ์ „์ž ํšŒ๋กœ: ์นด๋ฉ”๋ผ ํŠน์„ฑ, ์กฐ๋ช… ๋ฐ ๋น„๋””์˜ค ์บก์ฒ˜๋ฅผ ์ œ์–ดํ•˜๊ธฐ ์œ„ํ•ด custom-designed electronics๋ฅผ ์‚ฌ์šฉํ–ˆ์Šต๋‹ˆ๋‹ค. Omnivision OVM7692 CMOS ์นด๋ฉ”๋ผ(60fps)์™€ SuperSpeed USB 3.0 ํ—ˆ๋ธŒ๋ฅผ ์—ฐ๊ฒฐํ•˜๋Š” custom PCB๋ฅผ ํฌํ•จํ•˜๋ฉฐ, RGB LEDs๋ฅผ ํ†ตํ•ด ์กฐ๋ช… ๊ฐ•๋„๋ฅผ ์กฐ์ ˆํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

II. In-Hand Manipulation ์‘์šฉ ๋ฐ ํ•™์Šต ๋ฐฉ๋ฒ•๋ก 

DIGIT ์„ผ์„œ์˜ ์„ฑ๋Šฅ์„ ์ž…์ฆํ•˜๊ธฐ ์œ„ํ•ด Allegro hand์— DIGIT๋ฅผ ์žฅ์ฐฉํ•˜์—ฌ ์œ ๋ฆฌ ๊ตฌ์Šฌ(marble)์„ ์ •๋ฐ€ํ•˜๊ฒŒ in-hand manipulationํ•˜๋Š” ์ž‘์—…์„ ์ˆ˜ํ–‰ํ–ˆ์Šต๋‹ˆ๋‹ค. ์ด๋Š” ๊ตฌ์Šฌ์˜ ๋ฏธ๋„๋Ÿฌ์ง(slipping)๊ณผ ํšŒ์ „(rolling) ์—ญํ•™์„ ์„ฌ์„ธํ•˜๊ฒŒ ์ œ์–ดํ•ด์•ผ ํ•˜๋Š” ๊ณ ๋‚œ์ด๋„ task์ž…๋‹ˆ๋‹ค.

  1. ์ž์œจ ํ•™์Šต ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘: Allegro hand๊ฐ€ ๊ตฌ์Šฌ์„ ์ง‘์€ ํ›„, ์†๊ฐ€๋ฝ์„ ๋ฌด์ž‘์œ„๋กœ ์›€์ง์—ฌ ์•ฝ 10์ดˆ ๋™์•ˆ 4800๊ฐœ์˜ trial ๋ฐ์ดํ„ฐ๋ฅผ ์ž์œจ์ ์œผ๋กœ ์ˆ˜์ง‘ํ–ˆ์Šต๋‹ˆ๋‹ค. ์ด ๊ณผ์ •์—์„œ ๋‘ DIGIT ์„ผ์„œ์˜ ๋น„๋””์˜ค, 8๊ฐœ์˜ joint servo์˜ ๊ฐ๋„ ์œ„์น˜ (j), ๊ทธ๋ฆฌ๊ณ  joint angular displacement command (a)๊ฐ€ ๊ธฐ๋ก๋˜์—ˆ์Šต๋‹ˆ๋‹ค.
  2. Tactile Predictive Model (์ด‰๊ฐ ์˜ˆ์ธก ๋ชจ๋ธ): ๊ณ ํ•ด์ƒ๋„ ์ด‰๊ฐ ์ด๋ฏธ์ง€๋ผ๋Š” ๊ณ ์ฐจ์› ์ž…๋ ฅ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด Structural VRNN ์•„ํ‚คํ…์ฒ˜์—์„œ ์˜๊ฐ์„ ๋ฐ›์€ compactํ•˜๊ณ  ์ €์ฐจ์›์ ์ธ โ€˜keypointโ€™ ํ‘œํ˜„์„ ํ•™์Šตํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ–ˆ์Šต๋‹ˆ๋‹ค.
    • Keypoint Autoencoder: ์ž…๋ ฅ ์ด๋ฏธ์ง€๋ฅผ (x, y, i) ํ˜•ํƒœ์˜ keypoint๋กœ ์ธ์ฝ”๋”ฉํ•˜๊ณ  ์ด๋ฅผ ๋‹ค์‹œ ์ด๋ฏธ์ง€๋กœ ์žฌ๊ตฌ์„ฑํ•˜๋Š” autoencoder๋ฅผ ํ›ˆ๋ จ์‹œ์ผฐ์Šต๋‹ˆ๋‹ค. ์—ฌ๊ธฐ์„œ (x, y)๋Š” ๊ตฌ์Šฌ์˜ 2D ์œ„์น˜๋ฅผ ๋‚˜ํƒ€๋‚ด๊ณ , (i)๋Š” ๊ตฌ์Šฌ์ด elastomer์— ๋ˆŒ๋ฆฌ๋Š” ๊นŠ์ด, ์ฆ‰ ์••๋ ฅ์„ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค. ์ด autoencoder๋Š” ResNet-18์„ backbone network๋กœ ์‚ฌ์šฉํ•˜๋ฉฐ L2 image reconstruction error๋ฅผ ํ†ตํ•ด self-supervised ๋ฐฉ์‹์œผ๋กœ ํ•™์Šต๋ฉ๋‹ˆ๋‹ค. ์ด ๊ณผ์ •์„ ํ†ตํ•ด 64x64 raw image๋ฅผ 14์ฐจ์›์˜ compactํ•œ ์ƒํƒœ ํ‘œํ˜„ s = [k_l, k_r, j] (์ขŒ์šฐ DIGIT์˜ keypoint ๋ฐ joint ๊ฐ๋„)๋กœ ์ค„์ผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
    • Dynamics Model: ํ•™์Šต๋œ keypoint ํ‘œํ˜„์„ ์‚ฌ์šฉํ•˜์—ฌ ๋‹ค์Œ ์ƒํƒœ๋ฅผ ์˜ˆ์ธกํ•˜๋Š” ์‹ ๊ฒฝ๋ง ๋™์—ญํ•™ ๋ชจ๋ธ s' = f(s, a)๋ฅผ ํ›ˆ๋ จ์‹œ์ผฐ์Šต๋‹ˆ๋‹ค. ์ด ๋ชจ๋ธ์€ ๊ฐ„๋‹จํ•œ Multi-Layer Perceptron (MLP)์œผ๋กœ ๊ตฌ์„ฑ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค.
  3. Model Predictive Control (MPC): ํ•™์Šต๋œ ๋™์—ญํ•™ ๋ชจ๋ธ์„ ๊ธฐ๋ฐ˜์œผ๋กœ Cross-Entropy Method (CEM)๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ MPC๋ฅผ ์ˆ˜ํ–‰ํ–ˆ์Šต๋‹ˆ๋‹ค.
    • ๊ณ„์‚ฐ ํšจ์œจ์„ฑ ์ตœ์ ํ™”: ๊ธฐ์กด ์—ฐ๊ตฌ์™€ ๋‹ฌ๋ฆฌ, ๊ณ„ํš(planning)์„ ์ด๋ฏธ์ง€ ๊ณต๊ฐ„์ด ์•„๋‹Œ 14์ฐจ์›์˜ keypoint ๊ณต๊ฐ„์—์„œ ์ง์ ‘ ์ˆ˜ํ–‰ํ•ฉ๋‹ˆ๋‹ค. ์—”์ฝ”๋” ๋„คํŠธ์›Œํฌ(๋ชจ๋ธ์—์„œ ๊ฐ€์žฅ ๊ณ„์‚ฐ ๋น„์šฉ์ด ๋งŽ์ด ๋“œ๋Š” ๋ถ€๋ถ„)๋Š” MPC ๋‹จ๊ณ„ ์‹œ์ž‘ ์‹œ ํ•œ ๋ฒˆ๋งŒ ํ˜ธ์ถœ๋˜๋ฏ€๋กœ, ํ•œ MPC ๋‹จ๊ณ„๋‹น ๊ณ„์‚ฐ ์‹œ๊ฐ„์„ 1.4์ดˆ๋กœ ๋‹จ์ถ•ํ•˜์—ฌ ์‹ค์‹œ๊ฐ„ ์ œ์–ด๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค (CDNA ๋ชจ๋ธ์€ 69์ดˆ ์†Œ์š”).
    • Cost function: ๊ณ„ํš ๊ณผ์ •์—์„œ ๊ฐ action sequence์˜ ๋น„์šฉ์€ ํ˜„์žฌ keypoint ์œ„์น˜์™€ ๋ชฉํ‘œ keypoint ์œ„์น˜ ๊ฐ„์˜ Euclidean distance๋กœ ์ •์˜๋ฉ๋‹ˆ๋‹ค. ์ด๋Š” ๊ตฌ์Šฌ์„ ์›ํ•˜๋Š” (x, y) ์œ„์น˜๋กœ ์ด๋™์‹œํ‚ค๊ณ , ๋„ˆ๋ฌด ์„ธ๊ฒŒ ๋ˆ„๋ฅด๊ฑฐ๋‚˜ ๋–จ์–ด๋œจ๋ฆฌ๋Š” ๊ฒƒ์„ ๋ฐฉ์ง€ํ•ฉ๋‹ˆ๋‹ค.

III. ์‹คํ—˜ ๊ฒฐ๊ณผ

  1. Video Predictive Model ํ‰๊ฐ€: Struct-NN ๋ชจ๋ธ์€ ์ •์„ฑ์ ์œผ๋กœ ์ข‹์€ ์˜ˆ์ธก ์„ฑ๋Šฅ์„ ๋ณด์ด๋ฉฐ (Fig. 7), CDNA ๋ชจ๋ธ์— ๋น„ํ•ด RMSE๋Š” ์•ฝ๊ฐ„ ๋†’์ง€๋งŒ (Table III), ํ›จ์”ฌ ์ ์€ ํŒŒ๋ผ๋ฏธํ„ฐ ์ˆ˜์™€ ์••๋„์ ์œผ๋กœ ๋น ๋ฅธ inference ๋ฐ MPC ๊ณ„์‚ฐ ์†๋„๋ฅผ ๋ณด์—ฌ์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค. ํŠนํžˆ MPC ๋‹จ๊ณ„์—์„œ ์†๋„ ์ฐจ์ด๋Š” ์•ฝ 50๋ฐฐ ์ด์ƒ์œผ๋กœ, ์‹ค์‹œ๊ฐ„ ๋‹ค์ง€ํ˜• ์ œ์–ด์— Struct-NN์˜ ํšจ์œจ์„ฑ์ด ํ•„์ˆ˜์ ์ž„์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.
  2. Marble Manipulation ๊ฒฐ๊ณผ: ํ•™์Šต๋œ ๋™์—ญํ•™ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•œ MPC ์ปจํŠธ๋กค๋Ÿฌ๋Š” ์ˆ˜๋™์œผ๋กœ ํŠœ๋‹๋œ ์„ ํ˜• P controller์— ๋น„ํ•ด ๋ชฉํ‘œ ์œ„์น˜๊นŒ์ง€์˜ Euclidean distance๋ฅผ ์ง€์†์ ์œผ๋กœ ๊ฐ์†Œ์‹œ์ผœ ๋” ๋‚˜์€ ์„ฑ๋Šฅ์„ ๋ณด์˜€์Šต๋‹ˆ๋‹ค (Fig. 8 ์ƒ๋‹จ). ์ด๋Š” learned model์ด complexํ•˜๊ณ  non-linearํ•œ ๊ตฌ์Šฌ์˜ ๋™์—ญํ•™์„ ํšจ๊ณผ์ ์œผ๋กœ ์ œ์–ดํ•จ์„ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค. ๋น„๋ก ์•ฝ 25%์˜ trial์—์„œ ๊ตฌ์Šฌ์ด ๋–จ์–ด์ง€๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ์—ˆ์œผ๋‚˜ (Fig. 8 ํ•˜๋‹จ), ์ด๋Š” task์˜ ๋‚œ์ด๋„์™€ actuation noise ๋ฐ ๊ณ„ํš์˜ ๋ถ€์ •ํ™•์„ฑ ๋•Œ๋ฌธ์œผ๋กœ ๋ถ„์„๋ฉ๋‹ˆ๋‹ค.

๊ฒฐ๋ก : DIGIT๋Š” ๊ณ ํ•ด์ƒ๋„ ์ด‰๊ฐ ๊ฐ์ง€๋ฅผ ์ œ๊ณตํ•˜๋ฉด์„œ ์†Œํ˜•ํ™”, ๋‚ด๊ตฌ์„ฑ, ์ €๋น„์šฉ์„ ๋ชจ๋‘ ๋งŒ์กฑํ•˜๋Š” ํ˜์‹ ์ ์ธ tactile sensor์ž…๋‹ˆ๋‹ค. ์ด ์„ผ์„œ๋ฅผ ํ™œ์šฉํ•˜์—ฌ deep model predictive control ๊ธฐ๋ฐ˜์œผ๋กœ ๋ณต์žกํ•œ in-hand marble manipulation ์ž‘์—…์„ ์„ฑ๊ณต์ ์œผ๋กœ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Œ์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค. ๋…ผ๋ฌธ ์ €์ž๋“ค์€ DIGIT์˜ ๋””์ž์ธ๊ณผ ์ œ์กฐ ๊ณต์ •์„ www.digit.ml์— open-sourceํ™”ํ•˜์—ฌ ๋กœ๋ด‡ ๊ณตํ•™ ์ปค๋ฎค๋‹ˆํ‹ฐ์˜ ๊ด‘๋ฒ”์œ„ํ•œ ์ฑ„ํƒ์„ ์žฅ๋ คํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ํ–ฅํ›„ ์—ฐ๊ตฌ๋Š” ์„ผ์„œ์˜ ์ถ”๊ฐ€์ ์ธ ์†Œํ˜•ํ™” ๋ฐ curved, omni-directional sensing field๋ฅผ ๊ฐ€์ง„ ์„ผ์„œ ๋””์ž์ธ์— ์ดˆ์ ์„ ๋งž์ถœ ๊ฒƒ์ž…๋‹ˆ๋‹ค.

๐Ÿ”” Ring Review

๐Ÿ”” Ring โ€” An idea that echoes. Grasp the core and its value.

Copyright 2026, JungYeon Lee