Curieux.JY
  • JungYeon Lee
  • Post
  • Lecture
  • Note

On this page

  • ๐Ÿ” Ping Review
  • ๐Ÿ”” Ring Review

๐Ÿ“ƒDenseTact

tactile
vision-based
fisheye
shape-reconstruction
Optical Tactile Sensor for Dense Shape Reconstruction
Published

May 27, 2026

  • Paper
  1. DenseTact๋Š” ์ €๋ ดํ•˜๊ณ  ํ™•์žฅ ๊ฐ€๋Šฅํ•˜๋ฉฐ ์ปดํŒฉํŠธํ•œ ๋น„์ „ ๊ธฐ๋ฐ˜ ์ด‰๊ฐ ์„ผ์„œ๋กœ, ๋‚ด๋ถ€ ์นด๋ฉ”๋ผ ์ด๋ฏธ์ง€๋กœ๋ถ€ํ„ฐ ๊ตฌํ˜• ์—˜๋ผ์Šคํ† ๋จธ ํ‘œ๋ฉด์˜ ๊ณ ํ•ด์ƒ๋„ 3D ํ˜•์ƒ ์žฌ๊ตฌ์„ฑ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.
  2. ์ด ์„ผ์„œ๋Š” ๋”ฅ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜์˜ ์ธ์ฝ”๋”-๋””์ฝ”๋” ๋„คํŠธ์›Œํฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํ‘œ๋ฉด ๋ณ€ํ˜•์„ ์‹ค์‹œ๊ฐ„(18ms)์œผ๋กœ ์ถ”์ •ํ•˜๋ฉฐ, 3D ํ”„๋ฆฐํŒ…๋œ ๊ต์ • ์˜ค๋ธŒ์ ํŠธ์™€ Ray Casting ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ํ†ตํ•ด ์ •๋ฐ€ํ•œ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๋ฅผ ๊ตฌ์ถ•ํ•ฉ๋‹ˆ๋‹ค.
  3. DenseTact๋Š” ํ‰๊ท  0.28mm์˜ ๊นŠ์ด ์ถ”์ • ์˜ค์ฐจ์™€ ์šฐ์ˆ˜ํ•œ ๋‚ด๊ตฌ์„ฑ์„ ํ†ตํ•ด ๋กœ๋ด‡์˜ ๋ฌผ์ฒด ์กฐ์ž‘ ๋ฐ ์ธํ•ธ๋“œ(in-hand) ์œ„์น˜ ์ถ”์ •์— ํ•„์š”ํ•œ ๊ณ ํ•ด์ƒ๋„ ์ด‰๊ฐ ์ •๋ณด๋ฅผ ํšจ์œจ์ ์œผ๋กœ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ” Ping Review

๐Ÿ” Ping โ€” A light tap on the surface. Get the gist in seconds.

DenseTact๋Š” ๋กœ๋ด‡ ์กฐ์ž‘ ์ž‘์—…์—์„œ ์ด‰๊ฐ ๊ฐ์ง€์˜ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•ด ๊ณ ํ•ด์ƒ๋„ ํ‘œ๋ฉด ์žฌ๊ตฌ์„ฑ์„ ์œ„ํ•œ ๊ด‘ํ•™ ์ด‰๊ฐ ์„ผ์„œ์ž…๋‹ˆ๋‹ค. ์ด ์„ผ์„œ๋Š” ์ €๋ ดํ•˜๊ณ , ํ™•์žฅ ๊ฐ€๋Šฅํ•˜๋ฉฐ, ์ฝคํŒฉํŠธํ•œ ๋””์ž์ธ์„ ๊ฐ€์ง€๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ๊ธฐ์กด์˜ ๋น„์ „ ๊ธฐ๋ฐ˜ ์ด‰๊ฐ ์„ผ์„œ๋Š” ๊ณ ํ•ด์ƒ๋„๋ฅผ ์ œ๊ณตํ•˜์ง€๋งŒ, ๋‚ฎ์€ ์ •ํ™•๋„, ๋†’์€ ๋น„์šฉ, 2D ํ˜•์ƒ ์ œํ•œ ๋“ฑ์˜ ํ•œ๊ณ„๊ฐ€ ์žˆ์—ˆ์Šต๋‹ˆ๋‹ค. DenseTact๋Š” ์ด๋Ÿฌํ•œ ๋ฌธ์ œ์ ์„ ํ•ด๊ฒฐํ•˜๋ฉฐ, ํŠนํžˆ 3D ํ˜•์ƒ ์„ผ์„œ๋กœ์„œ ์ •๋ฐ€ํ•œ ์ธํ•ธ๋“œ ์กฐ์ž‘(in-hand manipulation)์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.

II. ๊ด€๋ จ ์—ฐ๊ตฌ (Related Works)

๊ธฐ์กด ์ด‰๊ฐ ์„ผ์„œ๋“ค์€ ์••์ „(piezoelectric), ๊ด‘ํ•™(optics), ์ €ํ•ญ(resistance), ์šฉ๋Ÿ‰(capacity) ๋“ฑ ๋‹ค์–‘ํ•œ ์ ‘๊ทผ ๋ฐฉ์‹์„ ์‚ฌ์šฉํ•ด์™”์Šต๋‹ˆ๋‹ค. ์ตœ๊ทผ์—๋Š” ๊ณ ํ•ด์ƒ๋„ ํŠน์„ฑ ๋•Œ๋ฌธ์— ๋น„์ „ ๊ธฐ๋ฐ˜ ์ด‰๊ฐ ์„ผ์„œ๊ฐ€ ์ธ๊ธฐ๋ฅผ ์–ป๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. Gelsight, Gelslim, DIGIT์™€ ๊ฐ™์€ ์„ผ์„œ๋“ค์€ ๊ณ ํ•ด์ƒ๋„๋ฅผ ์ œ๊ณตํ•˜์ง€๋งŒ, ํ‰ํ‰ํ•œ ํ‘œ๋ฉด์œผ๋กœ ์ธํ•ด ์กฐ์ž‘ ์ž‘์—…์— ์ œ์•ฝ์ด ์žˆ์Šต๋‹ˆ๋‹ค. Omnitact์™€ ๊ฐ™์€ 3D ๊ณก๋ฉด ์„ผ์„œ๋Š” ๋‹ค๋ฐฉํ–ฅ ๊ฐ์ง€๊ฐ€ ๊ฐ€๋Šฅํ•˜์ง€๋งŒ ๋น„์šฉ์ด ๋งŽ์ด ๋“ญ๋‹ˆ๋‹ค. DenseTact๋Š” ์ด๋Ÿฌํ•œ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด ๋น„์šฉ ํšจ์œจ์ ์ด๊ณ  3D ํ˜•์ƒ์„ ๊ฐ€์ง€๋ฉฐ ๊ณ ํ•ด์ƒ๋„ ๊ฐ์ง€ ํ”ผ๋“œ๋ฐฑ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

III. DenseTact ์„ผ์„œ ์„ค๊ณ„ (DenseTact Sensor Design)

DenseTact๋Š” ์ธํ•ธ๋“œ ์†Œํ˜• ๊ฐ์ฒด ์กฐ์ž‘์— ์œ ์šฉํ•œ ์ž‘์€ ์„ผ์„œ ํฌ๊ธฐ, ๋‹ค๋ชฉ์  ์กฐ์ž‘์„ ์œ„ํ•œ ๋งค์šฐ ๋ถ€๋“œ๋Ÿฌ์šด 3D ๊ณก๋ฉด, ๊ทธ๋ฆฌ๊ณ  ์ ‘์ด‰ ๊ฐ์ง€๋ฅผ ์œ„ํ•œ ๊ณ ํ•ด์ƒ๋„ ํ‘œ๋ฉด ๋ณ€ํ˜• ๋ชจ๋ธ๋ง(ํ˜•์ƒ ์žฌ๊ตฌ์„ฑ)์„ ๋ชฉํ‘œ๋กœ ์„ค๊ณ„๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

  1. ์—˜๋ผ์Šคํ† ๋จธ ์ œ์ž‘ (Elastomer Fabrication):
    • ๋ฐ˜๊ตฌํ˜•(hemispherical) ํˆฌ๋ช… ์—˜๋ผ์Šคํ† ๋จธ(Silicone Inc. P-565 Platinum Clear Silicone, 20:1 ๋น„์œจ, 6.5 Shore A ๊ฒฝ๋„)๋ฅผ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. ์ด ๊ฒฝ๋„๋Š” ์‚ฌ๋žŒ ํ”ผ๋ถ€์™€ ์œ ์‚ฌํ•˜์—ฌ ์ž‘์€ ์ „๋‹จ๋ ฅ์—๋„ ํฐ ํ‘œ๋ฉด ๋ณ€ํ˜•์„ ํ—ˆ์šฉํ•ฉ๋‹ˆ๋‹ค.
    • ์—˜๋ผ์Šคํ† ๋จธ์˜ ์ ‘์ด‰ ๊ฒฝ๊ณ„๋ฉด์—๋Š” ๋ฐ˜์‚ฌ ์ฝ”ํŒ…(reflective coating)์ด ๋˜์–ด ์žˆ์–ด, ๋‹จ์ผ ์นด๋ฉ”๋ผ๋กœ ๋‚ด๋ถ€ ๋ณ€ํ˜•์„ ๊ด€์ฐฐํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
    • Inhibit Xโ„ข๋ฅผ ์ ‘์ฐฉ์ œ๋กœ ์‚ฌ์šฉํ•˜์—ฌ ๋ฐ˜์‚ฌ์„ฑ ๊ธˆ์† ์ž‰ํฌ(reflective metallic ink)์™€ ์‹ค๋ฆฌ์ฝ˜ ํ˜ผํ•ฉ๋ฌผ(Smooth-on Psycho Paintโ„ข)์„ ์—์–ด๋ธŒ๋Ÿฌ์‹ฑํ•˜์—ฌ ํ‘œ๋ฉด์„ ์ฝ”ํŒ…ํ•ฉ๋‹ˆ๋‹ค.
  2. ์นด๋ฉ”๋ผ ๋ฐ ์กฐ๋ช… ์‹œ์Šคํ…œ (Camera and Illumination system):
    • ์†Œ๋‹ˆ IMX179 ์ด๋ฏธ์ง€ ์„ผ์„œ(8MP, 30fps)๋ฅผ ํƒ‘์žฌํ•œ ์ €๋น„์šฉ ์นด๋ฉ”๋ผ๋ฅผ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.
    • 185ยฐ FoV(์‹œ์•ผ๊ฐ)์˜ ์–ด์•ˆ ๋ Œ์ฆˆ(fisheye lens)๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ฐ˜๊ตฌํ˜• ์—˜๋ผ์Šคํ† ๋จธ ์ „์ฒด๋ฅผ ์ปค๋ฒ„ํ•ฉ๋‹ˆ๋‹ค.
    • ์œ ์—ฐํ•œ PCB(flexible PCB)์— 24๊ฐœ์˜ RGB LED๊ฐ€ ์žฅ์ฐฉ๋œ LED ์ŠคํŠธ๋ฆฝ์ด ์„ผ์„œ ๋‚ด๋ถ€์— ์›ํ†ตํ˜•์œผ๋กœ ๋ฐฐ์น˜๋˜์–ด ์กฐ๋ช…์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ์ด ์กฐ๋ช… ์ „๋žต์€ ํ‘œ๋ฉด์˜ ์˜ค๋ชฉํ•œ ๋ถ€๋ถ„์ด ์ƒ‰์ƒ ํŒจํ„ด์„ ๋ฐฉ์ถœํ•˜๋„๋ก ํ•˜์—ฌ ํ‘œ๋ฉด ํ˜•์ƒ๊ณผ ์ƒ‰์ƒ ์ฑ„๋„ ๋ฐ˜์‚ฌ์œจ ๊ฐ„์˜ ์ƒ๊ด€๊ด€๊ณ„๋ฅผ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค.
  3. ์„ผ์„œ ์กฐ๋ฆฝ (Sensor Assembly):
    • ์กฐ๋ฆฝ ์‹œ ์นด๋ฉ”๋ผ ๋ Œ์ฆˆ์˜ ์ค‘์‹ฌ์ด ๋ฐ˜๊ตฌํ˜• ์—˜๋ผ์Šคํ† ๋จธ์˜ ์ค‘์‹ฌ๊ณผ ์ผ์น˜ํ•˜๋„๋ก ์ •๋ ฌ๋ฉ๋‹ˆ๋‹ค.
    • LED ์ŠคํŠธ๋ฆฝ์€ ์—˜๋ผ์Šคํ† ๋จธ ๋ฐ”๋กœ ์•„๋ž˜์— ์œ„์น˜ํ•ฉ๋‹ˆ๋‹ค.
    • 3D ํ”„๋ฆฐํŒ…๋œ ์นด๋ฉ”๋ผ ๋งˆ์šดํŠธ๊ฐ€ ์นด๋ฉ”๋ผ์™€ ์—˜๋ผ์Šคํ† ๋จธ๋ฅผ ๊ณ ์ •ํ•˜๋ฉฐ, LED ์ŠคํŠธ๋ฆฝ์€ ์ด ๋งˆ์šดํŠธ ๋‚ด๋ถ€์— ์žฅ์ฐฉ๋ฉ๋‹ˆ๋‹ค.
    • ๋†’์ด๋Š” 35mm, ๋ฐ˜๊ตฌํ˜• ์—˜๋ผ์Šคํ† ๋จธ์˜ ๋ฐ˜๊ฒฝ์€ 25mm์ž…๋‹ˆ๋‹ค.
    • ์ „์ฒด ์„ผ์„œ ๋น„์šฉ์€ 80๋‹ฌ๋Ÿฌ ๋ฏธ๋งŒ์œผ๋กœ, ์นด๋ฉ”๋ผ ์‹œ์Šคํ…œ(70๋‹ฌ๋Ÿฌ)์ด ๋Œ€๋ถ€๋ถ„์„ ์ฐจ์ง€ํ•ฉ๋‹ˆ๋‹ค.

IV. ํ˜•์ƒ ์žฌ๊ตฌ์„ฑ (Shape Reconstruction)

DenseTact๋Š” ๋‹จ์ผ ์ด๋ฏธ์ง€๋กœ๋ถ€ํ„ฐ ์„ผ์„œ ํ‘œ๋ฉด์˜ ๊ณ ํ•ด์ƒ๋„ ํ‘œํ˜„์„ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด GelSight์™€ ์œ ์‚ฌํ•œ ์„ผ์„œ๋“ค์€ ๋žจ๋ฒ„์‹œ์•ˆ ํ‘œ๋ฉด(Lambertian surface) ๊ฐ€์ •์„ ํ†ตํ•ด ๊ฐ ํ”ฝ์…€์˜ ๊ฐ•๋„(intensity)๋ฅผ ํ‘œ๋ฉด ๋ฒ•์„ (surface normal)๊ณผ ์—ฐ๊ด€์‹œ์ผฐ์Šต๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ DenseTact์™€ ๊ฐ™์ด 3D ํ˜•์ƒ์ด๋ฉฐ ๋น„๋žจ๋ฒ„์‹œ์•ˆ(non-Lambertian) ํ‘œ๋ฉด์„ ๊ฐ€์ง„ ์„ผ์„œ์—๋Š” ์ด ๋ฐฉ๋ฒ•์ด ์ ์šฉ๋˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.

์ด ๊ฒฝ์šฐ, ๊ฐ ํ”ฝ์…€ (u,v)์˜ ๊ฐ•๋„ I(u,v)๋Š” ํ‘œ๋ฉด ๋ฒ•์„  \frac{\partial f}{\partial u}(u,v), \frac{\partial f}{\partial v}(u,v) ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ํ”ฝ์…€์˜ ์œ„์น˜ (u,v)์—๋„ ์˜์กดํ•˜๋Š” ๋น„์„ ํ˜• ํ•จ์ˆ˜ R์ด ๋ฉ๋‹ˆ๋‹ค. I(u, v) = R(\frac{\partial f}{\partial u} (u, v), \frac{\partial f}{\partial v} (u, v), u, v) (1)

๋”ฐ๋ผ์„œ ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ์ ‘๊ทผ ๋ฐฉ์‹์„ ํ†ตํ•ด ์ด๋ฏธ์ง€ ํ”ฝ์…€์˜ RGB ๊ฐ’ I_{rgb}(u,v)๋กœ๋ถ€ํ„ฐ ์„ผ์„œ ํ‘œ๋ฉด์˜ ํ•ด๋‹น ๊ตฌํ˜• ์ขŒํ‘œ (R, \theta, \psi)๋ฅผ ์ถ”์ •ํ•˜๋Š” ๋น„์„ ํ˜• ํ•จ์ˆ˜ M์„ ํ•™์Šตํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค: (R, \theta, \psi) = M(I_{rgb}(u, v)) (2)

A. ๊นŠ์ด ๋ฐ์ดํ„ฐ ์ƒ์„ฑ (Depth Data Generation):

๋ชจ๋ธ ํ•™์Šต์„ ์œ„ํ•ด ์ •ํ™•ํ•œ ๊ณ ํ•ด์ƒ๋„ Ground-Truth ํ‘œ๋ฉด ์ •๋ณด๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค. ์ƒ์šฉ ๊ฑฐ๋ฆฌ ์ธก์ • ์„ผ์„œ(range-finding sensors)๋Š” ๋ฐ€๋ฆฌ๋ฏธํ„ฐ ๊ทœ๋ชจ์˜ ์˜ค์ฐจ๋ฅผ ๊ฐ€์ง€๋ฏ€๋กœ, 3D ํ”„๋ฆฐํŒ…๋œ ์•Œ๋ ค์ง„ ์ปดํ“จํ„ฐ ์ƒ์„ฑ ํ‘œ๋ฉด ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์—ฌ Ground-Truth ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.

  • Ground-Truth ์ƒ์„ฑ ๋ฐฉ๋ฒ•: Ultimaker S5 3D ํ”„๋ฆฐํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋‹ค์–‘ํ•œ ํ˜•ํƒœ์˜ โ€œ์ธ๋””์ผ€์ดํ„ฐ(indicator)โ€์™€ โ€œ์ธ๋ดํ„ฐ(indenter)โ€๋ฅผ ํ”„๋ฆฐํŒ…ํ•ฉ๋‹ˆ๋‹ค. ์ธ๋””์ผ€์ดํ„ฐ๋Š” ๊ตฌ๋ฉ์ด ๋šซ๋ฆฐ ๋ฐ˜๊ตฌํ˜• ๋ชจ์–‘์ด๋ฉฐ, ์ธ๋ดํ„ฐ๋Š” ์ด ๊ตฌ๋ฉ์— ์‚ฝ์ž…๋˜์–ด ์„ผ์„œ๋ฅผ ๋ณ€ํ˜•์‹œํ‚ต๋‹ˆ๋‹ค.
    • 37๊ฐœ์˜ ๋‹ค๋ฅธ ์ธ๋””์ผ€์ดํ„ฐ์™€ 25๊ฐœ์˜ ๋‹ค๋ฅธ ์ธ๋ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋‹ค์–‘ํ•œ ์ ‘์ด‰ ๊ตฌ์„ฑ์„ ๋งŒ๋“ญ๋‹ˆ๋‹ค.
    • CNC ๋จธ์‹ ์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ์‹œ B์ถ•(์„ธ๋กœ์ถ•)์œผ๋กœ 0.9ยฐ์”ฉ ์ž๋™ ํšŒ์ „์‹œํ‚ค๊ณ , A, C์ถ•(๊ฐ€๋กœ์ถ•)์€ 45ยฐ์”ฉ ์ˆ˜๋™์œผ๋กœ ์กฐ์ •ํ•˜์—ฌ ๋ฐ์ดํ„ฐ ๋‹ค์–‘์„ฑ์„ ๋†’์ž…๋‹ˆ๋‹ค.
    • CNC ๋จธ์‹ ์€ Z-๋ฐฉํ–ฅ์œผ๋กœ ์„ผ์„œ๋ฅผ ๋ˆ„๋ฅด๋„๋ก ์ •๋ฐ€ํ•˜๊ฒŒ ์ œ์–ด๋ฉ๋‹ˆ๋‹ค.
  • ๋ฐ์ดํ„ฐ ์ •๊ทœํ™”: STL ํŒŒ์ผ๋กœ๋ถ€ํ„ฐ ๋ ˆ์ด ์บ์ŠคํŒ…(ray casting) ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์‚ฌ์šฉํ•˜์—ฌ ๊นŠ์ด ๊ฐ’(radial value)์„ ์–ป์Šต๋‹ˆ๋‹ค. ์ด ๊นŠ์ด ๊ฐ’์„ 8๋น„ํŠธ ์ •์ˆ˜(0-255)๋กœ ์ •๊ทœํ™”ํ•˜์—ฌ ์ถœ๋ ฅ ๊ฐ’์˜ ํฌ๊ธฐ๋ฅผ ์ค„์ž…๋‹ˆ๋‹ค. ์ตœ๋Œ€ ๊นŠ์ด ๋ณ€ํ˜•(9.4mm)์„ ์‚ฌ์šฉํ•˜์—ฌ ์ •๊ทœํ™”ํ•˜๋ฉฐ, 1ํ”ฝ์…€ ๊ฐ•๋„(intensity)๋Š” ์‹ค์ œ ๊นŠ์ด ๊ฐ’์—์„œ 0.0354mm ์ฆ๊ฐ€์— ํ•ด๋‹นํ•ฉ๋‹ˆ๋‹ค.
  • ์ด 30,200๊ฐœ์˜ ๋‹ค๋ฅธ ์ ‘์ด‰ ๊ตฌ์„ฑ(29,200๊ฐœ ํ•™์Šต, 1,000๊ฐœ ํ…Œ์ŠคํŠธ)์„ ์ƒ์„ฑํ•˜๋ฉฐ, ํ…Œ์ŠคํŠธ ์„ธํŠธ์—๋Š” ํ•™์Šต ์„ธํŠธ์™€ ๋‹ค๋ฅธ ์ธ๋””์ผ€์ดํ„ฐ์™€ ์ธ๋ดํ„ฐ ์กฐํ•ฉ์ด ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. ์ „์ฒด ๋ฐ์ดํ„ฐ์…‹ ํฌ๊ธฐ๋Š” 3.6 GB์ž…๋‹ˆ๋‹ค.

B. ์นด๋ฉ”๋ผ ์ด๋ฏธ์ง€๋กœ๋ถ€ํ„ฐ 3D ๋Œ€์‘์  ์ฐพ๊ธฐ (3D correspondence from camera image):

์–ด์•ˆ ๋ Œ์ฆˆ์˜ ์™œ๊ณก๊ณผ 3D ํ˜•์ƒ ์„ผ์„œ ํ‘œ๋ฉด๊ณผ์˜ ๋Œ€์‘์ (correspondence)์„ ์ฐพ๊ธฐ ์œ„ํ•ด ์ƒˆ๋กœ์šด ์บ˜๋ฆฌ๋ธŒ๋ ˆ์ด์…˜ ๋ฐฉ๋ฒ•์ด ๊ฐœ๋ฐœ๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

  • ๊ต์ • ๊ณผ์ •:
    1. ์•Œ๋ ค์ง„ ํฌ๊ธฐ์˜ 3D ํ”„๋ฆฐํŒ…๋œ ํ†ฑ๋‹ˆ ๋ชจ์–‘(saw-tooth shape) ์ธ๋””์ผ€์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.
    2. ์ธ๋””์ผ€์ดํ„ฐ๋ฅผ ์„ผ์„œ์— ๋ฐ€์–ด ๋„ฃ์€ ํ›„, Canny ์—ฃ์ง€ ๊ฒ€์ถœ ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•˜์—ฌ ์„ผ์„œ ์ด๋ฏธ์ง€์—์„œ ํ†ฑ๋‹ˆ ์—ฃ์ง€๋ฅผ ๊ฐ์ง€ํ•ฉ๋‹ˆ๋‹ค.
    3. ์ด๋ฏธ์ง€์—์„œ ๊ฐ์ง€๋œ ์—ฃ์ง€ ์œ„์น˜๋ฅผ ์„ผ์„œ ํ‘œ๋ฉด์˜ ์—ฃ์ง€ ์œ„์น˜์™€ ๋งค์นญํ•ฉ๋‹ˆ๋‹ค.
  • ๊ฐ€์šฐ์‹œ์•ˆ ํ”„๋กœ์„ธ์Šค(Gaussian Process, GP) ํšŒ๊ท€ ๋ชจ๋ธ: ์ด๋ฏธ์ง€์˜ ์ค‘์‹ฌ์œผ๋กœ๋ถ€ํ„ฐ์˜ ๋ฐ˜์ง€๋ฆ„ r๊ณผ ๋ฐ˜๊ตฌํ˜• ์„ผ์„œ ํ‘œ๋ฉด์˜ ๋ฐ˜์ง€๋ฆ„ R \sin(\theta) ๊ฐ„์˜ ๋Œ€์‘ ๊ด€๊ณ„๋ฅผ ๋ชจ๋ธ๋งํ•ฉ๋‹ˆ๋‹ค. R \sin(\theta) = f_{GP}(r(u, v))
    1. ์ด ๋Œ€์‘ ๊ด€๊ณ„๋ฅผ ํ†ตํ•ด ๊ฐ ์ด๋ฏธ์ง€ ํ”ฝ์…€ (u,v)๋Š” ๊ตฌํ˜• ์ขŒํ‘œ (\theta, \phi)๋กœ ๋ณ€ํ™˜๋ฉ๋‹ˆ๋‹ค. ์—ฌ๊ธฐ์„œ (u_c, v_c)๋Š” ์ด๋ฏธ์ง€ ํ‰๋ฉด์˜ ์ค‘์‹ฌ์ž…๋‹ˆ๋‹ค. (\theta, \phi) = (\sin^{-1}(\frac{f_{GP}(r)}{R}), \tan^{-1}(\frac{v - v_c}{u - u_c}))
  • ๋ ˆ์ด ์บ์ŠคํŒ… ์•Œ๊ณ ๋ฆฌ์ฆ˜ (Ray Casting Algorithm): ๊ฐ ํ”ฝ์…€์— ๋Œ€ํ•œ (\theta, \phi)๊ฐ€ ๊ฒฐ์ •๋˜๋ฉด, STL ํŒŒ์ผ์˜ ์‚ผ๊ฐํ˜• ๋ฉ”์‰ฌ(triangular mesh)์—์„œ ๊ฐ€์žฅ ๊ฐ€๊นŒ์šด ์ ์„ ๊ณ„์‚ฐํ•˜๊ธฐ ์œ„ํ•ด ๋ ˆ์ด ์บ์ŠคํŒ… ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. R_{ray}(u, v) = f_{raycast}(\text{Mesh}_{stl}, \theta(u, v), \psi(u, v))
    1. ์ด ๊ณผ์ •์„ ํ†ตํ•ด ์ž…๋ ฅ ์ด๋ฏธ์ง€์˜ ๊ฐ ํ”ฝ์…€๊ณผ Ground-Truth ๊นŠ์ด ๊ฐ’ ๊ฐ„์˜ 1:1 ๋Œ€์‘ ๊ด€๊ณ„๊ฐ€ ์„ค์ •๋ฉ๋‹ˆ๋‹ค.

C. ๋ชจ๋ธ๋ง (Modeling):

์ด ๋ฌธ์ œ๋Š” ๋‹จ์ผ ์ด๋ฏธ์ง€ ๊นŠ์ด ์ถ”์ • ๋ฌธ์ œ๋กœ ๊ฐ„์ฃผ๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ผ๋ฐ˜์ ์ธ ๊นŠ์ด ์ถ”์ • ๋ฌธ์ œ์™€ ๋‹ฌ๋ฆฌ, DenseTact ๋ฐ์ดํ„ฐ์…‹์€ ์ „์—ญ์ ์ธ ์ •๋ณด(global information)๋ณด๋‹ค ๊ตญ๋ถ€์ ์ธ ๋ณ€ํ˜• ์ •๋ณด(local deformation information)์— ๋” ์ค‘์ ์„ ๋‘ก๋‹ˆ๋‹ค.

  • ๋„คํŠธ์›Œํฌ ์•„ํ‚คํ…์ฒ˜: ๊ฐ„๋‹จํ•œ ์ธ์ฝ”๋”-๋””์ฝ”๋”(encoder-decoder) ๊ตฌ์กฐ๋ฅผ ์‚ฌ์šฉํ•˜๋ฉฐ, ์Šคํ‚ต ์ปค๋„ฅ์…˜(skip connection)์„ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค.
    • ์ธ์ฝ”๋”: ์‚ฌ์ „ ํ•™์Šต๋œ DenseNet-161 ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.
    • ๋””์ฝ”๋”: ์ธ์ฝ”๋”์˜ ๋™์ผํ•œ ํฌ๊ธฐ ๋ธ”๋ก๊ณผ ์ด์ „ ์—…์ƒ˜ํ”Œ๋ง๋œ ๋ธ”๋ก์„ ์—ฐ๊ฒฐํ•˜์—ฌ ๊ตญ๋ถ€ ์ •๋ณด๋ฅผ ํ•™์Šตํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•ฉ๋‹ˆ๋‹ค.
  • ์†์‹ค ํ•จ์ˆ˜ (Loss Function): ๋‹ค์Œ ์„ธ ๊ฐ€์ง€ ์†์‹ค์˜ ์กฐํ•ฉ์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.
    1. ๊นŠ์ด ๊ฐ’์— ๋Œ€ํ•œ ์ ๋Œ€์  L_1 ์†์‹ค (point-wise L_1 loss)
    2. ๊นŠ์ด ์ด๋ฏธ์ง€์˜ ๊ธฐ์šธ๊ธฐ(gradient)์— ๋Œ€ํ•œ L_1 ์†์‹ค
    3. ๊ตฌ์กฐ์  ์œ ์‚ฌ์„ฑ ์†์‹ค (structural similarity loss, SSIM)
  • ํ•™์Šต: ์ด๋ฏธ์ง€๋Š” 570x570x3์—์„œ 640x480x3์œผ๋กœ ํฌ๊ธฐ๊ฐ€ ์กฐ์ •๋œ ํ›„ ๋„คํŠธ์›Œํฌ์— ์ „๋‹ฌ๋˜๋ฉฐ, ์ถœ๋ ฅ ๊ฒฐ๊ณผ๋Š” 320x240์—์„œ 570x570์œผ๋กœ ํฌ๊ธฐ๊ฐ€ ์กฐ์ •๋ฉ๋‹ˆ๋‹ค. ๊นŠ์ด ๊ฐ’(0-255)์€ ํ•™์Šต ํ’ˆ์งˆ์„ ๊ทน๋Œ€ํ™”ํ•˜๊ธฐ ์œ„ํ•ด (10, 1000) ๋ฒ”์œ„๋กœ ์žฌ์กฐ์ •๋ฉ๋‹ˆ๋‹ค. ๋„คํŠธ์›Œํฌ๋Š” NVIDIA P100 16GB GPU์—์„œ 16 ์—ํฌํฌ(460K ์ดํ„ฐ๋ ˆ์ด์…˜), ๋ฐฐ์น˜ ํฌ๊ธฐ 4๋กœ ํ•™์Šต๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

V. ๊ฒฐ๊ณผ ๋ฐ ํ† ๋ก  (Results and Discussion)

  • ์ •์„ฑ์  ๊ฒฐ๊ณผ (Qualitative Results): ๋ชจ๋ธ์€ ๋‹จ์ผ ์ด๋ฏธ์ง€๋กœ๋ถ€ํ„ฐ ์„ผ์„œ ํ˜•์ƒ์„ ์ƒ๋‹นํžˆ ์ž˜ ์žฌ๊ตฌ์„ฑํ•˜๋Š” ๊ฒƒ์œผ๋กœ ๋‚˜ํƒ€๋‚ฌ์Šต๋‹ˆ๋‹ค.
  • ์„ฑ๋Šฅ: ์„ผ์„œ๋Š” ๋‹จ์ผ ์ด๋ฏธ์ง€๋กœ๋ถ€ํ„ฐ ๊นŠ์ด ๋ทฐ๋ฅผ ์˜ˆ์ธกํ•˜๋Š” ๋ฐ ํ‰๊ท  18.17ms๊ฐ€ ์†Œ์š”๋˜์–ด, 30fps์˜ ์‹ค์‹œ๊ฐ„ ์กฐ์ž‘ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์˜ˆ์ธก๋œ ๊นŠ์ด ๊ฐ’์„ ์‚ฌ์šฉํ•˜์—ฌ 3D ํฌ์ธํŠธ ํด๋ผ์šฐ๋“œ๊ฐ€ ์žฌ๊ตฌ์„ฑ๋ฉ๋‹ˆ๋‹ค.
  • ์ •๋Ÿ‰์  ๊ฒฐ๊ณผ (Quantitative Results): ํ•™์Šต ์„ธํŠธ์™€ ํ…Œ์ŠคํŠธ ์„ธํŠธ ๊ฐ„์˜ ํฌ์ธํŠธ๋ณ„ L1 ์†์‹ค ๋ฐ L2 ์†์‹ค์ด ํ‰๊ฐ€๋˜์—ˆ์Šต๋‹ˆ๋‹ค.
    • Ground-Truth ์˜ค์ฐจ๋Š” 3D ํ”„๋ฆฐํ„ฐ์˜ ์ •๋ฐ€๋„ ์˜ค์ฐจ๋กœ ์ธํ•ด 109.6 ๋งˆ์ดํฌ๋ก ์ž…๋‹ˆ๋‹ค.
    • ํ•™์Šต ์„ธํŠธ์˜ ํ‰๊ท  L1 ์†์‹ค์€ 0.2381mm, ํ…Œ์ŠคํŠธ ์„ธํŠธ์˜ ํ‰๊ท  L1 ์†์‹ค์€ 0.2811mm์˜€์Šต๋‹ˆ๋‹ค.
    • ํ•™์Šต ์„ธํŠธ์˜ ํ‰๊ท  L2 ์†์‹ค์€ 0.0306mm, ํ…Œ์ŠคํŠธ ์„ธํŠธ์˜ ํ‰๊ท  L2 ์†์‹ค์€ 0.03208mm์˜€์Šต๋‹ˆ๋‹ค.
    • ์ฆ‰, DenseTact ์„ผ์„œ๋Š” ํ‰๊ท  0.28mm์˜ ์ ˆ๋Œ€ ์˜ค์ฐจ๋กœ ํ˜•์ƒ ์žฌ๊ตฌ์„ฑ์„ ์ˆ˜ํ–‰ํ•ฉ๋‹ˆ๋‹ค.
  • ์ž์„ธ ์ถ”์ (Pose Tracking) ํ‰๊ฐ€: ๋‘ ๊ฐœ์˜ DenseTact ์„ผ์„œ๊ฐ€ ์žฅ์ฐฉ๋œ Allegro ํ•ธ๋“œ๋กœ ์•Œ๋ ค์ง„ ๊ตฌํ˜• ๋ฌผ์ฒด๋ฅผ ์ง‘๊ณ  ICP(Iterative Closest Point) ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ฌผ์ฒด์˜ ์ž์„ธ๋ฅผ ์ถ”์ ํ•˜์—ฌ ์„ผ์„œ๋ฅผ ํ‰๊ฐ€ํ–ˆ์Šต๋‹ˆ๋‹ค. 23๋ฒˆ์˜ ํŒŒ์ง€(grasping) ์‹œ๋„ ํ›„, ํ‰๊ท  ํ”ผํŠธ๋‹ˆ์Šค ์ ์ˆ˜(fitness score)๋Š” 0.597(\sigma = 0.238), ํ‰๊ท  RMS ์˜ค์ฐจ๋Š” 0.037184(\sigma = 0.00276)์˜€์Šต๋‹ˆ๋‹ค. (200ํšŒ ICP ๋ฐ˜๋ณต ํ›„ ํ‰๊ท  RMS ์˜ค์ฐจ๋Š” 0.0211).
  • ๋‚ด๊ตฌ์„ฑ: ์„ผ์„œ๋Š” 30,000ํšŒ ์ด์ƒ์˜ ํ‘ธ์‹œ ๋ฐ ์ธก์ • ํ›„์—๋„ ๋ˆˆ์— ๋„๋Š” ๋ณ€ํ™” ์—†์ด ๋‚ด๊ตฌ์„ฑ์ด ๋›ฐ์–ด๋‚จ์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.

VI. ๊ฒฐ๋ก  (Conclusion)

DenseTact๋Š” ์ธ์ฒด๊ณตํ•™์  ๋ฐ˜๊ตฌํ˜• ์„ผ์„œ๋กœ, ์ „์ฒด ์„ผ์„œ ํ‘œ๋ฉด์„ ์žฌ๊ตฌ์„ฑํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ด ์„ผ์„œ๋Š” ๋‚ด๊ตฌ์„ฑ์ด ๋›ฐ์–ด๋‚˜๋ฉฐ, ๊ณ ํ•ด์ƒ๋„ ์ ‘์ด‰ ์บ˜๋ฆฌ๋ธŒ๋ ˆ์ด์…˜์„ ํ†ตํ•ด Ground-Truth์˜ ์ƒ๋Œ€์  ์ •ํ™•๋„์™€ ๋ถˆํ™•์‹ค์„ฑ์„ ๊ณ ๋ คํ–ˆ์Šต๋‹ˆ๋‹ค. ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง(deep neural network)์€ ์ž…๋ ฅ ์ด๋ฏธ์ง€๋กœ๋ถ€ํ„ฐ ๊นŠ์ด ๋งต(depth map)์„ ๋ชจ๋ธ๋งํ•˜๋Š” ๋ฐ ํ™œ์šฉ๋˜์—ˆ์œผ๋ฉฐ, ์‚ฌ์ „ ํ•™์Šต๋œ ์ธ์ฝ”๋”-๋””์ฝ”๋” ๊ธฐ๋ฐ˜ ๋„คํŠธ์›Œํฌ๊ฐ€ ํ•™์Šต ๋ฐ์ดํ„ฐ์…‹์„ ํ†ตํ•ด ์ •ํ™•ํ•œ ๊นŠ์ด ์žฌ๊ตฌ์„ฑ์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์—ฌ์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค. ์„ผ์„œ๋Š” ํ…Œ์ŠคํŠธ ์„ธํŠธ์—์„œ ํ‰๊ท  0.28mm์˜ ๊นŠ์ด ์ฐจ์ด๋ฅผ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค. ํ–ฅํ›„ ์—ฐ๊ตฌ์—๋Š” ๋‹ค์–‘ํ•œ LED ๊ตฌ์„ฑ ํ™œ์šฉ์„ ํ†ตํ•œ ์ •ํ™•๋„ ํ–ฅ์ƒ, ํฌ๊ธฐ ํ™•์žฅ์„ฑ ๋ฐ ๋‹ค์–‘ํ•œ ์„ผ์„œ ํ˜•์ƒ์— ๋Œ€ํ•œ ์ ์‘์„ฑ, DenseTact ์„ผ์„œ์— ํŠนํ™”๋œ ๋„คํŠธ์›Œํฌ ๊ตฌ์กฐ ๊ตฌํ˜„, ๊ทธ๋ฆฌ๊ณ  ์žฌ๊ตฌ์„ฑ๋œ ํ˜•์ƒ๊ณผ ํ•จ๊ป˜ ์—˜๋ผ์Šคํ† ๋จธ ๋ณ€ํ˜• ๊ธฐ๋ฐ˜์˜ ์ ‘์ด‰๋ ฅ ๋ถ„ํฌ ์ถ”์ถœ ๋“ฑ์ด ํฌํ•จ๋  ๊ฒƒ์ž…๋‹ˆ๋‹ค.

๐Ÿ”” Ring Review

๐Ÿ”” Ring โ€” An idea that echoes. Grasp the core and its value.

Copyright 2026, JungYeon Lee