Curieux.JY
  • JungYeon Lee
  • Post
  • ๐Ÿ•ธ๏ธ Graph
  • Lecture
  • Note

On this page

  • ๐Ÿ” Ping Review
  • ๐Ÿ”” Ring Review
    • ์„œ๋ก : ์™œ โ€œ์‹œ๊ฐ ์—†๋Š” ์กฐ์ž‘โ€์ธ๊ฐ€
    • ์‹œ์Šคํ…œ ๊ตฌ์„ฑ
    • ๋ฐฉ๋ฒ•: ๊ฐ•ํ™”ํ•™์Šต ํŒŒ์ดํ”„๋ผ์ธ
      • ์„ธ ๊ฐ€์ง€ ์ผ์ƒํ˜• ๊ณผ์ œ
      • 2๋‹จ๊ณ„ ๋ณด์ƒ ํ•จ์ˆ˜
      • ๋น„๋Œ€์นญ ๊ด€์ธก PPO
    • ์‹คํ—˜
      • ์ ˆ์ œ ์‹คํ—˜ (Ablation)
      • ๊ด€์ฐฐ
    • ๋น„ํŒ์  ๊ณ ์ฐฐ
    • ์š”์•ฝ ๋ฐ ๊ฒฐ๋ก 
    • ์ฐธ๊ณ  ์ •๋ณด

๐Ÿ“ƒDexTouch ๋ฆฌ๋ทฐ

tactile
dexterity
rl
sim2real
DexTouch: Learning to Seek and Manipulate Objects with Tactile Dexterity
Published

May 17, 2026

  • Paper Link
  • Project Page
  • ๊ฒŒ์žฌ: IEEE Robotics and Automation Letters (RA-L), 2024 โ€” DOI 10.1109/LRA.2024.3478571
  1. ๐Ÿค– DexTouch๋Š” ์‹œ๊ฐ ์ •๋ณด ์—†์ด ์ด‰๊ฐ๋งŒ์œผ๋กœ ์–ด๋‘  ์†์— ๋ฌด์ž‘์œ„๋กœ ๋†“์ธ ๋ฌผ์ฒด๋ฅผ ํƒ์ƒ‰ํ•˜๊ณ  ์กฐ์ž‘ํ•˜๋Š” ๋‹ค์ง€(multi-finger) ๋กœ๋ด‡ ์‹œ์Šคํ…œ์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.
  2. โœจ UR5e ํŒ” + AllegroHand(์ด 22-DoF)์— 16๊ฐœ์˜ ์ด์ง„(binary) ์ ‘์ด‰ ์„ผ์„œ๋ฅผ ๋ถ€์ฐฉํ•˜๊ณ , ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ๊ฐ•ํ™”ํ•™์Šต์œผ๋กœ ํ•™์Šตํ•œ ์ •์ฑ…์„ ๋ฏธ์„ธ์กฐ์ • ์—†์ด ์‹ค์ œ ๋กœ๋ด‡์œผ๋กœ ์ „์ดํ•ฉ๋‹ˆ๋‹ค.
  3. ๐Ÿš€ ๋ฌผ์ฒด ์žก๊ธฐยท๋ฌธ ์—ด๊ธฐยท๋ฐธ๋ธŒ ๋Œ๋ฆฌ๊ธฐ ์„ธ ๊ฐ€์ง€ ์ผ์ƒํ˜• ๊ณผ์ œ์—์„œ, ์ด‰๊ฐ ์„ผ์„œ์˜ ๋ฏผ๊ฐ๋„ยท๋ถ€์ฐฉ ์œ„์น˜๊ฐ€ ์‹œ๊ฐ ์—†๋Š” ์กฐ์ž‘ ์„ฑ๋Šฅ์— ๊ฒฐ์ •์ ์ž„์„ ์‹คํ—˜์œผ๋กœ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.

๐Ÿ” Ping Review

๐Ÿ” Ping โ€” A light tap on the surface. Get the gist in seconds.

DexTouch๋Š” ์‹œ๊ฐ ์—†๋Š” ์กฐ์ž‘(blind manipulation) ๋ฌธ์ œ๋ฅผ ์ •๋ฉด์œผ๋กœ ๋‹ค๋ฃน๋‹ˆ๋‹ค. ์‚ฌ๋žŒ์€ ์–ด๋‘์šด ๋ฐฉ์—์„œ๋„ ์†์„ ๋”๋“ฌ์–ด ๋ฌผ๊ฑด์„ ์ฐพ๊ณ  ๋‹ค๋ฃฐ ์ˆ˜ ์žˆ๋Š”๋ฐ, ์ด๋Š” ์ด‰๊ฐ์ด ์‹œ๊ฐ์˜ ๊ณต๋ฐฑ์„ ๋ฉ”์šฐ๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค. ์ €์ž๋“ค์€ ์ด ๋Šฅ๋ ฅ์„ ๋กœ๋ด‡์— ๋ถ€์—ฌํ•˜๊ธฐ ์œ„ํ•ด, 6-DoF UR5e + 16-DoF AllegroHand๋กœ ๊ตฌ์„ฑ๋œ 22-DoF ์‹œ์Šคํ…œ์— ์†๊ฐ€๋ฝ๋งˆ๋‹ค 3๊ฐœ์”ฉยท์†๋ฐ”๋‹ฅ์— 4๊ฐœ, ์ด 16๊ฐœ์˜ FSR(force-sensitive resistor) ์ ‘์ด‰ ์„ผ์„œ๋ฅผ ๋ถ€์ฐฉํ–ˆ์Šต๋‹ˆ๋‹ค.

๋ฌธ์ œ ์ •์˜: ์‹œ๊ฐ์ด ์ œํ•œ๋˜๋Š” ์ƒํ™ฉ(์–ด๋‘ , ๊ฐ€๋ฆผ, ์„ผ์„œ ๊ณ ์žฅ)์—์„œ๋„ ๋กœ๋ด‡์ด ๋ฌด์ž‘์œ„๋กœ ๋ฐฐ์น˜๋œ ๋ฌผ์ฒด๋ฅผ ์ด‰๊ฐ๋งŒ์œผ๋กœ ํƒ์ƒ‰ยท์กฐ์ž‘ํ•  ์ˆ˜ ์žˆ๋Š”๊ฐ€? ๋‹จ, ์ž‘์—…์˜ ๋Œ€๋žต์ ์ธ ์‚ฌ์ „ ์ •๋ณด(์˜ˆ: ํ…Œ์ด๋ธ” ์œ„์— ๋ฌผ์ฒด๊ฐ€ ์žˆ๋‹ค, ๋ฌธ ์†์žก์ด์˜ ๋Œ€๋žต์  ์œ„์น˜)๋Š” ์ฃผ์–ด์ง‘๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ๋ฐฉ๋ฒ•:

  • ์ด์ง„ ์ ‘์ด‰ ์‹ ํ˜ธ. ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ๊ฐ ์„ผ์„œ์˜ ์ ‘์ด‰๋ ฅ \|F\|๋ฅผ ๊ณ„์‚ฐํ•˜๊ณ  ์ž„๊ณ„๊ฐ’ \tilde{\theta}_{th}=0.01\,\mathrm{N}๋กœ ์ด์ง„ํ™”ํ•˜์—ฌ, ์‹ค์ œ FSR ์„ผ์„œ์˜ on/off ์‹ ํ˜ธ์™€ ๋™์ผํ•œ ํ˜•ํƒœ๋กœ ๋งŒ๋“ญ๋‹ˆ๋‹ค. ์ด ๋‹จ์ˆœํ•œ ํ‘œํ˜„ ๋•๋ถ„์— sim-to-real ๊ฒฉ์ฐจ๊ฐ€ ์ค„์–ด๋“ญ๋‹ˆ๋‹ค.
  • 2๋‹จ๊ณ„ ๋ณด์ƒ ์„ค๊ณ„. ๋ณด์ƒ์„ ๋„๋‹ฌ(reach) ๊ณผ ์‹คํ–‰(execute) ๋‘ ๊ตญ๋ฉด์œผ๋กœ ๋ถ„ํ•ดํ•ฉ๋‹ˆ๋‹ค. ์†๋๊ณผ ๋ชฉํ‘œ ๋ฌผ์ฒด ์‚ฌ์ด ๊ฑฐ๋ฆฌ์— ๊ธฐ๋ฐ˜ํ•œ r_{reach}๋กœ ๋จผ์ € ๋ฌผ์ฒด๋ฅผ ์ฐพ์•„๊ฐ€๊ฒŒ ํ•˜๊ณ , ๊ณผ์ œ๋ณ„ ์กฐ์ž‘ ํ–‰๋™์— ๋Œ€ํ•œ r_{execute}๋กœ ์‹ค์ œ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ ๋ชฉํ‘œ๋ฅผ ํ–ฅํ•œ ์›€์ง์ž„์—๋งŒ ๋ณด์ƒ์„ ์ฃผ์–ด ๋น„ํšจ์œจ์  ํƒ์ƒ‰์„ ์–ต์ œํ•ฉ๋‹ˆ๋‹ค.
  • ๋น„๋Œ€์นญ actor-critic. PPO๋กœ ํ•™์Šตํ•˜๋˜, ๊ฐ€์น˜(value) ๋„คํŠธ์›Œํฌ์—๋งŒ ํŠน๊ถŒ ์ •๋ณด(privileged information, ์˜ˆ: ๋ฌผ์ฒด์˜ ์‹ค์ œ ์œ„์น˜)๋ฅผ ์ œ๊ณตํ•˜๋Š” ๋น„๋Œ€์นญ ๊ด€์ธก(asymmetric observation) ์„ ์ ์šฉํ•ด ํ•™์Šต ํšจ์œจ์„ ๋†’์ž…๋‹ˆ๋‹ค. ์ •์ฑ…(policy)์€ ์ด‰๊ฐยท๊ณ ์œ ์ˆ˜์šฉ ์ •๋ณด๋งŒ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.
  • IsaacGym ์‹œ๋ฎฌ๋ ˆ์ด์…˜ โ†’ ์‹ค์ œ ์ „์ด. 10 Hz ์ œ์–ด ์ฃผ๊ธฐ๋กœ ํ•™์Šตํ•œ ์ •์ฑ…์„ ์ถ”๊ฐ€ ํ•™์Šต ์—†์ด ์‹ค์ œ ๋กœ๋ด‡์— ๋ฐฐํฌํ•ฉ๋‹ˆ๋‹ค.

์„ฑ๊ณผ: ๋ฌผ์ฒด ์žก๊ธฐ, ๋ฌธ ์—ด๊ธฐ, ๋ฐธ๋ธŒ ๋Œ๋ฆฌ๊ธฐ ์„ธ ๊ณผ์ œ ๋ชจ๋‘์—์„œ ์‹œ๊ฐ ์—†์ด ์กฐ์ž‘์— ์„ฑ๊ณตํ–ˆ์œผ๋ฉฐ, ์ด‰๊ฐ ์„ผ์„œ์˜ ๋ฏผ๊ฐ๋„์™€ ๋ถ€์ฐฉ ์œ„์น˜๊ฐ€ ์„ฑ๋Šฅ์„ ์ขŒ์šฐํ•จ์„, ๊ทธ๋ฆฌ๊ณ  ์†๋ชฉ ์žฅ์ฐฉ F/T ์„ผ์„œ๋ณด๋‹ค ๋ถ„์‚ฐ๋œ ์ด‰๊ฐ ์„ผ์„œ๊ฐ€ ์‹œ๊ฐ ์—†๋Š” ์กฐ์ž‘์— ์œ ๋ฆฌํ•จ์„ ์ ˆ์ œ ์‹คํ—˜์œผ๋กœ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.


๐Ÿ”” Ring Review

๐Ÿ”” Ring โ€” An idea that echoes. Grasp the core and its value.

์„œ๋ก : ์™œ โ€œ์‹œ๊ฐ ์—†๋Š” ์กฐ์ž‘โ€์ธ๊ฐ€

๋กœ๋ด‡ ์กฐ์ž‘ ์—ฐ๊ตฌ์˜ ๋Œ€๋ถ€๋ถ„์€ ์นด๋ฉ”๋ผ(์‹œ๊ฐ)์— ์˜์กดํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ํ˜„์‹ค์—์„œ ์‹œ๊ฐ์€ ์ž์ฃผ ๋ฌด๋ ฅํ•ด์ง‘๋‹ˆ๋‹ค โ€” ์กฐ๋ช…์ด ๊บผ์ง„ ๋ฐฉ, ๋ฌผ์ฒด๊ฐ€ ๊ฐ€๋ ค์ง„ ์„œ๋ž ์†, ๋จผ์ง€๋‚˜ ์—ฐ๊ธฐ๊ฐ€ ๊ฐ€๋“ํ•œ ํ™˜๊ฒฝ, ํ˜น์€ ์นด๋ฉ”๋ผ ์ž์ฒด๊ฐ€ ๊ณ ์žฅ ๋‚œ ์ƒํ™ฉ. ์‚ฌ๋žŒ์€ ์ด๋Ÿฐ ์ƒํ™ฉ์—์„œ๋„ ์†์˜ ๊ฐ๊ฐ ์œผ๋กœ ์–ด๋ ต์ง€ ์•Š๊ฒŒ ๋ฌผ๊ฑด์„ ์ฐพ์•„ ๋‹ค๋ฃน๋‹ˆ๋‹ค. ์ฃผ๋จธ๋‹ˆ ์† ์—ด์‡ ๋ฅผ ๋ณด์ง€ ์•Š๊ณ  ๊บผ๋‚ด๋Š” ๊ฒƒ์ด ๋Œ€ํ‘œ์ ์ž…๋‹ˆ๋‹ค.

DexTouch์˜ ์งˆ๋ฌธ์€ ๋ช…ํ™•ํ•ฉ๋‹ˆ๋‹ค. โ€œ๋Œ€๋žต์ ์ธ ์‚ฌ์ „ ์ •๋ณด๋งŒ ์ฃผ์–ด์ง„ ์ƒํƒœ์—์„œ, ๋กœ๋ด‡์ด ์˜ค์ง ์ด‰๊ฐ๋งŒ์œผ๋กœ ๋ฌด์ž‘์œ„๋กœ ๋†“์ธ ๋ฌผ์ฒด๋ฅผ ํƒ์ƒ‰ํ•˜๊ณ  ์กฐ์ž‘ํ•  ์ˆ˜ ์žˆ๋Š”๊ฐ€?โ€ ํ•ต์‹ฌ์€ ์‹œ๊ฐ์ด ์ œ๊ณตํ•˜๋˜ ์ •๋ณด(๋ฌผ์ฒด์˜ ์ •ํ™•ํ•œ ์œ„์น˜ยท์ž์„ธ)๋ฅผ ์ด‰๊ฐ์ด ๋Šฅ๋™์  ํƒ์ƒ‰์œผ๋กœ ๋ฉ”์šฐ๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค.

์‹œ์Šคํ…œ ๊ตฌ์„ฑ

DexTouch์˜ ํ•˜๋“œ์›จ์–ด๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

  • ๋กœ๋ด‡: 6-DoF UR5e ํŒ” + 16-DoF AllegroHand = ์ด 22-DoF
  • ์„ผ์„œ: ์†๊ฐ€๋ฝ๋‹น 3๊ฐœ ร— 4์†๊ฐ€๋ฝ + ์†๋ฐ”๋‹ฅ 4๊ฐœ = 16๊ฐœ FSR ์ ‘์ด‰ ์„ผ์„œ
  • ์‹ ํ˜ธ: ๊ฐ ์„ผ์„œ๋Š” ์ ‘์ด‰ ์—ฌ๋ถ€๋งŒ ์•Œ๋ ค์ฃผ๋Š” ์ด์ง„(on/off) ์‹ ํ˜ธ โ€” ํ’๋ถ€ํ•œ ์ด๋ฏธ์ง€ํ˜• ์ด‰๊ฐ์ด ์•„๋‹ˆ๋ผ ๋‹จ์ˆœ ์ ‘์ด‰ ๋น„ํŠธ

์ด โ€œ๋‹จ์ˆœํ•œ ์ด‰๊ฐโ€์ด๋ผ๋Š” ์„ ํƒ์ด ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค. GelSight๋ฅ˜ ๊ณ ํ•ด์ƒ๋„ ์ด‰๊ฐ๊ณผ ๋‹ฌ๋ฆฌ, ์ด์ง„ ์ ‘์ด‰ ์„ผ์„œ๋Š” ์ €๋ ดํ•˜๊ณ  sim-to-real ๋ชจ๋ธ๋ง์ด ์‰ฝ์Šต๋‹ˆ๋‹ค. ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ๋Š” 16๊ฐœ์˜ ๊ฐ€์ƒ ์ ‘์ด‰ ์„ผ์„œ๋กœ net contact force F=[F_x, F_y, F_z]์˜ ๋…ธ๋ฆ„ \|F\|์„ ๊ณ„์‚ฐํ•˜๊ณ , ์ž„๊ณ„๊ฐ’ \tilde{\theta}_{th}=0.01\,\mathrm{N}๋กœ ์ด์ง„ํ™”ํ•˜์—ฌ ์‹ค์ œ FSR๊ณผ ๋™์ผํ•œ ์‹ ํ˜ธ๋ฅผ ๋งŒ๋“ญ๋‹ˆ๋‹ค.

๋ฐฉ๋ฒ•: ๊ฐ•ํ™”ํ•™์Šต ํŒŒ์ดํ”„๋ผ์ธ

์„ธ ๊ฐ€์ง€ ์ผ์ƒํ˜• ๊ณผ์ œ

์ €์ž๋“ค์€ ์ผ์ƒ์ƒํ™œ๊ณผ ๊ด€๋ จ๋œ ์„ธ ๊ฐ€์ง€ ๋ฒค์น˜๋งˆํฌ ๊ณผ์ œ๋ฅผ ์ •์˜ํ•ฉ๋‹ˆ๋‹ค.

  1. ๋ฌผ์ฒด ์žก๊ธฐ (Grasping Object) โ€” ํ…Œ์ด๋ธ” ์œ„ ๋ฌด์ž‘์œ„ ์œ„์น˜์˜ ๋ฌผ์ฒด๋ฅผ ๋”๋“ฌ์–ด ์ฐพ์•„ ์ง‘์–ด ์˜ฌ๋ฆฝ๋‹ˆ๋‹ค. YCB ๋ฐ์ดํ„ฐ์…‹์—์„œ 15๊ฐœ ๋ฌผ์ฒด๋ฅผ ์„ ์ •ํ•˜๊ณ , ๊ทธ์ค‘ 7๊ฐœ๋กœ ์‹ค์ œ ์‹คํ—˜์„ ์ˆ˜ํ–‰ํ–ˆ์œผ๋ฉฐ 3๊ฐœ๋Š” ํ•™์Šต์— ์“ฐ์ด์ง€ ์•Š์€ unseen ๋ฌผ์ฒด์ž…๋‹ˆ๋‹ค.
  2. ๋ฌธ ์—ด๊ธฐ (Door Opening) โ€” ์†์žก์ด์˜ ๋Œ€๋žต์  ์œ„์น˜๋งŒ ์•Œ๊ณ  ๋ฌธ์„ ์—ฝ๋‹ˆ๋‹ค.
  3. ๋ฐธ๋ธŒ ๋Œ๋ฆฌ๊ธฐ (Valve Rotating) โ€” ๋ฐธ๋ธŒ ์ค‘์‹ฌ ๋ถ€๊ทผ์„ ๋”๋“ฌ์–ด ํšŒ์ „์‹œํ‚ต๋‹ˆ๋‹ค.

2๋‹จ๊ณ„ ๋ณด์ƒ ํ•จ์ˆ˜

์ ‘์ด‰์ด ์—†์œผ๋ฉด ๋ณด์ƒ ์‹ ํ˜ธ๋„ ์—†๋Š” ํฌ์†Œ(sparse) ๋ฌธ์ œ๋ฅผ ํ’€๊ธฐ ์œ„ํ•ด, ๋ณด์ƒ์„ ๋„๋‹ฌ๊ณผ ์‹คํ–‰ ๋‘ ๊ตญ๋ฉด์œผ๋กœ ๋‚˜๋ˆ•๋‹ˆ๋‹ค.

  • r_{reach}: ๊ฐ ์†๋๊ณผ ๋ชฉํ‘œ ๋ฌผ์ฒด(์žก๊ธฐ=ํ…Œ์ด๋ธ” ์œ„ ๋ฌผ์ฒด, ๋ฌธ ์—ด๊ธฐ=์†์žก์ด, ๋ฐธ๋ธŒ=๋ฐธ๋ธŒ ์ค‘์‹ฌ) ์‚ฌ์ด ๊ฑฐ๋ฆฌ d์— ๊ธฐ๋ฐ˜. ์†์ด ๋ชฉํ‘œ์— ๊ฐ€๊นŒ์›Œ์ง€๋„๋ก ์œ ๋„ํ•ฉ๋‹ˆ๋‹ค.
  • r_{execute}: ๊ณผ์ œ ๋ชฉ์ ์„ ๋‹ฌ์„ฑํ•˜๋Š” ์กฐ์ž‘ ํ–‰๋™(๋“ค์–ด ์˜ฌ๋ฆผ, ํšŒ์ „ ๋“ฑ)์— ๋Œ€ํ•œ ๋ณด์ƒ.

์ „์ฒด ๋ณด์ƒ์€ ๋‘ ํ•ญ์˜ ํ•ฉ์ด๋ฉฐ, ๋ชฉํ‘œ๋ฅผ ํ–ฅํ•œ ์›€์ง์ž„์—๋งŒ ๋ณด์ƒ์„ ์ฃผ๋„๋ก ์„ค๊ณ„ํ•˜์—ฌ ๋ฌด์˜๋ฏธํ•œ ํƒ์ƒ‰์„ ์ค„์ž…๋‹ˆ๋‹ค.

๋น„๋Œ€์นญ ๊ด€์ธก PPO

ํ•™์Šต์€ PPO(clipping \epsilon=0.2, \gamma=0.99, KL ์ž„๊ณ„๊ฐ’ 0.016 ๋“ฑ)๋กœ ์ง„ํ–‰๋ฉ๋‹ˆ๋‹ค. ํ•ต์‹ฌ ๊ธฐ๋ฒ•์€ ๋น„๋Œ€์นญ ๊ด€์ธก(asymmetric actor-critic) ์ž…๋‹ˆ๋‹ค.

  • ์ •์ฑ… ๋„คํŠธ์›Œํฌ(policy): ์ด‰๊ฐ ์ด์ง„ ์‹ ํ˜ธ + ๊ณ ์œ ์ˆ˜์šฉ(proprioception) ์ •๋ณด๋งŒ ์ž…๋ ฅ โ†’ ์‹ค์ œ ๋ฐฐํฌ ์‹œ ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ์ •๋ณด๋งŒ.
  • ๊ฐ€์น˜ ๋„คํŠธ์›Œํฌ(value): ์—ฌ๊ธฐ์— ๋”ํ•ด ๋ฌผ์ฒด์˜ ์‹ค์ œ ์œ„์น˜ ๊ฐ™์€ ํŠน๊ถŒ ์ •๋ณด๋ฅผ ์ถ”๊ฐ€ ์ž…๋ ฅ.

๊ฐ€์น˜ ์ถ”์ •์ด ์ •ํ™•ํ•ด์ง€๋ฏ€๋กœ ํ•™์Šต ํšจ์œจ์ด ์˜ค๋ฅด๊ณ , ํŠน๊ถŒ ์ •๋ณด๋Š” ์ถ”๋ก (๋ฐฐํฌ) ์‹œ์—๋Š” ์“ฐ์ด์ง€ ์•Š์œผ๋ฏ€๋กœ ์‹œ๊ฐ ์—†๋Š” ์‹ค์ „ ์กฐ๊ฑด์„ ๊ทธ๋Œ€๋กœ ๋งŒ์กฑํ•ฉ๋‹ˆ๋‹ค.

์‹คํ—˜

IsaacGym์—์„œ 10 Hz ์ œ์–ด ์ฃผ๊ธฐ๋กœ ํ•™์Šตํ•œ ์ •์ฑ…์„, ๋ฏธ์„ธ์กฐ์ • ์—†์ด ์‹ค์ œ UR5e + AllegroHand ์‹œ์Šคํ…œ์œผ๋กœ ์ „์ดํ•˜์—ฌ ํ‰๊ฐ€ํ–ˆ์Šต๋‹ˆ๋‹ค. ์‹œ๋ฎฌ๋ ˆ์ด์…˜๊ณผ ๋™์ผํ•˜๊ฒŒ, ๋ฌด์ž‘์œ„ ๋ฒ”์œ„ ์•ˆ์— ์ž„์˜๋กœ ๋†“์ธ ๋ฌผ์ฒด์— ๋Œ€ํ•œ ๊ฐ ๊ณผ์ œ์˜ ์„ฑ๊ณต๋ฅ ์„ ์ธก์ •ํ•ฉ๋‹ˆ๋‹ค.

์ ˆ์ œ ์‹คํ—˜ (Ablation)

DexTouch์˜ ๊ฐ€์žฅ ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋Š” โ€œ์ด‰๊ฐ์ด ์™œยท์–ด๋–ป๊ฒŒ ์ค‘์š”ํ•œ๊ฐ€โ€๋ฅผ ๋ถ„๋ฆฌํ•ด์„œ ๋ณด์—ฌ์ค€ ์ ˆ์ œ ์‹คํ—˜์ž…๋‹ˆ๋‹ค.

  • ์ด‰๊ฐ ์„ผ์„œ์˜ ๋ฏผ๊ฐ๋„ (Fig. 4a). ๋‘”๊ฐํ™”ํ•œ ์„ผ์„œ ๊ทธ๋ฃน(DA-Sensor)์€ ์„ฑ๊ณต๋ฅ ์ด ๊ฐ€์žฅ ๋‚ฎ์•˜์Šต๋‹ˆ๋‹ค. ๋‘”๊ฐํ•œ ์ด‰๊ฐ์—์„œ๋Š” ๋ฌผ์ฒด๋ฅผ ๋–จ์–ด๋œจ๋ฆฌ๋Š” ๋“ฑ ๊ณผ๋„ํ•œ ์กฐ์ž‘์ด ๊ด€์ฐฐ๋˜์—ˆ์Šต๋‹ˆ๋‹ค โ€” ๋ฏผ๊ฐํ•œ ์ด‰๊ฐ์ด ์กฐ์ž‘์— ํ•ต์‹ฌ์ž„์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.
  • ์„ผ์„œ์˜ ๋ถ€์ฐฉ ์œ„์น˜ (Fig. 4b). ์†๋๋งŒ ํ™œ์„ฑํ™”ํ•œ ๊ทธ๋ฃน(Fingertips)๊ณผ ์†๋ฐ”๋‹ฅ๋งŒ ํ™œ์„ฑํ™”ํ•œ ๊ทธ๋ฃน(Palm)์„ ๊ฐ๊ฐ 4๊ฐœ ์„ผ์„œ๋กœ ๋น„๊ตํ–ˆ์Šต๋‹ˆ๋‹ค. ์†๋ ๊ทธ๋ฃน์€ ์†๋์„ ์ฃผ๋กœ ์“ฐ๋Š” ๋ฐธ๋ธŒ ๋Œ๋ฆฌ๊ธฐ์—์„œ ์šฐ๋ฆฌ ์ •์ฑ…๊ณผ ์œ ์‚ฌํ•œ ์„ฑ๊ณต๋ฅ ์„ ๋‹ฌ์„ฑํ–ˆ์ง€๋งŒ, ๊ณผ์ œ๋ณ„๋กœ ์œ ๋ฆฌํ•œ ์œ„์น˜๊ฐ€ ๋‹ฌ๋ž์Šต๋‹ˆ๋‹ค โ€” ๋ถ„์‚ฐ ๋ฐฐ์น˜๋œ ์ด‰๊ฐ์˜ ์ด์ ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
  • F/T ์„ผ์„œ vs ์ด‰๊ฐ. ์†๋ชฉ ์žฅ์ฐฉ 6์ถ• ํž˜/ํ† ํฌ(F/T) ์„ผ์„œ๋ฅผ ์“ฐ๋Š” ๊ทธ๋ฃน๊ณผ ๋น„๊ตํ–ˆ์„ ๋•Œ, ๋ถ„์‚ฐ๋œ ์ ‘์ด‰ ์ด‰๊ฐ์ด ์‹œ๊ฐ ์—†๋Š” ์กฐ์ž‘์— ๋” ์œ ๋ฆฌํ–ˆ์Šต๋‹ˆ๋‹ค. F/T ์„ผ์„œ๋Š” ์ ‘์ด‰ ์œ„์น˜๋ฅผ ์•Œ ์ˆ˜ ์—†์–ด ๋Šฅ๋™์  ํƒ์ƒ‰์— ํ•œ๊ณ„๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.

๊ด€์ฐฐ

  • ๋น„๋Œ€์นญ ๊ด€์ธก์ด ํ•™์Šต ํšจ์œจ์„ ๋†’์ธ๋‹ค๋Š” ์ผ๋ฐ˜์  ๊ฒฐ๊ณผ๋ฅผ ์žฌํ™•์ธํ–ˆ์Šต๋‹ˆ๋‹ค.
  • unseen ๋ฌผ์ฒด ์ค‘ ํ…€๋ธ”๋Ÿฌ(tumbler) ์˜ ์„ฑ๊ณต๋ฅ ์ด ๊ฐ€์žฅ ๋‚ฎ์•˜๋Š”๋ฐ, ์ด๋Š” ๋ฌด๊ฒ๊ณ  ํ‘œ๋ฉด์ด ๋ฏธ๋„๋Ÿฝ๊ธฐ ๋•Œ๋ฌธ์œผ๋กœ ๋ถ„์„๋ฉ๋‹ˆ๋‹ค โ€” ๋ฌด๊ฒŒยท๋งˆ์ฐฐ ํŠน์„ฑ์ด ์ด‰๊ฐ ์กฐ์ž‘ ๋‚œ์ด๋„๋ฅผ ๊ฒฐ์ •ํ•จ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.

๋น„ํŒ์  ๊ณ ์ฐฐ

๊ฐ•์ 

  • ๋ช…ํ™•ํ•œ ๋ฌธ์ œ์˜์‹. โ€œ์‹œ๊ฐ์ด ์—†์„ ๋•Œโ€๋ผ๋Š” ํ˜„์‹ค์  ์ œ์•ฝ์„ ์ •๋ฉด์œผ๋กœ ๋‹ค๋ฃจ๊ณ , ์ด‰๊ฐ์˜ ์—ญํ• ์„ ์ ˆ์ œ ์‹คํ—˜์œผ๋กœ ๊น”๋”ํ•˜๊ฒŒ ๋ถ„๋ฆฌํ–ˆ์Šต๋‹ˆ๋‹ค.
  • ๋‹จ์ˆœํ•œ ์ด์ง„ ์ด‰๊ฐ์˜ sim-to-real. ๊ณ ํ•ด์ƒ๋„ ์ด‰๊ฐ ์—†์ด ์ด์ง„ ์ ‘์ด‰ ์‹ ํ˜ธ + ์ž„๊ณ„๊ฐ’ ์ด์ง„ํ™”๋งŒ์œผ๋กœ ๋ฏธ์„ธ์กฐ์ • ์—†๋Š” ์ „์ด๋ฅผ ๋‹ฌ์„ฑํ•œ ์ ์€ ์‹ค์šฉ์ ์ž…๋‹ˆ๋‹ค.
  • 2๋‹จ๊ณ„ ๋ณด์ƒ + ๋น„๋Œ€์นญ ๊ด€์ธก. ํฌ์†Œ ์ ‘์ด‰ ๋ณด์ƒ ๋ฌธ์ œ๋ฅผ ๋„๋‹ฌ/์‹คํ–‰ ๋ถ„ํ•ด์™€ ํŠน๊ถŒ ์ •๋ณด ํ™œ์šฉ์œผ๋กœ ์•ˆ์ •์ ์œผ๋กœ ํ•ด๊ฒฐํ–ˆ์Šต๋‹ˆ๋‹ค.

์•ฝ์ ๊ณผ ํ•œ๊ณ„

  • ์‚ฌ์ „ ์ •๋ณด ์˜์กด. ์™„์ „ํ•œ blind๊ฐ€ ์•„๋‹ˆ๋ผ โ€œ๋Œ€๋žต์  ์œ„์น˜โ€๋ผ๋Š” ์‚ฌ์ „ ์ •๋ณด๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค. ์ •๋ณด๊ฐ€ ์ „ํ˜€ ์—†๋Š” ํƒ์ƒ‰์œผ๋กœ์˜ ํ™•์žฅ์€ ์—ด๋ฆฐ ๋ฌธ์ œ์ž…๋‹ˆ๋‹ค.
  • ๊ณผ์ œยท๋ฌผ์ฒด ๋‹ค์–‘์„ฑ. ์„ธ ๊ณผ์ œยท์†Œ์ˆ˜ YCB ๋ฌผ์ฒด์— ํ•œ์ •๋˜์–ด, ๋” ๋ณต์žกํ•˜๊ฑฐ๋‚˜ ๋ณ€ํ˜• ๊ฐ€๋Šฅํ•œ ๋ฌผ์ฒด๋กœ์˜ ์ผ๋ฐ˜ํ™”๋Š” ์ถ”๊ฐ€ ๊ฒ€์ฆ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
  • ์ด์ง„ ์ด‰๊ฐ์˜ ์ •๋ณด๋Ÿ‰. on/off ์‹ ํ˜ธ๋Š” ๋ชจ๋ธ๋ง์ด ์‰ฌ์šด ๋Œ€์‹  ๋ฏธ๋„๋Ÿฌ์งยท์งˆ๊ฐ ๊ฐ™์€ ํ’๋ถ€ํ•œ ์ •๋ณด๋ฅผ ๋‹ด์ง€ ๋ชปํ•ด, ํ…€๋ธ”๋Ÿฌ์ฒ˜๋Ÿผ ๋ฏธ๋„๋Ÿฌ์šด ๋ฌผ์ฒด์—์„œ ํ•œ๊ณ„๋ฅผ ๋ณด์ž…๋‹ˆ๋‹ค.

์š”์•ฝ ๋ฐ ๊ฒฐ๋ก 

DexTouch๋Š” ์‹œ๊ฐ ์—†๋Š” ์กฐ์ž‘์„, 22-DoF ๋‹ค์ง€ ๋กœ๋ด‡์— ๋ถ„์‚ฐ ๋ฐฐ์น˜ํ•œ 16๊ฐœ์˜ ์ด์ง„ ์ด‰๊ฐ ์„ผ์„œ์™€ ๊ฐ•ํ™”ํ•™์Šต์œผ๋กœ ํ’€์–ด๋‚ธ ์—ฐ๊ตฌ์ž…๋‹ˆ๋‹ค. 2๋‹จ๊ณ„ ๋ณด์ƒ๊ณผ ๋น„๋Œ€์นญ ๊ด€์ธก PPO๋กœ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ ์ •์ฑ…์„ ํ•™์Šตํ•˜๊ณ , ๋ฏธ์„ธ์กฐ์ • ์—†์ด ์‹ค์ œ ๋กœ๋ด‡์œผ๋กœ ์ „์ดํ•˜์—ฌ ๋ฌผ์ฒด ์žก๊ธฐยท๋ฌธ ์—ด๊ธฐยท๋ฐธ๋ธŒ ๋Œ๋ฆฌ๊ธฐ๋ฅผ ์„ฑ๊ณต์‹œ์ผฐ์Šต๋‹ˆ๋‹ค. ๋ฌด์—‡๋ณด๋‹ค ์ด‰๊ฐ์˜ ๋ฏผ๊ฐ๋„์™€ ๋ถ„์‚ฐ ๋ฐฐ์น˜๊ฐ€ ์‹œ๊ฐ ์—†๋Š” ์กฐ์ž‘์— ๊ฒฐ์ •์ ์ด๋ฉฐ, ์†๋ชฉ F/T ์„ผ์„œ๋ณด๋‹ค ๋ถ„์‚ฐ ์ ‘์ด‰ ์ด‰๊ฐ์ด ๋Šฅ๋™์  ํƒ์ƒ‰์— ์œ ๋ฆฌํ•˜๋‹ค๋Š” ์ ์„ ์ ˆ์ œ ์‹คํ—˜์œผ๋กœ ๋ถ„๋ช…ํžˆ ๋ณด์˜€์Šต๋‹ˆ๋‹ค. โ€œ๋ณด์ง€ ์•Š๊ณ  ๋”๋“ฌ์–ด ๋‹ค๋ฃจ๋Š”โ€ ๋Šฅ๋ ฅ์„ ๋กœ๋ด‡์— ๋ถ€์—ฌํ•˜๋ ค๋Š” ๋ฐฉํ–ฅ์—์„œ ์˜๋ฏธ ์žˆ๋Š” ์ถœ๋ฐœ์ ์ž…๋‹ˆ๋‹ค.

์ฐธ๊ณ  ์ •๋ณด

  • ๋…ผ๋ฌธ: Kang-Won Lee, Yuzhe Qin, Xiaolong Wang, Soo-Chul Lim. โ€œDexTouch: Learning to Seek and Manipulate Objects with Tactile Dexterity.โ€ IEEE RA-L, 2024. arXiv:2401.12496.
  • ์†Œ์†: Dongguk University (Seoul) ยท University of California San Diego
  • ํ”„๋กœ์ ํŠธ ํŽ˜์ด์ง€: https://lee-kangwon.github.io/dextouch/
  • ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ: IsaacGym

Copyright 2026, JungYeon Lee