Curieux.JY
  • JungYeon Lee
  • Post
  • Lecture
  • Note

On this page

  • Brief Review
  • Detail Review
    • 1. ์—ฐ๊ตฌ ๊ฐœ์š” ๋ฐ ๊ธฐ์—ฌ
    • 2. ์‹œ์Šคํ…œ ๊ตฌ์„ฑ ๋ฐ ์† ์ถ”์  ๋ฐฉ๋ฒ•
    • 3. ์ธ๊ฐ„-๋กœ๋ด‡ ์† ๋งคํ•‘ ์ „๋žต ๋ฐ ์ˆ˜ํ•™์  ๋ชจ๋ธ๋ง
    • 4. ์† ๋ฆฌํƒ€๊ฒŸํŒ… ๋ชจ๋“ˆ ๋™์ž‘ ์›๋ฆฌ์™€ ์ œ์•ฝ์กฐ๊ฑด
    • 5. ์‹คํ—˜ ์„ค์ • ๋ฐ ์„ฑ๋Šฅ ํ‰๊ฐ€
    • 6. ๊ธฐ์กด ๋ฐฉ๋ฒ•๊ณผ ๋น„๊ต ๋ฐ ๊ธฐ์ˆ ์  ํ•œ๊ณ„
  • Additional Review
    • ์‹œ์ž‘ํ•˜๋ฉฐ: โ€œ์žฅ๊ฐ‘ ์—†์ด๋„ ์†์„ ๋นŒ๋ ค์ค„ ์ˆ˜ ์žˆ์„๊นŒ?โ€
    • ๋ฌธ์ œ์˜ ๋ณธ์งˆ: ์™œ ์† ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜์€ ์–ด๋ ค์šด๊ฐ€
      • ์ฒซ์งธ, ์ž์œ ๋„ ๋ถˆ์ผ์น˜
      • ๋‘˜์งธ, ์‹œ๊ฐ๋งŒ์œผ๋กœ ์†์„ ์ถ”์ ํ•˜๋Š” ์ผ
      • ์…‹์งธ, ์ •๋ฐ€ ๊ทธ๋ฆฝ์˜ ๋น„๋Œ€์นญ์„ฑ
    • ์‹œ์Šคํ…œ ํ•œ๋ˆˆ์— ๋ณด๊ธฐ
    • ์‹œ๊ฐ ํŒŒ์ดํ”„๋ผ์ธ: ๋ชจ๋ธ ํ”„๋ฆฌ์™€ ๋ชจ๋ธ ๊ธฐ๋ฐ˜์˜ ๊ฒฐํ˜ผ
      • 1๋‹จ๊ณ„: PointNet++๋กœ ๊ฑฐ์นœ ์ž์„ธ ์žก๊ธฐ
      • 2๋‹จ๊ณ„: DART๋กœ ์ •๋ฐ€ ์ถ”์ 
      • ๋ณด์กฐ: GloveNet์œผ๋กœ ์†๋ 2D ์œ„์น˜ ๋ณด๊ฐ•
    • ์šด๋™ํ•™์  ๋ฆฌํƒ€๊ฒŒํŒ…: ์ด ๋…ผ๋ฌธ์˜ ์ง„์งœ ํ•ต์‹ฌ
      • ์ž˜๋ชป๋œ ๋ฐฉ๋ฒ•๋“ค
      • ์˜ณ์€ ์งˆ๋ฌธ: โ€œ๊ทธ๋ฆฝ์˜ ๊ธฐํ•˜ํ•™์—์„œ ๋ฌด์—‡์ด ์ค‘์š”ํ•œ๊ฐ€?โ€
      • ๋น„์šฉ ํ•จ์ˆ˜ ๋“ค์—ฌ๋‹ค๋ณด๊ธฐ
      • Projection Scheme: ์ •๋ฐ€ ๊ทธ๋ฆฝ์˜ ๋น„๋Œ€์นญ์„ ๊ฐ๋‹นํ•˜๋Š” ๋ฒ•
      • ์˜์‚ฌ์ฝ”๋“œ๋กœ ์ •๋ฆฌํ•˜๋ฉด
    • ํŒ” ์ œ์–ด: Riemannian Motion Policies
    • ์‹คํ—˜: 15๊ฐœ ๊ณผ์ œ, ๋‘ ๋ช…์˜ ํŒŒ์ผ๋Ÿฟ, ๋‹ค์„ฏ ๋ฒˆ์”ฉ
      • ํ‰๊ฐ€ ์„ค๊ณ„
      • ๋Œ€ํ‘œ ๊ณผ์ œ์™€ ๊ฒฐ๊ณผ
      • ๊ฒฐ๊ณผ์˜ ์ง„์งœ ์˜๋ฏธ
    • ๋น„ํŒ์  ๊ณ ์ฐฐ
      • ์ž˜ ๋œ ๊ฒƒ
      • ํ•œ๊ณ„์™€ ๋น„ํŒ์ 
    • ๊ด€๋ จ ์—ฐ๊ตฌ ์ง€ํ˜• ์†์—์„œ
      • ์ง์ „ ์—ฐ๊ตฌ์™€์˜ ์ฐจ๋ณ„์ 
      • DexPilot ์ดํ›„์˜ ํ๋ฆ„
    • ๋‹ค์‹œ ์‚ดํŽด๋ณด๋Š” ํ•ต์‹ฌ ํ†ต์ฐฐ
      • 1. ๋น„์šฉ ํ•จ์ˆ˜ ์„ค๊ณ„๊ฐ€ ๊ณง ๋ฌธ์ œ ์ •์˜๋‹ค
      • 2. ๋ชจ๋ธ ํ”„๋ฆฌ์™€ ๋ชจ๋ธ ๊ธฐ๋ฐ˜์€ ์ ์ด ์•„๋‹ˆ๋‹ค
      • 3. ์‹œ์—ฐ ์ˆ˜์ง‘์€ ์ถ”์ •๋ณด๋‹ค ๋น„์‹ผ ์ž์›์ด๋‹ค
    • ๊ฒฐ๋ก 
    • ์ฐธ๊ณ 

๐Ÿ“ƒDexPilot ๋ฆฌ๋ทฐ

retargeting
vision
Kinematic Motion Retargeting for Contact-Rich Anthropomorphic Manipulations
Published

August 19, 2025

  • Paper Link
  • Project Link
  1. DexPilot์€ ์ €๋น„์šฉ ๋น„์ „ ๊ธฐ๋ฐ˜ ์›๊ฒฉ ์กฐ์ž‘ ์‹œ์Šคํ…œ์œผ๋กœ, ์žฅ๊ฐ‘์ด๋‚˜ ๋งˆ์ปค ์—†์ด ๋งจ์† ์›€์ง์ž„์„ ์‚ฌ์šฉํ•˜์—ฌ 23 DoA์˜ ๋กœ๋ด‡ ํŒ”/์† ์‹œ์Šคํ…œ์„ ์™„๋ฒฝํ•˜๊ฒŒ ์ œ์–ดํ•ฉ๋‹ˆ๋‹ค.
  2. ์‹œ์Šคํ…œ์€ ๋”ฅ๋Ÿฌ๋‹๊ณผ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์ถ”์ (DART)์„ ๊ฒฐํ•ฉํ•˜์—ฌ ์ธ๊ฐ„ ์†์˜ ํฌ์ฆˆ์™€ ๊ด€์ ˆ ๊ฐ๋„๋ฅผ ์ถ”์ •ํ•˜๊ณ , ๋น„์„ ํ˜• ์ตœ์ ํ™” ๊ธฐ๋ฐ˜์˜ ํ‚ค๋„ค๋งˆํ‹ฑ ๋ฆฌํƒ€๊ฒŸํŒ…์„ ํ†ตํ•ด Allegro hand์˜ ๋™์ž‘์œผ๋กœ ๋ณ€ํ™˜ํ•ฉ๋‹ˆ๋‹ค.
  3. DexPilot์€ ๋‹ค์–‘ํ•œ ๋ณต์žกํ•œ ์กฐ์ž‘ ์ž‘์—…์—์„œ ์ธ๊ฐ„ ์‹œ์—ฐ์ž๋ฅผ ํ†ตํ•ด ๋†’์€ ์„ฑ๊ณต๋ฅ ์„ ๋‹ฌ์„ฑํ•˜๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ์ˆ™๋ จ๋œ ๋™์ž‘ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ๋ฐ ๋ฏธ๋ž˜์˜ ์ž์œจ ์ •์ฑ… ํ•™์Šต ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.

Brief Review

๋ณธ ๋…ผ๋ฌธ์€ ๊ณ ์ž์œ ๋„(high degree-of-actuation, DoA) ๋กœ๋ด‡ ์†-ํŒ” ์‹œ์Šคํ…œ(Allegro Hand๊ฐ€ ์žฅ์ฐฉ๋œ KUKA LBR iiwa)์„ ์œ„ํ•œ ์ €๋น„์šฉ, Vision Based Teleoperation ์‹œ์Šคํ…œ์ธ DexPilot์„ ์†Œ๊ฐœํ•œ๋‹ค. DexPilot์€ ํŠน๋ณ„ํ•œ ์žฅ๋น„(markerless, glove-free) ์—†์ด ๋งจ์†(bare human hand)์˜ ์›€์ง์ž„์„ ๊ด€์ฐฐํ•˜์—ฌ ๋กœ๋ด‡์„ ์ง์ ‘ ๋ชจ๋ฐฉ ์ œ์–ด(direct imitation)ํ•œ๋‹ค. ์ด ์‹œ์Šคํ…œ์€ ์ •๊ตํ•œ ํŒŒ์ง€(precision grasp), ๋‹ค์ง€ ์กฐ์ž‘(multi-fingered manipulation), ์ธ-ํ•ธ๋“œ ์กฐ์ž‘(in-hand manipulation) ๋“ฑ ๋‹ค์–‘ํ•œ ๋ณต์žกํ•œ Task๋ฅผ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•˜๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ํ•™์Šต์„ ์œ„ํ•œ ๊ณ ์ฐจ์› ์„ผ์„œ ๋ฐ์ดํ„ฐ ๋ฐ ํ–‰๋™ ๋ฐ์ดํ„ฐ(sensorimotor state-action data)๋ฅผ ์ˆ˜์ง‘ํ•˜๋Š” ๋ฐ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค.

์‹œ์Šคํ…œ ์•„ํ‚คํ…์ฒ˜ ๋ฐ ํ•˜๋“œ์›จ์–ด: ์‹œ์Šคํ…œ์€ KUKA LBR iiwa7 R800 ๋กœ๋ด‡ ํŒ”๊ณผ Wonik Robotics Allegro Hand๋กœ ๊ตฌ์„ฑ๋œ ๋กœ๋ด‡ ์‹œ์Šคํ…œ๊ณผ, ์กฐ์ž‘์ž์˜ ์†์„ ๊ด€์ฐฐํ•˜๋Š” 4๋Œ€์˜ Intel RealSense D415 RGB-D ์นด๋ฉ”๋ผ๋กœ ๊ตฌ์„ฑ๋œ ์ธ๊ฐ„ ์กฐ์ž‘์ž ์˜์—ญ์œผ๋กœ ๋‚˜๋‰œ๋‹ค. Allegro Hand์—๋Š” SynTouch BioTac ์ด‰๊ฐ ์„ผ์„œ์™€ 3M TB641 ๊ทธ๋ฆฝ ํ…Œ์ดํ”„๊ฐ€ ์žฅ์ฐฉ๋˜์–ด ์ด‰๊ฐ ํ”ผ๋“œ๋ฐฑ ์‹ ํ˜ธ(92๊ฐœ)์™€ ๋งˆ์ฐฐ๋ ฅ์„ ์ œ๊ณตํ•œ๋‹ค. ์‹œ์Šคํ…œ์€ Vision Based Perception, Optimization, Motion Generation, Control ๋ชจ๋“ˆ๋กœ ๊ตฌ์„ฑ๋˜๋ฉฐ, ์•ฝ 1์ดˆ์˜ Latency๋ฅผ ๊ฐ€์ง„๋‹ค.

ํ•ต์‹ฌ ๋ฐฉ๋ฒ•๋ก :

  1. ํ•ธ๋“œ ํŠธ๋ž˜ํ‚น (Hand Tracking): ์กฐ์ž‘์ž์˜ ์† ์ถ”์ ์€ DART ์™€ Deep Neural Networks์˜ ์กฐํ•ฉ์œผ๋กœ ์ด๋ฃจ์–ด์ง„๋‹ค.
    • DART: articulated human hand model ([28, 29] ๊ธฐ๋ฐ˜)์„ ์ž…๋ ฅ Point Cloud์— ๋งค์นญํ•˜์—ฌ ์†์˜ Pose์™€ 20๊ฐœ Joint Angle์„ ์—ฐ์†์ ์œผ๋กœ ์ถ”์ ํ•˜๋Š” ๋ชจ๋ธ ๊ธฐ๋ฐ˜(model-based) ์ถ”์ ๊ธฐ์ด๋‹ค. Nonlinear Optimization ๊ธฐ๋ฐ˜์ด๋ฏ€๋กœ ์ •ํ™•ํ•œ ์ดˆ๊ธฐํ™”๊ฐ€ ํ•„์ˆ˜์ ์ด๋ฉฐ, Spurious Local Minima์— ๋น ์ง€๋Š” ๊ฒƒ์„ ๋ฐฉ์ง€ํ•˜๊ธฐ ์œ„ํ•ด Neural Network๋กœ๋ถ€ํ„ฐ Hand Pose Prior์™€ Hand Segmentation ์ •๋ณด๋ฅผ ํ™œ์šฉํ•œ๋‹ค.
    • Neural Networks: DART์˜ ์ดˆ๊ธฐํ™” ๋ฐ ๊ฐ•๊ฑด์„ฑ(robustness) ํ™•๋ณด๋ฅผ ์œ„ํ•ด ๋‘ ๋‹จ๊ณ„๋กœ ํ•™์Šต๋œ Neural Network๊ฐ€ ์‚ฌ์šฉ๋œ๋‹ค.
      • First Phase (with Glove for Annotation): ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ์ดˆ๊ธฐ์—๋Š” ์ƒ‰์ƒ ๋ธ”๋กญ(coloured blobs)์ด ๋ถ€์ฐฉ๋œ ์žฅ๊ฐ‘์„ ์ฐฉ์šฉํ•˜๊ณ  ResNet-50 with spatial-softmax ๊ธฐ๋ฐ˜์˜ GloveNet์„ ํ›ˆ๋ จ์‹œ์ผœ RGB ์ด๋ฏธ์ง€์—์„œ ๋ธ”๋กญ์˜ 2D ์œ„์น˜๋ฅผ ์˜ˆ์ธกํ•œ๋‹ค. ํŠนํžˆ ์†๋ฐ”๋‹ฅ ๋’ท๋ฉด์˜ 3๊ฐœ ๋ธ”๋กญ์„ ์‚ฌ์šฉํ•˜์—ฌ Hand Pose๋ฅผ ์ถ”์ •ํ•œ๋‹ค. 4๋Œ€ ์นด๋ฉ”๋ผ์˜ ์˜ˆ์ธก ๋ฐ Depth ๊ฐ’์„ ์ด์šฉํ•˜์—ฌ 3D Hand Pose๋ฅผ ์–ป๊ณ  Hand Segmentation์„ ์ˆ˜ํ–‰ํ•œ๋‹ค. DART๋Š” ์ด Segmentation๋œ Point Cloud์— ๋Œ€ํ•ด์„œ๋งŒ ์ตœ์ ํ™”ํ•˜์—ฌ Annotation ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•œ๋‹ค.
      • Second Phase (Bare Hand Tracking): First Phase์—์„œ ์ƒ์„ฑ๋œ Annotation์„ ์‚ฌ์šฉํ•˜์—ฌ ๋งจ์†(bare hand)์˜ Fused Point Cloud๋ฅผ ์ง์ ‘ ์ฒ˜๋ฆฌํ•œ๋‹ค. PointNet++ ๊ธฐ๋ฐ˜ ์•„ํ‚คํ…์ฒ˜๊ฐ€ ์‚ฌ์šฉ๋˜๋ฉฐ, ํ…Œ์ด๋ธ” Point ์ œ๊ฑฐ ํ›„ Arm๊ณผ Body๋ฅผ ํฌํ•จํ•œ Point Cloud๋ฅผ ์ž…๋ ฅ๋ฐ›์•„ Hand๋ฅผ Localizeํ•˜๊ณ  Hand Pose ๋ฐ Hand Segmentation(Auxiliary Segmentation Loss ์‚ฌ์šฉ)์„ ์ถ”์ •ํ•œ๋‹ค. ์ด ๋„คํŠธ์›Œํฌ๋Š” ์†์˜ 23๊ฐœ Keypoint(์†๊ฐ€๋ฝ๋ณ„ 4๊ฐœ ๊ด€์ ˆ + ์†๋ฐ”๋‹ฅ ๋’ท๋ฉด 3๊ฐœ)์˜ 3D ์ขŒํ‘œ๋ฅผ ์˜ˆ์ธกํ•˜๋„๋ก ํ›ˆ๋ จ๋œ๋‹ค. Uniform Sub-sampling์œผ๋กœ ์ธํ•œ ์†๊ฐ€๋ฝ Keypoint ์˜ˆ์ธก ์ •ํ™•๋„ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ์ฒซ ๋‹จ๊ณ„์˜ Pose์™€ Segmentation์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์† ์œ„์˜ ํฌ์ธํŠธ๋ฅผ ๋‹ค์‹œ ์ƒ˜ํ”Œ๋งํ•˜์—ฌ ๋‘ ๋ฒˆ์งธ ๋‹จ๊ณ„(second stage)์—์„œ Keypoint ์˜ˆ์ธก์„ ์ •๋ฐ€ํ™”ํ•œ๋‹ค.
      • JointNet: ์˜ˆ์ธก๋œ 23๊ฐœ Keypoint ์œ„์น˜(23x3 ๋ฒกํ„ฐ)๋ฅผ 20๊ฐœ Joint Angle(์†๊ฐ€๋ฝ ๊ด€์ ˆ)๋กœ ๋งคํ•‘ํ•˜๋Š” 2-layer fully connected network์ธ JointNet์„ ์‚ฌ์šฉํ•˜์—ฌ DART์˜ Joint Prior๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
  2. ์šด๋™ํ•™์  ๋ฆฌํƒ€๊ฒŸํŒ… (Kinematic Retargeting): ์ธ๊ฐ„ ์†์˜ ๊ด€์ ˆ ์›€์ง์ž„์„ Allegro Hand์˜ ๊ด€์ ˆ ์›€์ง์ž„์œผ๋กœ ๋งคํ•‘ํ•˜๋Š” ๊ณผ์ •์ด๋‹ค. ์ธ๊ฐ„ ์†๊ณผ Allegro Hand๋Š” ์šด๋™ํ•™์ ์œผ๋กœ ๋‹ค๋ฅด๋ฏ€๋กœ, Grasping๊ณผ Manipulation์— ์ค‘์š”ํ•œ Fingertip Task-Space Metrics์— ์šฐ์„ ์ˆœ์œ„๋ฅผ ๋‘”๋‹ค.
    • ๋น„์šฉ ํ•จ์ˆ˜(Cost Function)๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์ •์˜๋œ๋‹ค: C(q_h, q_a) = \frac{1}{2N}\sum_{i=0} s(d_i)||r_i(q_a) - f(d_i)\hat{r}_i(q_h)||^2 + \gamma||q_a||^2 ์—ฌ๊ธฐ์„œ q_h, q_a๋Š” ๊ฐ๊ฐ ์ธ๊ฐ„ ์†๊ณผ Allegro Hand์˜ ๊ด€์ ˆ ๊ฐ๋„์ด๋ฉฐ, r_i๋Š” ํ•œ ์ขŒํ‘œ๊ณ„(์˜ˆ: ์†๋ฐ”๋‹ฅ)์—์„œ ๋‹ค๋ฅธ ์ขŒํ‘œ๊ณ„(์˜ˆ: ์†๊ฐ€๋ฝ ๋)๊นŒ์ง€์˜ ๋ฒกํ„ฐ์ด๋‹ค. d_i = ||r_i(q_h)||, \hat{r}_i(q_h) = \frac{r_i(q_h)}{||r_i(q_h)||}์ด๋‹ค.
    • s(d_i)๋Š” ์Šค์œ„์นญ ๊ฐ€์ค‘์น˜ ํ•จ์ˆ˜(switching weight function)๋กœ, ์ถ”์  ์˜ค๋ฅ˜๊ฐ€ ์žˆ์„ ๋•Œ ์ •๋ฐ€ ํŒŒ์ง€(precision grasp) ์‹œ ์†๊ฐ€๋ฝ ์ถฉ๋Œ์„ ๋ฐฉ์ง€ํ•˜๊ณ  Thumb์™€์˜ ์ ‘์ด‰์„ ๊ฐ€๊น๊ฒŒ ํ•˜๋Š” Projection Scheme์— ์‚ฌ์šฉ๋œ๋‹ค. d_i๊ฐ€ ์ž„๊ณ„๊ฐ’ \epsilon๋ณด๋‹ค ์ž‘์„ ๊ฒฝ์šฐ, Primary Finger-Thumb ๋ฒกํ„ฐ์—๋Š” 0, Primary Finger-Primary Finger ๋ฒกํ„ฐ์—๋Š” 400์˜ ๊ฐ€์ค‘์น˜๋ฅผ ๋ถ€์—ฌํ•œ๋‹ค (Table I).
    • f(d_i)๋Š” ๊ฑฐ๋ฆฌ ํ•จ์ˆ˜(distancing function)๋กœ, d_i๊ฐ€ \epsilon๋ณด๋‹ค ํด ๊ฒฝ์šฐ \beta d_i (\beta=1.6)๋กœ ๋น„๋ก€ ์Šค์ผ€์ผ๋งํ•˜๊ณ , ์ž‘์„ ๊ฒฝ์šฐ Primary Finger-Thumb ๊ฐ„์˜ ๊ฑฐ๋ฆฌ๋Š” \eta_1 (10^{-4}m), Primary Finger ๊ฐ„์˜ ๊ฑฐ๋ฆฌ๋Š” \eta_2 (3 \times 10^{-2}m)๋กœ ๊ฐ•์ œํ•˜์—ฌ ์ตœ์†Œ/์ตœ๋Œ€ ๊ฑฐ๋ฆฌ๋ฅผ ์œ ์ง€ํ•œ๋‹ค.
    • \gamma||q_a||^2๋Š” ์ •๊ทœํ™” ํ•ญ(regularization term)์œผ๋กœ, \gamma=2.5 \times 10^{-3}์ด๋ฉฐ Allegro ๊ด€์ ˆ ๊ฐ๋„๋ฅผ 0(์™„์ „ํžˆ ์—ด๋ฆฐ ์†)์œผ๋กœ ์ •๊ทœํ™”ํ•˜์—ฌ ํ•ด์˜ ์ค‘๋ณต์„ ์ค„์ด๊ณ  ๋น„์ •์ƒ์ ์ธ ์ž์„ธ๋ฅผ ๋ฐฉ์ง€ํ•œ๋‹ค.
    • ๋ฒกํ„ฐ r_i๋Š” ๊ฑฐ๋ฆฌ์™€ ๋ฐฉํ–ฅ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ์ขŒํ‘œ๊ณ„์˜ Orientation ์ •๋ณด๋„ ํฌํ•จํ•œ๋‹ค. Allegro Hand์˜ Primary Finger distal ๊ด€์ ˆ์€ medial ๊ด€์ ˆ๊ณผ ๋™์ผํ•˜๊ฒŒ ์ œ์•ฝ๋œ๋‹ค.
    • ์ด ๋น„์šฉ ํ•จ์ˆ˜๋Š” NLopt ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์˜ Sequential Least-Squares Quadratic Programming (SLSQP) ์•Œ๊ณ ๋ฆฌ์ฆ˜ [35, 36, 37]์„ ์‚ฌ์šฉํ•˜์—ฌ ์‹ค์‹œ๊ฐ„์œผ๋กœ ์ตœ์†Œํ™”๋œ๋‹ค. Forward Kinematic ๊ณ„์‚ฐ์€ Orocos Kinematics and Dynamics library ๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค. ๊ฒฐ๊ณผ๋Š” First-Order Low-Pass Filter๋ฅผ ๊ฑฐ์นœ๋‹ค.
  3. ๋ชจ์…˜ ์ƒ์„ฑ ๋ฐ ์ œ์–ด (Motion Generation and Control):
    • Allegro Palm์˜ Cartesian Pose๋Š” Riemannian Motion Policies (RMPs)๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ œ์–ด๋œ๋‹ค. RMPs๋Š” Arm์˜ Torque-Level Impedance Controller์— ๋ชฉํ‘œ Joint Trajectory๋ฅผ 200Hz๋กœ ๋ณด๋‚ธ๋‹ค.
    • Kinematically Retargeting๋œ Allegro Angles๋Š” Allegro Hand์˜ Torque-Level Joint Controller์— 30Hz๋กœ ๋ณด๋‚ธ๋‹ค.
    • ๋กœ๋ด‡๊ณผ ์นด๋ฉ”๋ผ ์‹œ์Šคํ…œ ๊ฐ„์˜ ๊ณต๊ฐ„ ์ •๋ ฌ์€ ์ดˆ๊ธฐ ์† ์ž์„ธ(ํ…Œ์ด๋ธ”๊ณผ ํ‰ํ–‰ํ•œ ์™„์ „ํžˆ ์—ด๋ฆฐ ์†)๋ฅผ ๊ธฐ์ค€์œผ๋กœ ๋ณด์ •ํ•˜์—ฌ ์กฐ์ž‘์ž๊ฐ€ ์ง๊ด€์ ์œผ๋กœ ๋กœ๋ด‡์„ ์ œ์–ดํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•œ๋‹ค.

์‹คํ—˜ ๋ฐ ๊ฒฐ๊ณผ: DexPilot ์‹œ์Šคํ…œ์€ Pringles ์บ” ์ •๋ ฌ, ์ปต ์‚ฝ์ž…, ๋‘ ๊ฐœ ํ๋ธŒ ํŒŒ์ง€, ์ง€๊ฐ‘์—์„œ ๋ˆ ๊บผ๋‚ด๊ธฐ ๋“ฑ 15๊ฐ€์ง€ ๋‹ค์–‘ํ•œ Task (Table II, Fig. 1)์— ๋Œ€ํ•ด ๋‘ ๋ช…์˜ ์กฐ์ž‘์ž๋ฅผ ๋Œ€์ƒ์œผ๋กœ ํ…Œ์ŠคํŠธ๋˜์—ˆ๋‹ค. ์„ฑ๋Šฅ์€ ํ‰๊ท  ์™„๋ฃŒ ์‹œ๊ฐ„(Mean Completion Time, CT)๊ณผ ์„ฑ๊ณต๋ฅ (Success Rate)๋กœ ์ธก์ •๋˜์—ˆ๋‹ค (Fig. 14, 15). ๊ฒฐ๊ณผ๋Š” ์ด‰๊ฐ ํ”ผ๋“œ๋ฐฑ ๋ถ€์žฌ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ๋‹ค์–‘ํ•œ Task์—์„œ ๋†’์€ ์„ฑ๊ณต๋ฅ ์„ ๋‹ฌ์„ฑํ–ˆ์Œ์„ ๋ณด์—ฌ์ค€๋‹ค. ๋ณต์žกํ•œ Task ์ˆ˜ํ–‰์€ Bare Hand ๊ด€์ฐฐ๋งŒ์œผ๋กœ๋„ ์ •๊ตํ•œ Skill ์ „๋‹ฌ์ด ๊ฐ€๋Šฅํ•จ์„ ์ž…์ฆํ•œ๋‹ค. Task ์ˆ˜ํ–‰ ์ค‘ ์ˆ˜์ง‘๋œ ํ’๋ถ€ํ•œ Sensorimotor ๋ฐ์ดํ„ฐ(BioTac ์‹ ํ˜ธ ๋“ฑ)๋Š” ํ–ฅํ›„ ๋กœ๋ด‡ ํ•™์Šต์— ํ™œ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค.

๋…ผ์˜ ๋ฐ ํ•œ๊ณ„: DexPilot์€ ๋ณต์žกํ•œ Manipulation Task ํ•ด๊ฒฐ์„ ์œ„ํ•œ ์‹คํ–‰ ๊ฐ€๋Šฅํ•˜๊ณ  ์ €๋ ดํ•œ Teleoperation ์†”๋ฃจ์…˜์„ ์ œ๊ณตํ•˜๋ฉฐ, ํ•™์Šต์„ ์œ„ํ•œ ๊ณ ํ’ˆ์งˆ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•œ๋‹ค. ํ–ฅํ›„ ๊ฐœ์„  ๋ฐฉํ–ฅ์œผ๋กœ๋Š” Deep Learning ๋ฐœ์ „์„ ํ†ตํ•œ Hand Tracking ์ •ํ™•๋„ ํ–ฅ์ƒ, RGB ๋ฐ์ดํ„ฐ ํ™œ์šฉ, ์ž์œจ์ ์ธ ํž˜ ์กฐ์ ˆ ์ œ์–ด ๊ธฐ๋Šฅ ํ†ตํ•ฉ, ์˜๋„ ์ธ์‹ ๋“ฑ์ด ์ œ์‹œ๋œ๋‹ค. ํ•œ๊ณ„์ ์œผ๋กœ๋Š” ์ œํ•œ์ ์ธ ์ž‘์—… ๊ณต๊ฐ„, Projection Scheme์ด Finger Gaiting์ด๋‚˜ ์ž‘์€ ๋ฌผ์ฒด ๋†“๊ธฐ๋ฅผ ๋ฐฉํ•ดํ•˜๋Š” ๋ฌธ์ œ, ์ด‰๊ฐ ํ”ผ๋“œ๋ฐฑ ๋ถ€์žฌ๋กœ ์ธํ•œ ์ •๋ฐ€ Task(์˜ˆ: NIST Peg-in-hole insertion)์˜ ์–ด๋ ค์›€, ์‹œ์Šคํ…œ Latency, ์กฐ์ž‘์ž ์† ํฌ๊ธฐ/๋ชจ์–‘์— ๋Œ€ํ•œ ๊ฐ•๊ฑด์„ฑ ๋“ฑ์ด ์–ธ๊ธ‰๋œ๋‹ค. ํŠนํžˆ Peg-in-hole Insertion๊ณผ ๊ฐ™์€ ๊ณ ์ •๋ฐ€ Task๋Š” ํ˜„์žฌ ์‹œ์Šคํ…œ์œผ๋กœ๋„ ์‹œ๋„๋Š” ๊ฐ€๋Šฅํ•˜๋‚˜ ์„ฑ๊ณต๋ฅ ์ด ๋งค์šฐ ๋‚ฎ์•„ ์ถ”๊ฐ€์ ์ธ ๊ฐœ์„ ์ด ํ•„์š”ํ•˜๋‹ค.

์ฃผ์š” Contribution:

  • Markerless, glove-free, ์ „์ ์œผ๋กœ Vision-based์ธ Teleoperation ์‹œ์Šคํ…œ์œผ๋กœ ๊ณ ์ž์œ ๋„ ๋กœ๋ด‡ ์†-ํŒ” ์‹œ์Šคํ…œ์„ ์ง์ ‘ ๋ชจ๋ฐฉ ์ œ์–ดํ•œ๋‹ค.
  • Hand Joint ์ถ”์  ์˜ค๋ฅ˜ ์กด์žฌ ์‹œ์—๋„ ์† ๊ธฐ๊ต์™€ Precision Grasp์˜ ์‹คํ˜„ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์กดํ•˜๋Š” Novel Cost Function ๋ฐ Projection Scheme for Kinematically Retargeting Human Hand Joints to Allegro Hand Joints.
  • Fine Manipulation๊ณผ Dexterity๋ฅผ ํฌํ•จํ•˜๋Š” ๋‹ค์–‘ํ•œ Task์—์„œ์˜ Teleoperation ์‹œ์Šคํ…œ ์‹œ์—ฐ ๋ฐ ํ‰๊ฐ€.
  • ์ด‰๊ฐ ํ”ผ๋“œ๋ฐฑ ๋ถ€์žฌ์—๋„ ๋†’์€ Task ์„ฑ๊ณต๋ฅ  ๋‹ฌ์„ฑ.

Detail Review

1. ์—ฐ๊ตฌ ๊ฐœ์š” ๋ฐ ๊ธฐ์—ฌ

DexPilot์€ ๊ณ ์ž์œ ๋„(23 DoF)์˜ ๋‹ค์ง€๋Šฅ ๋กœ๋ด‡ ์†โ€“ํŒ” ์‹œ์Šคํ…œ์„ ์ €๋น„์šฉยท์‹œ๊ฐ ๊ธฐ๋ฐ˜์œผ๋กœ ์›๊ฒฉ ์กฐ์ž‘ํ•˜๊ธฐ ์œ„ํ•œ ์‹œ์Šคํ…œ์ด๋‹ค. ์ „ํ†ต์ ์ธ ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜ ์‹œ์Šคํ…œ์€ ๊ณ ์ž์œ ๋„ ๋กœ๋ด‡ ์ œ์–ด์‹œ ๊ณ ๊ฐ€์˜ ์„ผ์„œ(๊ธ€๋Ÿฌ๋ธŒ, ๋งˆ์ปค, ๋ชจ์…˜์บก์ฒ˜ ๋“ฑ)๋ฅผ ์š”๊ตฌํ•˜์ง€๋งŒ, DexPilot์€ ์‹ค์ œ ์ธ๊ฐ„์˜ ๋งจ์† ์›€์ง์ž„๋งŒ์œผ๋กœ 23์ž์œ ๋„์˜ Allegro ๋กœ๋ด‡ ์†๊ณผ ๋กœ๋ด‡ ํŒ”์„ ์ง์ ‘ ๋ชจ์‚ฌยท์ œ์–ดํ•œ๋‹ค.

์ฃผ์š” ๊ธฐ์—ฌ๋กœ๋Š”

  1. ๋งˆ์ปค๋‚˜ ์žฅ๊ฐ‘ ์—†์ด ์ˆœ์ˆ˜ RGB-D ์นด๋ฉ”๋ผ๋กœ ์ธ๊ฐ„ ์†์„ ์ถ”์ ํ•˜์—ฌ ๋กœ๋ด‡ ์†์— ์ „์‚ฌํ•˜๋Š” ์‹œ๊ฐ ๊ธฐ๋ฐ˜ ๊ธ€๋Ÿฌ๋ธŒ-ํ”„๋ฆฌ ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜ ๊ตฌํ˜„,
  2. ์† ๋(fingertip) ์œ„์น˜ ๋ฐ ๋ฐฉํ–ฅ์„ ๋ณด์กดํ•˜๋ฉด์„œ ์ธ๊ฐ„ ์† ๊ด€์ ˆ ์ƒํƒœ๋ฅผ Allegro ์† ๊ด€์ ˆ๋กœ ๋งคํ•‘ํ•˜๋Š” ์ƒˆ๋กœ์šด ๋น„์šฉ ํ•จ์ˆ˜ ๋ฐ ํˆฌ์˜(projection) ๊ธฐ๋ฒ• ์ œ์•ˆ,
  3. ์ •๋ฐ€ํ•œ ์ง‘๊ธฐ(pinching)์™€ ๋‹ค์ค‘ ๋‹จ๊ณ„ ์กฐ์ž‘์„ ํฌํ•จํ•œ ๋‹ค์–‘ํ•œ ๊ณผ์ œ(์ง€ํ ์ถ”์ถœ, ์„œ๋ž ์—ด๊ธฐ, ์•ฝ๋ณ‘ ๊ฐœ๋ด‰ ๋“ฑ)์—์„œ 23DoF ์‹œ์Šคํ…œ ์กฐ์ž‘์„ ์‹œ์—ฐ,
  4. ๋‘ ๋ช…์˜ ํŒŒ์ผ๋Ÿฟ์œผ๋กœ ์ง„ํ–‰ํ•œ ์‹คํ—˜์—์„œ ์†๋„ ๋ฐ ์„ฑ๊ณต๋ฅ  ์ง€ํ‘œ๋กœ ์‹œ์Šคํ…œ ์„ฑ๋Šฅ์„ ํ‰๊ฐ€์ด๋‹ค.

์ด ๊ฒฐ๊ณผ ๊ณ ์ž์œ ๋„ ๋กœ๋ด‡ ์กฐ์ž‘์šฉ ๋Œ€์šฉ๋Ÿ‰ ์ƒํƒœยทํ–‰๋™(์ƒํƒœ/์•ก์…˜) ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘์ด ๊ฐ€๋Šฅํ•˜๋ฉฐ, ํ–ฅํ›„ ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์กฐ์ž‘ ์ •์ฑ… ํ•™์Šต์— ์œ ์šฉํ•œ ๋ฐ์ดํ„ฐ์…‹์„ ์ œ๊ณตํ•  ์ˆ˜ ์žˆ๋‹ค.

2. ์‹œ์Šคํ…œ ๊ตฌ์„ฑ ๋ฐ ์† ์ถ”์  ๋ฐฉ๋ฒ•

DexPilot์˜ ํ•˜๋“œ์›จ์–ด๋Š” KUKA LBR iiwa7 ํ˜‘๋™๋กœ๋ด‡ ํŒ”๊ณผ Wonik Allegro ์†์œผ๋กœ ๊ตฌ์„ฑ๋˜๋ฉฐ, Allegro ์† ๋์—๋Š” Biotac ์ด‰๊ฐ ์„ผ์„œ๋ฅผ ์žฅ์ฐฉํ•˜์˜€๋‹ค. ์‚ฌ๋žŒ ํŒŒ์ผ๋Ÿฟ ์˜์—ญ์—๋Š” ๊ฒ€์€์ƒ‰ ์ฒœ์œผ๋กœ ๋ฎ์ธ ํ…Œ์ด๋ธ” ์œ„์— 4๋Œ€์˜ Intel RealSense D415 RGB-D ์นด๋ฉ”๋ผ๊ฐ€ ๋ฐฐ์น˜๋˜์–ด, ์ธ๊ฐ„ ์†์„ ์—ฌ๋Ÿฌ ์‹œ์ ์—์„œ ๊ด€์ฐฐํ•œ๋‹ค.

์‹œ์Šคํ…œ์€ ์„ธ ๊ฐœ์˜ ์ฒ˜๋ฆฌ ์Šค๋ ˆ๋“œ๋กœ ๋ณ‘๋ ฌ ์‹คํ–‰๋œ๋‹ค.

  • ํ•™์Šต ์Šค๋ ˆ๋“œ๋Š” 4๊ฐœ ์นด๋ฉ”๋ผ์˜ ์œตํ•ฉ๋œ ํฌ์ธํŠธ ํด๋ผ์šฐ๋“œ๋กœ๋ถ€ํ„ฐ ์†์˜ ์ž์„ธ ๋ฐ ๊ด€์ ˆ๊ฐ์„ ์ถ”์ •ํ•˜๋Š” ์‹ ๊ฒฝ๋ง์„ ์‹คํ–‰ํ•˜๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ์–ป์€ ์ดˆ๊ธฐ ์ถ”์ •๊ฐ’์„ ํ•˜์œ„ ๋ชจ๋“ˆ์— ์ œ๊ณตํ•œ๋‹ค.
  • ์ถ”์  ์Šค๋ ˆ๋“œ๋Š” DART(Differentiable Articulated Rigid-body Tracker)๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ธ๊ฐ„ ์† ๋ชจ๋ธ์˜ 6์ž์œ ๋„ ์œ„์น˜ ๋ฐ 20๊ฐœ ๊ด€์ ˆ(๊ฐ ์†๊ฐ€๋ฝ๋‹น 4๊ฐœ: 1 abduction, 3 flexion)์˜ ์ž์„ธ๋ฅผ ์ง€์†์ ์œผ๋กœ ์ตœ์ ํ™” ์ถ”์ ํ•œ๋‹ค. ์ด๋•Œ, ์‹ ๊ฒฝ๋ง์ด ์ œ๊ณตํ•œ ์† ์œ„์น˜/๊ด€์ ˆ๊ฐ ์˜ˆ์ธก์ด ์ดˆ๊ธฐ๊ฐ’(prior)์œผ๋กœ ์‚ฌ์šฉ๋˜์–ด ๋กœ์ปฌ ๋ฏธ๋‹ˆ๋งˆ๋กœ ๋น ์ง€๋Š” ๊ฒƒ์„ ๋ฐฉ์ง€ํ•œ๋‹ค.
  • ์ œ์–ด ์Šค๋ ˆ๋“œ๋Š” Riemannian Motion Policy(RMP) ๊ธฐ๋ฐ˜์˜ ์ œ์–ด ๋ฐฉ์ •์‹์„ ๊ณ„์‚ฐํ•˜์—ฌ Allegro ์†๋ฐ”๋‹ฅ์˜ ๋ชฉํ‘œ ์œ„์น˜ยท์ž์„ธ์™€ ํŒ” ๋™์ž‘์„ ์ƒ์„ฑํ•œ๋‹ค. ์ „์ฒด ์‹œ์Šคํ…œ์˜ ์—”๋“œ-ํˆฌ-์—”๋“œ ์ง€์—ฐ(latency)์€ ์•ฝ 1์ดˆ ์ •๋„๋กœ ๋ณด๊ณ ๋˜์—ˆ๋‹ค.

์‹œ๊ฐ ๊ธฐ๋ฐ˜ ์† ์ถ”์ ์„ ์œ„ํ•ด DexPilot์€ ๋‘ ๋‹จ๊ณ„์˜ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ๊ณผ DART ์ตœ์ ํ™”๋ฅผ ๊ฒฐํ•ฉํ•˜์˜€๋‹ค.

  1. ์ฒซ ๋ฒˆ์งธ ๋‹จ๊ณ„์—์„œ๋Š” ํŒŒ์ผ๋Ÿฟ์ด ์ฐฉ์šฉํ•œ ์ปฌ๋Ÿฌ ์žฅ๊ฐ‘(glove)์„ ํ™œ์šฉํ•˜์—ฌ ํ•™์Šต ๋ฐ์ดํ„ฐ๋ฅผ ์–ป๋Š”๋‹ค. ์žฅ๊ฐ‘์˜ ์†๊ฐ€๋ฝ ๋๊ณผ ์†๋ฐ”๋‹ฅ์— ์„œ๋กœ ๋‹ค๋ฅธ ์ƒ‰์˜ ์ ์„ ๋ถ€์ฐฉํ•˜๊ณ , 4๋Œ€์˜ RGB ์นด๋ฉ”๋ผ๋กœ ๊ด€์ฐฐํ•œ RGB ์˜์ƒ์„ ResNet-50 ๊ธฐ๋ฐ˜์˜ ํšŒ๊ท€ ๋„คํŠธ์›Œํฌ(GloveNet)๋ฅผ ํ†ตํ•ด ์ƒ‰์ ์˜ 2D ์œ„์น˜๋ฅผ ์ถ”์ •ํ•œ๋‹ค. ์ด๋ ‡๊ฒŒ ์–ป์€ 2D ์ขŒํ‘œ์— ๊นŠ์ด(depth)๋ฅผ ๊ฒฐํ•ฉํ•ด 3D ์œ„์น˜๋ฅผ ๊ณ„์‚ฐํ•˜๊ณ , ๊ทธ๋กœ๋ถ€ํ„ฐ ์†์˜ ํฌ์ฆˆ(์„ธ ์ ์˜ ์œ„์น˜)์™€ ๋ถ„ํ• (segmentation)์„ ๊ตฌํ•œ๋‹ค. ์ด ์ •๋ณด๋ฅผ ์ด์šฉํ•ด DART๊ฐ€ ์† ๋ชจ๋ธ์„ ์„ธ๋ถ„ํ™”(segmented point cloud)์— ๋งž์ถ”์–ด ์ตœ์ ํ™”ํ•˜๋„๋ก ํ•จ์œผ๋กœ์จ, ์ดˆ๊ธฐ์—๋Š” ์žฅ๊ฐ‘์„ ์“ด ์ƒํƒœ์—์„œ ์ •ํ™•ํ•œ ์† ๊ด€์ ˆ๊ฐ ์–ด๋…ธํ…Œ์ด์…˜์„ ์ƒ์„ฑํ•œ๋‹ค.
  2. ๋‘ ๋ฒˆ์งธ ๋‹จ๊ณ„์—์„œ๋Š” ์žฅ๊ฐ‘ ์—†์ด ์ƒ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค. 4๊ฐœ ์นด๋ฉ”๋ผ์˜ ๊นŠ์ด ์˜์ƒ์„ ์œตํ•ฉํ•˜์—ฌ ํ…Œ์ด๋ธ” ํ‰๋ฉด์„ ์ œ๊ฑฐํ•œ ํ›„, ๋‚จ์€ ์†ยทํŒ” ํฌ์ธํŠธํด๋ผ์šฐ๋“œ๋ฅผ PointNet++ ๊ธฐ๋ฐ˜ ๋„คํŠธ์›Œํฌ์— ์ž…๋ ฅํ•œ๋‹ค. ์ด ๋„คํŠธ์›Œํฌ๋Š” ์† ๋ถ€๋ถ„์„ ๋ถ„๋ฆฌํ•˜๊ณ (์†๋ถ„ํ• ), ์†๋ผˆ์˜ 23๊ฐœ ์ฃผ์š” ๊ด€์ ˆ์ (keypoints; ์†๊ฐ€๋ฝ๋‹น 4๊ฐœ, ์†๋ฐ”๋‹ฅ ํ›„๋ฉด 3๊ฐœ)๋ฅผ 3D ์ขŒํ‘œ๋กœ ํšŒ๊ท€ํ•œ๋‹ค. ์ฒซ ๋‹จ๊ณ„์˜ ์†๋ฐ”๋‹ฅ ์ปฌ๋Ÿฌ ์žฅ๊ฐ‘ ๋ฐฉ์‹์œผ๋กœ ์ƒ์„ฑ๋œ ์–ด๋…ธํ…Œ์ด์…˜์„ ํ•™์Šต์— ์‚ฌ์šฉํ•˜์—ฌ, ์‹ค์ œ ๋งจ์† ๋ฐ์ดํ„ฐ์—์„œ๋„ ์† ๊ด€์ ˆ ํฌ์ฆˆ๋ฅผ ์˜ˆ์ธกํ•œ๋‹ค. ๋˜ํ•œ, 23๊ฐœ ํ‚คํฌ์ธํŠธ๋ฅผ 20์ฐจ์› ๊ด€์ ˆ๊ฐ์œผ๋กœ ๋ณ€ํ™˜ํ•˜๊ธฐ ์œ„ํ•œ JointNet(2์ธต ์™„์ „์—ฐ๊ฒฐ๋ง)๋„ ํ•จ๊ป˜ ํ•™์Šต์‹œ์ผฐ๋‹ค. ์ด ๋”ฅ ๋„คํŠธ์›Œํฌ๋“ค ๋•๋ถ„์— DART ์ถ”์ ์ด ์žฅ๊ธฐ๊ฐ„ ์•ˆ์ •์ ์œผ๋กœ ์ˆ˜ํ–‰๋˜๋ฉฐ, ๊ฒ€์ฆ ์…‹์—์„œ ํ‰๊ท  ํ‚คํฌ์ธํŠธ ์˜ค์ฐจ๋Š” ์•ฝ 9.7mm, ๊ด€์ ˆ๊ฐ ์˜ค์ฐจ๋Š” ์•ฝ 1.33ยฐ๋กœ ๋ณด๊ณ ๋˜์—ˆ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ, DexPilot์€ ์นด๋ฉ”๋ผ ํฌ์ธํŠธํด๋ผ์šฐ๋“œโ†’ํ‚คํฌ์ธํŠธโ†’๊ด€์ ˆ๊ฐ ์ถ”์ •โ†’DART ๋ฏธ์„ธ์กฐ์ •์˜ ํŒŒ์ดํ”„๋ผ์ธ์„ ํ†ตํ•ด ์ธ๊ฐ„ ์†์˜ ํฌ์ฆˆ์™€ ๊ด€์ ˆ ์ƒํƒœ๋ฅผ ์‹ค์‹œ๊ฐ„์œผ๋กœ ์–ป์–ด๋‚ธ๋‹ค.

3. ์ธ๊ฐ„-๋กœ๋ด‡ ์† ๋งคํ•‘ ์ „๋žต ๋ฐ ์ˆ˜ํ•™์  ๋ชจ๋ธ๋ง

์ธ๊ฐ„ ์†๊ณผ Allegro ๋กœ๋ด‡ ์†์€ ๊ด€์ ˆ ์ˆ˜, ๊ด€์ ˆ์ถ• ๋ฐฐ์น˜, ์†๊ฐ€๋ฝ ๊ธธ์ด ๋“ฑ์ด ํฌ๊ฒŒ ๋‹ค๋ฅด๊ธฐ ๋•Œ๋ฌธ์— ๋‹จ์ˆœํ•œ ๋Œ€์‘(mapping)์ด ๋ถˆ๊ฐ€๋Šฅํ•˜๋‹ค. DexPilot์€ ์ •๋ฐ€ ์กฐ์ž‘ ๊ด€์ ์—์„œ ์†๋(fingertip) ์ž‘์—… ๊ณต๊ฐ„(task-space) ์„ ์ตœ์šฐ์„ ์‹œํ•˜์—ฌ ๋‘ ์†์˜ ๋™์ž‘์„ ์—ฐ๊ฒฐํ•œ๋‹ค.

์†๋์„ ์ž‡๋Š” ์œ„์น˜์™€ ๋ฐฉํ–ฅ ์ •๋ณด๊ฐ€ ์ธ๊ฐ„ยท๋กœ๋ด‡ ์†์˜ ์ฃผ์š” ์กฐ์ž‘์„ ๊ฒฐ์ •ํ•œ๋‹ค๊ณ  ๋ณด๊ณ , ์ด๋“ค ์‚ฌ์ด ๊ฑฐ๋ฆฌ๋ฅผ ์ตœ์†Œํ™”ํ•˜๋Š” ์ตœ์ ํ™” ๊ธฐ๋ฐ˜ ๋งคํ•‘(cost function)์„ ์„ค๊ณ„ํ•˜์˜€๋‹ค. ๊ตฌ์ฒด์ ์œผ๋กœ, ์ธ๊ฐ„ ์† ์ž์„ธ q_h์™€ Allegro ์† ๊ด€์ ˆ q_a์— ๋Œ€ํ•ด ๋‹ค์Œ๊ณผ ๊ฐ™์€ ๋น„์šฉ ํ•จ์ˆ˜๋ฅผ ์ •์˜:

C(q_h, q_a) = \frac{1}{2}\sum_{i=1}^N s(d_i)\,|r_i(q_a) - f(d_i)\,\hat{r}_i(q_h)|^2 \;+\; \gamma|q_a|^2,

  • ์—ฌ๊ธฐ์„œ r_i(q)๋Š” ์†๋ฐ”๋‹ฅ(origin)์œผ๋กœ๋ถ€ํ„ฐ i๋ฒˆ์งธ ์†๋๊นŒ์ง€์˜ ๋ฒกํ„ฐ(๋˜๋Š” ์†๊ฐ€๋ฝ ๊ฐ„ ๋ฒกํ„ฐ)๋กœ, ๊ฐ๊ฐ Allegro ์†(r_i(q_a))๊ณผ ์ธ๊ฐ„ ์† ๋ชจ๋ธ(\hat{r}_i(q_h))์˜ ์ž‘์—… ๊ณต๊ฐ„์—์„œ ๊ณ„์‚ฐ๋œ๋‹ค.
  • \hat{r}_i(q_h)=r_i(q_h)/d_i๋Š” ์ •๊ทœํ™”๋œ ์ธ๊ฐ„ ์† ๋ฒกํ„ฐ์ด๋ฉฐ, d_i=|r_i(q_h)|
  • s(d_i)๋Š” ๊ฐ€์ค‘์น˜ ํ•จ์ˆ˜๋กœ์„œ ์ธ๊ฐ„ ์†์˜ ์—„์ง€๊ฐ€ i๋ฒˆ์งธ ๋ฒกํ„ฐ(r_i(q_h))์™€ ๊ฐ€๊น๊ฒŒ ์ ‘์ด‰ํ•  ๋•Œ ์†๋ ๊ฐ„ ๊ฑฐ๋ฆฌ์— ๋” ํฐ ์ค‘์š”๋„๋ฅผ ๋ถ€์—ฌํ•œ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด, ์ž„๊ณ„๊ฑฐ๋ฆฌ \epsilon ์ดํ•˜๋กœ ๊ฐ€๊นŒ์›Œ์ง€๋ฉด ์—„์ง€์™€ ์†๋์ด ๋Œ€์‘๋˜๋Š” ๋ฒกํ„ฐ ์ง‘ํ•ฉ S1์ผ ๋•Œ s(d_i)=200, ์†๋ ์Œ(S2)์— ๋Œ€ํ•ด์„œ๋Š” s(d_i)=400 ๋“ฑ์œผ๋กœ ๊ธ‰๊ฒฉํžˆ ์ฆ๊ฐ€์‹œํ‚จ๋‹ค. ๋ฐ˜๋ฉด ๊ฑฐ๋ฆฌ๊ฐ€ \epsilon ์ด์ƒ์ด๋ฉด s(d_i)=1์œผ๋กœ ์ž‘๊ฒŒ ์ค€๋‹ค.
  • f(d_i)๋Š” ๊ฑฐ๋ฆฌ ์กฐ์ ˆ ํ•จ์ˆ˜๋กœ์„œ, ๋ณดํ†ต f(d_i)=\beta d_i (์ฆํญ๊ณ„์ˆ˜ \beta=1.6)๋กœ ์†๋ ๊ฐ„ ๊ฑฐ๋ฆฌ๋ฅผ ๊ทธ๋Œ€๋กœ ๋ณต์‚ฌํ•˜์ง€๋งŒ, ์ž„๊ณ„๊ฑฐ๋ฆฌ ์ดํ•˜์ผ ๋•Œ ์†๊ฐ€๋ฝ๋ผ๋ฆฌ ๊ฒน์น˜์ง€ ์•Š๋„๋ก ์ผ์ • ๊ฑฐ๋ฆฌ(\eta_1,\eta_2)๋ฅผ ๊ฐ•์ œํ•œ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ์—„์ง€-์ฃผ์š” ์†๊ฐ€๋ฝ ์‚ฌ์ด๊ฐ€ ๋„ˆ๋ฌด ๊ฐ€๊นŒ์›Œ์ง€๋ฉด \eta_1=0.1\;\mathrm{mm}๋กœ ์ ‘์ด‰ ๊ฑฐ๋ฆฌ๋ฅผ ์œ ์ง€์‹œ์ผœ ํ•€์น˜ ์ง‘๊ธฐ๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๋ฉฐ, ์ฃผ์š” ์†๊ฐ€๋ฝ ๊ฐ„์—๋Š” \eta_2=30\;\mathrm{mm}๋กœ ์ผ์ • ๊ฑฐ๋ฆฌ๋ฅผ ํ™•๋ณดํ•œ๋‹ค.
  • ๋งˆ์ง€๋ง‰์œผ๋กœ \gamma|q_a|^2 ํ•ญ(์ •๊ทœํ™” ํ•ญ)์€ Allegro ์†์„ ์™„์ „ํžˆ ํŽผ์นœ ์ƒํƒœ(q_a=0)๋กœ ์œ ๋„ํ•˜์—ฌ ์ค‘๋ณต์„ฑ(redundancy)์„ ์™„ํ™”ํ•˜๊ณ  ๊ธฐ๊ดดํ•œ ์ตœ์†Œํ•ด(์˜ˆ: ์†๊ฐ€๋ฝ์ด ์†๋ฐ”๋‹ฅ์— ํŒŒ๊ณ ๋“œ๋Š” ํ˜„์ƒ)๋ฅผ ๋ฐฉ์ง€ํ•œ๋‹ค. ์ด๋•Œ ์‚ฌ์šฉํ•˜๋Š” ๋ฒกํ„ฐ ์ง‘ํ•ฉ S1, S2๋Š” ํ‘œ I์— ์ •์˜๋œ ๊ฒƒ์ฒ˜๋Ÿผ โ€œ์—„์ง€์™€ ์ฃผ์š” ์†๊ฐ€๋ฝ(๊ฒ€์ง€ยท์ค‘์ง€ยท์•ฝ์ง€) ์‚ฌ์ด ๋ฒกํ„ฐโ€์™€ โ€œ์—„์ง€์™€ ๊ฐ๊ฐ ๋งคํ•‘๋œ ๋‘ ์ฃผ์š” ์†๊ฐ€๋ฝ ์‚ฌ์ด ๋ฒกํ„ฐโ€๋กœ ๊ตฌ์„ฑํ•œ๋‹ค. ๋˜ํ•œ, ํ•ด ๊ณต๊ฐ„ ํฌ๊ธฐ๋ฅผ ์ค„์ด๊ธฐ ์œ„ํ•ด Allegro ์†์˜ ๊ฒ€์ง€ยท์ค‘์ง€ยท์•ฝ์ง€ ๊ฐ๊ฐ์— ๋Œ€ํ•ด ์›์œ„๊ด€์ ˆ(distal joint)์˜ ๊ฐ๋„๋ฅผ ์ค‘๊ฐ„๊ด€์ ˆ(medial joint)๊ณผ ๊ฐ™๊ฒŒ ๊ณ ์ •ํ•˜๋Š” ์ œ์•ฝ์„ ๋‘์—ˆ๋‹ค. ์ด์™€ ๊ฐ™์ด ์„ค๊ณ„๋œ ๋น„์šฉ ํ•จ์ˆ˜๋ฅผ ๋งค ํ”„๋ ˆ์ž„๋งˆ๋‹ค ์ตœ์†Œํ™”ํ•˜๋ฉด ์ธ๊ฐ„ ์†์˜ ์†๋ ๋ฐฐ์น˜์™€ ์œ ์‚ฌํ•œ Allegro ์† ๊ตฌ์„ฑ์ด ์ƒ์„ฑ๋œ๋‹ค.

์ตœ์ ํ™”๋Š” NLopt ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์˜ SLSQP(์ˆœ์ฐจ์  ์ด์ฐจ๊ณ„ํš๋ฒ•) ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ ์‹ค์‹œ๊ฐ„ ์ˆ˜ํ–‰๋œ๋‹ค. ์ดˆ๊ธฐ ํ”„๋ ˆ์ž„์—๋Š” Allegro ๊ฐ๋„๋ฅผ ๋ชจ๋‘ 0(์™„์ „ ํŽผ์นจ)์œผ๋กœ ์‹œ์ž‘ํ•˜๊ณ , ์ดํ›„ ๋งค ํ”„๋ ˆ์ž„์€ ์ด์ „ ํ”„๋ ˆ์ž„ ํ•ด๋ฅผ ์ดˆ๊ธฐ๊ฐ’์œผ๋กœ ์‚ฌ์šฉํ•˜์—ฌ ์—ฐ์†์„ฑ์„ ์œ ์ง€ํ•œ๋‹ค. ์ธ๊ฐ„ ์† ๋ชจ๋ธ๊ณผ Allegro ์†์˜ ์ˆœ๋ฐฉํ–ฅ ๊ธฐ๊ตฌํ•™ ๊ณ„์‚ฐ์—๋Š” Orocos KDL ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ์‚ฌ์šฉํ•˜์˜€๋‹ค. ์ตœ์ ํ™” ๊ฒฐ๊ณผ๋กœ ์–ป์€ Allegro ๊ด€์ ˆ๊ฐ์€ ๊ณ ์ฃผํŒŒ ๋…ธ์ด์ฆˆ๋ฅผ ์–ต์ œํ•˜๊ธฐ ์œ„ํ•ด 1์ฐจ ์ €์—ญ ํ†ต๊ณผ ํ•„ํ„ฐ๋ฅผ ๊ฑฐ์ณ ์ถœ๋ ฅํ•œ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ ์ด ๋ฆฌํƒ€๊ฒŸํŒ… ๋ชจ๋“ˆ์€ ์ธ๊ฐ„ ํŒŒ์ผ๋Ÿฟ์ด ์†์„ ๊ตฌ๋ถ€๋ฆฌ๊ฑฐ๋‚˜ ์—„์ง€์™€ ์†๊ฐ€๋ฝ ์‚ฌ์ด ๊ฑฐ๋ฆฌ๋ฅผ ์กฐ์ ˆํ•  ๋•Œ, ๊ทธ ์†๋ ๋™์ž‘์ด ๋กœ๋ด‡ ์†์—์„œ๋„ ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ์žฌํ˜„๋˜๋„๋ก ๋™์ž‘ํ•œ๋‹ค.

4. ์† ๋ฆฌํƒ€๊ฒŸํŒ… ๋ชจ๋“ˆ ๋™์ž‘ ์›๋ฆฌ์™€ ์ œ์•ฝ์กฐ๊ฑด

DexPilot์˜ ๋ฆฌํƒ€๊ฒŸํŒ… ๋ชจ๋“ˆ์€ ์ถ”์  ์Šค๋ ˆ๋“œ ๋‚ด๋ถ€์—์„œ ์ž‘๋™ํ•˜๋ฉฐ, ์ธ๊ฐ„ ์† ์ถ”์  ๊ฒฐ๊ณผ๋ฅผ Allegro ์† ์ œ์–ด ๋ช…๋ น์œผ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ์‹ค์‹œ๊ฐ„ ์ตœ์ ํ™” ์—”์ง„์ด๋‹ค. ๋งค ์ฃผ๊ธฐ๋งˆ๋‹ค ์•ž์„œ ๊ณ„์‚ฐ๋œ ์ธ๊ฐ„ ์† ๊ด€์ ˆ๊ฐ์„ ์ž…๋ ฅ์œผ๋กœ ํ•˜์—ฌ ์œ„์˜ ๋น„์šฉ ํ•จ์ˆ˜๋ฅผ ์ตœ์†Œํ™”ํ•˜๋ฉฐ, ์ด๋•Œ s(d_i)๋‚˜ f(d_i) ๋“ฑ์˜ ๊ธฐ๋ฒ•์œผ๋กœ ์—„์ง€-๊ฒ€์ง€ ๊ฐ„ ํ”ฝ์Šค์ณ ๋™์ž‘์„ ๊ฐ•์ œํ•œ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด, ์ง€ํ๋ฅผ ํ•€์น˜ํ•  ๋•Œ์™€ ๊ฐ™์ด ์—„์ง€์™€ ๊ฒ€์ง€ ์‚ฌ์ด ๊ฑฐ๋ฆฌ๊ฐ€ ์ž‘์•„์ ธ d_i<\epsilon์ด ๋˜๋ฉด ์†๋ ๊ฐ„ ๊ฑฐ๋ฆฌ๋ฅผ ์œ ์ง€ํ•˜๋„๋ก f(d_i)๊ฐ€ ์ž‘์•„์ง€๋ฉฐ, ๋™์‹œ์— ๊ฐ€์ค‘์น˜ s(d_i)๊ฐ€ ์ปค์ ธ ํ•ด๋‹น ์†๋ ๋ฒกํ„ฐ ํ•ญ์ด ๋น„์šฉ์— ํฌ๊ฒŒ ๋ฐ˜์˜๋œ๋‹ค. ์ด๋Ÿฌํ•œ ํˆฌ์˜(projection) ๊ธฐ๋ฒ•์€ ์นด๋ฉ”๋ผ ๊ธฐ๋ฐ˜ ์ถ”์ ์˜ ์˜ค์ฐจ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ์ •ํ™•ํ•œ ํ•€์น˜ ์ž์„ธ๋ฅผ ์œ ๋„ํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•ด์ฃผ์ง€๋งŒ, ํ›„์ˆ ํ•  ์ž‘์€ ๋ฌผ์ฒด ๋†“๊ธฐ ๋“ฑ์˜ ์ƒํ™ฉ์—์„œ๋Š” ์†๊ฐ€๋ฝ์„ ๋„ˆ๋ฌด ์˜ค๋ž˜ ์œ ์ง€ํ•˜๊ฒŒ ๋งŒ๋“œ๋Š” ๋ถ€์ž‘์šฉ๋„ ๋ฐœ์ƒํ•  ์ˆ˜ ์žˆ๋‹ค.

๋ฆฌํƒ€๊ฒŸํŒ… ์ตœ์ ํ™”๋Š” ์‹ค์‹œ๊ฐ„์œผ๋กœ ์‹คํ–‰๋˜์–ด์•ผ ํ•˜๋ฏ€๋กœ, ๊ณ„์‚ฐ ๋ณต์žก๋„๋ฅผ ์ค„์ด๊ณ  ์†”๋ฃจ์…˜์˜ ์—ฐ์†์„ฑ์„ ๋ณด์žฅํ•˜๋Š” ์—ฌ๋Ÿฌ ์ œ์•ฝ์กฐ๊ฑด๋„ ์ ์šฉ๋œ๋‹ค. ๋จผ์ € \gamma|q_a|^2 ์ •๊ทœํ™” ํ•ญ์„ ํ†ตํ•ด ํ•ด ๊ณต๊ฐ„์˜ ์ค‘๋ณต์„ฑ์„ ์–ต์ œํ•˜๋ฉฐ, ๋™์ผํ•œ ํšจ๊ณผ๋กœ ์•ž์„œ ์–ธ๊ธ‰ํ•œ ๊ฒ€์ง€ยท์ค‘์ง€ยท์•ฝ์ง€์˜ distal=medial ๊ณ ์ • ์ œ์•ฝ๋„ ๋„์ž…ํ•œ๋‹ค. ์ด์™€ ํ•จ๊ป˜, ์ตœ์ ํ™” ์ดˆ๊ธฐ๊ฐ’์„ ์ด์ „ ๊ฒฐ๊ณผ๋กœ ์„ค์ •ํ•˜์—ฌ ์—ฐ์‚ฐ ๋น„์šฉ๊ณผ ์ง„๋™์„ ์™„ํ™”ํ•œ๋‹ค. ๋งˆ์ง€๋ง‰์œผ๋กœ ๋กœ๋ด‡๊ณผ ์นด๋ฉ”๋ผ ์ขŒํ‘œ๊ณ„ ์ •ํ•ฉ(calibration)์„ ํ†ตํ•ด ์›ํ•˜๋Š” ์ดˆ๊ธฐ ์† ์ž์„ธ(ํŽผ์นœ ์†, ์†๋ฐ”๋‹ฅ ํ‰ํ–‰)๋ฅผ ์‹œ์Šคํ…œ์— ๋งž์ถ”์–ด ํŒŒ์ผ๋Ÿฟ์˜ ์†๊ณผ ๋กœ๋ด‡ ์†์ด ์ผ์น˜ํ•˜๋„๋ก ์„ค์ •ํ•œ๋‹ค. ์ข…ํ•ฉํ•˜๋ฉด, DexPilot์˜ ๋ฆฌํƒ€๊ฒŸํŒ… ๋ชจ๋“ˆ์€ ๋น„์„ ํ˜• ์ตœ์ ํ™” ๊ธฐ๋ฐ˜์ด๋ฉฐ, ์†๋ ์œ„์น˜ยท๋ฐฉํ–ฅ ์ž‘์—… ๊ณต๊ฐ„์„ ๋ณด์กดํ•˜๊ธฐ ์œ„ํ•œ ๋น„์šฉ ํ•จ์ˆ˜์— ์˜ํ•ด ์ธ๊ฐ„ ์†๋™์ž‘์„ Allegro ๊ด€์ ˆ๊ฐ’์œผ๋กœ ๋ณ€ํ™˜ํ•œ๋‹ค. ์ถ”๊ฐ€์ ์ธ ํ•„ํ„ฐ๋ง๊ณผ ์ œ์•ฝ์„ ํ†ตํ•ด ๋ถ€๋“œ๋Ÿฝ๊ณ  ๋ฌผ๋ฆฌ์ ์œผ๋กœ ํƒ€๋‹นํ•œ ์›€์ง์ž„์„ ๋ณด์žฅํ•˜๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ์ธ๊ฐ„ ํŒŒ์ผ๋Ÿฟ์˜ ์† ์ œ์Šค์ฒ˜๋Š” ์ •๊ตํ•˜๊ฒŒ ๋กœ๋ด‡ ์†์œผ๋กœ ๋ณต์ œ๋œ๋‹ค.

5. ์‹คํ—˜ ์„ค์ • ๋ฐ ์„ฑ๋Šฅ ํ‰๊ฐ€

DexPilot ์‹œ์Šคํ…œ์˜ ์„ฑ๋Šฅ์€ ๋‹ค์–‘ํ•œ ์กฐ์ž‘ ๊ณผ์ œ(task)์—์„œ ์ธก์ •๋˜์—ˆ๋‹ค. ์‚ฌ์šฉ๋œ ์‹คํ—˜ ์žฅ๋น„๋Š” ์•ž์„œ ์„ค๋ช…ํ•œ KUKA iiwa7+Allegro ์†, 4๋Œ€์˜ Intel RealSense D415 ์นด๋ฉ”๋ผ์ด๋‹ค[9]. ์‹คํ—˜์—์„œ ํŒŒ์ผ๋Ÿฟ(์กฐ์ข…์‚ฌ)์€ ํ…Œ์ด๋ธ” ์œ„์—์„œ ์ •ํ•ด์ง„ ๋ฌผ์ฒด๋ฅผ ์กฐ์ž‘ํ•ด์•ผ ํ–ˆ์œผ๋ฉฐ, ์‹คํ—˜ ๊ณผ์ œ๋Š” ์ด 15๊ฐ€์ง€๊ฐ€ ์ œ์‹œ๋˜์—ˆ๋‹ค(ํ‘œ II ์ฐธ์กฐ). ์—ฌ๊ธฐ์—๋Š” ๋‹จ์ˆœ ๋ฌผ์ฒด ์˜ฎ๊ธฐ๊ธฐ(pick-and-place)๋ถ€ํ„ฐ, ๋™์ „ ๋‚ด์ง€ ์ง€ํ๋ฅผ ์ง€๊ฐ‘์—์„œ ๊บผ๋‚ด๊ธฐ(๊ทธ๋ฆผ 11), ์„œ๋ž ์—ด๊ธฐ ๋ฐ ํ‹ฐ๋ฐฑ ๊บผ๋‚ด๊ธฐ(๊ทธ๋ฆผ 12), ๋•…์ฝฉํ†ต ๋šœ๊ป‘ ํ’€๊ธฐ(๊ทธ๋ฆผ 13) ๊ฐ™์€ ๋‹ค๋‹จ๊ณ„ ์ž‘์—…๋“ค์ด ํฌํ•จ.

๊ฐ ๊ณผ์ œ๋งˆ๋‹ค ํŒŒ์ผ๋Ÿฟ 2๋ช…์ด 5ํšŒ ์—ฐ์† ์‹œ๋„ํ•˜๋ฉฐ ์„ฑ๊ณต๋ฅ ์„ ์ธก์ •ํ–ˆ๊ณ , ์™„๋ฃŒ ์‹œ๊ฐ„(mean completion time)๋„ ๊ธฐ๋กํ–ˆ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ, DexPilot์€ ๋Œ€๋ถ€๋ถ„ ๊ณผ์ œ์—์„œ ๋†’์€ ์„ฑ๊ณต๋ฅ ์„ ๋ณด์˜€๋‹ค(Fig. 15). ํŠนํžˆ ๋‹จ์ˆœ ํ”ผํ‚น/ํ”Œ๋ ˆ์ด์Šค ์ž‘์—…์ด๋‚˜ ๋น„๊ต์  ํฐ ๋ฌผ์ฒด ์กฐ์ž‘ ์ž‘์—…๋“ค์€ ๋Œ€๋ถ€๋ถ„ ์„ฑ๊ณต๋ฅ  90โ€“100%์— ๋‹ฌํ–ˆ๋‹ค. ํ‰๊ท  ์™„๋ฃŒ ์‹œ๊ฐ„์€ ๊ณผ์ œ ๋‚œ์ด๋„์™€ ๋ณต์žก๋„์— ๋”ฐ๋ผ ๋‹ค์–‘ํ–ˆ๋Š”๋ฐ, ๋ฉ€ํ‹ฐ์Šคํ… ์ž‘์—…(์˜ˆ: ์„œ๋ž ์† ๋ฌผ๊ฑด ๊บผ๋‚ด๊ธฐ)์ผ์ˆ˜๋ก ์ˆ˜ ๋ถ„์ด ์†Œ์š”๋˜์—ˆ๋‹ค. ์ „๋ฐ˜์ ์œผ๋กœ ์‹œ์Šคํ…œ์€ ์ •๋ฐ€ ์ง‘๊ธฐยทํŒŒ์ง€, ๋‹ค์ง€ ๊ฐ„ ์กฐ์ž‘, ๋น„ํŒŒ์ง€(non-prehensile) ๋™์ž‘ ๋“ฑ์„ ๋ชจ๋‘ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋Š” ์ถฉ๋ถ„ํ•œ ์œ ์—ฐ์„ฑ๊ณผ ์•ˆ์ •์„ฑ์„ ๋ณด์˜€๋‹ค. ์ •์„ฑ์  ํ‰๊ฐ€์—์„œ๋„ DexPilot์˜ ์„ฑ๋Šฅ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ๊ทธ๋ฆผ 11์˜ ์ง€๊ฐ‘ ๊ณผ์ œ์—์„œ ํŒŒ์ผ๋Ÿฟ์€ ์ง€ํ๋ฅผ ์†๊ฐ€๋ฝ ์‚ฌ์ด์— ํ•€์น˜ํ•œ ์ฑ„๋กœ ์„ฑ๊ณต์ ์œผ๋กœ ์ง€๊ฐ‘ ๋ฐ”๊นฅ์œผ๋กœ ๋„์ง‘์–ด๋ƒˆ์œผ๋ฉฐ, ์ด๋•Œ ๋กœ๋ด‡ ์†๋„ ์ง€ํ๋ฅผ ๋†“์น˜์ง€ ์•Š๊ณ  ์œ ์ง€ํ–ˆ๋‹ค. ๊ทธ๋ฆผ 12์—์„œ๋Š” ์„œ๋ž์„ ์—ด๊ณ  ํ‹ฐ๋ฐฑ์„ ์žก์•„ ๋‹น๊ธฐ๊ธฐ ์œ„ํ•œ ์†๊ฐ€๋ฝ์˜ ํšŒ์ „ ๋ฐ ์ ‘์ด‰ ๋™์ž‘์ด ๋ช…ํ™•ํžˆ ๊ตฌํ˜„๋˜์—ˆ์œผ๋ฉฐ, ๊ทธ๋ฆผ 13์˜ ๋•…์ฝฉํ†ต ๋šœ๊ป‘ ๊ณผ์ œ์—์„œ๋Š” ๋šœ๊ป‘์„ ๋ฐ˜๋ณต ํšŒ์ „์‹œํ‚ค๋Š” ๋™์ž‘์ด ๋กœ๋ด‡์—๋„ ๊ทธ๋Œ€๋กœ ์ „๋‹ฌ๋˜์—ˆ๋‹ค. ์ด์ฒ˜๋Ÿผ ์ž‘์€ ๋ฌผ์ฒด๋ฅผ ์ง‘๊ฑฐ๋‚˜ ๋Œ๋ฆฌ๋Š” ์ •๋ฐ€ ๋™์ž‘ ๋ฟ ์•„๋‹ˆ๋ผ, ๋‘ ์†๊ฐ€๋ฝ์œผ๋กœ ๋ฌผ์ฒด๋ฅผ ์žก์€ ์ƒํƒœ์—์„œ ๋‚จ์€ ์†๊ฐ€๋ฝ์„ ์ด์šฉํ•ด ์ถ”๊ฐ€ ์กฐ์ž‘์„ ์ˆ˜ํ–‰ํ•˜๋Š” ๋ณตํ•ฉ ์กฐ์ž‘(compound manipulation)๋„ ๋ชจ๋‘ ์‚ฌ๋žŒ์ด ํ–‰ํ•˜๋“ฏ ์ˆ˜ํ–‰ ๊ฐ€๋Šฅํ•จ์„ ๋ณด์˜€๋‹ค.

๊ทธ๋Ÿฌ๋‚˜ ์ž‘์€ ๋ฌผ์ฒด๋ฅผ ๋‹ค๋ฃจ๋Š” ์ž‘์—…์—์„œ๋Š” ํ•œ๊ณ„๋„ ๊ด€์ฐฐ๋˜์—ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ํฌ๊ธฐ๊ฐ€ ์ž‘์€ ๋ธ”๋ก(pick blocks small)์ด๋‚˜ ์ปจํ…Œ์ด๋„ˆ ์† ๋ฌผ์ฒด ๋ฝ‘๊ธฐ(Container) ๋“ฑ์˜ ์ž‘์—…์€ ์™„๋ฃŒ ์‹œ๊ฐ„์ด ๊ธธ๊ฑฐ๋‚˜ ์„ฑ๊ณต๋ฅ ์ด ๋‚ฎ์•˜๋‹ค. ํŠนํžˆ ์ž‘์€ ๋ธ”๋ก์„ ์ฅ์—ˆ๋‹ค๊ฐ€ ๋†“๋Š” ๊ณผ์ •์—์„œ, ์•ž์„œ ์„ค๋ช…ํ•œ ํˆฌ์˜ ๊ธฐ๋ฒ•์ด ์†๊ฐ€๋ฝ ๊ฐ„ ๊ฑฐ๋ฆฌ๋ฅผ ๊ฐ•ํ•˜๊ฒŒ ์กฐ์ ˆํ•˜์—ฌ ๋ฌผ์ฒด๋ฅผ ๋Šฆ๊ฒŒ ๋†“๊ฒŒ ๋งŒ๋“ค๊ฑฐ๋‚˜ ์†๊ฐ€๋ฝ์ด ๊ฐ„์„ญํ•˜๋Š” ํ˜„์ƒ์ด ๋ฐœ์ƒํ–ˆ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ ์ž‘์€ ๋ธ”๋ก ์˜ฎ๊ธฐ๊ธฐ ๊ณผ์ œ์˜ ๊ฒฝ์šฐ ์„ฑ๊ณต๋ฅ ์ด ์ƒ๋Œ€์ ์œผ๋กœ ํ˜„์ €ํžˆ ๋‚ฎ์•˜๊ณ , ์™„๋ฃŒ ์‹œ๊ฐ„์ด ๋งค์šฐ ๊ธธ์–ด์กŒ๋‹ค. ์ด๋Ÿฌํ•œ ํ˜„์ƒ์€ ์žฅ๊ฐ‘ ๊ธฐ๋ฐ˜ ์ถ”์  ๋ฐ์ดํ„ฐ์˜ ๋ถ€์ •ํ™•์„ฑ์ด๋‚˜ ํˆฌ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ ์กฐ์ •์— ๊ธฐ์ธํ•œ ๊ฒƒ์œผ๋กœ ๋ถ„์„๋œ๋‹ค.

6. ๊ธฐ์กด ๋ฐฉ๋ฒ•๊ณผ ๋น„๊ต ๋ฐ ๊ธฐ์ˆ ์  ํ•œ๊ณ„

DexPilot์€ ๊ธ€๋Ÿฌ๋ธŒยท๋งˆ์ปค๋ฅผ ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š” ์ˆœ์ˆ˜ ์‹œ๊ฐ ๊ธฐ๋ฐ˜ ์‹œ์Šคํ…œ์ด๋ผ๋Š” ์ ์—์„œ ๋…์ฐฝ์ ์ด๋‹ค. ๊ธฐ์กด ์ƒ์šฉ ์‹œ์Šคํ…œ๋“ค(์˜ˆ: CyberGlove, HaptX)์€ ๋†’์€ ์ •ํ™•๋„์˜ ๊ด€์ ˆ ์ถ”์ •๊ณผ ์ด‰๊ฐ ํ”ผ๋“œ๋ฐฑ์„ ์ œ๊ณตํ•˜์ง€๋งŒ, ์žฅ๋น„ ๋น„์šฉ๊ณผ ๋ถ€ํ”ผ๊ฐ€ ํฌ๊ณ  ์‚ฌ์šฉ์ž์˜ ์ž์œ ๋กœ์šด ์›€์ง์ž„์„ ์ œํ•œํ•œ๋‹ค. ๋ฐ˜๋ฉด DexPilot์€ ์ €๋ ดํ•œ RGB-D ์นด๋ฉ”๋ผ ๋„คํŠธ์›Œํฌ๋งŒ์œผ๋กœ 23DoF ์ œ์–ด๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜์˜€๊ณ , ์ด๋Š” ์ข…๋ž˜์˜ ๊ธ€๋Ÿฌ๋ธŒ๋‚˜ ๋ชจ์…˜์บก์ฒ˜ ์—†์ด ๋ณต์žก ์กฐ์ž‘์„ ์ˆ˜ํ–‰ํ•œ ์‚ฌ๋ก€๋กœ๋Š” ๋“œ๋ฌผ๋‹ค. ๊ธฐ์กด ํ•™์ˆ  ์—ฐ๊ตฌ์™€ ๋น„๊ตํ•ด ๋ณด๋ฉด, Li ๋“ฑ์€ ๋”ฅ๋Ÿฌ๋‹์œผ๋กœ ์„€๋„์šฐ ํ•ธ๋“œ(Shadow Hand) ๊ด€์ ˆ๊ฐ์„ ์ถ”์ •ํ•˜์˜€์œผ๋‚˜ ์‹œ์Šคํ…œ ์ „์ฒด ์ ์šฉ๊ณผ ์ •๋ฐ€ ์ง‘๊ธฐ์—๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ์—ˆ๋‹ค. Antotsiou ๋“ฑ์€ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์ƒ์˜ ๊ฐ„๋‹จํ•œ ์กฐ์ž‘ ์ž‘์—…๋งŒ ๋ณด์˜€๋˜ ๋ฐ˜๋ฉด, DexPilot์€ ์‹ค์ œ ๋ฌผ๋ฆฌ์  ํ™˜๊ฒฝ์—์„œ ์†๋ ์ ‘์ด‰๊ณผ ์—ฐ๊ด€๋œ ๋ณต์žก ์ž‘์—…์„ ์ˆ˜ํ–‰ํ–ˆ๋‹ค. ์ด์ฒ˜๋Ÿผ DexPilot์€ ์‹œ๊ฐ-๋ชจ๋ธ ์ถ”์ , ์ตœ์ ํ™” ๊ธฐ๋ฐ˜ ๋ฆฌํƒ€๊ฒŸํŒ…, ์ž„ํ”ผ๋˜์Šค ์ œ์–ด๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ํ˜„์žฅ์ž‘์—…์— ํ•„์š”ํ•œ ์ˆ˜์ค€์˜ ์กฐ์ž‘ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค๋Š” ์ ์—์„œ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„์„ ์ œ์‹œํ–ˆ๋‹ค.

๊ทธ๋Ÿผ์—๋„ ๋ช‡ ๊ฐ€์ง€ ๊ธฐ์ˆ ์  ํ•œ๊ณ„๊ฐ€ ๋ณด๊ณ ๋˜์—ˆ๋‹ค.

  • ์ฒซ์งธ, ๊ด€์ฐฐ ์˜์—ญ(workspace)์ด ์นด๋ฉ”๋ผ ๋ฒ”์œ„๋กœ ์ œํ•œ๋˜์–ด ์žˆ์–ด ๋„“์€ ๊ณต๊ฐ„์—์„œ์˜ ์กฐ์ž‘์—๋Š” ๋ถ€์ ํ•ฉํ•˜๋‹ค. ์‹คํ—˜์—์„œ๋Š” ์นด๋ฉ”๋ผ๊ฐ€ ๊ด€์ ˆ ๊ฑฐ๋ฆฌ 1m ์ด๋‚ด์—์„œ ์ข‹์€ ํ’ˆ์งˆ์„ ๋ณด์˜€์œผ๋‚˜, ๋ฒ”์œ„๋ฅผ ๋ฒ—์–ด๋‚˜๋ฉด ๊นŠ์ด ์„ผ์‹ฑ ์ •ํ™•๋„๊ฐ€ ๊ธ‰๊ฒฉํžˆ ๋–จ์–ด์ง„๋‹ค.
  • ๋‘˜์งธ, ์•ž์„œ ์–ธ๊ธ‰ํ•œ ๋ฆฌํƒ€๊ฒŸํŒ… ํˆฌ์˜ ๊ธฐ๋ฒ•์˜ ๋ถ€์ž‘์šฉ์ด๋‹ค. ์—„์ง€-๊ฒ€์ง€ ํ•€์น˜ ์œ ์ง€ ์‹œ ์žก์€ ๋ฌผ์ฒด๋ฅผ ๋Šฆ๊ฒŒ ๋†“๊ฑฐ๋‚˜ ์†๊ฐ€๋ฝ๋ผ๋ฆฌ ๊ฐ„์„ญ์ด ๋ฐœ์ƒํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ์ด๋Š” ์ž‘์€ ๋ฌผ์ฒด ์ž‘์—…์—์„œ ํšจ์œจ์„ ๋–จ์–ด๋œจ๋ฆฐ๋‹ค. ํ˜„์žฌ๋Š” ์ด ๊ธฐ๋Šฅ์„ ์˜ต์…˜์œผ๋กœ ๋Œ ์ˆ˜ ์žˆ๋„๋ก ํ•˜์˜€์œผ๋‚˜, ๊ถ๊ทน์ ์œผ๋กœ๋Š” ์† ์ถ”์  ์ •ํ™•๋„๋ฅผ ๋†’์—ฌ ์ด๋Ÿฌํ•œ ๋ณด์ •์ด ํ•„์š” ์—†๋„๋ก ํ•ด์•ผ ํ•œ๋‹ค.
  • ์…‹์งธ, ์ด‰๊ฐ ํ”ผ๋“œ๋ฐฑ ๋ถ€์žฌ๋กœ ์ธํ•ด ๋ฏธ์„ธ ์กฐ์ž‘์ด ์–ด๋ ต๋‹ค. DexPilot์—๋Š” ์ด‰๊ฐ ์„ผ์„œ๊ฐ€ ์žˆ์–ด ๋ฐ์ดํ„ฐ๋ฅผ ๊ธฐ๋กํ•  ์ˆ˜ ์žˆ์œผ๋‚˜, ํŒŒ์ผ๋Ÿฟ์—๊ฒŒ๋Š” ํž˜/์ด‰๊ฐ ์ •๋ณด๊ฐ€ ๋Œ์•„๊ฐ€์ง€ ์•Š๋Š”๋‹ค. ์ด๋กœ ์ธํ•ด ๋ฌผ์ฒด๊ฐ€ ๋ฏธ๋„๋Ÿฌ์งˆ ๋•Œ ์ง๊ด€์ ์œผ๋กœ ๊ฐ์ง€ํ•˜๊ธฐ ์–ด๋ ค์›Œ ์กฐ์ž‘ ์‹คํŒจ์œจ์ด ๋†’์•„์งˆ ์ˆ˜ ์žˆ๋‹ค. ํ–ฅํ›„ ์ด‰๊ฐ ํ”ผ๋“œ๋ฐฑ ์ „๋‹ฌ์ด๋‚˜ ๋ฐ˜์ž๋™ ํž˜ ์ œ์–ด(์žก๊ธฐ ๊ฐ•๋„ ์ž๋™ ์กฐ์ ˆ) ๊ธฐ์ˆ ์„ ๊ฒฐํ•ฉํ•œ๋‹ค๋ฉด ์ด ๋ถ€๋‹ด์„ ์ค„์ผ ์ˆ˜ ์žˆ์„ ๊ฒƒ์ด๋‹ค.
  • ๋„ท์งธ, ์ถ”์  ๋ฐ ์ œ์–ด ์ง€์—ฐ์ด๋‹ค. ์ „์ฒด ์‹œ์Šคํ…œ์˜ ์‘๋‹ต ์ง€์—ฐ์€ ์•ฝ 1์ดˆ์ด๋ฉฐ, RMP ์ œ์–ด์˜ ํŒŒ๋ผ๋ฏธํ„ฐ ํŠœ๋‹๊ณผ ๋„คํŠธ์›Œํฌ ์ธํผ๋Ÿฐ์Šค ์ง€์—ฐ์„ ์ตœ์ ํ™”ํ•  ์ˆ˜ ์žˆ๋‹ค. ๋งˆ์ง€๋ง‰์œผ๋กœ ๊ณ ์ •๋ฐ€ ์‚ฝ์ž… ์ž‘์—…(์˜ˆ: ํŽ˜๊ทธ-์ธ-ํ™€)์€ ์•„์ง ์™„๋ฒฝํžˆ ์ˆ˜ํ–‰๋˜์ง€ ๋ชปํ–ˆ๋‹ค. ์‹ค์ œ๋กœ NIST ์‚ฝ์ž… ๊ณผ์ œ๋ฅผ ์‹œ๋„ํ–ˆ์ง€๋งŒ, ๋งค์šฐ ํ˜‘์†Œํ•œ ๊ฐ„๊ฒฉ(0.1mm)์—์„œ๋Š” ์„ฑ๊ณต๋ฅ ์ด 10% ์ดํ•˜๋กœ ์ €์กฐํ–ˆ๋‹ค. ์ด๋Š” ์นด๋ฉ”๋ผ ํ•ด์ƒ๋„, ์† ์ถ”์  ์ •๋ฐ€๋„, ์ œ์–ด ์‘๋‹ต ์†๋„ ๋“ฑ ๋‹ค์–‘ํ•œ ์š”์ธ์ด ๋ณตํ•ฉ์ ์œผ๋กœ ์ž‘์šฉํ•œ ๊ฒฐ๊ณผ๋กœ, ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด ๋” ์ •๋ฐ€ํ•œ ์ถ”์ ๊ณผ ์ž๋™ ์ œ์–ด ๋ณด์กฐ ๊ธฐ๋Šฅ์ด ํ•„์š”ํ•˜๋‹ค.

์ข…ํ•ฉํ•˜๋ฉด, DexPilot์€ ์ €๋น„์šฉ ์‹œ๊ฐ ๊ธฐ๋ฐ˜ ๋ฐฉ์‹์œผ๋กœ ๊ณ ์ž์œ ๋„ ๋กœ๋ด‡ ์†์„ ์กฐ์ž‘ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•œ ํš๊ธฐ์ ์ธ ์‹œ์Šคํ…œ์ด์ง€๋งŒ, ์นด๋ฉ”๋ผ ๊ด€์ธก ๋ฒ”์œ„, ์† ์ถ”์  ์ •ํ™•๋„, ์ด‰๊ฐ ๋ถ€์žฌ ๋“ฑ ์‹ค์ œ ํ™œ์šฉ ์‹œ ๊ณ ๋ คํ•ด์•ผ ํ•  ํ•œ๊ณ„์ ๋“ค๋„ ๋™์‹œ์— ์ง€๋‹ˆ๊ณ  ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ํ•œ๊ณ„๋“ค์„ ๊ฐœ์„ ํ•˜๋ฉด ์•ž์œผ๋กœ ๋ณด๋‹ค ์ •๊ตํ•œ ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜๊ณผ ๋กœ๋ด‡ ํ•™์Šต ์‘์šฉ์— ํฐ ๊ธฐ์—ฌ๋ฅผ ํ•  ์ˆ˜ ์žˆ์„ ๊ฒƒ์ด๋‹ค.


Additional Review

์‹œ์ž‘ํ•˜๋ฉฐ: โ€œ์žฅ๊ฐ‘ ์—†์ด๋„ ์†์„ ๋นŒ๋ ค์ค„ ์ˆ˜ ์žˆ์„๊นŒ?โ€

๋กœ๋ด‡ ์†์„ ์‚ฌ๋žŒ ์†์ฒ˜๋Ÿผ ์กฐ์ข…ํ•˜๋Š” ๊ฐ€์žฅ ์†”์งํ•œ ๋ฐฉ๋ฒ•์€ ๋ฌด์—‡์ผ๊นŒ. ๋ฐ์ดํ„ฐ ๊ธ€๋Ÿฌ๋ธŒ๋ฅผ ๋ผ๊ณ , ์ž์„ ํŠธ๋ž˜์ปค๋ฅผ ์†๋“ฑ์— ๋ถ™์ด๊ณ , ์†๊ฐ€๋ฝ ๋งˆ๋””๋งˆ๋‹ค IMU๋ฅผ ๋งค๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. ์ž˜ ์ž‘๋™ํ•˜์ง€๋งŒ ๋น„์‹ธ๊ณ  ๊ฑฐ์ถ”์žฅ์Šค๋Ÿฝ๋‹ค. ํ•œ ์‹œ๊ฐ„๋งŒ ๋ผ๊ณ  ์žˆ์–ด๋„ ์†์ด ๋•€์— ์ ˆ๊ณ  ์บ˜๋ฆฌ๋ธŒ๋ ˆ์ด์…˜์€ ์–ด๊ธ‹๋‚œ๋‹ค. ๊ทธ๋ž˜์„œ ์‚ฌ๋žŒ๋“ค์ด ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ๋„๋‹ฌํ•˜๋Š” ์งˆ๋ฌธ์ด ์žˆ๋‹ค โ€” ์นด๋ฉ”๋ผ๋งŒ์œผ๋กœ๋Š” ์ •๋ง ์•ˆ ๋˜๋Š” ๊ฑธ๊นŒ?

DexPilot์˜ ๋‹ต์€ โ€œ๋œ๋‹ค, ๊ทธ๊ฒƒ๋„ 23 ์ž์œ ๋„(degree of actuation, DoA) ์ „๋ถ€๋ฅผ ๋‹ค.โ€ ์ด๋‹ค. ์‚ฌ๋žŒ์˜ ๋งจ์†์„ 4๋Œ€์˜ RealSense ๊นŠ์ด ์นด๋ฉ”๋ผ๋กœ ๊ด€์ฐฐํ•˜๊ณ , GPU ๋‘ ์žฅ์œผ๋กœ ์ž์„ธ๋ฅผ ์ถ”์ •ํ•˜๊ณ , ์ตœ์ ํ™” ํ•œ ๋ฒˆ์œผ๋กœ Allegro ์†๊ณผ 7-DoF ํŒ”์— ๋™์‹œ์— ๋ช…๋ น์„ ๋‚ด๋ฆฐ๋‹ค. ์ง€๊ฐ‘์—์„œ ์ง€ํ๋ฅผ ๋นผ๊ณ , ์„œ๋ž์„ ์—ด๊ณ  ํ‹ฐ๋ฐฑ์„ ๊บผ๋‚ด๊ณ , ๋„ค ์†๊ฐ€๋ฝ ์‚ฌ์ด์— ํ๋ธŒ ๋‘ ๊ฐœ๋ฅผ ๋ผ์šฐ๋Š” ๋ฌ˜๊ธฐ๊นŒ์ง€. ์ด‰๊ฐ ํ”ผ๋“œ๋ฐฑ ์—†์ด.

์ด ๋ฆฌ๋ทฐ๋Š” ๊ทธ ์‹œ์Šคํ…œ์ด ์–ด๋–ป๊ฒŒ ๊ทธ๋ฆฌ๊ณ  ์™œ ์ž‘๋™ํ•˜๋Š”์ง€๋ฅผ, ์ˆ˜์‹๊ณผ ์•Œ๊ณ ๋ฆฌ์ฆ˜๊ณผ ํ•จ๊ป˜ ์ง๊ด€์ ์œผ๋กœ ํ’€์–ด๋ณธ๋‹ค. ํŠนํžˆ ์šด๋™ํ•™์  ๋ฆฌํƒ€๊ฒŒํŒ…(kinematic retargeting)์˜ ๋น„์šฉ ํ•จ์ˆ˜ ์„ค๊ณ„ โ€” ์ด ๋…ผ๋ฌธ์ด ํ›„์† ์—ฐ๊ตฌ(AnyTeleop, GELLO, Bunny-VisionPro ๋“ฑ)์— ๋‚จ๊ธด ์ง„์งœ ์œ ์‚ฐ โ€” ๋Š” ํ•œ ์ค„ ํ•œ ์ค„ ๋œฏ์–ด๋ณผ ๊ฐ€์น˜๊ฐ€ ์žˆ๋‹ค.

๋ฌธ์ œ์˜ ๋ณธ์งˆ: ์™œ ์† ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜์€ ์–ด๋ ค์šด๊ฐ€

์†์„ ์˜ฎ๊ธฐ๋Š” ๊ฒƒ๊ณผ ์†๊ฐ€๋ฝ์„ ์›€์ง์ด๋Š” ๊ฒƒ์€ ๋‹ค๋ฅธ ๋ฌธ์ œ๋‹ค. ํŒ”์€ 6-DoF, ์ž˜ ํ’€๋ฆฐ IK๊ฐ€ ์žˆ์œผ๋ฉด ๋์ด๋‹ค. ์†์€ ๋‹ค๋ฅด๋‹ค.

์ฒซ์งธ, ์ž์œ ๋„ ๋ถˆ์ผ์น˜

์‚ฌ๋žŒ ์†์€ ๋Œ€๋žต 27 DoF, Allegro ์†์€ 16 DoF๋‹ค. ์ž์œ ๋„๊ฐ€ ๋‹ค๋ฅธ ๋‘ ๋ฉ”์ปค๋‹ˆ์ฆ˜ ์‚ฌ์ด์— ์ผ๋Œ€์ผ ๋Œ€์‘์„ ๋งŒ๋“ค ์ˆ˜ ์žˆ๋Š” ์ž์—ฐ์Šค๋Ÿฌ์šด ๋ฐฉ๋ฒ•์€ ์กด์žฌํ•˜์ง€ ์•Š๋Š”๋‹ค. ๊ด€์ ˆ ๊ฐ๋„๋ฅผ ์ง์ ‘ ๋ณต์‚ฌํ•˜๋Š” ๋ฐฉ์‹(joint-space copy)์€ ์†๊ฐ€๋ฝ ๊ธธ์ด๊ฐ€ ๋‹ค๋ฅด๊ณ  ๋ฒ ์ด์Šค ์œ„์น˜๊ฐ€ ๋‹ค๋ฅธ ์ˆœ๊ฐ„ ๋ฌด๋„ˆ์ง„๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ์‚ฌ๋žŒ์ด ์—„์ง€์™€ ๊ฒ€์ง€๋กœ ๋™์ „์„ ์ง‘๋Š” ์ž์„ธ๋ฅผ Allegro์— ๊ทธ๋Œ€๋กœ ๋ณต์‚ฌํ•˜๋ฉด, ์†๋์ด ๋งŒ๋‚˜์•ผ ํ•  ๊ณณ์—์„œ 5 cm์”ฉ ๋น—๋‚˜๊ฐ„๋‹ค.

์ด๊ฑธ ํ•ด๊ฒฐํ•˜๋ ค๋ฉด ๋ฌด์—‡์„ ๋ณด์กดํ• ์ง€ ๊ฒฐ์ •ํ•ด์•ผ ํ•œ๋‹ค. ์†๋ ์œ„์น˜? ์†๊ฐ€๋ฝ ๋ฐฉํ–ฅ? ์†๋ฐ”๋‹ฅ ๋Œ€๋น„ ์ƒ๋Œ€ ์ขŒํ‘œ? ์ •๋‹ต์€ โ€œ์ •ํ™•ํ•œ ๊ทธ๋ฆฝ์„ ์œ„ํ•ด ํ•„์š”ํ•œ ๊ฒƒโ€์ด๋‹ค. DexPilot์˜ ๋‹ต์€ ์†๊ฐ€๋ฝ ๋๊ณผ ์†๋ฐ”๋‹ฅ ์‚ฌ์ด, ๊ทธ๋ฆฌ๊ณ  ์†๊ฐ€๋ฝ ๋๋ผ๋ฆฌ์˜ ๋ฒกํ„ฐ๋‹ค. ์ด ์„ ํƒ์ด ์™œ ์ž์—ฐ์Šค๋Ÿฌ์šด์ง€๋Š” ์ž ์‹œ ํ›„์— ๋ณธ๋‹ค.

๋‘˜์งธ, ์‹œ๊ฐ๋งŒ์œผ๋กœ ์†์„ ์ถ”์ ํ•˜๋Š” ์ผ

์†์€ ๊ฐ€๋ ธ๋‹ค, ๋˜ ๊ฐ€๋ ธ๋‹ค, ์ž๊ธฐ ์ž์‹ ์„ ๊ฐ€๋ฆฐ๋‹ค. ์ƒˆ๋ผ์†๊ฐ€๋ฝ์„ ๊ตฝํžˆ๋ฉด ์•ฝ์ง€๊ฐ€ ๊ฐ€๋ฆฌ๊ณ , ์•ฝ์ง€๋ฅผ ๊ตฝํžˆ๋ฉด ์ค‘์ง€๊ฐ€ ๊ฐ€๋ฆฐ๋‹ค. ํ•œ ๋Œ€์˜ ์นด๋ฉ”๋ผ๋กœ๋Š” ๊ฑฐ์˜ ํ•ญ์ƒ ์–ด๋–ค ์†๊ฐ€๋ฝ์€ ๋ณด์ด์ง€ ์•Š๋Š”๋‹ค.

DexPilot์€ ์นด๋ฉ”๋ผ๋ฅผ ๋„ค ๋Œ€ โ€” ์‚ฌ์šฉ์ž ์‹œ์ ์—์„œ ์ •๋ฉด 1๋Œ€, ์ขŒ์šฐ 2๋Œ€, ์•„๋ž˜ 1๋Œ€ โ€” ๋ฐฐ์น˜ํ•ด์„œ ์†์ด ์–ด๋А ๋ฐฉํ–ฅ์œผ๋กœ ํšŒ์ „ํ•ด๋„ ์ ์–ด๋„ ํ•œ ์‹œ์ ์€ ์‚ด์•„๋‚จ๋„๋ก ๋งŒ๋“ ๋‹ค. ๊ทธ๋ฆฌ๊ณ  ํ•œ ๊ฐ€์ง€ ๋”: ๋ชจ๋ธ ํ”„๋ฆฌ ์ถ”์ •๊ธฐ์™€ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ํŠธ๋ž˜์ปค๋ฅผ ๊ฐ™์ด ์“ด๋‹ค. ์‹ ๊ฒฝ๋ง์€ ๋น ๋ฅด๊ณ  ๊ฐ•๊ฑดํ•˜์ง€๋งŒ ๋–จ๋ฆฐ๋‹ค. ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์ถ”์ (DART)์€ ๋งค๋„๋Ÿฝ์ง€๋งŒ ํ•œ ๋ฒˆ ๋†“์น˜๋ฉด ์˜์›ํžˆ ๋ชป ๋”ฐ๋ผ๊ฐ„๋‹ค. ๋‘˜์„ ๊ฒฐํ•ฉํ•˜๋ฉด โ€œ์ดˆ๊ธฐ๊ฐ’์€ ์‹ ๊ฒฝ๋ง, ์ •๋ฐ€ํ™”๋Š” ์ตœ์ ํ™”โ€๋ผ๋Š” ๊ณ ์ „์  ์ปดํ“จํ„ฐ๋น„์ „ ๋ ˆ์‹œํ”ผ๊ฐ€ ๋œ๋‹ค.

์…‹์งธ, ์ •๋ฐ€ ๊ทธ๋ฆฝ์˜ ๋น„๋Œ€์นญ์„ฑ

์†๋๊ณผ ์†๋์ด ๋‹ฟ์•„์•ผ ํ•˜๋Š” ๊ทธ๋ฆฝ(precision grasp)์€ ์‹คํŒจ์— ๋งค์šฐ ๋ฏผ๊ฐํ•˜๋‹ค. 9 mm ์ถ”์  ์˜ค์ฐจ๋Š” ์†๋ฐ”๋‹ฅ ์œ„ ์ž์„ธ์—์„œ๋Š” ๋ฌด์‹œํ•  ๋งŒํ•˜์ง€๋งŒ, โ€œ์—„์ง€์™€ ๊ฒ€์ง€๋กœ ์ข…์ด ํ•œ ์žฅ ์ง‘๊ธฐโ€์—์„œ๋Š” ์น˜๋ช…์ ์ด๋‹ค. 9 mm ์˜ค์ฐจ๋กœ ์†๊ฐ€๋ฝ์ด 1 mm ๋–จ์–ด์ง€๋ฉด ์ข…์ด๋Š” ๋–จ์–ด์ง„๋‹ค.

์ด ๋น„๋Œ€์นญ์„ ๋‹ค๋ฃจ๋Š” ๊ฒŒ DexPilot์˜ ์˜๋ฆฌํ•œ ๋ถ€๋ถ„์ด๊ณ , ๋’ค์—์„œ projection scheme์ด๋ผ๋Š” ์ด๋ฆ„์œผ๋กœ ๋“ฑ์žฅํ•œ๋‹ค.

์‹œ์Šคํ…œ ํ•œ๋ˆˆ์— ๋ณด๊ธฐ

์ „์ฒด ํŒŒ์ดํ”„๋ผ์ธ์€ ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค:

flowchart LR
    A[4x RealSense<br/>Depth Cameras] --> B[Hand Segmentation<br/>plane removal]
    B --> C[PointNet++ Stage 1<br/>coarse hand pose]
    C --> D[PointNet++ Stage 2<br/>refined pose]
    D --> E[DART Model-based<br/>Tracker]
    E --> F{Hand Joints<br/>23 keypoints}
    F --> G[Kinematic Retargeting<br/>SLSQP nonlinear opt.]
    F --> H[Palm Pose<br/>6-DoF]
    G --> I[Allegro 16 joint angles]
    H --> J[RMPs<br/>Arm Motion Policy]
    J --> K[KUKA LBR4+<br/>7-DoF arm]
    I --> L[Allegro Hand<br/>torque controller]
    K --> M[Robot World]
    L --> M

ํ•ต์‹ฌ ์ˆซ์ž๋งŒ ์ •๋ฆฌํ•˜๋ฉด:

๊ตฌ์„ฑ์š”์†Œ ์‚ฌ์–‘
๊นŠ์ด ์นด๋ฉ”๋ผ Intel RealSense D415 x 4
GPU NVIDIA GPU x 2 (PointNet++ + DART)
์† Allegro Hand V4, 16 DoF
ํŒ” KUKA LBR4+, 7 DoF
์ด ์ž์œ ๋„ 23 DoA
์ œ์–ด ์ฃผ๊ธฐ 30 Hz
ํ‰๊ท  keypoint ์˜ค์ฐจ 9.7 mm
๋น„์šฉ ํ•จ์ˆ˜ ์ตœ์ ํ™” SLSQP (NLopt)
ํŒ” ์ œ์–ด๊ธฐ Riemannian Motion Policies
์† ์ž์„ธ ์ถ”์ • ๋ชจ๋ธ PointNet++ 2-stage + DART
์†๋ 2D ๊ฒ€์ถœ ๋ณด์กฐ GloveNet (ResNet-50 + spatial-softmax)

๋น„์šฉ์ด ์™œ ๋‚ฎ์€์ง€๋Š” ํ‘œ ์•ˆ์— ๋‹ค ์žˆ๋‹ค. ์นด๋ฉ”๋ผ ํ•œ ๋Œ€๊ฐ€ ์•ฝ 150๋‹ฌ๋Ÿฌ, GPU์™€ ์›Œํฌ์Šคํ…Œ์ด์…˜์„ ๋”ํ•ด๋„ ๊ธ€๋Ÿฌ๋ธŒ ๊ธฐ๋ฐ˜ ์†”๋ฃจ์…˜(์ˆ˜์ฒœ๋งŒ ์›)์˜ ์ผ๋ถ€์— ๋ถˆ๊ณผํ•˜๋‹ค.

์‹œ๊ฐ ํŒŒ์ดํ”„๋ผ์ธ: ๋ชจ๋ธ ํ”„๋ฆฌ์™€ ๋ชจ๋ธ ๊ธฐ๋ฐ˜์˜ ๊ฒฐํ˜ผ

1๋‹จ๊ณ„: PointNet++๋กœ ๊ฑฐ์นœ ์ž์„ธ ์žก๊ธฐ

๋„ค ๋Œ€์˜ ์นด๋ฉ”๋ผ์—์„œ ๋“ค์–ด์˜จ ๊นŠ์ด ์˜์ƒ์€ ํ•ฉ์ณ์ ธ ํ•˜๋‚˜์˜ ํฌ์ธํŠธ ํด๋ผ์šฐ๋“œ๊ฐ€ ๋œ๋‹ค. ์ฑ…์ƒ์€ ํ‰๋ฉด ํ”ผํŒ…(RANSAC)์œผ๋กœ ์ œ๊ฑฐํ•œ๋‹ค. ๋‚จ์€ ์ ๋“ค์—๋Š” ์†, ํŒ”, ๊ทธ๋ฆฌ๊ณ  ์‚ฌ๋žŒ์˜ ๋ชธํ†ต ์ผ๋ถ€๊ฐ€ ๋“ค์–ด ์žˆ๋‹ค.

์—ฌ๊ธฐ์„œ PointNet++๊ฐ€ ๋“ฑ์žฅํ•œ๋‹ค. PointNet++์˜ ๋‘๋“œ๋Ÿฌ์ง„ ํŠน์ง•์€ ์  ์ง‘ํ•ฉ์„ ์ž…๋ ฅ์œผ๋กœ ๋ฐ›์•„ ์ˆœ์„œ์— ๋ถˆ๋ณ€(permutation invariant)ํ•˜๋ฉด์„œ๋„ ์ง€์—ญ์  ๊ตฌ์กฐ๋ฅผ ๋ณธ๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. ์†๋ฐ”๋‹ฅ ํ•œ ์ ์„ ์ฒ˜๋ฆฌํ•  ๋•Œ, ๊ทธ ์  ์ฃผ๋ณ€์˜ ์ด์›ƒ ์ ๋“ค์ด ์ด๋ฃจ๋Š” ์ž‘์€ ํŒจ์น˜๋ฅผ ํ•จ๊ป˜ ๋ณธ๋‹ค. ์ด ์ง€์—ญ ํŒจ์น˜๋“ค์ด ๊ณ„์ธต์ ์œผ๋กœ ๋ชจ์—ฌ ๊ฒฐ๊ตญ ์† ์ „์ฒด์˜ ์ž์„ธ๋ฅผ ๊ฒฐ์ •ํ•œ๋‹ค.

DexPilot์˜ ๋„คํŠธ์›Œํฌ๋Š” ๋‘ ๋‹จ๊ณ„๋กœ ๊ตฌ์„ฑ๋œ๋‹ค:

  • Stage 1: ์ž…๋ ฅ ํฌ์ธํŠธ ํด๋ผ์šฐ๋“œ ์ „์ฒด์—์„œ ์†์˜ ๊ฑฐ์นœ ์ž์„ธ(global rotation + translation + 23๊ฐœ keypoint์˜ ๋Œ€๋žต์  ์œ„์น˜)๋ฅผ ์ถ”์ •ํ•œ๋‹ค. ์ฑ…์ƒ์€ ๋น ์กŒ์ง€๋งŒ ํŒ”๊ณผ ๋ชธํ†ต์ด ๋‚จ์•„ ์žˆ์–ด ๋…ธ์ด์ฆˆ๊ฐ€ ํฌ๋‹ค.
  • Stage 2: Stage 1์˜ ์ถœ๋ ฅ์„ ๋ฐ”ํƒ•์œผ๋กœ ์† ์˜์—ญ์„ ์ž˜๋ผ๋‚ด๊ณ , ๊ทธ ์•ˆ์—์„œ ์ •๋ฐ€ํ•œ keypoint ์œ„์น˜๋ฅผ ๋‹ค์‹œ ์ถ”์ •ํ•œ๋‹ค. ๊ฐ™์€ PointNet++ ๋ฐฑ๋ณธ์— MLP ํ—ค๋“œ๋ฅผ ๋ถ™์ธ ๊ตฌ์กฐ.

7,000์žฅ์˜ ๊ฒ€์ฆ ์˜์ƒ์—์„œ ํ‰๊ท  9.7 mm ์˜ค์ฐจ. ์ด๊ฑด ์†๋ชฉ ๋„ˆ๋น„์˜ 1/3 ์ˆ˜์ค€์ด๊ณ , ์†๊ฐ€๋ฝ ๋งˆ๋”” ๊ธธ์ด์˜ 1/5 ์ˆ˜์ค€์ด๋‹ค. ์ •๋ฐ€ ๊ทธ๋ฆฝ์—๋Š” ๋ถ€์กฑํ•˜์ง€๋งŒ ์‹œ์ž‘์ ์œผ๋กœ๋Š” ์ถฉ๋ถ„ํ•˜๋‹ค.

2๋‹จ๊ณ„: DART๋กœ ์ •๋ฐ€ ์ถ”์ 

DART(Dense Articulated Real-time Tracking)๋Š” GPU ๊ธฐ๋ฐ˜ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์ถ”์ ๊ธฐ๋‹ค. ํ•ต์‹ฌ ์•„์ด๋””์–ด๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค:

  1. ์†์˜ ์—ฐ๊ฒฐ๋œ ๊ฐ•์ฒด ๋ชจ๋ธ(kinematic tree)์„ ๋ฏธ๋ฆฌ ๋งŒ๋“ค์–ด ๋‘”๋‹ค.
  2. ๊ฐ ๊ฐ•์ฒด์— ๋Œ€ํ•ด signed distance function (SDF) โ€” โ€œ์ด ์ ์—์„œ ํ‘œ๋ฉด๊นŒ์ง€์˜ ๋ถ€ํ˜ธ ์žˆ๋Š” ๊ฑฐ๋ฆฌโ€ โ€” ์„ ์ •์˜ํ•œ๋‹ค.
  3. ๊ด€์ธก๋œ ํฌ์ธํŠธ ํด๋ผ์šฐ๋“œ์˜ ๊ฐ ์ ์— ๋Œ€ํ•ด, ๊ฐ€๊นŒ์šด ๊ฐ•์ฒด์˜ SDF ๊ฐ’์„ ๊ณ„์‚ฐํ•œ๋‹ค.
  4. ๋ชจ๋“  ์ ์˜ SDF ํ•ฉ์„ ์ตœ์†Œํ™”ํ•˜๋Š” ๊ด€์ ˆ ๊ฐ๋„๋ฅผ ๋น„์„ ํ˜• ์ตœ์ ํ™”๋กœ ๊ตฌํ•œ๋‹ค.

PointNet++๊ฐ€ โ€œ์–ด๋””์ฏคโ€์ด๋ผ๊ณ  ์™ธ์น˜๋ฉด DART๊ฐ€ โ€œ์ •ํ™•ํžˆ ์—ฌ๊ธฐโ€๋ผ๊ณ  ๋‹ค๋“ฌ๋Š” ๊ตฌ์กฐ๋‹ค. ๋‘ ๊ฐ€์ง€๊ฐ€ ๊ฒฐํ•ฉ๋˜๋Š” ๋ฐฉ์‹์ด ์ค‘์š”ํ•˜๋‹ค:

  • PointNet++์˜ keypoint ์˜ˆ์ธก์€ DART์˜ ์ดˆ๊ธฐํ™” ๋ฐ ์„ ํ—˜์  ์ œ์•ฝ์œผ๋กœ ๋“ค์–ด๊ฐ„๋‹ค.
  • ๋งค ํ”„๋ ˆ์ž„ DART๋Š” ์ด์ „ ํ”„๋ ˆ์ž„์˜ ๊ฒฐ๊ณผ๋ฅผ ๋”ฐ๋ผ๊ฐ€์ง€๋งŒ, ํฐ ์ฐจ์ด๊ฐ€ ๋‚˜๋ฉด PointNet++ ์˜ˆ์ธก์œผ๋กœ ๋ฆฌ์…‹ํ•œ๋‹ค.

์ด ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ๋•๋ถ„์— DexPilot์€ ์†์ด ๋น ๋ฅด๊ฒŒ ์›€์ง์—ฌ๋„(jerk) ์•ˆ ์žƒ์–ด๋ฒ„๋ฆฌ๊ณ , ์†๊ฐ€๋ฝ์ด ๊ผฌ์—ฌ๋„ ํ•œ ์†๊ฐ€๋ฝ์ด ๋‹ค๋ฅธ ์†๊ฐ€๋ฝ์œผ๋กœ โ€œํŠ€๋Š”โ€ ํ˜„์ƒ์ด ์ ๋‹ค. ์ด ๊ตฌ์กฐ๋Š” ์ดํ›„ ๋งŽ์€ ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜ ์‹œ์Šคํ…œ์˜ ํ‘œ์ค€์ด ๋œ๋‹ค.

flowchart TB
    PC[Point Cloud<br/>~30K points] --> PN[PointNet++<br/>Stage 1+2]
    PN -->|23 keypoints<br/>coarse pose| INIT[DART Init/<br/>Re-init check]
    PN -.->|fallback if<br/>DART loses track| INIT
    INIT --> DART[DART SDF<br/>Optimization]
    DART -->|refined joints| OUT[Final Hand Pose]
    OUT -.->|prev frame| DART

๋ณด์กฐ: GloveNet์œผ๋กœ ์†๋ 2D ์œ„์น˜ ๋ณด๊ฐ•

์†๋ ์ถ”์ ์ด ํŠนํžˆ ์–ด๋ ค์šฐ๋‹ˆ ๋ณ„๋„ ๋ชจ๋ธ์„ ๋‘”๋‹ค. GloveNet์€ RGB ์ด๋ฏธ์ง€์—์„œ ResNet-50 ๋ฐฑ๋ณธ ์œ„์— spatial-softmax ํ—ค๋“œ๋ฅผ ์–น์–ด ์†๋์˜ 2D ์œ„์น˜๋ฅผ ํšŒ๊ท€ํ•œ๋‹ค. ๊นŠ์ด ๋งต์˜ ์†๋์€ ์ข…์ข… ๋…ธ์ด์ฆˆ์— ๋ฌปํ˜€ ์‚ฌ๋ผ์ง€๋ฏ€๋กœ, ์ด RGB ๊ธฐ๋ฐ˜ ๋‹จ์„œ๊ฐ€ ๊นŠ์ด ์‹ ํ˜ธ์˜ ๋นˆ ๊ณณ์„ ๋ฉ”์šด๋‹ค.

์šด๋™ํ•™์  ๋ฆฌํƒ€๊ฒŒํŒ…: ์ด ๋…ผ๋ฌธ์˜ ์ง„์งœ ํ•ต์‹ฌ

์—ฌ๊ธฐ์„œ๋ถ€ํ„ฐ๊ฐ€ DexPilot์˜ ์ง„์งœ ๊ธฐ์—ฌ๋‹ค. ์† ์ž์„ธ๋ฅผ ์•Œ์•˜๋‹ค๊ณ  ์น˜์ž โ€” 23๊ฐœ keypoint์˜ 3D ์ขŒํ‘œ๊ฐ€ ๋งค ํ”„๋ ˆ์ž„ ๋“ค์–ด์˜จ๋‹ค. ์ด๊ฑธ Allegro์˜ 16๊ฐœ ๊ด€์ ˆ ๊ฐ๋„๋กœ ์–ด๋–ป๊ฒŒ ๋ฐ”๊พธ๋‚˜?

์ž˜๋ชป๋œ ๋ฐฉ๋ฒ•๋“ค

๋จผ์ € ์•ˆ ๋˜๋Š” ๋ฐฉ๋ฒ•๋ถ€ํ„ฐ ๋ณด์ž. ์ง๊ด€๊ณผ ์–ด๊ธ‹๋‚˜๋Š” ๊ฒŒ ์–ด๋””์ธ์ง€๊ฐ€ ๋” ๊ตํ›ˆ์ ์ด๋‹ค.

  1. ๊ด€์ ˆ ๊ฐ๋„ ๋ณต์‚ฌ: ์‚ฌ๋žŒ์˜ ๊ฒ€์ง€ PIP ๊ด€์ ˆ์ด 30ยฐ๋ฉด Allegro ๊ฒ€์ง€ PIP๋„ 30ยฐ. ๊ธธ์ด๋„ ๋ฒ ์ด์Šค ์œ„์น˜๋„ ๋‹ค๋ฅด๋‹ˆ ์†๋์ด ๋งŒ๋‚˜์•ผ ํ•  ๊ณณ์—์„œ ๋น—๋‚˜๊ฐ„๋‹ค.
  2. ์†๋ ์œ„์น˜ ๋ณต์‚ฌ: ์‚ฌ๋žŒ ์†๋ฐ”๋‹ฅ ์ขŒํ‘œ๊ณ„์—์„œ ์†๋ ์œ„์น˜๋ฅผ ์ธก์ •ํ•˜๊ณ  Allegro ์†๋ฐ”๋‹ฅ ์ขŒํ‘œ๊ณ„์— ๊ทธ๋Œ€๋กœ ์ž…๋ ฅ. ์†๊ฐ€๋ฝ ๊ธธ์ด๊ฐ€ ์งง์€ Allegro๋Š” ๋„๋‹ฌ ๋ถˆ๊ฐ€๋Šฅํ•œ ์ ์ด ๋งŽ๊ณ , ๋„๋‹ฌํ•˜๋”๋ผ๋„ ์†๊ฐ€๋ฝ์ด ์ด์ƒํ•˜๊ฒŒ ํœœ๋‹ค.
  3. ๋‹จ์ˆœ BioIK ํ’€์ด: ์†๋ฐ”๋‹ฅ์—์„œ ์†๋๊นŒ์ง€์˜ ๋ฒกํ„ฐ๋ฅผ ๋งค์นญ. ์†๋๋ผ๋ฆฌ์˜ ์ƒ๋Œ€ ์œ„์น˜๋Š” ๋ณด์กด๋˜์ง€ ์•Š์œผ๋‹ˆ ์ •๋ฐ€ ๊ทธ๋ฆฝ์ด ๋ง๊ฐ€์ง„๋‹ค.

์˜ณ์€ ์งˆ๋ฌธ: โ€œ๊ทธ๋ฆฝ์˜ ๊ธฐํ•˜ํ•™์—์„œ ๋ฌด์—‡์ด ์ค‘์š”ํ•œ๊ฐ€?โ€

์ง‘๊ธฐ๋ฅผ ๊ฒฐ์ •ํ•˜๋Š” ๊ฑด ์†๋๊ณผ ์†๋ ์‚ฌ์ด์˜ ์ƒ๋Œ€ ๋ฒกํ„ฐ๋‹ค. ์—„์ง€๊ฐ€ ๊ฒ€์ง€์— ๋‹ฟ๋Š” ํ•€์น˜ ๊ทธ๋ฆฝ์ด๋ผ๋ฉด โ€œ์—„์ง€ ๋๊ณผ ๊ฒ€์ง€ ๋ ์‚ฌ์ด์˜ ๋ฒกํ„ฐโ€๊ฐ€ ์˜๋ฒกํ„ฐ๊ฐ€ ๋˜์–ด์•ผ ํ•œ๋‹ค. ์†๋ฐ”๋‹ฅ์—์„œ ์†๋๊นŒ์ง€์˜ ๋ฒกํ„ฐ๋„ ์ค‘์š”ํ•œ๋ฐ, ๊ทธ๊ฑด ๊ทธ๋ฆฝ์˜ ๊นŠ์ด์™€ ๋ฐฉํ–ฅ์„ ๊ฒฐ์ •ํ•œ๋‹ค.

DexPilot์€ ๊ทธ๋ž˜์„œ task-space ๋ฒกํ„ฐ์˜ ๋ชจ์Œ์„ ์ •์˜ํ•œ๋‹ค (Fig. 8 ์ฐธ์กฐ):

  • ์†๋ฐ”๋‹ฅ-์—„์ง€๋ ๋ฒกํ„ฐ
  • ์†๋ฐ”๋‹ฅ-๊ฒ€์ง€๋ ๋ฒกํ„ฐ
  • ์†๋ฐ”๋‹ฅ-์ค‘์ง€๋ ๋ฒกํ„ฐ
  • ์†๋ฐ”๋‹ฅ-์•ฝ์ง€๋ ๋ฒกํ„ฐ
  • ์—„์ง€๋-๊ฒ€์ง€๋ ๋ฒกํ„ฐ
  • ์—„์ง€๋-์ค‘์ง€๋ ๋ฒกํ„ฐ
  • ์—„์ง€๋-์•ฝ์ง€๋ ๋ฒกํ„ฐ

๊ฐ ๋ฒกํ„ฐ๋ฅผ ์‚ฌ๋žŒ ์†์—์„œ ์ธก์ •ํ•œ ๊ฐ’๊ณผ Allegro์—์„œ ๊ณ„์‚ฐํ•œ ๊ฐ’(๊ด€์ ˆ ๊ฐ๋„์˜ ํ•จ์ˆ˜)์ด ์ผ์น˜ํ•˜๋„๋ก ์ตœ์ ํ™”ํ•œ๋‹ค.

๋น„์šฉ ํ•จ์ˆ˜ ๋“ค์—ฌ๋‹ค๋ณด๊ธฐ

์ „์ฒด ๋น„์šฉ ํ•จ์ˆ˜๋Š” ์ง๊ด€์ ์œผ๋กœ ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์“ธ ์ˆ˜ ์žˆ๋‹ค (๋…ผ๋ฌธ ํ‘œ๊ธฐ๋ฅผ ์‚ด์ง ์ •๋ฆฌ):

J(q) \;=\; \sum_{i=1}^{N} w_i \,\bigl\| f_i(q) - s\cdot r_i^{h} \bigr\|^2 \;+\; \gamma \,\| q \|^2

์—ฌ๊ธฐ์„œ:

  • q \in \mathbb{R}^{16}: Allegro ๊ด€์ ˆ ๊ฐ๋„.
  • r_i^{h} \in \mathbb{R}^3: ์‚ฌ๋žŒ ์†์—์„œ ์ธก์ •ํ•œ i๋ฒˆ์งธ task-space ๋ฒกํ„ฐ.
  • f_i(q): ๊ฐ™์€ ์ •์˜๋กœ Allegro์—์„œ ๊ณ„์‚ฐํ•œ ๋ฒกํ„ฐ (์ˆœ๋ฐฉํ–ฅ ์šด๋™ํ•™์œผ๋กœ ์–ป์Œ).
  • s: ์‚ฌ๋žŒ ์†๊ณผ Allegro ์‚ฌ์ด์˜ ์Šค์ผ€์ผ ์ธ์ž (์† ํฌ๊ธฐ ๋ณด์ •).
  • w_i: ๋ฒกํ„ฐ๋ณ„ ๊ฐ€์ค‘์น˜.
  • \gamma = 2.5 \times 10^{-3}: Allegro ๊ฐ๋„๋ฅผ 0(์™„์ „ํžˆ ํŽด์ง„ ์†)์— ๊ฐ€๊น๊ฒŒ ๋„๋Š” ์ •์น™ํ™”.

์ •์น™ํ™” ํ•ญ \gamma \|q\|^2์ด ์™œ ์žˆ๋Š”์ง€๊ฐ€ ํฅ๋ฏธ๋กญ๋‹ค. ๋น„์šฉ ํ•จ์ˆ˜๊ฐ€ ๋น„๋ณผ๋ก(non-convex)์ด๊ณ , ๊ฐ™์€ task-space ๋ฒกํ„ฐ๋ฅผ ๋งŒ๋“œ๋Š” ๊ด€์ ˆ ์กฐํ•ฉ์ด ์—ฌ๋Ÿฌ ๊ฐœ ์กด์žฌํ•œ๋‹ค. ์ •์น™ํ™”๋ฅผ ๋นผ๋ฉด ์ตœ์ ํ™”๊ธฐ๊ฐ€ ์†๊ฐ€๋ฝ์„ ์†๋ฐ”๋‹ฅ ์•ˆ์ชฝ์œผ๋กœ ๋ง์•„ ๋„ฃ๋Š” ํฌํ•œํ•œ ์ง€์—ญ ์ตœ์†Œ์ ์— ๋น ์ง€๋Š” ๊ฒฝ์šฐ๊ฐ€ ์ž์ฃผ ์ƒ๊ธด๋‹ค. ํ•œ ๋ฒˆ ๊ทธ๋Ÿฐ ์ž์„ธ์— ๋“ค์–ด๊ฐ€๋ฉด ๋‹ค์Œ ํ”„๋ ˆ์ž„์—์„œ ๋น ์ ธ๋‚˜์˜ค๊ธฐ ์–ด๋ ต๋‹ค. \gamma๋Š” ๊ทธ ํ•จ์ •์— ๋น ์ง€์ง€ ์•Š๋„๋ก ์‚ด์ง โ€œํŽด์ง„ ์ž์„ธ ์ชฝ์œผ๋กœ ๋ฐ€์–ด์ฃผ๋Š”โ€ ์Šคํ”„๋ง ์—ญํ• ์ด๋‹ค.

๋˜ ํ•˜๋‚˜์˜ ํŠธ๋ฆญ: distal ๊ด€์ ˆ์„ medial ๊ด€์ ˆ๊ณผ ๋ฌถ๋Š”๋‹ค. ์‚ฌ๋žŒ ๊ฒ€์ง€์˜ DIP๋Š” PIP์™€ ๊ฐ•ํ•œ ์ƒ๊ด€์ด ์žˆ๋‹ค(์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ๊ฐ™์ด ๊ตฝํ˜€์ง„๋‹ค). Allegro์˜ ๊ฒ€์ง€/์ค‘์ง€/์•ฝ์ง€์—์„œ๋„ distal = medial๋กœ ์ œ์•ฝ์„ ๊ฑธ์–ด ํƒ์ƒ‰ ๊ณต๊ฐ„์„ ์ค„์ธ๋‹ค. 16 DoF๊ฐ€ ์‚ฌ์‹ค์ƒ 13 DoF์ฒ˜๋Ÿผ ๋™์ž‘ํ•˜์ง€๋งŒ ์ž์—ฐ์Šค๋Ÿฌ์›€์€ ์žƒ์ง€ ์•Š๋Š”๋‹ค.

Projection Scheme: ์ •๋ฐ€ ๊ทธ๋ฆฝ์˜ ๋น„๋Œ€์นญ์„ ๊ฐ๋‹นํ•˜๋Š” ๋ฒ•

์ด๊ฒŒ ์ง„์งœ ์˜๋ฆฌํ•œ ๋ถ€๋ถ„์ด๋‹ค. ์‚ฌ๋žŒ ์†์—์„œ ์—„์ง€์™€ ๊ฒ€์ง€ ๋ ์‚ฌ์ด์˜ ๋ฒกํ„ฐ๊ฐ€ ์ž‘์•„์งˆ์ˆ˜๋ก(์ฆ‰ ํ•€์น˜ ๊ทธ๋ฆฝ์— ๊ฐ€๊นŒ์›Œ์งˆ์ˆ˜๋ก), 9 mm ์ถ”์  ์˜ค์ฐจ๊ฐ€ ์ƒ๋Œ€์ ์œผ๋กœ ์ปค์ง„๋‹ค. ์‚ฌ๋žŒ์ด ์˜๋„ํ•œ ๊ฒƒ์€ โ€œ๋‹ฟ์Œโ€์ธ๋ฐ, ์ถ”์  ๊ฒฐ๊ณผ๋Š” โ€œ8 mm ๋–จ์–ด์งโ€์ด ๋˜์–ด Allegro๋„ 8 mm ๋–จ์–ด์ง„๋‹ค. ์ข…์ด๋Š” ๋–จ์–ด์ง„๋‹ค.

DexPilot์˜ ํ•ด๊ฒฐ์ฑ…์€ ์ž„๊ณ„๊ฐ’ ๊ธฐ๋ฐ˜ ํˆฌ์˜์ด๋‹ค. ๋ฒกํ„ฐ์˜ ํฌ๊ธฐ๊ฐ€ ์–ด๋–ค ๊ฑฐ๋ฆฌ d_{\text{thresh}} ์ดํ•˜๋กœ ์ค„์–ด๋“ค๋ฉด, ๋น„์šฉ ํ•จ์ˆ˜์˜ ๊ทธ ํ•ญ์„ ๋ชฉํ‘œ = 0 (์™„์ „ ์ ‘์ด‰)์œผ๋กœ ๋ฐ”๊ฟ”์น˜๊ธฐํ•œ๋‹ค. ์ฆ‰:

\tilde r_i^{h} \;=\; \begin{cases} r_i^{h}, & \|r_i^{h}\| \geq d_{\text{thresh}} \\ \mathbf{0}, & \|r_i^{h}\| < d_{\text{thresh}} \end{cases}

๋˜ํ•œ ์ด ๋ชจ๋“œ์—์„œ๋Š” ํ•ด๋‹น ๋ฒกํ„ฐ์˜ ๊ฐ€์ค‘์น˜ w_i๋ฅผ ๋Œ€ํญ ์˜ฌ๋ ค์„œ ์ตœ์ ํ™”๊ธฐ๊ฐ€ โ€œ์–ด๋–ป๊ฒŒ ํ•ด์„œ๋“  ์†๋์„ ๋ถ™์—ฌ๋ผโ€๋ผ๊ณ  ๋ช…๋ น๋ฐ›๊ฒŒ ํ•œ๋‹ค.

์ง๊ด€์ ์œผ๋กœ ํ‘œํ˜„ํ•˜๋ฉด: ์‚ฌ๋žŒ์ด โ€œ๊ฑฐ์˜โ€ ์†๊ฐ€๋ฝ์„ ๋ถ™์˜€๋‹ค ์‹ถ์œผ๋ฉด ์‹œ์Šคํ…œ์€ โ€œ๋ถ™์ด๋ ค ํ–ˆ๊ตฌ๋‚˜โ€๋กœ ํ•ด์„ํ•œ๋‹ค. ์ด๊ฒŒ ๋„ˆ๋ฌด ๊ณต๊ฒฉ์ ์ด๋ฉด ์‚ฌ์šฉ์ž๊ฐ€ ์†๊ฐ€๋ฝ์„ ์‚ด์ง ๋ฒŒ๋ฆฌ๋ ค๋Š” ๋ฏธ์„ธ ์กฐ์ •์— ์‹œ์Šคํ…œ์ด ์‘๋‹ตํ•˜์ง€ ์•Š๋Š”๋‹ค โ€” ์ด ๋ถ€์ž‘์šฉ์€ ๋…ผ๋ฌธ๋„ ์ธ์ •ํ•œ๋‹ค. ๊ทธ๋ž˜์„œ ์‚ฌ์šฉ์ž๊ฐ€ projection์„ ๋Œ ์ˆ˜ ์žˆ๋Š” ํ† ๊ธ€์ด ์žˆ๋‹ค.

์˜์‚ฌ์ฝ”๋“œ๋กœ ์ •๋ฆฌํ•˜๋ฉด

# Per-frame retargeting (called at 30 Hz)
def retarget(human_keypoints_3d, q_prev):
    # 1) Build task-space vectors from human hand
    r_h = compute_task_vectors(human_keypoints_3d)   # 7 vectors

    # 2) Apply projection for precision grasps
    for i in range(len(r_h)):
        if norm(r_h[i]) < d_thresh:
            r_h_tilde[i] = zeros(3)
            w[i] = w_high           # snap together
        else:
            r_h_tilde[i] = r_h[i]
            w[i] = w_nominal

    # 3) Solve nonlinear least-squares with SLSQP
    def cost(q):
        r_a = forward_kinematics_vectors(q)  # Allegro vectors
        err = 0
        for i in range(len(r_h_tilde)):
            err += w[i] * norm(r_a[i] - s * r_h_tilde[i])**2
        err += gamma * norm(q)**2
        return err

    q_star = slsqp_minimize(
        cost,
        x0=q_prev,                # warm-start from previous solution
        constraints=joint_limits,
        equality=[q.distal == q.medial for primary fingers]
    )

    # 4) Low-pass filter to smooth jitter and projection switches
    q_out = lowpass(q_star, q_prev_out)
    return q_out

์„ธ ๊ฐ€์ง€ ๋””ํ…Œ์ผ์— ์ฃผ๋ชฉํ•  ๋งŒํ•˜๋‹ค:

  1. Warm start: ์ด์ „ ํ”„๋ ˆ์ž„ ํ•ด๋ฅผ ์ดˆ๊ธฐ๊ฐ’์œผ๋กœ ์“ฐ๋ฉด SLSQP๊ฐ€ ๋ณดํ†ต 5-10 iteration ๋งŒ์— ์ˆ˜๋ ดํ•œ๋‹ค. 30 Hz ์‹ค์‹œ๊ฐ„ ์ œ์–ด๊ฐ€ ๊ฐ€๋Šฅํ•ด์ง€๋Š” ํ•ต์‹ฌ์ด๋‹ค.
  2. ์ €์—ญ ํ†ต๊ณผ ํ•„ํ„ฐ: ์ถ”์  ๋…ธ์ด์ฆˆ์™€ projection switch๊ฐ€ ๋งŒ๋“œ๋Š” ๊ณ ์ฃผํŒŒ ์ ํ”„๋ฅผ ๋ถ€๋“œ๋Ÿฝ๊ฒŒ ๋งŒ๋“ ๋‹ค. 1์ฐจ ํ•„ํ„ฐ ํ•œ ์ค„๋กœ ์†์ด ๋–จ๋ฆฌ์ง€ ์•Š๊ฒŒ ๋œ๋‹ค.
  3. SLSQP๋Š” KKT ๋งŒ์กฑํ•˜๋Š” SQP: ๊ด€์ ˆ ํ•œ๊ณ„, distal=medial ๊ฐ™์€ ๋“ฑ์‹/๋ถ€๋“ฑ์‹ ์ œ์•ฝ์„ ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ๋‹ค๋ฃฌ๋‹ค.

ํŒ” ์ œ์–ด: Riemannian Motion Policies

์† ๊ด€์ ˆ์€ retargeting์œผ๋กœ ํ’€๋ ธ๋‹ค. ๊ทธ๋Ÿผ ํŒ”์€? ์‚ฌ๋žŒ ์†๋ฐ”๋‹ฅ์˜ 6-DoF ์ž์„ธ๋ฅผ ๋ฐ›์•„ KUKA LBR4+ 7-DoF ํŒ”์ด ๋”ฐ๋ผ๊ฐ€์•ผ ํ•œ๋‹ค. ๊ทธ๋ƒฅ IK๋ฅผ ํ’€๋ฉด ๋  ๊ฒƒ ๊ฐ™์ง€๋งŒ โ€” ๊ทธ๋Ÿฌ๋ฉด ์ถฉ๋Œ๊ณผ ํŠน์ด์ ์ด ๋ฌธ์ œ๋‹ค.

DexPilot์€ RMPflow ๊ณ„์—ด์˜ Riemannian Motion Policies (RMPs)๋ฅผ ์“ด๋‹ค. ์ง๊ด€์€ ์ด๋ ‡๋‹ค:

  • ๊ฐ ์„œ๋ธŒํƒœ์Šคํฌ(์˜ˆ: โ€œ์—”๋“œ ์ดํŽ™ํ„ฐ๋ฅผ ๋ชฉํ‘œ ์ž์„ธ๋กœโ€, โ€œ์žฅ์• ๋ฌผ ํ”ผํ•˜๊ธฐโ€, โ€œ๊ด€์ ˆ ํ•œ๊ณ„ ๋ฉ€๋ฆฌํ•˜๊ธฐโ€)์— ๋Œ€ํ•ด ๊ฐ€์†๋„ ํ•„๋“œ์™€ ๊ทธ ์‹ ๋ขฐ๋„๋ฅผ ํ‘œํ˜„ํ•˜๋Š” ๋ฉ”ํŠธ๋ฆญ์„ ์ •์˜ํ•œ๋‹ค.
  • ๋ชจ๋“  ์„œ๋ธŒํƒœ์Šคํฌ์˜ ๊ฐ€์†๋„๋ฅผ ๋ฉ”ํŠธ๋ฆญ์œผ๋กœ ๊ฐ€์ค‘ ํ‰๊ท ํ•ด์„œ ํ•ฉ์„ฑ ๊ฐ€์†๋„๋ฅผ ๋งŒ๋“ ๋‹ค.
  • ํ•ฉ์„ฑ ๊ฐ€์†๋„๋ฅผ ๊ด€์ ˆ ๊ณต๊ฐ„์œผ๋กœ ํ’€๋ฐฑ(pullback)ํ•ด์„œ ํ† ํฌ ๋ช…๋ น์œผ๋กœ ๋ณ€ํ™˜ํ•œ๋‹ค.

์ˆ˜์‹์„ ๋นŒ๋ฆฌ๋ฉด, i๋ฒˆ์งธ ์ •์ฑ…์ด ๊ฐ€์†๋„ \ddot x_i์™€ ๋ฉ”ํŠธ๋ฆญ M_i๋ฅผ ๋‚ด๋†“์„ ๋•Œ ํ•ฉ์„ฑ ์ •์ฑ…์€:

\ddot x^* \;=\; \Bigl(\sum_i M_i\Bigr)^{-1} \sum_i M_i \, \ddot x_i

๋ฌผ๋ฆฌ์ ์œผ๋กœ๋Š” โ€œ์‹ ๋ขฐ๋„ ๊ฐ€์ค‘ ํ•ฉ์˜(consensus by precision-weighted averaging)โ€๋‹ค. ๊ฐ€๊นŒ์šด ์žฅ์• ๋ฌผ์€ ๋ฉ”ํŠธ๋ฆญ์ด ์ปค์ง€๊ณ , ๋”ฐ๋ผ์„œ ํšŒํ”ผ ๊ฐ€์†๋„๊ฐ€ ๋‹ค๋ฅธ ๋ชจ๋“  ์ •์ฑ…์„ ์••๋„ํ•œ๋‹ค. ๊ฑฐ๋ฆฌ๊ฐ€ ๋ฉ€์–ด์ง€๋ฉด ๋ฉ”ํŠธ๋ฆญ์ด ์ž‘์•„์ ธ ์‚ฌ์‹ค์ƒ ๋ฌด์‹œ๋œ๋‹ค.

DexPilot ๋งฅ๋ฝ์—์„œ ์ค‘์š”ํ•œ ๊ฒƒ์€ RMP๊ฐ€ ๋‹ซํžŒ ํ˜•ํƒœ๋กœ ์ถฉ๋Œ ํšŒํ”ผ์™€ ์ถ”์ข…์„ ๋™์‹œ์— ์ฒ˜๋ฆฌํ•œ๋‹ค๋Š” ์ ์ด๋‹ค. ์‚ฌ์šฉ์ž๊ฐ€ ์†์„ ๋น ๋ฅด๊ฒŒ ํœ˜๋‘˜๋Ÿฌ๋„ Allegro ์†๋ฐ”๋‹ฅ์ด ํŒ”๊ณผ ์ž๊ธฐ ์ž์‹ ์„ ๋“ค์ด๋ฐ›์ง€ ์•Š๋Š”๋‹ค.

์‹คํ—˜: 15๊ฐœ ๊ณผ์ œ, ๋‘ ๋ช…์˜ ํŒŒ์ผ๋Ÿฟ, ๋‹ค์„ฏ ๋ฒˆ์”ฉ

ํ‰๊ฐ€ ์„ค๊ณ„

๋…ผ๋ฌธ์€ ์ •๋ฐ€ ๊ทธ๋ฆฝ, ํŒŒ์›Œ ๊ทธ๋ฆฝ, prehensile/non-prehensile ์กฐ์ž‘, ์†๊ฐ€๋ฝ ๋ณดํ–‰(finger gaiting)์„ ๋‘๋ฃจ ํฌํ•จํ•˜๋Š” 15๊ฐœ ๊ณผ์ œ๋ฅผ ์ •์˜ํ•œ๋‹ค. ๊ฐ ๊ณผ์ œ๋Š” ๋‘ ๋ช…์˜ ํ›ˆ๋ จ๋œ โ€œํŒŒ์ผ๋Ÿฟโ€์ด ๋‹ค์„ฏ ๋ฒˆ์”ฉ ์ˆ˜ํ–‰ํ•œ๋‹ค. ์ธก์ • ์ง€ํ‘œ:

  • ์™„๋ฃŒ ์‹œ๊ฐ„ (mean completion time): ์‹œ๊ฐ„์ด ์งง์„์ˆ˜๋ก ์‹œ์Šคํ…œ์ด ์ง๊ด€์ ์ด๊ณ  ์ •ํ™•ํ•˜๋‹ค.
  • ์„ฑ๊ณต๋ฅ  (success rate): 5ํšŒ ์ค‘ ์„ฑ๊ณตํ•œ ํšŸ์ˆ˜.

๋Œ€ํ‘œ ๊ณผ์ œ์™€ ๊ฒฐ๊ณผ

๋…ผ๋ฌธ์— ๋‚˜์˜ค๋Š” ์ธ์ƒ์ ์ธ ๊ณผ์ œ๋“ค:

  • ์ง€๊ฐ‘์—์„œ ์ง€ํ ๋นผ๊ธฐ: ํ•œ ์†์œผ๋กœ ์ง€๊ฐ‘์„ ์žก๊ณ , ๋‹ค๋ฅธ ์†๊ฐ€๋ฝ ์กฐํ•ฉ์œผ๋กœ ์ข…์ด๋ฅผ ํ•€์น˜ํ•ด ๋นผ๋‚ธ๋‹ค. ๋‘ ์†๊ฐ€๋ฝ ๋ ์‚ฌ์ด ์••๋ ฅ์„ ์œ ์ง€ํ•ด์•ผ ํ•˜๋ฏ€๋กœ projection scheme์ด ๊ฒฐ์ •์ ์ด๋‹ค.
  • ํ‹ฐ ์„œ๋ž ์—ด๊ธฐ, ํ‹ฐ๋ฐฑ ๊บผ๋‚ด๊ธฐ, ๋‹ค์‹œ ๋‹ซ๊ธฐ: ๊ธด ์‹œ๊ณ„์—ด์˜ multi-step ๊ณผ์ œ. ๋ˆ„์  ์˜ค์ฐจ์— ๊ฐ•๊ฑดํ•ด์•ผ ํ•œ๋‹ค.
  • Pringles ํ†ต ์„ธ์šฐ๊ธฐ, ๋นจ๊ฐ„ ํ†ต ์•ˆ์— ๋„ฃ๊ธฐ: ๋Œ€ํ˜• ๋ฌผ์ฒด์˜ ๋น„ํŒŒ์ง€(non-prehensile) ์กฐ์ž‘ + ํŒŒ์ง€ ์ „ํ™˜.
  • ๋‘ ํ๋ธŒ๋ฅผ ๋„ค ์†๊ฐ€๋ฝ ์‚ฌ์ด์— ๋ผ์šฐ๊ธฐ: ์†๊ฐ€๋ฝ ์‚ฌ์ด ์••๋ ฅ์„ ๋ถ„์‚ฐํ•ด์„œ ์œ ์ง€ํ•ด์•ผ ํ•˜๋Š” ์–ด๋ ค์šด multi-finger grasp.

ํ‰๊ท ์ ์œผ๋กœ ๋‘ ํŒŒ์ผ๋Ÿฟ ๋ชจ๋‘ ๋Œ€๋ถ€๋ถ„ ๊ณผ์ œ์—์„œ 80% ์ด์ƒ์˜ ์„ฑ๊ณต๋ฅ ์„ ๊ธฐ๋กํ–ˆ๊ณ , ์™„๋ฃŒ ์‹œ๊ฐ„์€ ๊ณผ์ œ๋ณ„๋กœ 5-30์ดˆ ๋ฒ”์œ„์˜€๋‹ค. ์ •๋ฐ€ ๊ทธ๋ฆฝ ๊ณผ์ œ๋Š” ํ›ˆ๋ จ ์‹œ๊ฐ„์ด ๊ธธ์—ˆ๋‹ค๋Š” ์ ์ด ๋ณด๊ณ ๋œ๋‹ค โ€” ํŒŒ์ผ๋Ÿฟ์ด ์‹œ์Šคํ…œ์˜ ์‘๋‹ต ํŠน์„ฑ์— ์ ์‘ํ•˜๋Š” ์‹œ๊ฐ„์ด ํ•„์š”ํ•˜๋‹ค. ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜์˜ โ€œ์œ ๋Šฅํ•จโ€์ด ์‹œ์Šคํ…œ๋ฟ ์•„๋‹ˆ๋ผ ์‚ฌ๋žŒ-์‹œ์Šคํ…œ ๊ฒฐํ•ฉ์˜ ํ•จ์ˆ˜์ž„์„ ๋ณด์—ฌ์ฃผ๋Š” ๋Œ€๋ชฉ์ด๋‹ค.

๊ฒฐ๊ณผ์˜ ์ง„์งœ ์˜๋ฏธ

์ด ์‹คํ—˜์ด ๋ณด์—ฌ์ฃผ๋Š” ๊ฒƒ์€ ๋‹จ์ง€ โ€œDexPilot์ด ์ž‘๋™ํ•œ๋‹คโ€๊ฐ€ ์•„๋‹ˆ๋‹ค. ๋” ๊นŠ์€ ๋ฉ”์‹œ์ง€๋Š” ๋‘ ๊ฐ€์ง€๋‹ค:

  1. ์ด‰๊ฐ ์—†์ด๋„ ์‹œ๊ฐ๋งŒ์œผ๋กœ ์ •๋ฐ€ ์กฐ์ž‘์ด ๊ฐ€๋Šฅํ•˜๋‹ค. ์ธ๊ฐ„ ์†-๋ˆˆ ํ˜‘์‘์˜ ์˜ˆ์ธก ๋ชจ๋ธ(์‚ฌ๋žŒ์ด ์‹œ๊ฐ์œผ๋กœ ์ ‘์ด‰์„ ์˜ˆ์ธกํ•œ๋‹ค๋Š” ์ธ์ง€์‹ฌ๋ฆฌ ์—ฐ๊ตฌ [12])์ด ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜ ํšŒ๋กœ ์•ˆ์—์„œ๋„ ํ™œ์šฉ๋œ๋‹ค.
  2. ๊ณ ํ’ˆ์งˆ ์‹œ์—ฐ ๋ฐ์ดํ„ฐ์˜ ์ˆ˜์ง‘ ํ†ต๋กœ๊ฐ€ ์—ด๋ฆฐ๋‹ค. 23 DoA ์ƒํƒœ-ํ–‰๋™ ์‹œํ€€์Šค, ๊นŠ์ด ์˜์ƒ, RGB ์˜์ƒ์ด ๋™๊ธฐํ™”๋˜์–ด ์Œ“์ธ๋‹ค. ์ด ๋ฐ์ดํ„ฐ๊ฐ€ ์ดํ›„ imitation learning, dexterous RL, VLA ํ•™์Šต์˜ ์—ฐ๋ฃŒ๊ฐ€ ๋œ๋‹ค. DexPilot์˜ ์ง„์ •ํ•œ ์˜ํ–ฅ์€ ์‹œ์Šคํ…œ ๊ทธ ์ž์ฒด๊ฐ€ ์•„๋‹ˆ๋ผ ๊ทธ๊ฒƒ์ด ๋งŒ๋“ค์–ด ๋‚ผ ๋ฐ์ดํ„ฐ์…‹์ด๋‹ค.

๋น„ํŒ์  ๊ณ ์ฐฐ

์ž˜ ๋œ ๊ฒƒ

  • ์ €๋น„์šฉ์œผ๋กœ 23 DoA ์ „์ฒด ์ œ์–ด: ๊ธ€๋Ÿฌ๋ธŒ ์†”๋ฃจ์…˜ ๋Œ€๋น„ ํ•œ ์ž๋ฆฟ์ˆ˜ ์ด์ƒ ์ €๋ ด.
  • ๋ชจ๋ธ ๊ธฐ๋ฐ˜ + ๋ชจ๋ธ ํ”„๋ฆฌ ์ถ”์ ์˜ ๋ชจ๋ฒ” ์‚ฌ๋ก€: ์ดํ›„ ๊ฑฐ์˜ ๋ชจ๋“  vision-based teleop์ด ์ด ๊ตฌ์กฐ๋ฅผ ๋”ฐ๋ผ๊ฐ„๋‹ค.
  • Task-space ๋ฒกํ„ฐ + projection์˜ retargeting ๊ณต์‹: ๋‹จ์ˆœํ•˜๋ฉด์„œ๋„ ์ •๋ฐ€ ๊ทธ๋ฆฝ์„ ๊ฐ€๋Šฅ์ผ€ ํ•œ ๊ฒฐ์ •์  ์„ค๊ณ„.
  • ์ฆ‰์‹œ ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ์ฝ”๋“œ ์œ ์‚ฐ: dex-retargeting ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ๋“ฑ์— โ€œDexPilot retargetingโ€ ๋ชจ๋“œ๋กœ ๊ทธ๋Œ€๋กœ ์‚ด์•„ ์žˆ๋‹ค.

ํ•œ๊ณ„์™€ ๋น„ํŒ์ 

๋…ผ๋ฌธ์ด ์Šค์Šค๋กœ ์ธ์ •ํ•˜๋Š” ํ•œ๊ณ„์™€ ํ•„์ž๊ฐ€ ๋ณดํƒœ๋Š” ๋น„ํŒ์„ ๊ฐ™์ด ์ •๋ฆฌํ•œ๋‹ค.

1. ์›Œํฌ ๋ณผ๋ฅจ์ด ์ž‘๋‹ค

๊ฐ RealSense ์นด๋ฉ”๋ผ๋Š” 1 m ์•ˆ์—์„œ๋งŒ ๊นŠ์ด ํ’ˆ์งˆ์ด ์‚ด์•„ ์žˆ๋‹ค. ๋”ฐ๋ผ์„œ ์‚ฌ์šฉ์ž์˜ ์†์ด ์›€์ง์ผ ์ˆ˜ ์žˆ๋Š” ์‹คํšจ ์ž‘์—… ๊ณต๊ฐ„์ด ์ข๋‹ค. ์นด๋ฉ”๋ผ๋ฅผ ๋” ์ข‹์€ ๊ฒƒ์œผ๋กœ ๋ฐ”๊พธ๋ฉด ๋Š˜์–ด๋‚˜์ง€๋งŒ, ๊ทธ๋Ÿฌ๋ฉด โ€œ์ €๋น„์šฉโ€์ด๋ผ๋Š” ์ •์ฒด์„ฑ์ด ํ”๋“ค๋ฆฐ๋‹ค. ํ›„์† ์—ฐ๊ตฌ(AnyTeleop ๋“ฑ)๋Š” VR ํ—ค๋“œ์…‹๊ณผ ์† ์œ„ ๋งˆ์šดํŠธ๋กœ ์ด ํ•œ๊ณ„๋ฅผ ๋น„๊ปด๊ฐ„๋‹ค.

2. Projection scheme์˜ ๋ถ€์ž‘์šฉ

ํ•€์น˜ ๋ชจ๋“œ๋กœ ์ผ๋‹จ snap๋˜๋ฉด, ์‚ฌ์šฉ์ž๊ฐ€ ์†๊ฐ€๋ฝ์„ ์ฒœ์ฒœํžˆ ๋–ผ๋ ค ํ•  ๋•Œ ์‹œ์Šคํ…œ์ด ๊ทธ ์˜๋„๋ฅผ ๋ชป ์•Œ์•„์ฑˆ๋‹ค. ์ž‘์€ ๋ฌผ์ฒด๋ฅผ ๋†“๋Š” ์ˆœ๊ฐ„ ์†์ด ๊ตณ์–ด ์žˆ๋Š” ์…ˆ์ด๋‹ค. Finger gaiting(๊ณต์„ ์†๊ฐ€๋ฝ ์‚ฌ์ด๋กœ ๊ตด๋ฆฌ๊ธฐ) ๊ณผ์ œ๋„ ๊ฐ™์€ ์ด์œ ๋กœ ์–ด๋ ต๋‹ค. ๋…ผ๋ฌธ์€ ํ† ๊ธ€ ์˜ต์…˜์„ ์ œ๊ณตํ•˜์ง€๋งŒ ์ด์ƒ์ ์ธ ํ•ด๋ฒ•์€ โ€œ๋” ์ •ํ™•ํ•œ ์ถ”์ ์œผ๋กœ projection ์ž์ฒด๊ฐ€ ํ•„์š” ์—†๊ฒŒ ๋งŒ๋“ค๊ธฐโ€๋ผ๊ณ  ๋งํ•œ๋‹ค.

3. ์ด‰๊ฐ์ด ์—†๋‹ค

์ด‰๊ฐ์ด ์—†์œผ๋ฉด ์ ‘์ด‰ ์ง์ „๊ณผ ์งํ›„๋ฅผ ๊ตฌ๋ณ„ ๋ชป ํ•œ๋‹ค. ์‚ฌ๋žŒ์€ ์‹œ๊ฐ์œผ๋กœ ๊ทธ ๊ฐญ์„ ๋ฉ”์šฐ์ง€๋งŒ, ํ•™์Šต๋œ ์ •์ฑ…์€ ๋ชป ํ•  ์ˆ˜ ์žˆ๋‹ค. DexPilot์œผ๋กœ ์ˆ˜์ง‘ํ•œ ๋ฐ์ดํ„ฐ๋Š” ์ƒํƒœ = ์‹œ๊ฐ + ๊ด€์ ˆ์ด์ง€ ์ƒํƒœ = ์‹œ๊ฐ + ๊ด€์ ˆ + ์ด‰๊ฐ์ด ์•„๋‹ˆ๋ผ๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค. ์ดํ›„ NeuralFeels, TacSL, AnyRotate ๊ฐ™์€ ์—ฐ๊ตฌ๊ฐ€ ์ด‰๊ฐ ํ†ตํ•ฉ ๋ฐฉํ–ฅ์œผ๋กœ ๊ฐ”๋‹ค.

4. ์†๋ชฉ/ํŒ”๋š ๋น„ํ‹€๋ฆผ์˜ ํ•œ๊ณ„

์‚ฌ๋žŒ ์†๋ชฉ์€ ์•ฝ 180ยฐ ํšŒ์ „ํ•˜์ง€๋งŒ, ์‹œ๊ฐ ๊ธฐ๋ฐ˜ ์ถ”์ ์€ ์ž๊ธฐ ๊ฐ€๋ฆผ(self-occlusion) ๋•Œ๋ฌธ์— ๊ทน๋‹จ ์ž์„ธ์—์„œ ์‹ ๋ขฐ๋„๊ฐ€ ๋–จ์–ด์ง„๋‹ค. ๋˜ํ•œ KUKA ํŒ”์˜ ์šด๋™ํ•™์  ํŠน์ด์  ํšŒํ”ผ ๋•Œ๋ฌธ์— ์ผ๋ถ€ ํšŒ์ „์€ ํ‘œํ˜„์ด ์–ด๋ ต๋‹ค.

5. ์‚ฌ๋žŒ ์† ๋ชจ๋ธ์ด universalํ•˜์ง€ ์•Š๋‹ค

DART์˜ ์† ๋ชจ๋ธ์€ ํ‘œ์ค€ ํฌ๊ธฐ์— ์บ˜๋ฆฌ๋ธŒ๋ ˆ์ด์…˜๋œ๋‹ค. ์†์ด ๋งค์šฐ ํฌ๊ฑฐ๋‚˜ ์ž‘์€ ์‚ฌ์šฉ์ž๋Š” ์Šค์ผ€์ผ ์ธ์ž s๋กœ ๋ณด์ •ํ•˜์ง€๋งŒ, ์†๊ฐ€๋ฝ ๋น„์œจ์ด ๋‹ค๋ฅด๋ฉด retargeting ํ’ˆ์งˆ์ด ๋–จ์–ด์ง„๋‹ค. ํ›„์† ์—ฐ๊ตฌ์—์„œ MANO ๊ฐ™์€ ํŒŒ๋ผ๋ฏธํ„ฐ ๋ชจ๋ธ์ด ๋ณดํŽธํ™”๋œ ์ด์œ ๋‹ค.

6. ํ•™์Šต ๊ฐ€๋Šฅํ•œ retargeting์˜ ๋ถ€์žฌ

DexPilot์˜ retargeting์€ ์ตœ์ ํ™” ๊ธฐ๋ฐ˜์ด๊ณ , ๋น„์šฉ ํ•จ์ˆ˜์˜ ๊ฐ€์ค‘์น˜ w_i, \gamma, d_{\text{thresh}}๋Š” ์†์œผ๋กœ ํŠœ๋‹๋œ๋‹ค. ๋” ์ผ๋ฐ˜์ ์ธ ์†/๋กœ๋ด‡ ์กฐํ•ฉ์œผ๋กœ ํ™•์žฅํ•˜๋ ค๋ฉด ์ด ํŠœ๋‹์„ ์ž๋™ํ™”ํ•˜๊ฑฐ๋‚˜ ์‹ ๊ฒฝ๋ง์œผ๋กœ ๋Œ€์ฒดํ•ด์•ผ ํ•œ๋‹ค. ์ด๊ฒŒ ์ตœ๊ทผ์˜ learned retargeting ํ๋ฆ„์ด๋‹ค.

๊ด€๋ จ ์—ฐ๊ตฌ ์ง€ํ˜• ์†์—์„œ

์ง์ „ ์—ฐ๊ตฌ์™€์˜ ์ฐจ๋ณ„์ 

  • ๊ธ€๋Ÿฌ๋ธŒ ๊ธฐ๋ฐ˜: CyberGlove + Polhemus. ์ •ํ™•ํ•˜์ง€๋งŒ ๋น„์‹ธ๊ณ  ์˜๋ฅ˜์„ฑ ๋ฌธ์ œ.
  • ๋งˆ์ปค ๊ธฐ๋ฐ˜ ๋ชจ์…˜์บก์ณ: OptiTrack + ์†๊ฐ€๋ฝ ๋งˆ์ปค. ์ •ํ™•ํ•˜์ง€๋งŒ ๋งˆ์ปค ์ถ”์ ์ด ์†๊ฐ€๋ฝ ๊ฐ€๋ฆผ์— ์•ฝํ•˜๋‹ค.
  • ๋‹จ์ผ ์นด๋ฉ”๋ผ RGB ๊ธฐ๋ฐ˜: MediaPipe ๋ฅ˜. ๋น ๋ฅด์ง€๋งŒ ๊นŠ์ด๊ฐ€ ์•ฝํ•ด 3D ์ž์„ธ ์ถ”์ •์ด ํ”๋“ค๋ฆฐ๋‹ค.

DexPilot์€ ๋ฉ€ํ‹ฐ๋ทฐ ๊นŠ์ด + ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ถ”์  + task-space ๋ฆฌํƒ€๊ฒŒํŒ…์˜ ์กฐํ•ฉ์œผ๋กœ ์ด ์…‹์˜ ๋‹จ์ ์„ ๋™์‹œ์— ํ”ผํ•œ๋‹ค.

DexPilot ์ดํ›„์˜ ํ๋ฆ„

DexPilot์˜ retargeting ๊ณต์‹์€ ํ›„์† ์—ฐ๊ตฌ์— ๊นŠ๊ฒŒ ์Šค๋ฉฐ๋“ค์—ˆ๋‹ค.

  • AnyTeleop (Qin et al., 2023): ๋‹ค์–‘ํ•œ ๋กœ๋ด‡ ์†/ํŒ” ์กฐํ•ฉ์„ ์ง€์›ํ•˜๋Š” ์ผ๋ฐ˜ํ™”๋œ ์‹œ์Šคํ…œ. DexPilot์˜ retargeting์„ genericํ•˜๊ฒŒ ์žฌํฌ์žฅ.
  • GELLO (Wu et al., 2024): VR ์ปจํŠธ๋กค๋Ÿฌ ๋Œ€๋น„ ๋” ์ง๊ด€์ ์ธ ์ €๋น„์šฉ ํ•˜๋“œ์›จ์–ด ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜.
  • Bunny-VisionPro (2024): Vision Pro ํ—ค๋“œ์…‹์„ ์ด์šฉํ•œ ์–‘์† ์ •๋ฐ€ ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜.
  • TeleOpBench: ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜ ๋ฒค์น˜๋งˆํฌ โ€” DexPilot์ด baseline ์ค‘ ํ•˜๋‚˜๋กœ ๋“ค์–ด๊ฐ„๋‹ค.
  • dex-retargeting ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ: ์˜คํ”ˆ์†Œ์Šค์—์„œ โ€œDexPilot retargetingโ€์ด ํ‘œ์ค€ ์˜ต์…˜ ์ค‘ ํ•˜๋‚˜.

๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ์ธก๋ฉด์—์„œ๋„ DexPilot์€ ์ดํ›„ RoboTurk, DexCap, OpenTeach ๊ฐ™์€ ์‹œ์—ฐ ์ˆ˜์ง‘ ํŒŒ์ดํ”„๋ผ์ธ์˜ ์ง€์  ์กฐ์ƒ ์—ญํ• ์„ ํ•œ๋‹ค.

๋‹ค์‹œ ์‚ดํŽด๋ณด๋Š” ํ•ต์‹ฌ ํ†ต์ฐฐ

์ด ๋…ผ๋ฌธ์—์„œ ์–ป์„ ์ˆ˜ ์žˆ๋Š” ์ผ๋ฐ˜์  ๊ตํ›ˆ์„ ์„ธ ๊ฐ€์ง€๋กœ ์••์ถ•ํ•˜๋ฉด:

1. ๋น„์šฉ ํ•จ์ˆ˜ ์„ค๊ณ„๊ฐ€ ๊ณง ๋ฌธ์ œ ์ •์˜๋‹ค

DexPilot์˜ retargeting์€ โ€œ๊ด€์ ˆ์„ ์–ด๋–ป๊ฒŒ ๋งคํ•‘ํ•  ๊ฒƒ์ธ๊ฐ€โ€๊ฐ€ ์•„๋‹ˆ๋ผ โ€œ์–ด๋–ค ๊ธฐํ•˜ํ•™์„ ๋ณด์กดํ•  ๊ฒƒ์ธ๊ฐ€โ€๋ฅผ ๋น„์šฉ ํ•จ์ˆ˜๋กœ ์„ ์–ธํ•œ๋‹ค. Task-space ๋ฒกํ„ฐ์˜ ์„ ํƒ, ์ •์น™ํ™”์˜ ๋ฐฉํ–ฅ, projection์˜ ์ž„๊ณ„๊ฐ’ โ€” ๋ชจ๋‘ ๊ทธ ์„ ์–ธ์˜ ์ผ๋ถ€๋‹ค. ๋น„์šฉ ํ•จ์ˆ˜๋ฅผ ํ•œ ์ค„ ๋ฐ”๊พธ๋ฉด retargeting์˜ ์„ฑ๊ฒฉ์ด ๋ฐ”๋€๋‹ค. ์ด๊ฑด ๊ฑฐ์˜ ๋ชจ๋“  manipulation ๋ฌธ์ œ์— ๋˜‘๊ฐ™์ด ์ ์šฉ๋œ๋‹ค.

2. ๋ชจ๋ธ ํ”„๋ฆฌ์™€ ๋ชจ๋ธ ๊ธฐ๋ฐ˜์€ ์ ์ด ์•„๋‹ˆ๋‹ค

์‹ ๊ฒฝ๋ง์˜ ๊ฐ•๊ฑดํ•จ๊ณผ ์ตœ์ ํ™”์˜ ์ •๋ฐ€ํ•จ์„ ์ง๋ ฌ๋กœ ์—ฐ๊ฒฐํ•œ ๊ตฌ์กฐ๋Š” ๋‹จ์ˆœํ•˜์ง€๋งŒ ๊ฐ•๋ ฅํ•˜๋‹ค. Stage 1์—์„œ ์™ธ์ณ์ฃผ๊ณ  Stage 2์—์„œ ๋‹ค๋“ฌ๋Š” ํŒจํ„ด์€ hand pose estimation, object pose tracking, articulated tracking ์–ด๋””์—๋‚˜ ์ ์šฉ๋œ๋‹ค. โ€œ๋‘˜ ์ค‘ ๋ญ˜ ์“ฐ์ง€โ€๋ผ๋Š” ์งˆ๋ฌธ์€ ๋ณดํ†ต ์ž˜๋ชป๋œ ์งˆ๋ฌธ์ด๋‹ค.

3. ์‹œ์—ฐ ์ˆ˜์ง‘์€ ์ถ”์ •๋ณด๋‹ค ๋น„์‹ผ ์ž์›์ด๋‹ค

์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ๋…ผ๋ฌธ ํ•œ ํŽธ์ด์ง€๋งŒ, 23 DoA ร— 30 Hz ร— ์ˆ˜์‹ญ ์‹œ๊ฐ„์˜ ๋™๊ธฐํ™”๋œ ์‹œ์—ฐ ๋ฐ์ดํ„ฐ๋Š” ํ›„์† ์—ฐ๊ตฌ์˜ ๋ชจ๋“  imitation learning ์ •์ฑ…์„ ๋งŒ๋“ค ์ˆ˜ ์žˆ๋Š” ์ธํ”„๋ผ๋‹ค. ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜ ์‹œ์Šคํ…œ์˜ ๊ฐ€์น˜๋Š” ๊ทธ ์‹œ์Šคํ…œ์˜ ์šฐ์•„ํ•จ์ด ์•„๋‹ˆ๋ผ ๊ทธ๊ฒƒ์ด ๋งŒ๋“ค์–ด ๋‚ผ ๋ฐ์ดํ„ฐ์˜ ํฌ๊ธฐ์™€ ํ’ˆ์งˆ๋กœ ํ‰๊ฐ€ํ•ด์•ผ ํ•œ๋‹ค.

๊ฒฐ๋ก 

DexPilot์€ ์นด๋ฉ”๋ผ ๋„ค ๋Œ€์™€ GPU ๋‘ ์žฅ์œผ๋กœ 23 DoA ์†-ํŒ” ์‹œ์Šคํ…œ์„ ์ง๊ด€์ ์œผ๋กœ ์กฐ์ข…ํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ฒƒ์„ ๋ณด์—ฌ์คฌ๋‹ค. ๊ทธ ๊ธธ์— ๋†“์ธ ๊ธฐ์ˆ ์  ๋””ํ…Œ์ผ โ€” ๋ฉ€ํ‹ฐ๋ทฐ ๊นŠ์ด ํ†ตํ•ฉ, PointNet++/DART ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ถ”์ , task-space ๋ฒกํ„ฐ ๊ธฐ๋ฐ˜ retargeting, projection scheme, RMP ํŒ” ์ œ์–ด โ€” ์€ ๊ฐ๊ฐ ๋ณ„๊ฐœ์˜ ์—ฐ๊ตฌ ์ฃผ์ œ๋กœ ๋ฐœ์ „ํ•  ๊ฐ€์น˜๊ฐ€ ์žˆ๋‹ค.

์ด ๋…ผ๋ฌธ์ด ์ง€๊ธˆ๋„ ์ธ์šฉ๋˜๋Š” ์ด์œ ๋Š” ์–ด๋–ค trick ๋•Œ๋ฌธ์ด ์•„๋‹ˆ๋ผ ์–ด๋–ค ์‚ฌ๊ณ ๋ฐฉ์‹ ๋•Œ๋ฌธ์ด๋‹ค: ๋ฌธ์ œ๋ฅผ ํ’€ ๋•Œ ๋ฌด์—‡์„ ๋ณด์กดํ•ด์•ผ ํ•˜๋Š”์ง€๋ฅผ ๋จผ์ € ๋ฌป๊ณ , ๊ทธ๊ฒƒ์„ ๋น„์šฉ ํ•จ์ˆ˜๋กœ ์˜ฎ๊ธฐ๊ณ , ์ •๋ฐ€๋„๊ฐ€ ๋น„๋Œ€์นญ์ ์œผ๋กœ ์ค‘์š”ํ•œ ์˜์—ญ(์ •๋ฐ€ ๊ทธ๋ฆฝ)์—๋Š” ๋น„๋Œ€์นญ์  ์ฒ˜๋ฆฌ๋ฅผ ๋ช…์‹œ์ ์œผ๋กœ ๋„ฃ๋Š”๋‹ค. Robotics ์—ฐ๊ตฌ์—์„œ ์ด ์‚ฌ๊ณ ๋ฐฉ์‹์€ ์‹œ๋ฎฌ๋ ˆ์ด์…˜-์‹ค์„ธ๊ณ„ ๊ฐญ, sim2real, ๋„๋ฉ”์ธ ์ ์‘ ์–ด๋””์—์„œ๋‚˜ ๊ฐ™์€ ๋ชจ์–‘์œผ๋กœ ๋‚˜ํƒ€๋‚œ๋‹ค.

Allegro ์†์œผ๋กœ dexterous manipulation์„ ์—ฐ๊ตฌํ•œ๋‹ค๋ฉด, DexPilot์€ ๋ฐ˜๋“œ์‹œ ์ฝ๊ณ  ๋ฐ˜๋“œ์‹œ ํ•œ ๋ฒˆ ๊ตฌํ˜„ํ•ด ๋ด์•ผ ํ•˜๋Š” ๋…ผ๋ฌธ์ด๋‹ค. ๋น„์šฉ ํ•จ์ˆ˜์˜ ๊ฐ€์ค‘์น˜๋ฅผ ์ง์ ‘ ๋งŒ์ ธ ๋ณด๊ณ , projection ์ž„๊ณ„๊ฐ’์„ ๋ฐ”๊ฟ” ๋ณด๊ณ , distal=medial ์ œ์•ฝ์„ ํ’€์–ด ๋ณด๋ฉด, ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜์˜ ์–ด๋””๊ฐ€ ๋ถ€๋“œ๋Ÿฝ๊ณ  ์–ด๋””๊ฐ€ ๋นก๋นกํ•œ์ง€๊ฐ€ ์†๋์œผ๋กœ ๋А๊ปด์ง„๋‹ค. ๊ทธ ๊ฐ๊ฐ์ด ๋‹ค์Œ ์‹œ์Šคํ…œ์„ ์„ค๊ณ„ํ•  ๋•Œ์˜ ์ถœ๋ฐœ์ ์ด ๋œ๋‹ค.

์ฐธ๊ณ 

  • Handa, A., Van Wyk, K., Yang, W., Liang, J., Chao, Y.-W., Wan, Q., Birchfield, S., Ratliff, N. D., Fox, D. DexPilot: Vision-Based Teleoperation of Dexterous Robotic Hand-Arm System. ICRA 2020. arXiv:1910.03135
  • ํ”„๋กœ์ ํŠธ ์˜์ƒ: sites.google.com/view/dex-pilot
  • ๊ด€๋ จ ์ฝ”๋“œ: dexsuite/dex-retargeting
  • ํ›„์† ์—ฐ๊ตฌ ํ๋ฆ„: AnyTeleop, GELLO, Bunny-VisionPro, TeleOpBench

Copyright 2026, JungYeon Lee