Curieux.JY
  • JungYeon Lee
  • Post
  • Lecture
  • Note

On this page

  • ๐Ÿ” Ping Review
  • ๐Ÿ”” Ring Review
    • ์„œ๋ก 
      • ์™œ โ€œ์ด‰๊ฐ ์„ผ์„œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜โ€์ด ์ด๋ ‡๊ฒŒ ์–ด๋ ค์šด๊ฐ€
      • ํ•ต์‹ฌ ์•„์ด๋””์–ด ํ•œ ์ค„ ์š”์•ฝ
    • ๋ฐฉ๋ฒ•
      • 1๋‹จ๊ณ„: FEM์œผ๋กœ ์ •๋‹ต ๋ฐ์ดํ„ฐ ๋งŒ๋“ค๊ธฐ
      • 2๋‹จ๊ณ„: ๋ฉ”์‹œ๋ฅผ ๊ทธ๋ž˜ํ”„๋กœ ๋ฐ”๊พธ๊ธฐ โ€” ์™œ GNN์ธ๊ฐ€
      • 3๋‹จ๊ณ„: GNN ๋ชจ๋ธ โ€” Encode-Process-Decode
      • ์ž…๋ ฅ ๊ตฌ์„ฑ์— ๋Œ€ํ•œ ์ ˆ์ œ(ablation): force-only vs. translation
    • ์‹คํ—˜
      • ์„ค์ •
      • ๊ฒฐ๊ณผ 1: ๋‹จ์ผ ๋ฌผ์ฒด โ€” ๋ณด์ง€ ๋ชปํ•œ ์ž์„ธ๋กœ ์ผ๋ฐ˜ํ™”
      • ๊ฒฐ๊ณผ 2: ๋‹ค์ค‘ ๋ฌผ์ฒด โ€” ํ˜•์ƒ ๊ฐ„ ์ผ๋ฐ˜ํ™”
      • ์†๋„
    • ๋น„ํŒ์  ๊ณ ์ฐฐ
      • ๊ฐ•์ 
      • ์•ฝ์ ยทํ•œ๊ณ„
    • ์š”์•ฝ ๋ฐ ๊ฒฐ๋ก 
      • ๊ด€๋ จ ์—ฐ๊ตฌ์™€์˜ ์œ„์น˜

๐Ÿ“ƒDeformable Tactile Simulation with GNN

tactile
simulation
gnn
Real-Time Simulation of Deformable Tactile Sensors and Objects in Robotic Grasping using Graph Neural Networks with Inductive Biases
Published

May 15, 2026

  • Paper Link
  • Poster Link

๐Ÿ” Ping Review

๐Ÿ” Ping โ€” A light tap on the surface. Get the gist in seconds.


๐Ÿ”” Ring Review

๐Ÿ”” Ring โ€” An idea that echoes. Grasp the core and its value.

์„œ๋ก 

์™œ โ€œ์ด‰๊ฐ ์„ผ์„œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜โ€์ด ์ด๋ ‡๊ฒŒ ์–ด๋ ค์šด๊ฐ€

๋กœ๋ด‡์ด ๋ฌผ๊ฑด์„ ์ง‘์„ ๋•Œ, ์†๊ฐ€๋ฝ ๋์—์„œ ๋ฌด์Šจ ์ผ์ด ๋ฒŒ์–ด์ง€๋Š”์ง€ ์šฐ๋ฆฌ๋Š” ์‚ฌ์‹ค ๊ฑฐ์˜ ๋ชจ๋ฆ…๋‹ˆ๋‹ค. ์‚ฌ๋žŒ์€ ์‚ฌ๊ณผ๋ฅผ ์ฅ๋ฉด ์†๋์˜ ์‚ด์ด ๋ˆŒ๋ฆฌ๊ณ , ๊ทธ ๋ˆŒ๋ฆผ์˜ ๋ถ„ํฌ๋กœ๋ถ€ํ„ฐ โ€œ์ด ์ •๋„๋ฉด ์•ˆ ๋–จ์–ด๋œจ๋ฆฌ๊ฒ ๋‹คโ€, โ€œ๋” ์„ธ๊ฒŒ ์ฅ๋ฉด ๋ฉ๋“ค๊ฒ ๋‹คโ€๋ฅผ ์ง๊ฐ์ ์œผ๋กœ ์••๋‹ˆ๋‹ค. GelSight ๊ฐ™์€ ์‹œ๊ฐ ๊ธฐ๋ฐ˜ ์ด‰๊ฐ ์„ผ์„œ(visual tactile sensor)๋Š” ์ด ์ง๊ด€์„ ๋กœ๋ด‡์—๊ฒŒ ์ฃผ๋ ค๋Š” ์žฅ์น˜์ž…๋‹ˆ๋‹ค. ๋ถ€๋“œ๋Ÿฌ์šด ์ ค(gel) ํ‘œ๋ฉด์ด ๋ฌผ์ฒด์— ๋ˆŒ๋ฆฌ๋ฉด ๊ทธ ๋ณ€ํ˜•์„ ๋‚ด๋ถ€ ์นด๋ฉ”๋ผ๋กœ ์ฐ์–ด โ€œ์ด‰๊ฐ ์ด๋ฏธ์ง€โ€๋กœ ๋งŒ๋“ญ๋‹ˆ๋‹ค.

๋ฌธ์ œ๋Š” ์ด ์„ผ์„œ๋ฅผ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ํ•˜๋Š” ๊ฒƒ์ด ๋งค์šฐ ๋น„์‹ธ๋‹ค๋Š” ์ ์ž…๋‹ˆ๋‹ค. ์ด ๋…ผ๋ฌธ์€ ์ด‰๊ฐ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์„ ๋‘ ๋ถ€๋ถ„์œผ๋กœ ๋‚˜๋ˆ•๋‹ˆ๋‹ค.

  1. ๋ฌผ๋ฆฌ(physics): ์ ค๊ณผ ๋ฌผ์ฒด๊ฐ€ ์ ‘์ด‰ํ•˜๋ฉด์„œ ์‹ค์ œ๋กœ ์–ด๋–ป๊ฒŒ ๋ณ€ํ˜•๋˜๊ณ , ์–ด๋””์— ์–ผ๋งˆ๋‚˜ ์‘๋ ฅ(stress)์ด ๊ฑธ๋ฆฌ๋Š”๊ฐ€.
  2. ๋ Œ๋”๋ง(rendering): ๊ทธ ๋ณ€ํ˜•์œผ๋กœ๋ถ€ํ„ฐ ์นด๋ฉ”๋ผ๊ฐ€ ๋ณด๋Š” ์ด๋ฏธ์ง€๋ฅผ ๋งŒ๋“ค์–ด๋‚ด๋Š”๊ฐ€.

์ด ๋…ผ๋ฌธ์€ ์˜ค์ง ๋ฌผ๋ฆฌ ๋ถ€๋ถ„๋งŒ ๋‹ค๋ฃน๋‹ˆ๋‹ค(๋ Œ๋”๋ง์€ ๋ฒ”์œ„ ๋ฐ–). ๊ทธ๋ฆฌ๊ณ  ๋ฌผ๋ฆฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์ง„์˜์€ ํฌ๊ฒŒ ๋‘˜๋กœ ๋‚˜๋‰ฉ๋‹ˆ๋‹ค.

  • ๊ฐ•์ฒด(rigid-body) ์‹œ๋ฎฌ๋ ˆ์ด์…˜: ๋น ๋ฆ…๋‹ˆ๋‹ค. ๊ฐ•ํ™”ํ•™์Šต์ฒ˜๋Ÿผ ๋Œ€๋Ÿ‰์˜ ๋ฐ์ดํ„ฐ๋ฅผ ๋ฝ‘์•„์•ผ ํ•  ๋•Œ ์œ ๋ฆฌํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ์ ค์ด ๋ˆŒ๋ฆฌ๋Š” ์ „๋‹จ๋ ฅ(shear)์ด๋‚˜ ๋ฏธ์„ธ ๋ณ€ํ˜•์„ ์ œ๋Œ€๋กœ ๋ชป ๋‹ด์Šต๋‹ˆ๋‹ค. ์ฆ‰ โ€œํ˜„์‹ค๊ฐโ€์ด ๋–จ์–ด์ง‘๋‹ˆ๋‹ค.
  • ์—ฐ์ฒด(soft-body) ์‹œ๋ฎฌ๋ ˆ์ด์…˜: ๋ณดํ†ต ์œ ํ•œ์š”์†Œ๋ฒ•(FEM, Finite Element Method)์œผ๋กœ ํ’‰๋‹ˆ๋‹ค. ์ ค๊ณผ ๋ฌผ์ฒด๋ฅผ ์ž‘์€ ์‚ฌ๋ฉด์ฒด(tetrahedron) ์š”์†Œ๋กœ ์ž˜๊ฒŒ ์ชผ๊ฐœ๊ณ , ๊ฐ ์š”์†Œ๊ฐ€ ๋ฐ›๋Š” ํž˜๊ณผ ๋ณ€ํ˜•์„ ๋ฌผ๋ฆฌ ๋ฐฉ์ •์‹์œผ๋กœ ์ •๋ฐ€ํ•˜๊ฒŒ ๊ณ„์‚ฐํ•ฉ๋‹ˆ๋‹ค. ํ˜„์‹ค๊ฐ์€ ๋›ฐ์–ด๋‚˜์ง€๋งŒ, ๊ฐ•์ฒด ๋Œ€๋น„ ์ˆ˜๋ฐฑ~์ˆ˜์ฒœ ๋ฐฐ ๋А๋ฆฝ๋‹ˆ๋‹ค. ๊ทธ๋ž˜์„œ ์‹ค์‹œ๊ฐ„ ์ œ์–ด๋‚˜ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ์—๋Š” ์‚ฌ์‹ค์ƒ ๋ชป ์”๋‹ˆ๋‹ค.

๋น„์œ ํ•˜์ž๋ฉด, FEM์€ โ€œ์ •๋‹ต์„ ์•Œ์ง€๋งŒ ์‹œํ—˜ ์‹œ๊ฐ„์ด ๋๋‚˜๋„๋ก ํ•œ ๋ฌธ์ œ๋„ ๋‹ค ๋ชป ํ‘ธ๋Š” ์šฐ๋“ฑ์ƒโ€์ด๊ณ , ๊ฐ•์ฒด ์‹œ๋ฎฌ๋ ˆ์ด์…˜์€ โ€œ๋Œ€์ถฉ ์ฐ์–ด์„œ ๋นจ๋ฆฌ ๋‚ด์ง€๋งŒ ์ ˆ๋ฐ˜์€ ํ‹€๋ฆฌ๋Š” ํ•™์ƒโ€์ž…๋‹ˆ๋‹ค. ์ด ๋…ผ๋ฌธ์˜ ๋ชฉํ‘œ๋Š” FEM์˜ ์ •๋‹ต์„ ํ‰๋‚ด ๋‚ด๋˜ ๊ฐ•์ฒด๋งŒํผ ๋น ๋ฅธ ์ œ3์˜ ํ•™์ƒ์„ ๋งŒ๋“œ๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ์•„์ด๋””์–ด ํ•œ ์ค„ ์š”์•ฝ

FEM์œผ๋กœ ์ •๋‹ต ๋ฐ์ดํ„ฐ๋ฅผ ๋งŒ๋“ค์–ด ๋‘๊ณ , ๊ทธ๋ž˜ํ”„ ์‹ ๊ฒฝ๋ง(GNN)์—๊ฒŒ โ€œFEM์ด๋ผ๋ฉด ์–ด๋–ป๊ฒŒ ๋ณ€ํ˜•ยท์‘๋ ฅ์„ ์ค„์ง€โ€๋ฅผ ํ•™์Šต์‹œํ‚จ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ FEM ๋Œ€๋น„ 10^2 \sim 10^3๋ฐฐ ๋น ๋ฅด๋ฉด์„œ๋„ ๋ณ€ํ˜•๊ณผ ์‘๋ ฅ์„ ๋™์‹œ์— ์˜ˆ์ธกํ•œ๋‹ค.

์ด ๋…ผ๋ฌธ์˜ ๊ธฐ์—ฌ๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

  • ์‹œ๊ฐ ์ด‰๊ฐ ์„ผ์„œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์— GNN ๊ธฐ๋ฐ˜ ๋ฌผ๋ฆฌ ํ•™์Šต์„ ์ฒ˜์Œ์œผ๋กœ ์ ์šฉํ•œ ์—ฐ๊ตฌ์ž…๋‹ˆ๋‹ค. (๊ธฐ์กด GNN ๋ฌผ๋ฆฌ ํ•™์Šต์€ ๋ณ€ํ˜• ๋ฌผ์ฒด ์ž์ฒด์— ์ง‘์ค‘ํ–ˆ์ง€, ์„ผ์„œ๋ฅผ ๋‹ค๋ฃจ์ง€๋Š” ์•Š์•˜์Šต๋‹ˆ๋‹ค.)
  • FEM ๋Œ€๋น„ 10^2~10^3๋ฐฐ ๊ฐ€์†ํ•˜๋ฉด์„œ, ํ•™์Šต ๋•Œ ๋ณด์ง€ ๋ชปํ•œ ๊ทธ๋ฆฝ ์ž์„ธ(unseen grasping pose)๋กœ๋„ ์ผ๋ฐ˜ํ™”ํ•˜๋Š” ์˜ˆ์ธก ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.

์ €์ž๋Š” Centrale Lyon(LIRIS)๊ณผ TU Darmstadt(IAS Lab) ์—ฐ๊ตฌ์ง„(Guillaume Duret, Danylo Mazurak, Frederik Heller, Florence Zara, Jan Peters, Liming Chen)์ด๋ฉฐ, ์ฝ”๋“œ๋Š” tacgraspnets.github.io์— ๊ณต๊ฐœ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค. ๋ณธ ์›Œํฌ์ˆ ๋…ผ๋ฌธ์€ ๋™์ผ ๊ทธ๋ฃน์˜ ์„ ํ–‰ ์—ฐ๊ตฌ ๋‘ ํŽธ(๋ฐ์ดํ„ฐ ์ƒ์„ฑ ํŒŒ์ดํ”„๋ผ์ธ TacGraspSim, ๊ทธ๋ฆฌ๊ณ  โ€œInductive Biasesโ€ GNN ๊ฐœ์„  ์—ฐ๊ตฌ)์„ ์ด‰๊ฐ ์„ผ์„œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์œผ๋กœ ํ™•์žฅํ•œ ์ž‘์—…์ž…๋‹ˆ๋‹ค.

๋ฐฉ๋ฒ•

์ „์ฒด ๊ทธ๋ฆผ์€ โ€œ๋ฐ์ดํ„ฐ ๋งŒ๋“ค๊ธฐ โ†’ ๊ทธ๋ž˜ํ”„๋กœ ๋ฐ”๊พธ๊ธฐ โ†’ GNN์œผ๋กœ ์˜ˆ์ธกํ•˜๊ธฐโ€์˜ 3๋‹จ์ž…๋‹ˆ๋‹ค. ๋จผ์ € ํฐ ๊ทธ๋ฆผ์„ ๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค.

flowchart TD
    subgraph DataGen["A. Dataset Generation (FEM in Isaac Gym)"]
        O[Load object] --> G[Load gripper at grasp pose]
        G --> C[Close gripper until contact]
        C --> F[Ramp grasp force to threshold N]
        F --> R[Record 50 frames per run:<br/>node deformation, rigid poses,<br/>stress, finger translation]
    end

    R --> GR

    subgraph Graph["B. Graph Construction"]
        GR[Nodes: gripper tetra vertices + object vertices] --> ME[Mesh edges:<br/>neighbor nodes within a body]
        GR --> CE[Contact edges:<br/>object node to gripper node]
    end

    ME --> GNN
    CE --> GNN

    subgraph Model["C. GNN: Encode-Process-Decode"]
        GNN[Encode node/edge features] --> MP[Multiple message-passing rounds]
        MP --> DEC[Decode]
        DEC --> OUT1[Deformation field]
        DEC --> OUT2[Stress field]
    end

1๋‹จ๊ณ„: FEM์œผ๋กœ ์ •๋‹ต ๋ฐ์ดํ„ฐ ๋งŒ๋“ค๊ธฐ

GNN์€ ์ง€๋„ํ•™์Šต ๋ชจ๋ธ์ด๋ฏ€๋กœ โ€œ์ •๋‹ตโ€์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ ์ •๋‹ต์„ FEM์ด ๋งŒ๋“ค์–ด ์ค๋‹ˆ๋‹ค.

  • ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ: NVIDIA Isaac Gym(GPU ๊ธฐ๋ฐ˜ ๊ณ ์„ฑ๋Šฅ ๋ฌผ๋ฆฌ ์—”์ง„) ์œ„์—์„œ FEM์„ ๋Œ๋ฆฝ๋‹ˆ๋‹ค.
  • ๊ธฐ๋ฐ˜: DefGraspSim(3D ๋ณ€ํ˜• ๋ฌผ์ฒด์˜ ๊ทธ๋ฆฝ ๊ฒฐ๊ณผ๋ฅผ FEM์œผ๋กœ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ํ•˜๋Š” ์„ ํ–‰ ์—ฐ๊ตฌ)์„ ๊ฐ€์ ธ์˜ค๋˜, ์ •์  ๊ทธ๋ฆฝ์ด ์•„๋‹ˆ๋ผ ๋™์  ๊ทธ๋ฆฝ(dynamic grasping) ์‹œ๋‚˜๋ฆฌ์˜ค๋กœ ํ™•์žฅํ•œ ์ž์ฒด ํŒŒ์ดํ”„๋ผ์ธ TacGraspSim์„ ์”๋‹ˆ๋‹ค.
  • ํ•˜๋“œ์›จ์–ด ์„ค์ •: ํ‰ํ–‰ ๊ทธ๋ฆฌํผ(parallel gripper)์— GelSight Mini ์ด‰๊ฐ ์„ผ์„œ๋ฅผ URDF๋กœ ๋ชจ๋ธ๋งํ•ด์„œ ๋ถ™์˜€์Šต๋‹ˆ๋‹ค.

๋ฐ์ดํ„ฐ ํ•œ ๋ฒˆ ๋ฝ‘๋Š” ์ ˆ์ฐจ๋Š” ์ด๋ ‡์Šต๋‹ˆ๋‹ค.

  1. ๋ฌผ์ฒด๋ฅผ ๋ถˆ๋Ÿฌ์˜จ๋‹ค.
  2. ๊ทธ๋ฆฌํผ๋ฅผ ๊ทธ๋ฆฝ ์œ„์น˜์— ๋†“๋Š”๋‹ค.
  3. ์ ‘์ด‰ํ•  ๋•Œ๊นŒ์ง€ ๊ทธ๋ฆฌํผ๋ฅผ ๋‹ซ๋Š”๋‹ค.
  4. ๊ทธ๋ฆฝ ํž˜์„ ์ž„๊ณ„๊ฐ’ N๊นŒ์ง€ ์„œ์„œํžˆ ํ‚ค์šฐ๋ฉด์„œ ๋ฐ์ดํ„ฐ๋ฅผ ๊ธฐ๋กํ•œ๋‹ค.

ํ•œ ๋ฒˆ์˜ ์‹คํ–‰(run)๋งˆ๋‹ค 50ํ”„๋ ˆ์ž„์„ ์ €์žฅํ•˜๋ฉฐ, ๊ฐ ํ”„๋ ˆ์ž„์—์„œ (i) ๋…ธ๋“œ๋ณ„ ๋ณ€ํ˜•(node-wise deformation), (ii) ๊ฐ•์ฒด ์ž์„ธ(rigid body pose), (iii) ์‘๋ ฅ ๋ถ„ํฌ(stress distribution), (iv) ๊ทธ๋ฆฌํผ ์†๊ฐ€๋ฝ ์ด๋™๋Ÿ‰(finger translation)์„ ๊ธฐ๋กํ•ฉ๋‹ˆ๋‹ค. ๋ฐ์ดํ„ฐ์…‹์€ 10์ข…์˜ ๋ฌผ์ฒด ร— ๋ฌผ์ฒด๋‹น 100๊ฐœ์˜ ๊ทธ๋ฆฝ ์ž์„ธ๋กœ ๊ตฌ์„ฑ๋ฉ๋‹ˆ๋‹ค(์‚ฌ๊ณผ, ๊ฐ์ž, ๋ ˆ๋ชฌ, ์˜ค์ด, ๋”ธ๊ธฐ ๋“ฑ ์‹๋ฃŒํ’ˆ๋ฅ˜ ํ˜•์ƒ).

2๋‹จ๊ณ„: ๋ฉ”์‹œ๋ฅผ ๊ทธ๋ž˜ํ”„๋กœ ๋ฐ”๊พธ๊ธฐ โ€” ์™œ GNN์ธ๊ฐ€

GNN์„ ์„ ํƒํ•œ ์ด์œ ๋Š” ์ง๊ด€์ ์ž…๋‹ˆ๋‹ค. FEM์˜ ๋ฉ”์‹œ ์ž์ฒด๊ฐ€ ์ด๋ฏธ ๊ทธ๋ž˜ํ”„์ด๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค.

  • FEM์€ ๋ฌผ์ฒด๋ฅผ ์‚ฌ๋ฉด์ฒด ์š”์†Œ๋“ค์˜ ๊ทธ๋ฌผ๋ง์œผ๋กœ ์ž˜๊ฒŒ ๋‚˜๋ˆ•๋‹ˆ๋‹ค. ๊ฐ ๊ผญ์ง“์ (vertex)์€ ๊ทธ๋ž˜ํ”„์˜ ๋…ธ๋“œ(node), ์ธ์ ‘ ๊ผญ์ง“์ ์„ ์ž‡๋Š” ๋ณ€์€ ๊ทธ๋ž˜ํ”„์˜ ์—ฃ์ง€(edge)๋กœ ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ๋Œ€์‘๋ฉ๋‹ˆ๋‹ค.
  • ๋” ๊ฒฐ์ •์ ์œผ๋กœ, GNN์˜ ๋ฉ”์‹œ์ง€ ์ „๋‹ฌ(message passing)์€ FEM์—์„œ ํž˜์ด ์ด์›ƒ ์š”์†Œ๋กœ ์ „ํŒŒ๋˜๋Š” ๊ณผ์ •๊ณผ ๋‹ฎ์•˜์Šต๋‹ˆ๋‹ค. FEM์—์„œ ํ•œ ๊ณณ์„ ๋ˆ„๋ฅด๋ฉด ๊ทธ ํž˜์ด ์ธ์ ‘ ์š”์†Œ๋ฅผ ํ†ตํ•ด ์˜†์œผ๋กœ, ๋˜ ์˜†์œผ๋กœ ํผ์ ธ ๋‚˜๊ฐ‘๋‹ˆ๋‹ค. GNN์—์„œ ํ•œ ๋…ธ๋“œ์˜ ์ •๋ณด๊ฐ€ ๋ฉ”์‹œ์ง€๋ฅผ ํ†ตํ•ด ์ด์›ƒ ๋…ธ๋“œ๋กœ ํผ์ง€๋Š” ๊ฒƒ๊ณผ ๊ตฌ์กฐ์ ์œผ๋กœ ๋™์ผํ•ฉ๋‹ˆ๋‹ค. ์ด๊ฒƒ์ด ์ด ๋…ผ๋ฌธ์ด ๊ฐ•์กฐํ•˜๋Š” ๊ท€๋‚ฉ ํŽธํ–ฅ(inductive bias)์˜ ํ•ต์‹ฌ์ž…๋‹ˆ๋‹ค. ์ฆ‰ โ€œ๋ฌผ๋ฆฌ์ ์œผ๋กœ ์ •๋ณด๊ฐ€ ๊ตญ์†Œ์ ์œผ๋กœ ์ „ํŒŒ๋œ๋‹คโ€๋Š” ์‚ฌ์‹ค์„ ๋ชจ๋ธ ๊ตฌ์กฐ์— ์ฒ˜์Œ๋ถ€ํ„ฐ ๋ฐ•์•„ ๋„ฃ์€ ๊ฒƒ์ž…๋‹ˆ๋‹ค.

๊ทธ๋ž˜ํ”„ ๊ตฌ์„ฑ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

  • ๋…ธ๋“œ: ๊ทธ๋ฆฌํผ(์ ค์˜ ์‚ฌ๋ฉด์ฒด ๋ฉ”์‹œ ๊ผญ์ง“์ )์™€ ๋ฌผ์ฒด ํ˜•์ƒ์˜ ๋…ธ๋“œ๋ฅผ ๋ชจ๋‘ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค.
  • ๋‘ ์ข…๋ฅ˜์˜ ์—ฃ์ง€:
    • ๋ฉ”์‹œ ์—ฃ์ง€(mesh edge): ํ•œ ๋ฌผ์ฒด ๋‚ด๋ถ€์˜ ์ด์›ƒ ๋…ธ๋“œ๋ฅผ ์—ฐ๊ฒฐํ•ฉ๋‹ˆ๋‹ค. โ†’ โ€œ์ด ๋ชธ์ฒด๋Š” ์–ด๋–ป๊ฒŒ ์ƒ๊ฒผ๋Š”๊ฐ€โ€๋ผ๋Š” ๊ธฐํ•˜ ์ •๋ณด๋ฅผ ๋‹ด์Œ.
    • ์ ‘์ด‰ ์—ฃ์ง€(contact edge): ๋ฌผ์ฒด ๋…ธ๋“œ์™€ ๊ทธ๋ฆฌํผ ๋…ธ๋“œ๋ฅผ ์—ฐ๊ฒฐํ•ฉ๋‹ˆ๋‹ค. โ†’ โ€œ์–ด๋””์„œ ๋‹ฟ์•˜๊ณ  ์–ผ๋งˆ๋‚˜ ํž˜์ด ๊ฑธ๋ฆฌ๋Š”๊ฐ€โ€๋ผ๋Š” ์ ‘์ด‰/ํž˜ ์ •๋ณด๋ฅผ ๋‹ด์Œ.

์ด๋ ‡๊ฒŒ ์—ฃ์ง€๋ฅผ ๋‘˜๋กœ ๋ถ„๋ฆฌํ•œ ๊ฒƒ ์—ญ์‹œ ๊ท€๋‚ฉ ํŽธํ–ฅ์ž…๋‹ˆ๋‹ค. โ€œ๋ฌผ์ฒด ๋‚ด๋ถ€์˜ ๋ณ€ํ˜• ์ „ํŒŒโ€์™€ โ€œ๋ฌผ์ฒด-๊ทธ๋ฆฌํผ ์‚ฌ์ด์˜ ์ ‘์ด‰๋ ฅโ€์€ ๋ณธ์งˆ์ด ๋‹ค๋ฅธ ์ƒํ˜ธ์ž‘์šฉ์ด๋ฏ€๋กœ ๋‹ค๋ฅธ ์—ฃ์ง€ ํƒ€์ž…์œผ๋กœ ๊ตฌ๋ถ„ํ•ด ์ฃผ๋ฉด ํ•™์Šต์ด ์‰ฌ์›Œ์ง‘๋‹ˆ๋‹ค.

๊ทธ๋ž˜ํ”„ ์š”์†Œ ๋ฌด์—‡์„ ๋‹ด๋Š”๊ฐ€ ๋ฌผ๋ฆฌ์  ์˜๋ฏธ
๋…ธ๋“œ(node) ๊ธฐํ•˜ ์ƒํƒœ(state), ํƒ€์ž…(type), ์šด๋™ ๋‹จ์„œ(motion cue) โ€œ์ด ์ ์€ ์ ค์ธ๊ฐ€ ๋ฌผ์ฒด์ธ๊ฐ€, ์ง€๊ธˆ ์–ด๋”” ์žˆ๊ณ  ์–ด๋””๋กœ ์›€์ง์ด๋‚˜โ€
๋ฉ”์‹œ ์—ฃ์ง€ ๊ธฐํ•˜ ์ •๋ณด(geometric) โ€œํ•œ ๋ชธ์ฒด ๋‚ด๋ถ€์˜ ์ด์›ƒ ์—ฐ๊ฒฐ = ํ˜•์ƒโ€
์ ‘์ด‰ ์—ฃ์ง€ ํž˜ ์‹ ํ˜ธ(force) โ€œ๋ฌผ์ฒด์™€ ๊ทธ๋ฆฌํผ๊ฐ€ ๋‹ฟ๋Š” ๊ณณ์˜ ์ƒํ˜ธ์ž‘์šฉโ€

3๋‹จ๊ณ„: GNN ๋ชจ๋ธ โ€” Encode-Process-Decode

๋ชจ๋ธ์€ ๋ฌผ๋ฆฌ ํ•™์Šต GNN์˜ ํ‘œ์ค€ ๊ณจ๊ฒฉ์ธ Encode-Process-Decode ๊ตฌ์กฐ๋ฅผ ๋”ฐ๋ฆ…๋‹ˆ๋‹ค.

  • Encode: ๊ฐ ๋…ธ๋“œ์™€ ์—ฃ์ง€์˜ ์›์‹œ ํŠน์ง•(feature)์„ ์ž ์žฌ ๋ฒกํ„ฐ(latent vector)๋กœ ์ธ์ฝ”๋”ฉํ•ฉ๋‹ˆ๋‹ค.
    • ๋…ธ๋“œ ํŠน์ง•: ๊ธฐํ•˜ ์ƒํƒœ + ๋…ธ๋“œ ํƒ€์ž… + ์šด๋™ ๋‹จ์„œ.
    • ์—ฃ์ง€ ํŠน์ง•: ๋ฉ”์‹œ ์—ฃ์ง€๋Š” ๊ธฐํ•˜ ์ •๋ณด, ์ ‘์ด‰ ์—ฃ์ง€๋Š” ํž˜ ์‹ ํ˜ธ.
  • Process: ์—ฌ๋Ÿฌ ๋ฒˆ์˜ ๋ฉ”์‹œ์ง€ ์ „๋‹ฌ ๋ผ์šด๋“œ(multiple message-passing rounds)๋ฅผ ๋Œ๋ฆฝ๋‹ˆ๋‹ค. ๊ฐ ๋ผ์šด๋“œ์—์„œ ๋…ธ๋“œ๋Š” ์ด์›ƒ์œผ๋กœ๋ถ€ํ„ฐ ๋ฉ”์‹œ์ง€๋ฅผ ๋ชจ์•„ ์ž๊ธฐ ์ƒํƒœ๋ฅผ ๊ฐฑ์‹ ํ•ฉ๋‹ˆ๋‹ค. ๋ผ์šด๋“œ๋ฅผ ๊ฑฐ๋“ญํ• ์ˆ˜๋ก ์ •๋ณด๊ฐ€ ๊ทธ๋ž˜ํ”„ ์ „์ฒด๋กœ ํผ์ ธ ๋‚˜๊ฐ€๋ฉฐ, ์ด๊ฒƒ์ด FEM์˜ ํž˜ ์ „ํŒŒ๋ฅผ ํ‰๋‚ด ๋ƒ…๋‹ˆ๋‹ค.
  • Decode: ์ตœ์ข… ๋…ธ๋“œ ํ‘œํ˜„์œผ๋กœ๋ถ€ํ„ฐ ๋ณ€ํ˜•(deformation)๊ณผ ์‘๋ ฅ(stress)์„ ๋™์‹œ์— ์ถœ๋ ฅํ•ฉ๋‹ˆ๋‹ค.

๋ฉ”์‹œ์ง€ ์ „๋‹ฌ์„ ์ง๊ด€์ ์ธ ์˜์‚ฌ์ฝ”๋“œ๋กœ ํ‘œํ˜„ํ•˜๋ฉด ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค(๊ฐœ๋… ์„ค๋ช…์šฉ).

# Encode-Process-Decode, conceptual
h_v = encode_node(features_v)          for each node v
h_e = encode_edge(features_e)          for each edge e (mesh or contact)

for round in 1..K:                     # multiple message-passing rounds
    for each edge e = (u, v):
        m_e = edge_update(h_e, h_u, h_v)   # build message along the edge
    for each node v:
        agg = aggregate(m_e for e incident to v)
        h_v = node_update(h_v, agg)        # update node from neighbors

deformation_v = decode_def(h_v)        for each node v
stress        = decode_stress(...)     # predicted as field over the body

์ด ๋…ผ๋ฌธ์˜ ๋ชจํƒœ๊ฐ€ ๋œ ์„ ํ–‰ ์—ฐ๊ตฌ(โ€œInductive Biases โ€ฆโ€ ๋…ผ๋ฌธ)๋Š” ํ‘œ์ค€ GNN(DefGraspNets)์— ๋‘ ๊ฐ€์ง€ ๊ท€๋‚ฉ ํŽธํ–ฅ์„ ๋”ํ–ˆ์Šต๋‹ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ๊ทธ ๊ฐœ์„ ํŒ์„ ๊ธฐ๋ฐ˜(baseline)์œผ๋กœ ์‚ผ์Šต๋‹ˆ๋‹ค.

  1. ์‚ฌ๋ฉด์ฒด ๋‹จ์œ„ ํŠน์ง•(tetrahedral features): ์‘๋ ฅ์„ ๊ผญ์ง“์ (vertex)์ด ์•„๋‹ˆ๋ผ ์‚ฌ๋ฉด์ฒด ์š”์†Œ ์ž์ฒด์—์„œ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค. FEM์ด ๋ณธ๋ž˜ ์š”์†Œ ๋‹จ์œ„๋กœ ์‘๋ ฅ์„ ๊ณ„์‚ฐํ•˜๋ฏ€๋กœ, ์˜ˆ์ธก์„ ๊ทธ ๋ฌผ๋ฆฌ ๋ชจ๋ธ์— ์ •๋ ฌ์‹œ์ผœ ๋” ์‚ฌ์‹ค์ ์ธ ๊ฒฐ๊ณผ๋ฅผ ๋ƒ…๋‹ˆ๋‹ค.
  2. ์ „์—ญ ํŠน์ง• ์ง€๋ฆ„๊ธธ(global feature shortcut): ๋ฉ”์‹œ ํ•ด์ƒ๋„๊ฐ€ ๋†’๊ณ  ๋ฉ”์‹œ์ง€ ์ „๋‹ฌ ๋ผ์šด๋“œ ์ˆ˜๊ฐ€ ์ ์œผ๋ฉด, ์ •๋ณด๊ฐ€ ๊ทธ๋ž˜ํ”„ ๋๊นŒ์ง€ ๋ชป ํผ์ง€๋Š” ํ•œ๊ณ„๊ฐ€ ์ƒ๊น๋‹ˆ๋‹ค. ์ „์—ญ ํŠน์ง•์€ ๊ทธ๋ž˜ํ”„ ์ „์ฒด์— ๊ด€๋ จ๋œ ์ •๋ณด๋ฅผ ๊ณง์žฅ ์ „๋‹ฌํ•˜๋Š” ์ง€๋ฆ„๊ธธ ์—ญํ• ์„ ํ•ด ์ด ๋ฌธ์ œ๋ฅผ ์™„ํ™”ํ•ฉ๋‹ˆ๋‹ค.

์ž…๋ ฅ ๊ตฌ์„ฑ์— ๋Œ€ํ•œ ์ ˆ์ œ(ablation): force-only vs. translation

์ €์ž๋Š” ๋‘ ๊ฐ€์ง€ ์ž…๋ ฅ ๋ฐฉ์‹์„ ๋น„๊ตํ•ฉ๋‹ˆ๋‹ค. ์ด ๋น„๊ต๊ฐ€ ๊ฒฐ๊ณผ๋ฅผ ์ดํ•ดํ•˜๋Š” ๋ฐ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

  • Force-only ์ž…๋ ฅ: ๊ทธ๋ฆฝ ํž˜๋งŒ ์ฃผ๊ณ , ๋ชจ๋ธ์ด ์†๊ฐ€๋ฝ ์ด๋™๋Ÿ‰(translation)๊ณผ ๋ณ€ํ˜•์„ ๋‘˜ ๋‹ค ์˜ˆ์ธกํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค. โ†’ ๋” ์–ด๋ ค์šด ๊ณผ์ œ.
  • Translation ์ž…๋ ฅ: ์†๊ฐ€๋ฝ ์ด๋™๋Ÿ‰์„ ๋ฏธ๋ฆฌ ์•Œ๋ ค์ฃผ๊ณ , ๋ชจ๋ธ์€ ๋ณ€ํ˜•๋งŒ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค. โ†’ ๋” ์‰ฌ์šด ๊ณผ์ œ.

์ง๊ด€์ ์œผ๋กœ, ์†๊ฐ€๋ฝ์ด ์–ผ๋งˆ๋‚˜ ๋“ค์–ด๊ฐ”๋Š”์ง€๋ฅผ ์•Œ๋ ค์ฃผ๋ฉด โ€œ๊ทธ ๊ฒฐ๊ณผ ์ ค์ด ์–ด๋–ป๊ฒŒ ๋ˆŒ๋ฆฌ๋‚˜โ€๋งŒ ํ’€๋ฉด ๋˜๋‹ˆ ํ•™์Šต์ด ๋‹จ์ˆœํ•ด์ง‘๋‹ˆ๋‹ค. ๊ฒŒ๋‹ค๊ฐ€ ์‹ค์ œ ๋กœ๋ด‡ ์ œ์–ด์—์„œ๋Š” ์œ„์น˜ ๋ช…๋ น์„ ์ฃผ๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋งŽ์œผ๋ฏ€๋กœ, translation ์ž…๋ ฅ ๋ฐฉ์‹์ด sim-to-real ์ „์ด์—๋„ ๋” ์œ ๋ฆฌํ•˜๋‹ค๊ณ  ์ €์ž๋Š” ์ฃผ์žฅํ•ฉ๋‹ˆ๋‹ค.

์‹คํ—˜

์„ค์ •

  • ๋ฐ์ดํ„ฐ: 10์ข… ๋ฌผ์ฒด ร— 100 ๊ทธ๋ฆฝ ์ž์„ธ, ์‹คํ–‰๋‹น 50ํ”„๋ ˆ์ž„.
  • ์ •๋‹ต(ground truth): Isaac Gym FEM ์‹œ๋ฎฌ๋ ˆ์ด์…˜.
  • ํ‰๊ฐ€ ์ง€ํ‘œ: ๋ณ€ํ˜• MAE(Mean Deformation MAE)์™€ ์‘๋ ฅ MAE(Mean Stress MAE). MAE๋Š” ํ‰๊ท  ์ ˆ๋Œ€ ์˜ค์ฐจ๋กœ, ์˜ˆ์ธก๊ณผ ์ •๋‹ต์˜ ์ฐจ์ด๋ฅผ ์ ˆ๋Œ“๊ฐ’ ํ‰๊ท ํ•œ ๊ฐ’์ด๋ฉฐ ์ž‘์„์ˆ˜๋ก ์ข‹์Šต๋‹ˆ๋‹ค.
  • ์‹คํ—˜ ๋‹จ๊ณ„: (1) ๋‹จ์ผ ๋ฌผ์ฒด ํ•™์Šต โ†’ ๊ฐ™์€ ํ˜•์ƒ์˜ ๋ณด์ง€ ๋ชปํ•œ ๊ทธ๋ฆฝ ์ž์„ธ๋กœ ์ผ๋ฐ˜ํ™” ๊ฒ€์ฆ, (2) ๋‹ค์ค‘ ๋ฌผ์ฒด ํ•™์Šต โ†’ ๋ฌผ์ฒด๋‹น ๊ทธ๋ฆฝ ์ž์„ธ์˜ 80%๋กœ ํ•™์Šตํ•˜๊ณ  ๋‚˜๋จธ์ง€ 20%๋กœ ํ…Œ์ŠคํŠธ.

๊ฒฐ๊ณผ 1: ๋‹จ์ผ ๋ฌผ์ฒด โ€” ๋ณด์ง€ ๋ชปํ•œ ์ž์„ธ๋กœ ์ผ๋ฐ˜ํ™”

Class Translation ์ž…๋ ฅ ๋ณ€ํ˜• MAE ์‘๋ ฅ MAE
potato True 6.57e-05 372.7
potato False 2.92e-04 382.8
apple True 7.20e-05 370.5
apple False 2.97e-04 427.9
lemon True 5.40e-05 212.1
lemon False 2.38e-04 265.6

(๊ตต์€ ๊ธ€์”จ๊ฐ€ ๋” ์ข‹์€ ์„ฑ๋Šฅ. ์ถœ์ฒ˜: ๋…ผ๋ฌธ Table I.)

ํ•ด์„: translation ์ž…๋ ฅ์„ ์ฃผ๋ฉด ๋ณ€ํ˜• MAE๊ฐ€ ํ•œ ์ž๋ฆฟ์ˆ˜(์•ฝ 4~5๋ฐฐ) ์ค„์–ด๋“ญ๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ๊ฐ์ž๋Š” 2.92e-04 โ†’ 6.57e-05. ์‘๋ ฅ ์˜ค์ฐจ๋„ ์ค„์ง€๋งŒ ๋ณ€ํ˜•๋งŒํผ ๊ทน์ ์ด์ง€๋Š” ์•Š์Šต๋‹ˆ๋‹ค. ์ฆ‰ โ€œ์ด๋™๋Ÿ‰์„ ์•Œ๋ ค์ฃผ๋Š” ๊ฒƒโ€์ด ๋ณ€ํ˜• ์˜ˆ์ธก ๋‚œ์ด๋„๋ฅผ ํฌ๊ฒŒ ๋‚ฎ์ถ˜๋‹ค๋Š” ์ง๊ด€์ด ์ˆ˜์น˜๋กœ ํ™•์ธ๋ฉ๋‹ˆ๋‹ค. ํ•ต์‹ฌ์€, ๊ฐ™์€ ๋ฌผ์ฒด๋ผ๋„ ํ•™์Šต ๋•Œ ๋ณด์ง€ ๋ชปํ•œ ๊ทธ๋ฆฝ ์ž์„ธ์—์„œ ์ž˜ ์ž‘๋™ํ–ˆ๋‹ค๋Š” ์ ์ž…๋‹ˆ๋‹ค.

๊ฒฐ๊ณผ 2: ๋‹ค์ค‘ ๋ฌผ์ฒด โ€” ํ˜•์ƒ ๊ฐ„ ์ผ๋ฐ˜ํ™”

Class Translation ์ž…๋ ฅ ๋ณ€ํ˜• MAE ์‘๋ ฅ MAE
Average True 6.30e-05 360.3
Average False 2.69e-04 420.2

(10๊ฐœ ๋ฌผ์ฒด ํ‰๊ท . ์ถœ์ฒ˜: ๋…ผ๋ฌธ Table II.)

ํ•ด์„: 10์ข… ๋ฌผ์ฒด๋กœ ํ•œ๊บผ๋ฒˆ์— ํ•™์Šตํ–ˆ์„ ๋•Œ์˜ ํ‰๊ท  ์˜ค์ฐจ๊ฐ€ ๋‹จ์ผ ๋ฌผ์ฒด ํ•™์Šต๊ณผ ๊ฑฐ์˜ ๊ฐ™์€ ๋ฒ”์œ„์— ๋จธ๋ญ…๋‹ˆ๋‹ค(๋ณ€ํ˜• 6.30e-05 vs. ๋‹จ์ผ ํ‰๊ท  6e-05๋Œ€). ์ฆ‰ ๋ฌผ์ฒด ์ข…๋ฅ˜๋ฅผ ๋Š˜๋ ค๋„ ์„ฑ๋Šฅ์ด ๋ฌด๋„ˆ์ง€์ง€ ์•Š๊ณ , ์—ฌ๋Ÿฌ ํ˜•์ƒ์— ๊ฑธ์ณ ์ผ๋ฐ˜ํ™”๋œ๋‹ค๋Š” ๋œป์ž…๋‹ˆ๋‹ค. ์—ฌ๊ธฐ์„œ๋„ translation ์ž…๋ ฅ์ด ์ผ๊ด€๋˜๊ฒŒ ๋” ์ข‹์Šต๋‹ˆ๋‹ค.

์†๋„

๊ฐ€์žฅ ์ธ์ƒ์ ์ธ ๊ฒฐ๊ณผ๋Š” ์†๋„์ž…๋‹ˆ๋‹ค. GNN์€ FEM ๋Œ€๋น„ 10^2~10^3๋ฐฐ ๋น ๋ฆ…๋‹ˆ๋‹ค. ์ •ํ™•๋„๋Š” FEM ์ •๋‹ต์— ๊ฒฌ์ค„ ๋งŒํ•œ ์ˆ˜์ค€์„ ์œ ์ง€ํ•˜๋ฉด์„œ์š”. ์ด ์ •๋„๋ฉด ๊ฐ•ํ™”ํ•™์Šต์šฉ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ, ์‹ค์‹œ๊ฐ„ ํ๋ฃจํ”„(closed-loop) ์ด‰๊ฐ ์ œ์–ด, ๊ทธ๋ฆฝ ๊ณ„ํš(grasp planning) ๊ฐ™์€ ๊ณณ์— ์‹ค์ œ๋กœ ์“ธ ์ˆ˜ ์žˆ๋Š” ์˜์—ญ์— ๋“ค์–ด์˜ต๋‹ˆ๋‹ค.

๊ทธ๋ฆผ 1(๋…ผ๋ฌธ)์— ๋Œ€ํ•œ ์„ค๋ช…: ์œ—์ค„์€ FEM์ด ๊ณ„์‚ฐํ•œ ์ •๋‹ต ์‘๋ ฅยท๋ณ€ํ˜•, ์•„๋žซ์ค„์€ GNN ์˜ˆ์ธก์„ ์—ฌ๋Ÿฌ ๊ทธ๋ฆฝ ์ž์„ธ์—์„œ ๋‚˜๋ž€ํžˆ ๋ณด์—ฌ ์ค๋‹ˆ๋‹ค. ์ƒ‰์œผ๋กœ ์น ํ•ด์ง„ ์‘๋ ฅ ๋ถ„ํฌ์˜ ์œ„์น˜์™€ ๊ฐ•๋„๊ฐ€ ๋‘ ์ค„์—์„œ ์‹œ๊ฐ์ ์œผ๋กœ ๊ฑฐ์˜ ์ผ์น˜ํ•ฉ๋‹ˆ๋‹ค. โ€œ๋น ๋ฅด๊ฒŒ ๋ฝ‘์€ ์˜ˆ์ธก์ด ์ •๋ฐ€ ์‹œ๋ฎฌ๋ ˆ์ด์…˜๊ณผ ๋ˆˆ์œผ๋กœ ๋ด๋„ ๋น„์Šทํ•˜๋‹คโ€๋Š” ๊ฒƒ์ด ํ•ต์‹ฌ ๋ฉ”์‹œ์ง€์ž…๋‹ˆ๋‹ค.

๋น„ํŒ์  ๊ณ ์ฐฐ

๊ฐ•์ 

  • ๋ช…ํ™•ํ•œ ๋ฌธ์ œ ์ •์˜์™€ ๊น”๋”ํ•œ ๋ฐœ์ƒ: โ€œFEM ๋ฉ”์‹œ = ๊ทธ๋ž˜ํ”„, ํž˜ ์ „ํŒŒ = ๋ฉ”์‹œ์ง€ ์ „๋‹ฌโ€์ด๋ผ๋Š” ๋Œ€์‘์€ ์šฐ์•„ํ•˜๊ณ , ๋ฌผ๋ฆฌ ๊ตฌ์กฐ๋ฅผ ๋ชจ๋ธ์— ๋ฐ•์•„ ๋„ฃ๋Š”(๊ท€๋‚ฉ ํŽธํ–ฅ) ๋ฐฉํ–ฅ์ด ์˜ณ์Šต๋‹ˆ๋‹ค. ๋ฐ์ดํ„ฐ์—์„œ ๋ชจ๋“  ๊ฒƒ์„ ์งœ๋‚ด๋ ค๋Š” ๋ฌด์ง€์„ฑ ์ ‘๊ทผ๋ณด๋‹ค ํ‘œ๋ณธ ํšจ์œจ์ด ์ข‹์„ ๊ฐ€๋Šฅ์„ฑ์ด ํฝ๋‹ˆ๋‹ค.
  • ๋ณ€ํ˜•๊ณผ ์‘๋ ฅ์„ ๋™์‹œ์— ์˜ˆ์ธก: ๋‹จ์ˆœ ํ˜•์ƒ ๋ณ€ํ˜•๋ฟ ์•„๋‹ˆ๋ผ ์‘๋ ฅ ๋ถ„ํฌ๊นŒ์ง€ ๋‚ด๋†“๋Š”๋‹ค๋Š” ์ ์ด ์ด‰๊ฐ ์‘์šฉ์—์„œ ํŠนํžˆ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค. ์‘๋ ฅ์€ ๋ฏธ๋„๋Ÿฌ์ง ์˜ˆ์ธก์ด๋‚˜ ํŒŒ์† ๋ฐฉ์ง€(์˜ˆ: ๊ณผ์ผ์ด ๋ฉ๋“œ๋Š”์ง€)์™€ ์ง๊ฒฐ๋ฉ๋‹ˆ๋‹ค.
  • ์‹ค์šฉ์  ์ž…๋ ฅ ์„ค๊ณ„: translation ์ž…๋ ฅ ๋ฐฉ์‹์ด sim-to-real ์ •๋ ฌ๊ณผ ํ•™์Šต ๋‹จ์ˆœํ™”๋ฅผ ๋™์‹œ์— ์žก๋Š”๋‹ค๋Š” ํ†ต์ฐฐ์€ ์‹ค๋ฌด์ ์œผ๋กœ ๊ฐ€์น˜๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.
  • ์žฌํ˜„์„ฑ: ์ฝ”๋“œ์™€ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ ํŒŒ์ดํ”„๋ผ์ธ์„ ๊ณต๊ฐœํ–ˆ์Šต๋‹ˆ๋‹ค.

์•ฝ์ ยทํ•œ๊ณ„

  • ์ƒˆ ๋ฌผ์ฒดยท์ƒˆ ์„ผ์„œ๋กœ์˜ ์ผ๋ฐ˜ํ™”๋Š” ๋ฏธ๊ฒ€์ฆ: ์ €์ž ์Šค์Šค๋กœ ์ฒซ ๋ฒˆ์งธ ํ•œ๊ณ„๋กœ ์ธ์ •ํ•ฉ๋‹ˆ๋‹ค. ์‹คํ—˜์€ ์‹๋ฃŒํ’ˆ๋ฅ˜ ํ˜•์ƒ 10์ข…์— ํ•œ์ •๋˜๋ฉฐ, ํ˜•์ƒ์ด๋‚˜ ์žฌ์งˆ์ด ํฌ๊ฒŒ ๋‹ค๋ฅธ ๋ฌผ์ฒด, ํ˜น์€ GelSight Mini๊ฐ€ ์•„๋‹Œ ๋‹ค๋ฅธ ์„ผ์„œ(DIGIT ๋“ฑ)๋กœ์˜ ์ „์ด๋Š” ๋ณด์—ฌ ์ฃผ์ง€ ๋ชปํ–ˆ์Šต๋‹ˆ๋‹ค.
  • ์ค‘๋ ฅ ๋ฌด์‹œ: ํ˜„์žฌ ๊ตฌํ˜„์€ ์ค‘๋ ฅ ํšจ๊ณผ๋ฅผ ๋ฌด์‹œํ•ฉ๋‹ˆ๋‹ค. ๋™์  ๊ทธ๋ฆฝ์—์„œ ๋ฌผ์ฒด ๋ฌด๊ฒŒ์— ๋”ฐ๋ฅธ ํž˜ ๋ถ„ํฌ ๋ณ€ํ™”๊ฐ€ ์˜ˆ์ธก์— ์˜ํ–ฅ์„ ์ค„ ์ˆ˜ ์žˆ๋Š”๋ฐ, ์ด๊ฒƒ์ด ๋ฐ˜์˜๋˜์ง€ ์•Š์•˜์Šต๋‹ˆ๋‹ค.
  • ์›Œํฌ์ˆ ๋‹จ๊ณ„์˜ ์ œํ•œ๋œ ํ‰๊ฐ€: ๋ฒ ์ด์Šค๋ผ์ธ ๋น„๊ต๊ฐ€ self-baseline(์ž์ฒด ์ž…๋ ฅ ๊ตฌ์„ฑ ablation) ์ค‘์‹ฌ์ด๊ณ , ์™ธ๋ถ€ ๋ฐฉ๋ฒ•(์˜ˆ: ๊ฐ•์ฒด ์‹œ๋ฎฌ๋ ˆ์ด์…˜, ๋‹ค๋ฅธ ํ•™์Šต ๊ธฐ๋ฐ˜ ์ด‰๊ฐ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ)๊ณผ์˜ ์ง์ ‘์  ์ •๋Ÿ‰ ๋น„๊ต๋Š” ๋ถ€์กฑํ•ฉ๋‹ˆ๋‹ค. ์†๋„ 10^2~10^3๋ฐฐ ์—ญ์‹œ ์ธก์ • ์กฐ๊ฑด(๋ฉ”์‹œ ํ•ด์ƒ๋„, ํ•˜๋“œ์›จ์–ด, FEM ์†”๋ฒ„ ์„ค์ •)์ด ๋ณธ๋ฌธ์— ์ •๋ฐ€ํ•˜๊ฒŒ ๋ช…์‹œ๋˜์ง€ ์•Š์•˜์Šต๋‹ˆ๋‹ค. (์ถ”์ธก) ๊ฒฐ๊ณผ์˜ ์ผ๋ฐ˜์„ฑ์„ ๋‹จ์ •ํ•˜๊ธฐ์—๋Š” ํ‘œ๋ณธ๊ณผ ๋น„๊ต๊ตฐ์ด ์ข์Šต๋‹ˆ๋‹ค.
  • ์‘๋ ฅ MAE ์ ˆ๋Œ€๊ฐ’์˜ ํ•ด์„ ์—ฌ์ง€: ๋ณ€ํ˜• MAE๋Š” translation ์ž…๋ ฅ์œผ๋กœ ํฌ๊ฒŒ ๊ฐœ์„ ๋˜์ง€๋งŒ ์‘๋ ฅ MAE ๊ฐœ์„ ํญ์€ ์ƒ๋Œ€์ ์œผ๋กœ ์ž‘์Šต๋‹ˆ๋‹ค. ์‘๋ ฅ์ด ๋ณ€ํ˜•๋ณด๋‹ค ๋ณธ์งˆ์ ์œผ๋กœ ํ•™์Šต์ด ์–ด๋ ต๋‹ค๋Š” ์‹ ํ˜ธ์ผ ์ˆ˜ ์žˆ์œผ๋ฉฐ, ์‘๋ ฅ ์ •ํ™•๋„๊ฐ€ ์ถฉ๋ถ„ํ•œ์ง€๋Š” ๋‹ค์šด์ŠคํŠธ๋ฆผ ๊ณผ์ œ(๋ฏธ๋„๋Ÿฌ์ง ๊ฐ์ง€ ๋“ฑ)์—์„œ์˜ ๊ฒ€์ฆ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
  • ๋ Œ๋”๋ง ๋ฏธํฌํ•จ: โ€œ์ด‰๊ฐ ์ด๋ฏธ์ง€โ€๊นŒ์ง€ ๋งŒ๋“ค์–ด์•ผ ์ง„์งœ ์‹œ๊ฐ ์ด‰๊ฐ ์„ผ์„œ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์ž…๋‹ˆ๋‹ค. ๋ณธ ์—ฐ๊ตฌ๋Š” ๋ฌผ๋ฆฌ ์ธต๊นŒ์ง€๋งŒ์ด๋ฏ€๋กœ, ์‹ค์ œ GelSight ์ด๋ฏธ์ง€๋ฅผ ์“ฐ๋Š” ์ •์ฑ…์— ๋ฐ”๋กœ ๊ฝ‚์œผ๋ ค๋ฉด ์ถ”๊ฐ€ ๋ Œ๋”๋ง ๋‹จ๊ณ„๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.

์š”์•ฝ ๋ฐ ๊ฒฐ๋ก 

์ด ๋…ผ๋ฌธ์€ โ€œFEM์€ ์ •ํ™•ํ•˜์ง€๋งŒ ๋„ˆ๋ฌด ๋А๋ฆฌ๋‹คโ€๋Š” ์‹œ๊ฐ ์ด‰๊ฐ ์„ผ์„œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์˜ ๊ณ ์งˆ์  ๋ณ‘๋ชฉ์„, ๊ทธ๋ž˜ํ”„ ์‹ ๊ฒฝ๋ง์œผ๋กœ FEM์„ ๋ชจ์‚ฌํ•ด ํ‘ธ๋Š” ๊น”๋”ํ•œ ์ ‘๊ทผ์„ ๋ณด์—ฌ ์ค๋‹ˆ๋‹ค. FEM ๋ฉ”์‹œ๋ฅผ ๊ทธ๋ž˜ํ”„๋กœ ๋ณด๊ณ , ๋ฉ”์‹œ์ง€ ์ „๋‹ฌ์„ ํž˜ ์ „ํŒŒ์— ๋Œ€์‘์‹œํ‚ค๋ฉฐ, ๋ฉ”์‹œ ์—ฃ์ง€/์ ‘์ด‰ ์—ฃ์ง€ ๋ถ„๋ฆฌ์™€ ์‚ฌ๋ฉด์ฒด ๋‹จ์œ„ ์‘๋ ฅ ์˜ˆ์ธก ๊ฐ™์€ ๋ฌผ๋ฆฌ์  ๊ท€๋‚ฉ ํŽธํ–ฅ์„ ์‹ฌ์€ ๊ฒƒ์ด ํ•ต์‹ฌ์ž…๋‹ˆ๋‹ค.

๊ฒฐ๊ณผ์ ์œผ๋กœ FEM ๋Œ€๋น„ 10^2~10^3๋ฐฐ ๋น ๋ฅด๋ฉด์„œ ๋ณ€ํ˜•๊ณผ ์‘๋ ฅ์„ ๋™์‹œ์— ์˜ˆ์ธกํ•˜๊ณ , ๋ณด์ง€ ๋ชปํ•œ ๊ทธ๋ฆฝ ์ž์„ธ์™€ ์—ฌ๋Ÿฌ ๋ฌผ์ฒด ํ˜•์ƒ์— ์ผ๋ฐ˜ํ™”ํ•ฉ๋‹ˆ๋‹ค. ์†๊ฐ€๋ฝ ์ด๋™๋Ÿ‰์„ ์ž…๋ ฅ์œผ๋กœ ์ฃผ๋Š” ๋ฐฉ์‹์ด ํ•™์Šต์„ ๋‹จ์ˆœํ™”ํ•˜๊ณ  sim-to-real์—๋„ ์œ ๋ฆฌํ•˜๋‹ค๋Š” ์‹ค์šฉ์  ๊ด€์ฐฐ๋„ ์ธ์ƒ์ ์ž…๋‹ˆ๋‹ค.

๋‹ค๋งŒ ์ƒˆ ๋ฌผ์ฒดยท์ƒˆ ์„ผ์„œ ์ผ๋ฐ˜ํ™”, ์ค‘๋ ฅ ํšจ๊ณผ, ์™ธ๋ถ€ ๋ฒ ์ด์Šค๋ผ์ธ๊ณผ์˜ ๋น„๊ต, ๊ทธ๋ฆฌ๊ณ  ๋ Œ๋”๋ง๊นŒ์ง€ ์ด์–ด์ง€๋Š” end-to-end ํŒŒ์ดํ”„๋ผ์ธ์€ ์•ž์œผ๋กœ์˜ ์ˆ™์ œ์ž…๋‹ˆ๋‹ค. ๊ทธ๋Ÿผ์—๋„ ์‹ค์‹œ๊ฐ„ ์ด‰๊ฐ ๊ธฐ๋ฐ˜ ํ๋ฃจํ”„ ์ œ์–ด์™€ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ์ด๋ผ๋Š”, ๊ทธ๋™์•ˆ FEM ์†๋„์— ๋ง‰ํ˜€ ์žˆ๋˜ ์‘์šฉ์˜ ๋ฌธ์„ ์—ฌ๋Š” ์˜๋ฏธ ์žˆ๋Š” ํ•œ ๊ฑธ์Œ์ž…๋‹ˆ๋‹ค.

๊ด€๋ จ ์—ฐ๊ตฌ์™€์˜ ์œ„์น˜

  • DefGraspNets / DefGraspSim (Huang et al.): 3D ๋ณ€ํ˜• ๋ฌผ์ฒด์˜ ๊ทธ๋ฆฝ ๊ฒฐ๊ณผ๋ฅผ GNN/FEM์œผ๋กœ ๋‹ค๋ฃฌ ์„ ํ–‰ ์—ฐ๊ตฌ. ๋ณธ ๋…ผ๋ฌธ์€ ๊ทธ ๋ฐœ์ƒ์„ ์ด‰๊ฐ ์„ผ์„œ(์ ค)๋กœ ํ™•์žฅ.
  • Inductive Biases GNN (Heller et al., ๋ณธ ๋…ผ๋ฌธ์˜ ์ง์ ‘์  baseline): ์‚ฌ๋ฉด์ฒด ํŠน์ง•๊ณผ ์ „์—ญ ํŠน์ง• ์ง€๋ฆ„๊ธธ์„ ๋„์ž…ํ•ด ๋ณ€ํ˜•/์‘๋ ฅ ์˜ˆ์ธก์„ ๊ฐœ์„ .
  • TacSL / Tactile Gym 2.0: ๊ฐ•ํ™”ํ•™์Šต์šฉ ๋น ๋ฅธ(์ฃผ๋กœ ๊ฐ•์ฒด ๊ธฐ๋ฐ˜) ์ด‰๊ฐ ์‹œ๋ฎฌ๋ ˆ์ด์…˜. ์†๋„๋Š” ๋น ๋ฅด๋‚˜ ์—ฐ์ฒด ํ˜„์‹ค๊ฐ์€ ๋ถ€์กฑ โ€” ๋ณธ ๋…ผ๋ฌธ์ด ๋…ธ๋ฆฐ ๋นˆํ‹ˆ.
  • TacFlex: ์‹œ๊ฐ ์ด‰๊ฐ ์„ผ์„œ์˜ ์ด‰๊ฐ ์ด๋ฏธ์ง€(imprint) ์‹œ๋ฎฌ๋ ˆ์ด์…˜ โ€” ๋ณธ ๋…ผ๋ฌธ์ด ๋‹ค๋ฃจ์ง€ ์•Š์€ ๋ Œ๋”๋ง ์ชฝ์„ ๋ณด์™„ํ•˜๋Š” ์œ„์น˜.

Copyright 2026, JungYeon Lee