Curieux.JY
  • JungYeon Lee
  • Post
  • Note

On this page

  • ๐Ÿ” Ping Review

๐Ÿ“ƒDexNDM ๋ฆฌ๋ทฐ

dexterous manipulation
sim2real
Closing the Reality Gap for Dexterous In-Hand Rotation via Joint-Wise Neural Dynamics Model
Published

March 21, 2026

  • Paper Link

  • Project Link

  • Video

  • Xueyi Liu, He Wang, Li Yi

  1. ๐Ÿค– ๋ณธ ์—ฐ๊ตฌ๋Š” sim-to-real reality gap์œผ๋กœ ์ธํ•ด ์–ด๋ ค์›€์ด ํฐ dexterous in-hand rotation์—์„œ ์ „๋ก€ ์—†๋Š” ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ์„ ๋‹ฌ์„ฑํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.
  2. ๐Ÿฆพ ์ด๋ฅผ ์œ„ํ•ด, limited real-world data๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ํ•™์Šตํ•˜๊ณ  sim policy์˜ actions๋ฅผ ์กฐ์ •ํ•˜๋Š” joint-wise neural dynamics model๊ณผ autonomous data collection ์ „๋žต์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.
  3. โœจ DexNDM์€ ๋‹จ์ผ policy๋กœ ๋ณต์žกํ•œ ํ˜•์ƒ, ๋†’์€ aspect ratio, ๋‹ค์–‘ํ•œ wrist orientation์„ ๊ฐ€์ง„ ๋ฌผ์ฒด๋ฅผ ํ˜„์‹ค ์„ธ๊ณ„์—์„œ ์„ฑ๊ณต์ ์œผ๋กœ ์กฐ์ž‘ํ•˜์—ฌ, teleoperation๊ณผ ๊ฐ™์€ complex dexterous tasks๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ” Ping Review

๐Ÿ” Ping โ€” A light tap on the surface. Get the gist in seconds.

DEXNDM: CLOSING THE REALITY GAP FOR DEXTEROUS IN-HAND ROTATION VIA JOINT-WISENEURAL DYNAMICS MODEL ๋…ผ๋ฌธ์€ dexterous in-hand rotation์—์„œ ๋ฐœ์ƒํ•˜๋Š” sim-to-real gap์„ ์ขํžˆ๊ธฐ ์œ„ํ•ด joint-wise neural dynamics model์„ ํ™œ์šฉํ•˜๋Š” ์ƒˆ๋กœ์šด ์ ‘๊ทผ ๋ฐฉ์‹์ธ DexNDM์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.

๋กœ๋ด‡ ๊ณตํ•™์—์„œ ์ผ๋ฐ˜ํ™”๋œ ์†์•ˆ ๊ฐ์ฒด ํšŒ์ „์„ ๋‹ฌ์„ฑํ•˜๋Š” ๊ฒƒ์€ ์—ฌ์ „ํžˆ ์ค‘๋Œ€ํ•œ ๋„์ „ ๊ณผ์ œ์ž…๋‹ˆ๋‹ค. ๊ธฐ์กด ๋ฐฉ๋ฒ•๋“ค์€ ๋‹จ์ˆœํ•œ ๊ธฐํ•˜ํ•™์  ํ˜•ํƒœ, ์ œํ•œ๋œ ๊ฐ์ฒด ํฌ๊ธฐ, ๊ณ ์ •๋œ ์†๋ชฉ ์ž์„ธ, ๋งž์ถคํ˜• ํ•˜๋“œ์›จ์–ด ๋“ฑ ์ œ์•ฝ๋œ ์‹œ๋‚˜๋ฆฌ์˜ค์— ๊ตญํ•œ๋˜์–ด ์™”์Šต๋‹ˆ๋‹ค. ์ด๋Ÿฌํ•œ ํ•œ๊ณ„์˜ ์ฃผ๋œ ์›์ธ์€ ๋ณต์žกํ•˜๊ณ  ์ ‘์ด‰์ด ๋งŽ์€ ๋™์—ญํ•™์œผ๋กœ ์ธํ•ด ๋ฐœ์ƒํ•˜๋Š” โ€œํ˜„์‹ค-์‹œ๋ฎฌ๋ ˆ์ด์…˜(sim-to-real) ๊ฐ„๊ทนโ€์ž…๋‹ˆ๋‹ค. ํŠนํžˆ, dexterous manipulation์—์„œ๋Š” ๋ถ„ํฌ ๊ด€๋ จ ๋ฐ์ดํ„ฐ์˜ ๋Œ€๋Ÿ‰ ์ˆ˜์ง‘์ด ์–ด๋ ต๊ณ , ์น˜๋ช…์ ์ธ ์‹คํŒจ(์˜ˆ: ๊ฐ์ฒด ๋‚™ํ•˜) ์‹œ ๋นˆ๋ฒˆํ•œ ์ธ๊ฐ„ ๊ฐœ์ž…์ด ํ•„์š”ํ•˜๋ฉฐ, ์†์œผ๋กœ ์ธํ•œ ๊ฐ€๋ฆผ์œผ๋กœ ๊ฐ์ฒด ์ƒํƒœ ์ถ”์ •์ด ์–ด๋ ต๋‹ค๋Š” ๋ฌธ์ œ๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.

DexNDM์€ ์ด ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋‘ ๊ฐ€์ง€ ํ•ต์‹ฌ ์„ค๊ณ„๋ฅผ ๋„์ž…ํ•ฉ๋‹ˆ๋‹ค.

  1. ์ „๋ฌธ๊ฐ€-์ผ๋ฐ˜์ฃผ์˜์ž(Specialist-to-Generalist) ์ •์ฑ… ํ›ˆ๋ จ: ๋จผ์ €, ๋‹ค์–‘ํ•œ ๊ฐ์ฒด ์นดํ…Œ๊ณ ๋ฆฌ(์›ํ†ต, ์ง์œก๋ฉด์ฒด, ๋ณต์žกํ•œ ํ˜•์ƒ ๋“ฑ)์— ๊ฑธ์ณ RL(Reinforcement Learning)์„ ํ†ตํ•ด oracle policy๋ฅผ ํ›ˆ๋ จํ•ฉ๋‹ˆ๋‹ค. ์ด oracle policy๋“ค์€ ํ’๋ถ€ํ•œ privileged observation์„ ํ™œ์šฉํ•ฉ๋‹ˆ๋‹ค. ์ดํ›„, ์„ฑ๊ณต์ ์ธ oracle ๊ถค์ ๋งŒ์„ ์ง‘๊ณ„ํ•˜์—ฌ Behavior Cloning (BC)์„ ํ†ตํ•ด ๋‹จ์ผ generalist policy๋ฅผ ํ›ˆ๋ จํ•ฉ๋‹ˆ๋‹ค. generalist policy์˜ ๊ด€์ธก์น˜ o_{gen_t}๋Š” proprioception history, ์†๋ชฉ ๋ฐฉํ–ฅ, ํšŒ์ „ ์ถ• ์ •๋ณด๋ฅผ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค. ์ด ๋ฐฉ์‹์€ ์–ด๋ ค์šด ์ž‘์—…์—์„œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์ตœ์ ํ™” ์‹คํŒจ๋‚˜ ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ์˜ ์„ฑ๋Šฅ ์ €ํ•˜ ๋ฌธ์ œ๋ฅผ ํ”ผํ•˜๋ฉด์„œ ๋†’์€ ํ’ˆ์งˆ์˜ oracle behavior๋ฅผ ๋ชจ๋ฐฉํ•˜์—ฌ ์‹ค์ œ ํ™˜๊ฒฝ์— ๋ฐฐํฌ ๊ฐ€๋Šฅํ•œ ์ •์ฑ…์„ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.
  2. ์กฐ์ธํŠธ๋ณ„ ์‹ ๊ฒฝ ๋™์—ญํ•™ ๋ชจ๋ธ (Joint-Wise Neural Dynamics Model): ์ด ๋ชจ๋ธ์€ ํ˜„์‹ค-์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ฐ„๊ทน์„ ๋ฉ”์šฐ๋Š” ํ•ต์‹ฌ ์š”์†Œ์ž…๋‹ˆ๋‹ค.
    • ๋ชจ๋ธ ์„ค๊ณ„: ๊ธฐ์กด์˜ โ€œ์ „์ฒด ์†(whole-hand)โ€ ๋ชจ๋ธ๊ณผ ๋‹ฌ๋ฆฌ, ๊ฐ ์กฐ์ธํŠธ i์˜ ๋™์—ญํ•™์„ ๊ฐœ๋ณ„์ ์œผ๋กœ ๋ชจ๋ธ๋งํ•ฉ๋‹ˆ๋‹ค. ๊ฐ ์กฐ์ธํŠธ์˜ ๋‹ค์Œ ์ƒํƒœ q^i_{t+1}๋Š” ์˜ค์ง ํ•ด๋‹น ์กฐ์ธํŠธ์˜ W ์Šคํ… ์ƒํƒœ-์•ก์…˜ ์ด๋ ฅ h^i_t = \{q^i_j, a^i_j\}_{j=t-W+1}^t๋กœ๋ถ€ํ„ฐ ์˜ˆ์ธก๋ฉ๋‹ˆ๋‹ค. ์ด๋Š” q^i_{t+1} = f_{\psi^i}(h^i_t)์™€ ๊ฐ™์ด ํ‘œํ˜„๋ฉ๋‹ˆ๋‹ค. ์ด ์„ค๊ณ„๋Š” ๊ณ ์ฐจ์›์ ์ธ ์‹œ์Šคํ…œ ์ „๋ฐ˜์˜ ์˜ํ–ฅ(์˜ˆ: ์กฐ์ธํŠธ ๊ฐ„ ์ปคํ”Œ๋ง, ์ž‘๋™, ๊ฐ์ฒด ์œ ๋ฐœ ํšจ๊ณผ)์„ ์ €์ฐจ์›์˜ โ€œ์œ ํšจํ•œ(effective)โ€ ๋ณ€์ˆ˜๋กœ ์ฆ๋ฅ˜ํ•˜์—ฌ ๊ฐ ์กฐ์ธํŠธ์˜ ๋™์—ญํ•™์  ํ”„๋กœํ•„๋กœ๋ถ€ํ„ฐ ๊ทธ ์ง„ํ™”๋ฅผ ์•”์‹œ์ ์œผ๋กœ ํฌ์ฐฉํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.
    • ์ด๋ก ์  ๊ทผ๊ฑฐ (์ •๋ณด ์ˆ˜์ถ•์„ ํ†ตํ•œ ์ผ๋ฐ˜ํ™”): ์ด ๋ชจ๋ธ์˜ ํ•ต์‹ฌ ๊ฐ•์ ์€ ์ •๋ณด ์ˆ˜์ถ•(Information Contraction)์„ ํ†ตํ•ด ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค.
      • ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ๋ถ€๋“ฑ์‹ (Data Processing Inequality for KL divergence, Theorem 3.1): ์ „์ฒด ์‹œ์Šคํ…œ ์ƒํƒœ X = H_t์™€ ์กฐ์ธํŠธ๋ณ„ ์ƒํƒœ Y = h^i_t ๊ฐ„์˜ ๋งคํ•‘ g: X \to Y๊ฐ€ ์ฃผ์–ด์กŒ์„ ๋•Œ, KL(P\|Q) \ge KL(g(P)\|g(Q))์ด ์„ฑ๋ฆฝํ•ฉ๋‹ˆ๋‹ค. ์—ฌ๊ธฐ์„œ P๋Š” ์‹ค์ œ ํ™˜๊ฒฝ ๋ถ„ํฌ, Q๋Š” ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋˜๋Š” ์ˆ˜์ง‘๋œ ๋ฐ์ดํ„ฐ ๋ถ„ํฌ์ž…๋‹ˆ๋‹ค. ํŠนํžˆ, g๊ฐ€ P์™€ Q๊ฐ€ ๋‹ค๋ฅธ ์ƒ๋Œ€์  ๊ตฌ์กฐ๋ฅผ ๊ฐ€์ง€๋Š” ์ง€์ ๋“ค์„ ๋ณ‘ํ•ฉํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ๋น„๋‹จ์‚ฌ์ (non-injective)์ด๋ฉด, ์ด ๋ถ€๋“ฑ์‹์€ ์—„๋ฐ€ํ•˜๊ฒŒ ์„ฑ๋ฆฝํ•ฉ๋‹ˆ๋‹ค (>). ์ด๋Š” ๊ณ ์ฐจ์› ์ •๋ณด๋ฅผ ์ €์ฐจ์›์œผ๋กœ ์ถ•์†Œํ•  ๋•Œ, ๋‘ ๋ถ„ํฌ ๊ฐ„์˜ KL ๋ฐœ์‚ฐ์ด ์ค„์–ด๋“ค์–ด ๋ถ„ํฌ ๋ณ€ํ™”(distribution shift)๊ฐ€ ์™„ํ™”๋จ์„ ์˜๋ฏธํ•ฉ๋‹ˆ๋‹ค.
      • ์ผ๋ฐ˜ํ™” ๊ฐ„๊ทน ์ˆ˜์ถ• (Generalization Gap Contraction, Theorem 3.2): KL(g(P)\|g(Q)) < KL(P\|Q)์ธ ๊ฒฝ์šฐ, ์กฐ์ธํŠธ๋ณ„ ๋ชจ๋ธ f_2 \circ g_X์˜ generalization gap์ด ์ „์ฒด ์† ๋ชจ๋ธ f_1์˜ generalization gap๋ณด๋‹ค ์ž‘์•„์ง‘๋‹ˆ๋‹ค. ์ฆ‰, ์ถ•์†Œ๋œ ์ •๋ณด๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๋ชจ๋ธ์ด ํ˜„์‹ค-์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ฐ„๊ทน๊ณผ ๊ฐ™์€ ๋ถ„ํฌ ๋ณ€ํ™” ์ƒํ™ฉ์—์„œ ๋” ์ž˜ ์ผ๋ฐ˜ํ™”๋ฉ๋‹ˆ๋‹ค.
    • ์ž์œจ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ (Autonomous Data Collection): โ€œ์นด์˜ค์Šค ๋ฐ•์Šค(Chaos Box)โ€๋ผ๋Š” ์ €๋น„์šฉ์˜ ์ž์œจ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ์ „๋žต์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. ๋กœ๋ด‡ ์†์€ ์†Œํ”„ํŠธ๋ณผ์ด ๊ฐ€๋“ ์ฐฌ ์ปจํ…Œ์ด๋„ˆ์— ๋ฐฐ์น˜๋˜๋ฉฐ, ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํ›ˆ๋ จ๋œ ๊ธฐ๋ณธ ์ •์ฑ…์˜ ์•ก์…˜์„ open-loop์œผ๋กœ ์žฌ์ƒํ•˜๊ณ  ๊ฐ ์•ก์…˜์— ๊ฐ€์šฐ์‹œ์•ˆ ๋…ธ์ด์ฆˆ(\sigma=0.01)๋ฅผ ์ถ”๊ฐ€ํ•˜์—ฌ ๋‹ค์–‘ํ•œ ๋ฌด์ž‘์œ„ ๋ถ€ํ•˜(randomized loads)๋ฅผ ๊ฐ€ํ•ฉ๋‹ˆ๋‹ค. ์ด ๊ณผ์ •์€ ์™„์ „ํžˆ ์ž์œจ์ ์ด๊ณ  ํ•˜๋“œ์›จ์–ด ์•ˆ์ „ํ•˜๋ฉฐ, ๊ฐ์ฒด ๋‚™ํ•˜ ์‹œ์˜ ์ธ๊ฐ„ ๊ฐœ์ž…์ด๋‚˜ ๋ฆฌ์…‹์ด ํ•„์š” ์—†์–ด ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘์ด ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.
    • ์ž”์—ฌ ์ •์ฑ… (Residual Policy): ํ•™์Šต๋œ ์กฐ์ธํŠธ๋ณ„ ๋™์—ญํ•™ ๋ชจ๋ธ f_\psi๋ฅผ ํ™œ์šฉํ•˜์—ฌ ๊ธฐ๋ณธ ์ •์ฑ…์˜ ์•ก์…˜์„ ๋ณด์ƒํ•˜๋Š” ์ž”์—ฌ ์ •์ฑ… \pi_{res}๋ฅผ ํ›ˆ๋ จํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ๋ณธ ์ •์ฑ…์˜ ๊ด€์ธก์น˜ o_{gen_t}์™€ ๊ธฐ๋ณธ ์•ก์…˜ a_t๊ฐ€ ์ฃผ์–ด์ง€๋ฉด, \pi_{res}๋Š” ๋ณด์ •์น˜ a_{res,t}๋ฅผ ์ถœ๋ ฅํ•˜๋ฉฐ, ์‹ค์ œ ๋ฐฐํฌ ์‹œ์—๋Š” a_t + a_{res,t}๊ฐ€ ์‹คํ–‰๋ฉ๋‹ˆ๋‹ค. ์ด ๋ฐฉ์‹์€ ๊ธฐ์กด ์ •์ฑ…์˜ ๋™์ž‘์„ ํฌ๊ฒŒ ๋ณ€๊ฒฝํ•˜์ง€ ์•Š์œผ๋ฉด์„œ ์‹ค์ œ ํ™˜๊ฒฝ์˜ ๋™์—ญํ•™์  ํŠน์„ฑ์„ ๋ฐ˜์˜ํ•˜๋„๋ก ๋ฏธ์„ธ ์กฐ์ •ํ•˜๋Š” ํšจ๊ณผ๋ฅผ ๊ฐ€์ง‘๋‹ˆ๋‹ค.

์‹คํ—˜ ๊ฒฐ๊ณผ:

์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํ‰๊ฐ€์—์„œ DexNDM์˜ generalist policy๋Š” ๋ฏธ๊ณต๊ฐœ ๊ฐ์ฒด์— ๋Œ€ํ•ด ๊ธฐ์กด AnyRotate ๊ตฌํ˜„๋ณด๋‹ค 37%~81% ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์˜€์Šต๋‹ˆ๋‹ค. ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ DexNDM์€ ์ „๋ก€ ์—†๋Š” dexterity๋ฅผ ๋ณด์—ฌ์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค. ๋ณต์žกํ•œ ํ˜•์ƒ(๋™๋ฌผ ๋ชจ๋ธ), ๋†’์€ ์ข…ํšก๋น„(์ตœ๋Œ€ 5.33), ์ž‘์€ ํฌ๊ธฐ ๊ฐ์ฒด์— ๋Œ€ํ•ด ๋‹ค์–‘ํ•œ ์†๋ชฉ ๋ฐฉํ–ฅ ๋ฐ ํšŒ์ „ ์ถ•์—์„œ ์„ฑ๊ณต์ ์ธ ๊ณต์ค‘ ํšŒ์ „์„ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค. ํŠนํžˆ, 10-16cm ๊ธธ์ด์˜ ๊ธด ๊ฐ์ฒด๋ฅผ palm-down ๊ตฌ์„ฑ์—์„œ ๊ณต์ค‘์—์„œ ๊ฑฐ์˜ ํ•œ ๋ฐ”ํ€ด ํšŒ์ „์‹œํ‚ค๋Š” ๋Šฅ๋ ฅ์„ ๋ณด์—ฌ์ฃผ์—ˆ๋Š”๋ฐ, ์ด๋Š” ๊ธฐ์กด ์—ฐ๊ตฌ์—์„œ ์‹œ๋„๋˜์ง€ ์•Š์•˜๊ฑฐ๋‚˜ ์–ด๋ ค์› ๋˜ ๋ถ€๋ถ„์ž…๋‹ˆ๋‹ค. Visual Dexterity ๋ฐ AnyRotate์™€ ๋น„๊ตํ•˜์—ฌ ํƒ์›”ํ•œ ์„ฑ๋Šฅ๊ณผ ๊ด‘๋ฒ”์œ„ํ•œ ๊ฐ์ฒด ๋ฐ ์กฐ๊ฑด์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค. Whole-Hand Neural Dynamics Model๊ณผ์˜ ๋น„๊ต๋ฅผ ํ†ตํ•ด, DexNDM์˜ joint-wise model์ด ๋ฐ์ดํ„ฐ๊ฐ€ ์ œํ•œ์ ์ด๊ฑฐ๋‚˜ train-test distribution shift๊ฐ€ ์žˆ๋Š” ํ™˜๊ฒฝ์—์„œ ํ›จ์”ฌ ๋” ๋†’์€ ์ƒ˜ํ”Œ ํšจ์œจ์„ฑ๊ณผ ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ์„ ๊ฐ€์ง์„ ํ™•์ธํ–ˆ์Šต๋‹ˆ๋‹ค. ๋ฐ˜๋ฉด ASAP ๋ฐ UAN๊ณผ ๊ฐ™์€ ๊ธฐ์กด sim-to-real ๋ฐฉ๋ฒ•๋“ค์€ object-loaded ์ƒํ˜ธ์ž‘์šฉ ๋™์—ญํ•™์— ๋Œ€ํ•œ generalization์ด ๋ถ€์กฑํ•˜์—ฌ ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ ์‹คํŒจํ–ˆ์Šต๋‹ˆ๋‹ค. DexNDM์€ tool-using ๋ฐ ์กฐ๋ฆฝ๊ณผ ๊ฐ™์€ ๋ณต์žกํ•œ dexterous task๋ฅผ ์œ„ํ•œ teleoperation ์‹œ์Šคํ…œ์— ์„ฑ๊ณต์ ์œผ๋กœ ์ ์šฉ๋  ์ˆ˜ ์žˆ์Œ์„ ์‹œ์—ฐํ–ˆ์Šต๋‹ˆ๋‹ค.

๊ฒฐ๋ก :

DexNDM์€ joint-wise neural dynamics model๊ณผ ์ž์œจ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ์ „๋žต์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋Š” ์ƒˆ๋กœ์šด sim-to-real framework๋ฅผ ์ œ๊ณตํ•˜์—ฌ ์ „๋ก€ ์—†๋Š” ์†์•ˆ ๊ฐ์ฒด ํšŒ์ „ ๋Šฅ๋ ฅ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค. ์ด ์—ฐ๊ตฌ๋Š” dexterous manipulation์˜ โ€œํ˜„์‹ค-์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ฐ„๊ทนโ€์„ ์ขํžˆ๋Š” ๋ฐ ์ค‘์š”ํ•œ ์ง„์ „์„ ์ด๋ฃจ์—ˆ์œผ๋ฉฐ, ํ–ฅํ›„ ์ด‰๊ฐ ์„ผ์„œ ๋ฐ ๋” ํ’๋ถ€ํ•œ ์‹ ํ˜ธ ํ†ตํ•ฉ์„ ํ†ตํ•ด ๋ชจ๋ธ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค. # ๐Ÿ”” Ring Review

๐Ÿ”” Ring โ€” An idea that echoes. Grasp the core and its value.

  • ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์ •์ฑ…์˜ action์„ ์‹ค์ œ ํ™˜๊ฒฝ ์กฐ๊ฑด์— ๋งž๊ฒŒ ๋ณ€ํ™˜ํ•˜๋Š” joint-wise dynamics model์ด ํ•ต์‹ฌ
  • ์ตœ์†Œํ•œ์˜ ์ธ๊ฐ„ ๊ฐœ์ž…์œผ๋กœ ์™„์ „ ์ž์œจ์ ์ธ ์‹ค์ œ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ์ „๋žต ์ œ์‹œ
  • ๋ณต์žกํ•œ ํ˜•์ƒ์˜ ๋ฌผ์ฒด, ๋†’์€ ์ข…ํšก๋น„(์ตœ๋Œ€ 5.33), ์†Œํ˜• ๋ฌผ์ฒด, ๋‹ค์–‘ํ•œ ์†๋ชฉ ๋ฐฉํ–ฅ์—์„œ ์„ฑ๊ณต์ ์ธ ํšŒ์ „ ์‹œ์—ฐ
  • ํ…”๋ ˆ์˜คํผ๋ ˆ์ด์…˜ ์‘์šฉ์—์„œ๋„ ์œ ํšจ์„ฑ ๊ฒ€์ฆ

Copyright 2026, JungYeon Lee