π¦[Review] Efficient Data Collection for Robotic Manipulation via Compositional Generalization
Meta
author:: Jensen Gao,Β Annie Xie,Β Ted Xiao,Β Chelsea Finn,Β Dorsa Sadigh
[λ Όλ¬Έ 리뷰] κ°λκ·
Reviewed by Kade Kang (devkade12@gmail.com)
Reviewed:: 2025-12-22, 52week
First-Pass
Data collection has become an increasingly important problem in robotic manipulation, yet there still lacks much understanding of how to effectively collect data to facilitate broad generalization. Recent works on large-scale robotic data collection typically vary many environmental factors of variation (e.g., object types, table textures) during data collection, to cover a diverse range of scenarios. However, they do not explicitly account for the possible compositional abilities of policies trained on the data. If robot policies can compose environmental factors from their data to succeed when encountering unseen factor combinations, we can exploit this to avoid collecting data for situations that composition would address. To investigate this possibility, we conduct thorough empirical studies both in simulation and on a real robot that compare data collection strategies and assess whether visual imitation learning policies can compose environmental factors. We find that policies do exhibit composition, although leveraging prior robotic datasets is critical for this on a real robot. We use these insights to propose better in-domain data collection strategies that exploit composition, which can induce better generalization than naive approaches for the same amount of effort during data collection. We further demonstrate that a real robot policy trained on data from such a strategy achieves a success rate of 77.5% when transferred to entirely new environments that encompass unseen combinations of environmental factors, whereas policies trained using data collected without accounting for environmental variation fail to transfer effectively, with a success rate of only 2.5%. We provide videos atΒ this http URL.
Second-Pass
Third-Pass
- ν΄λΉ λ Όλ¬Έμ μ μ± μ΄ λ€μν νκ²½ μμΈμ λν μ‘°ν©μ μΌλ°νκ° κ°λ₯ν¨μ 보μΈλ€. κ·Έλ λ€λ©΄ μ΄ μ μ± μ μμΈλ€κ°μ κ΄κ³λ₯Ό κ°μ§κ³ μμκΉ?(e.g. A κ° B μμ μλ€) λ§μ½ κ°μ§κ³ μλ€λ©΄, κ΄κ³λ₯Ό λͺ μμ μΌλ‘ μΆμΆνκ±°λ λΆμν μ μλ λ°©λ²μ 무μμΈκ°? λͺ μμ μΌλ‘ μΆμΆν μ μλ€λ©΄, μλ‘μ΄ μμΈ μ‘°ν©, μλ‘μ΄ ννμ κ΄κ³λ μΌλ°νν μ μμ§ μμκΉ?
- Prior data κ° compositional generalization μ κ²°μ μ μΈ μν μ νλλ°, μ΄λ prior data κ° λ‘λ΄ μ μ± μ λΆμ¬ν μ΄μ μ΄ λ¬΄μμΌκΉ? μΌλ°νκ° λ μ μλλ‘ μ§μμ μ μ΄νλ€λ©΄, μ΄λ€ μ§μμ μ μ΄ν κΉ?
- κ³ λ €νμ§ λͺ»ν μμΈ (λ°©ν΄λ¬Ό, μ‘°λͺ λ³ν λ±) μ λν΄ μ ν λ°μ΄ν°κ° κ°κ±΄μ±μ λΆμ¬νλ€κ³ λ Όλ¬Έμμ μΈκΈνλλ°, λ‘λ΄μ΄ μ΄μ μ μ ν μ μλμ§ μκ³ μλ‘κ² λ°μνλ νκ²½ μμΈμ λν΄μλ compositional generalization μ λ°νν μ μμκΉ?