We make the first attempt to seamlessly integrate camera geometry into a unified multimodal model, introducing a camera-centric framework, i.e., Puffin, to advance ...
Hello, your work is excellent. However, I have a small problem. When I use scripts/demo_collision_free_grasps.py to infer my own data, there's a strange phenomenon: it can only determine the grab pose ...