Discussion about this post

User's avatar
Jonathan Michaux's avatar

Excellent post. For the open-source component, will you only be comparing what you call end-to-end methods? Or will you also have some classical/hybrid baselines as well?

Bharath Suresh's avatar

This is a great post, I especially liked the clear hardware mapping for each example.

This may be an oversimplification, but from what I understand, there seems to be a clear split between two layers - planning and control.

Planning seems to be "heavy compute, low frequency" - like running VLMs or a World model on a GPU. Control seems to be more lightweight, but needs higher frequency, hence mostly running on CPUs.

So is it realistic to expect a hybrid architecture with some, or all of the planning to move to "the cloud." I know that some applications cannot tolerate the unpredictability of cloud latency, but there could be many applications where it could be fine - like robots in a controlled factory environment. If this is realistic, then the hybrid architecture might be better economically (by sharing compute for a group of robots) and might also make the robots lighter + power efficient.

Do you think that's a viable direction for robotics?

11 more comments...

No posts

Ready for more?