

I get the impression this is a video-only thing because you need multiple vantage points of the scene. You can still extract a single frame in the end of course (like the article itself does), but you’ll need to shift around meaningful distances, like attack submarines do with Target Motion Analysis.
I love news titles that are basically “guess which word’s the verb” games.