Abstract: In multi-split computing, to achieve low inference latency and avoid significant accuracy degradation, it is advantageous to distribute sub-models to computing nodes that can exchange ...