https://asplos.dev/wordpress/2023/02/02/failure-tolerant-training-with-persistent-memory-d/
Failure Tolerant Training with Persistent Memory Disaggregation over CXL