nanobind: tiny and efficient C++/Python bindings
Implement a Pytorch-like DL library in C++ from scratch, step by step