This is all WIP; please enjoy these videos.
This shows stochastic gradient descent performing linear fitting:
This shows the linear fitting in phase space:
This is an example of performing backprop through an ODE integrator, and then SGD to fit parameters of a mass-spring-damper system: