The Elements of Differentiable Programming

Book

Book draft available on arXiv.

Abstract

Artificial intelligence has recently experienced remarkable advances, fueled by large models, vast datasets, accelerated hardware, and, last but not least, the transformative power of differentiable programming. This new programming paradigm enables end-to-end differentiation of complex computer programs (including those with control flows and data structures), making gradient-based optimization of program parameters possible. As an emerging paradigm, differentiable programming builds upon several areas of computer science and applied mathematics, including automatic differentiation, graphical models, optimization and statistics. This book presents a comprehensive review of the fundamental concepts useful for differentiable programming. We adopt two main perspectives, that of optimization and that of probability, with clear analogies between the two. Differentiable programming is not merely the differentiation of programs, but also the thoughtful design of programs intended for differentiation. By making programs differentiable, we inherently introduce probability distributions over their execution, providing a means to quantify the uncertainty associated with program outputs.

1. Introduction
I. Fundamentals

2. Differentiation
3. Probabilistic learning

II. Differentiable programs

4. Parameterized programs
5. Control flows
6. Data structures

III. Differentiating through programs

7. Finite differences
8. Automatic differentiation
9. Second-order automatic differentiation
10. Inference in graphical models as differentiation
11. Differentiating through optimization
12. Differentiating through integration

IV. Smoothing programs

13. Smoothing by optimization
14. Smoothing by integration

V. Optimizing differentiable programs

15. Optimization basics
16. First-order optimization
17. Second-order optimization
18. Duality

How to cite

@article{edpbook,
title={The {E}lements of {D}ifferentiable {P}rogramming},
author={Blondel, Mathieu and Roulet, Vincent},
journal={arXiv preprint arXiv:2403.14606},
year={2024}
}

Code

Python code accompanying the book is available in this github repository.

Teaching materials

Some slides covering a subset of the book are available here.

Authors

Mathieu Blondel, Google DeepMind, mblondel@google.com
Vincent Roulet, Google DeepMind, vroulet@google.com

Feel free to email us with suggestions, mistakes, typos. We are grateful for any feedback!