Mathematical Foundations of Deep Learning
Interactive Visualizations
Animate optimization steps
Compare approximations
Explore contour plots
Convex vs non-convex
Level sets Q(x) = xα΅Ax
Classify matrices
Spectral vs Frobenius
Principal components
Low-rank approximation
Document similarity
Bell curve & 68-95-99.7 rule
Contour ellipses & eigenvectors
Bernoulli, Binomial, Categorical
Law of Large Numbers
Shannon entropy for discrete distributions
Classification loss with softmax
Gaussian & discrete, forward vs reverse
Perceptron β Logic Gates β MLP β Cybenko β Deep Networks
Sigmoid, Tanh, Heaviside & derivatives comparison
Loss surface, learning rate effects, convergence
Step-by-step through a 1β2β1 MLP