An accessible, authoritative, and up-to-date computer vision textbook offering a comprehensive introduction to the foundations of the field that incorporates the latest deep learning advances.
Machine learning has revolutionized computer vision, but the methods of today have deep roots in the history of the field. Providing a much-needed modern treatment, this accessible and up-to-date textbook comprehensively introduces the foundations of computer vision while incorporating the latest deep learning advances. Taking a holistic approach that goes beyond machine learning, it addresses fundamental issues in the task of vision and the relationship of machine vision to human perception. Foundations of Computer Vision covers topics not standard in other texts, including transformers, diffusion models, statistical image models, issues of fairness and ethics, and the research process. To emphasize intuitive learning, concepts are presented in short, lucid chapters alongside extensive illustrations, questions, and examples. Written by leaders in the field and honed by a decade of classroom experience, this engaging and highly teachable book offers an essential next-generation view of computer vision.
• Up-to-date treatment integrates classic computer vision and deep learning • Accessible approach emphasizes fundamentals and assumes little background knowledge • Student-friendly presentation features extensive examples and images • Proven in the classroom • Instructor resources include slides, solutions, and source code
Antonio Torralba is Professor and Head of the AI+D faculty at the Department of Electrical Engineering and Computer Science at MIT, where he is a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL).
Phillip Isola is Associate Professor of Electrical Engineering and Computer Science at MIT, where he is a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL).
William T. Freeman is Thomas and Gerd Perkins Professor of Electrical Engineering and Computer Science at MIT, where he is a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL). He is also a research manager at Google Research in Cambridge, Massachusetts.
Written by some of the most creative and beloved researchers in the field, this book tells the story of computer vision with exceptional clarity, deep insight, and infectious enthusiasm. It is as if Bach, Mozart, and Chopin were to collaborate on a music textbook.
Alexei Efros, Professor of Electrical Engineering and Computer Science, University of California, Berkeley
This is a book of tremendous scope written by three masters, and indeed three generations, of the computer vision field. The writing is clear, engaging, and full of insights. The figures wonderfully illustrate the foundational ideas.
Andrew Zisserman, Professor of Computer Vision Engineering, University of Oxford