Skip to content
MIT Press
  • MIT Press
  • Books
    • Column
      • View all subjects
      • New releases
      • Catalogs
      • Textbooks
      • Series
    • Column
      • Authors
      • Distributed presses
      • The MIT Press Reader
      • Podcasts
    • Column
      • MIT Press Direct

        MIT Press Direct is a distinctive collection of influential MIT Press books curated for scholars and libraries worldwide.

        • Learn more
  • Journals
    • column
      • Journals all topics
      • Economics
      • International Affairs, History, & Political Science
    • column
      • Arts & Humanities
      • Science & Technology
      • Open access
    • column
      • MIT Press journals

        MIT Press began publishing journals in 1970 with the first volumes of Linguistic Inquiry and the Journal of Interdisciplinary History. Today we publish over 30 titles in the arts and humanities, social sciences, and science and technology.

        • Learn more
  • Open Access
    • column
      • Open access at the MIT Press
      • Open access books
      • Open access journals
    • column
      • Direct to Open
      • MIT Open Publishing Services
      • MIT Press Open on PubPub
    • Column
      • Open access

        The MIT Press has been a leader in open access book publishing for over two decades, beginning in 1995 with the publication of William Mitchell’s City of Bits, which appeared simultaneously in print and in a dynamic, open web edition.

        • Learn more
  • Resources
    • column
      • Current authors
      • Prospective authors
      • Instructors
    • column
      • Media inquiries
      • Booksellers
      • Rights and permissions
    • column
      • Resources

        Collaborating with authors, instructors, booksellers, librarians, and the media is at the heart of what we do as a scholarly publisher. If you can’t find the resource you need here, visit our contact page to get in touch.

        • Learn more
  • Give
  • About
    • Column
      • About
      • Jobs
      • Internships
      • MIT Press Editorial Board
      • MIT Press Management Board
    • Column
      • Catalogs
      • News
      • Events
      • Conferences
      • Bookstore
    • Column
      • The MIT Press

        The MIT Press has been a leader in open access book publishing for two decades, beginning in 1995 with the publication of William Mitchell’s City of Bits. Most recently, we have pioneered a new open access model, Direct to Open, that has enabled us to publish more than 80 open access titles in a single year.

        • Learn more
  • Contact Us
Newsletter
MIT Press
  • Home
  • MIT Lincoln Laboratory Series
  • mathematics
  • computers
  • Mathematics of Big Data
Mathematics of Big Data

MIT Lincoln Laboratory Series

Mathematics of Big Data

Spreadsheets, Databases, Matrices, and Graphs

by Jeremy Kepner and Hayden Jananthan

Foreword by Charles E. Leiserson

  • $80.00 Hardcover
  • Rent eTextbook

448 pp., 7 x 9 in, 99 figures

  • Hardcover
  • 9780262038393
  • Published: July 17, 2018

$80.00

For Professors: Request permissions

  • Penguin Random House
  • Amazon
  • Barnes and Noble
  • Bookshop.org
  • Indiebound
  • Indigo
  • Books a Million

Other Retailers:

  • Penguin Random House
  • Amazon
  • Barnes and Noble
  • Bookshop.org
  • Indiebound
  • Indigo
  • Books a Million
  • Amazon.co.uk
  • Blackwells
  • Bookshop.org
  • Foyles
  • Hive
  • Waterstones
  • Request exam/desk copy
  • Request instructor ancillaries
  • Request permissions
  • Description
  • Author(s)
  • Praise
  • Resources

The first book to present the common mathematical foundations of big data analysis across a range of applications and technologies.

Today, the volume, velocity, and variety of data are increasing rapidly across a range of fields, including Internet search, healthcare, finance, social media, wireless devices, and cybersecurity. Indeed, these data are growing at a rate beyond our capacity to analyze them. The tools—including spreadsheets, databases, matrices, and graphs—developed to address this challenge all reflect the need to store and operate on data as whole sets rather than as individual elements. This book presents the common mathematical foundations of these data sets that apply across many applications and technologies. Associative arrays unify and simplify data, allowing readers to look past the differences among the various tools and leverage their mathematical similarities in order to solve the hardest big data challenges.

The book first introduces the concept of the associative array in practical terms, presents the associative array manipulation system D4M (Dynamic Distributed Dimensional Data Model), and describes the application of associative arrays to graph analysis and machine learning. It provides a mathematically rigorous definition of associative arrays and describes the properties of associative arrays that arise from this definition. Finally, the book shows how concepts of linearity can be extended to encompass associative arrays. Mathematics of Big Data can be used as a textbook or reference by engineers, scientists, mathematicians, computer scientists, and software engineers who analyze big data.

Jeremy Kepner is an MIT Lincoln Laboratory Fellow, Founder and Head of the MIT Lincoln Laboratory Supercomputing Center, and Research Affiliate in MIT's Mathematics Department.

Hayden Jananthan is a PhD candidate in the Department of Mathematics at Vanderbilt University.

Charles E. Leiserson is Edwin Sibley Webster Professor in Electrical Engineering and Computer Science at MIT.

Mathematics of Big Data presents a sophisticated view of matrices, graphs, databases, and spreadsheets, with many examples to help the discussion. The authors present the topic in three parts—applications and practice, mathematical foundations, and linear systems—with self-contained chapters to allow for easy reference and browsing. The algorithms are expressed in D4M, with execution possible in Matlab, Octave, and Julia. With exercises at the end of each section, the book can be used as a supplemental or primary text for a class on big data, algorithms, data structures, data analytics, linear algebra, or abstract algebra.

Jack Dongarra, Professor, University of Tennessee, Oak Ridge National Laboratory, and University of Manchester; coauthor of MPI: The Complete Reference, second edition, volume 1

In this era of big data, new methods for gaining insights promise to improve all aspects of our lives. This new textbook from Kepner and Jananthan is a fantastic resource for data scientists to understand the unifying mathematics for big data problems that covers everything from databases to graph analytics.

David A. Bader, Professor and Chair, School of Computational Science and Engineering, Georgia Institute of Technology

Software and Examples

OpenCourseWare with Video Lectures

Selected Solutions

Table of Contents

Selected Solutions

Table of Contents

Related Books

Category Theory for the Sciences
Decision Making Under Uncertainty
logo
  • Column 1
    • Books
    • Journals
    • The MIT Press Reader
    • Podcasts
    • Imprints
  • Column 2
    • The MIT Press
      • About
      • Bookstore
      • Catalogs
      • Conferences
      • Press Editorial Board
      • Jobs
      • Internships
      • Press Management Board
      • News
      • Staff
      • Code of Conduct
      • Give
  • Column 3
    • Site Help
      • Accessibility
      • FAQ
      • Our eBooks
      • Privacy Policy
      • Terms of Use
  • Column 4
    • Resources
      • Current Authors
      • Prospective Authors
      • Booksellers
      • Instructors
      • Rights and Permissions
      • Media Inquiries
      • MIT Discounts
  • Column 5
    • Digital
      • CogNet
      • Digital Partners and Products
      • Knowledge Futures Group
      • MIT Press Direct
  • US

    One Broadway 12th Floor Cambridge, MA 02142

  • UK

    Unit 57710 PO Box 6945 London W1A 6US UK

  • Contact

Connect

© 2022 MIT Press. All Rights Reserved.

Powered by Supadu