A Data Ecosystem for Individuals, Teams and People

What is DataHub?

For End Users:

  • A way to store your data centrally, without having to set up your own database
  • A way to collaborate with others
  • A way to seamlessly share your data with friends and colleagues
  • A suite of tools to process your data

For Developers:

  • A database-agnostic, language-agnostic, MITREid (OpenID + OAuth2) integrated platform for your mobile and web apps
  • A web client for easy manipulation of your data
  • An application ecosystem for data processing, including ingestion, curation, integration, discovery, query, analytics, visualization, and machine learning
  • An open source project from MIT CSAIL's Living Lab


DataHub is hosted at MIT Computer Science & Artificial Intelligence Lab (CSAIL) with collaborators from the University of Maryland - College Park, and the University of Illinois at Urbana-Champaign.


Anant Bhardwaj
PhD Student, MIT CSAIL
Aaron Elmore
Asst. Professor, University of Chicago
Sam Madden
Professor, MIT CSAIL
David Karger
Professor, MIT CSAIL
Aditya Parameswaran
Asst. Professor, UIUC
Amol Deshpande
Assoc. Professor, UMD
Souvik Bhattacherjee
PhD Student, UMD
Amit Chavan
PhD Student, UMD
Silu Huang
PhD Student, UIUC
Stephen C. Buckley
Project Manager, MIT Big Data Initiative
Justin Anderson
Programmer, MIT Big Data Initiative
Albert Carter
Programmer, MIT Big Data Initiative
Denis Babani
Volunteer Programmer, MIT Big Data Initiative


Elizabeth Bruce
Former Executive Director, MIT Big Data Initiative
Eugene Wu
Asst. Professor, Columbia University


API Docs
API documentation
Example Code
sample code in various programming languages including C++, Java, Go, Python, and JavaScript
GitHub Repo
the DataHub source code repository on GitHub


Please email, subscribe to our discussion list, or view the list archives.