github.com/prestodb/presto

The official home of the Presto distributed SQL query engine for big data

Open this visualization on its own page →

Contributors

525

Lines of Code

133,155

From

2012-08-08

To

2020-12-28

About prestodb/presto

Presto is a distributed SQL query engine designed for querying large-scale datasets across big data systems. It enables users to run SQL queries against multiple data sources including Hadoop, Hive, and modern lakehouse architectures. The engine is written in Java and provides both a server component and a command-line interface for interactive querying.

The project includes a comprehensive web-based console built with React for monitoring and managing queries, along with multiple pluggable connectors that allow Presto to interface with different data storage systems. It also features Presto native, a C++ rewrite of the worker component that leverages Velox, a C++ database library, to provide enhanced performance for data processing workloads.

Development is straightforward for Java developers, using standard Maven build processes and supporting popular IDEs like IntelliJ IDEA. The project requires Java 17 and includes extensive testing infrastructure, detailed documentation, and active community support through Slack channels for developers seeking assistance.

Share this video