Do you ❤️ Trino? Give us a 🌟 on GitHub

Trino users

Learn about organizations who use Trino, help with development, and are generally part of our community!

 
AdLibertas

AdLibertas

AdLibertas relies on Trino to power comprehensive user-level Audience Reporting for app developers. Trino allows our customers to create easy-to-use and visual user LTV reports across their large app user datasets. Since the scale and scope of these reports can widely vary, Trino’s ability to quickly scale on distributed systems and query across our data lake makes access fast and affordable for all sizes of clients without messy and expensive ETL.

AWS

AWS

Amazon Web Services (AWS) is widely used for deploying and running Trino. Amazon Athena or Amazon EMR embed Trino for your usage.

Amazon Athena is a serverless, interactive analytics service built on open-source frameworks, supporting open-table and file formats. Athena provides a simplified, flexible way to analyze petabytes of data where it lives. Analyze data or build applications from an Amazon Simple Storage Service (S3) data lake and 30 data sources, including on-premises data sources or other cloud systems using SQL or Python. Athena is built on open-source Trino and Presto engines and Apache Spark frameworks, with no provisioning or configuration effort required.

Amazon EMR provides a managed Hadoop framework that makes it easy, fast, and cost-effective to process vast amounts of data across dynamically scalable Amazon EC2 instances. With EMR, you can launch a large Trino cluster in minutes. You don’t need to worry about node provisioning, cluster setup or tuning. Using Trino on EMR provides these benefits to customers:

  • Elasticity: With Amazon EMR, you can provision one, hundreds, or thousands of compute instances to process data at any scale. You can easily increase or decrease the number of instances manually or with auto scaling, and you only pay for what you use.
  • Simple and predictable pricing: You pay a per-second rate for every second used, with a one-minute minimum charge.
Ampool

Ampool

Ampool is powered by Trino to provide SQL query federation across 200+ data sources. Combined with Ampool’s high-performance in-memory distributed caching engine, a turbocharged Trino query engine can speedup BI & ad-hoc analytical workloads by an order of magnitude. Ampool is built from the ground up to operate seamlessly in hybrid and multi-cloud environments.

Bazaar Technologies

Bazaar Technologies

At Bazaar, we are working to digitize Pakistan’s retail sector. Trino is the engine that powers the Bazaar Analytics Platform (aka Buraq). Buraq handles petabytes of data from a diverse set of data stores from Hive, S3 Lakes, RDS and Redshift. Trino allows us to build a scalable lake house architecture, that drives real time insights and machine learning workloads helping us to provide best in class customer experience.

BestSecret

BestSecret

BestSecret is a leading European members-only online destination for premium and luxury off-price fashion. It currently employs around 1,700 international fashion and technology enthusiasts from more than 80 nations.

Trino and Trino Gateway are part of the data platform used at BestSecret.

Chasing Securities

Chasing Securities

The Chasing Securities Data Center uses Trino as the crucial general data query and data processing engine. Thousands of jobs per day use Trino to complete data processing from the ODS layer to the EDW layer, and they use Trino as their Ad-Hoc query service for the company. They also use Trino as the backend query engine for the data service API provided to other departments.

Chasing Securities also uses Trino to connect to nearly 100 data sources as the federal query engine, including Oracle, SQL Server, MySQL, HBase, and many more, and have developed hundreds of security industry-related UDF functions on Trino, which are used to add to meet various business needs.

Cloud Chef Labs

Cloud Chef Labs

Cloud Chef Labs’s DataRoaster provides data platforms running on Kubernetes to build a data lake and AI-based analytics platform. Trino is one of the components provided by DataRoaster as an interactive query engine to query the data lake. Users run Trino in DataRoaster as a cost-effective alternative to serverless services provided by other cloud providers.

Datappeal

Datappeal

The Data Appeal Company is a data provider that analyzes geospatial, business information, and customer perception data for any point of interest worldwide. Data Appeal leverages Trino high-performance capabilities to power its data analysis platform, providing an interactive location intelligence visualization for customers. Trino also enables our data scientists to query our various datasources with complete transparency, which in turn yields applicable, data-driven insights.

datasapiens

datasapiens

At datasapiens, we help your company to become a data-driven organization. Trino is at the core of that endeavour and is our primary data retrieval engine. We use it on top of various data sources from diverse industries including retail, gastronomy, travel and others. Trino unifies a diverse set of data stores such as Apache Hive, PostgreSQL, Apache Pinot, and Redshift. It allows us to build a scalable lakehouse architecture providing insights to thousands of satisfied users.

DiDi

DiDi

With its excellent data processing capability and high performance, Trino has a wide range of applications in DiDi. Currently, Trino processes over one million queries every day.

Dune

Dune

Dune is a web-based platform that allows you to query public blockchain data and aggregate it into beautiful dashboards. DuneSQL, based on the open-source Trino engine, is the custom-built query engine for efficient analysis of blockchain data.

Flippa

Flippa

Flippa.com is the #1 marketplace to buy and sell online businesses such as e-commerce stores, blogs, affiliate sites, SaaS businesses, apps, and more. Established in 2009, we have over 15 years of historical data we are able to use for baseline comparisons in our insights reports, and to power our AI buyer matching functionality, among many other things. Trino sits between our various data sources and our business analytics, reporting, and ML infrastructure, enabling us to work at speed across vast amounts of data with minimal complexity and operational overhead. Switching to Trino from our old, traditional data lake solution has saved us thousands of dollars per month, and we’ve been using Trino in production successfully since 2022. We continue to double down on our usage and adoption of Trino and would absolutely recommend it!

ForePaaS

ForePaaS

ForePaaS leverages Trino as its main query engine technology. ForePaaS provides a lakehouse platform that can run on multiple open-source storage engines. The Trino-powered query engine allows users to run unified analytics and BI at petabyte scale on their lakehouse and directly-queryable data sources at lightning speeds. Trino also enforces the automated data access control so that analytics end-users only see the data relevant to them.

Gett

Gett

As the leading ground travel platform for businesses, Gett uses Trino as the core SQL engine for all data pipelines. Extracting data from heterogeneous sources, transforming it with the rich analytical functions and loading results to the visualisation tools to get wiser business decisions.

Google Cloud

Google Cloud

Google Cloud is widely used for deploying and running Trino. Dataproc supports embedding Trino for your usage in combination with other systems on Google Cloud. Trino also supports connecting to numerous data sources on Google Cloud including Google BigQuery.

Index Exchange

Index Exchange

Index Exchange is a global advertising marketplace that generates and processes trillions of events per day. We are leveraging Trino to complement our reporting infrastructure in a few ways self-service data analysis of aggregated data, internal BI reporting, and as an access point for investigation of event-level records. As the exchange grows, so too will Trino as one of our critical architectural solutions.

Jampp

Jampp

Jampp employs Trino as the main interface between its Data Warehouse and the rest of the company, using it to feed their predictive algorithms as well as performing analytic queries, ETLs and monitoring data quality.

JioSaavn

JioSaavn

At JioSaavn, Trino plays a pivotal role in the data engineering stack and is the central integrated hub for all the data needs. It is used in ETL processing since its performance is much better than Hive and Spark for certain use cases. Trino analyzes billions of events logged everyday and generates insights with adhoc analytical queries. It is also very useful to query data from various databases like MongoDB, Cassandra, and MySQL. Trino is also extensively used to power BI tools and dashboards from both ETL pipeline data in ORC and Parquet files and real-time data using the Pinot and Druid connector.

LINE

LINE

Trino is a core piece of LINE’s BI tools and interactive analysis workflow. It is being actively used in areas such as KPI and system metrics monitoring, as well as data analytics for large datasets. Trino is the preferred tool for our ML engineers and data scientists to better understand the data on our datalake through fast, interactive queries.

LinkedIn

LinkedIn

Trino is the de-facto ad-hoc and interactive analytics engine at LinkedIn that enables fast exploration and analysis of data stored in massive data lake. Every month thousands of users from across the company issue millions of queries scanning 100s of petabytes of data from a host of clients, including programmatic access from internal applications.

Microsoft Azure

Microsoft Azure

Microsoft Azure is widely used for deploying and running Trino. Azure HDInsights on AKS embeds Trino for your usage in combination with other systems on Azure. Trino also supports connecting to numerous data sources on Azure.

Myntra

Myntra

Myntra is among India’s leading e-commerce platforms for fashion, beauty, and lifestyle. An integral part of the Flipkart Group, Myntra brings together technology and fashion to create the best experience for its fashion-forward customers, helping them look good. In alignment with our data-first organizational culture, Trino has been an incredible partner in our efforts to democratize data. Our teams run thousands of queries every day on petabytes of data right out of our in-house datalake. It has grown to be a critical component within our Myntra Data Platform (MDP) powering a wide array of workloads ranging from ad-hoc to batch queries.

Naver

Naver

Naver Corporation is a global ICT company, providing South Korea’s number one search portal “NAVER”. Naver uses Trino for data analytics and business intelligence (BI) purposes. Trino provides data scientists with quick access to their data lake, via a standard SQL interface. Trino process massive amounts of data from different sources, such as user logs and clickstream data, in real-time.

Nielsen

Nielsen

Nielsen Media is the market leader for measuring audiences, predicting outcomes and powering contents. Nielsen uses Trino as a query engine over its petabyte-scale data lake, serving batch and interactive use cases for products, modelling, research and testing. Trino’s open source nature also allows Nielsen to customize it to handle a wide spectrum of business needs.

NTT Communications

NTT Communications

NTT Communications’ telecom and internet services are powered by analysis on Trino. Trino enables interactive analytics over tables of terabytes in size which drive BI dashboards. The seamless feature to connect multiple data sources and query interactiveness are some of the driving factors that make Trino a pivotal technology of data analysis at NTT Communications.

Peaka

Peaka

Peaka is a Zero-ETL data integration platform. It integrates more than 300 SaaS tools, relational and NoSQL databases, and allows users to query them. Peaka includes the ability to query this variety of data sources for a single source of truth without moving it out of its original location. It supports caching on Iceberg, streaming ingestion and ability to create instant replicas of data sources on Iceberg.

Trino is at the heart of Peaka and enables many of these features.

Plural

Plural

Plural creates the future of open-source infrastructure. Plural is an open-source, self-hosting framework that allows you to deploy Trino on your own cloud infrastructure.

Plural provides a flexible, scalable solution to application delivery that gives you the autonomy you need to build and reconfigure the software you use freely. Plural includes support for Trino. It provides automated upgrade delivery, Terraform/Helm management, interactive runbooks, autoscaling, and all the features you’d expect out of a managed service for free. Additionally, Plural manages all the complexities with deployment on Kubernetes, providing sane defaults and interfaces for first-time K8s users and experienced users alike. Your access to your infrastructure as code is completely transparent, so all your configuration is fully ejectable from the Plural platform.

Raft

Raft

Trino provides Raft with a distributed query engine that can be deployed to the most secure networks of the Department of Defense. We work in an environment which data is highly siloed by nature. Trino provides a scalable solution to drive decisions from data coming from the edge which can connect to virtually any source. It is key to our strategy of deploying a Data Fabric, reducing the time to insights for our customers, and in return lower their operating costs.

Rapido

Rapido

As India’s largest bike taxi service, Rapido uses Trino to provide SQL query federation across various data sources, and as SQL engine in various data pipelines. It is being actively used in areas such as KPI and system metrics monitoring, as well as data analytics for large datasets. Trino is a core piece of Rapido’s data platform to provide interactive analysis workflow and visualisation with Metabase and Superset.

Razorpay

Razorpay

We chose Trino because it is fast, scalable, and integrates well within our ecosystem. We use Apache Spark and Hudi to replicate data from OLTP databases and ingest events emitted by microservices in our AWS S3 data lake while using Hive Metastore to store table definitions and metadata. Trino made it easy to query these raw tables and derive insights from them as our users were already comfortable with SQL.

Resurface

Resurface

Resurface uses Trino to turn every API call into a durable, observable transaction. Resurface makes it easy to create a complete system of record for your API calls, to speed escalations, improve CX, and enable data science, without any third-party data transfer. Trino gives us the flexibility to support very small or very large installations, and integrate with other customer systems, behind a consistent SQL interface. We’re proud to be developing purpose-built connectors for Trino that deliver consistently fast performance at any scale and budget.

Shopify

Shopify

At Shopify, Trino provides their data scientists with quick access to their data lake, via an industry standard SQL interface that joins and aggregates data across heterogeneous data sources. Shopify Trino clusters handle 15 Gbps and over 300 million rows of data per second.

Stackable

Stackable

Stackable provides you with a curated selection of the best open source data apps like Apache Kafka, Apache Druid, Trino and Apache Spark. Store, process and visualize your data with the latest versions. All data apps work together seamlessly, and can be added or removed in no time. Based on Kubernetes, it runs everywhere – on prem or in the cloud.

Use the Stackable data platform to create unique and enterprise-class data architectures. For example, it supports modern data warehouses, data lakes, event streaming, machine learning or data mesh use cases.

Stackable offers a Trino operator for Kubernetes to easily spin up, scale, and maintain one or many Trino clusters that are configured to be secure and performant out of the box.

Starburst

Starburst

Starburst offers a full-featured data lake analytics platform, built on open source Trino. The platform includes the capabilities needed to discover, organize, and consume data without the need for time-consuming and costly migrations. It supports the lake as the center of gravity, and additionally accesses data outside the lake when needed. Starburst founders and employess include the Trino co-creators, numerous maintainers, contributors and other experts, ready to support you.

Starburst Galaxy is a fully managed data lake analytics platform designed for large and complex data sets in and around your cloud data lake. It is the easiest and fastest way for you to start running queries at interactive speeds across data sources using the business intelligence and analytics tools you already know.

Starburst Galaxy takes just minutes to set up and takes care of the heavy lifting of designing, provisioning, maintaining, and securing your Trino infrastructure. In addition, Galaxy offers proprietary features such as fully managed connectors, global search, schema discovery, monitoring and metrics, and data sharing with Data Products that allow your data teams to focus on generating unique insights from your data – not managing and building analytics infrastructure.

Starburst Enterprise is a data lake analytics platform that allows organizations to access and analyze data on the lake and additional connected sources, including cloud data warehouses, legacy and on-premise databases, and modern streaming and NoSQL sources, all through a single, unified interface.

It is a fully supported, production-tested and enterprise-grade distribution of the popular open-source project Trino. Starburst Enterprise adds additional features, such as enterprise-grade security, access controls, a variety of supported connectors, improved performance, and a user-friendly interface.

With Starburst Enterprise, users can run ad-hoc and batch queries, build interactive dashboards, and perform data exploration on a variety of data sources. The platform is designed to be scalable and can handle large and complex data sets that are distributed across multiple systems.

Swrve

Swrve

Swrve uses Trino to power our behavioral insights system, allowing our customers to capture detailed data about user behavior in real time, in order to trigger personalized marketing campaigns. We needed a fast, reliable, SQL-based system supporting the level of fine-grained discovery of high-cardinality user behaviours that our customers demand, and Trino provides exactly that.

Treasure Data

Treasure Data

Trino has been at the core of the Customer Data Platform (CDP) of Treasure Data, and more than millions of SQL queries running every day with Trino are helping our customers for analyzing their business-critical datasets and making data-driven decisions.

Valerian

Valerian

Valerian uses Trino as its top choice for a distributed querying engine and has evolved the technology to shift seamlessly between large data structures/platforms and as a replacement for standard databases such as MySQL, PostgreSQL, and MongoDB. Valerian works with companies and organizations to conduct applied Research and Development (R&D) and apply modern technology to solve complex engineering challenges, even advancing the state of the art. Using Trino, Valerian was able to reduce millions in cloud spend on relational databases for clients by augmenting Trino’s use to function as a relational database on top of Object Storage with near-infinite scalability. By using Trino as both a distributed querying engine on top of a unified data lake for AI/ML and Data Analytics as well as the ACID-based transactional database layer of applications, Valerian has been able to harness Trino to create the future of Data Fabric solutions.

Are you a user too?

We are happy to include your testimonial! Contact us on the community chat, and we will get you added.