Learn about organizations who use Trino, help with development, and are generally part of our community!
AdLibertas relies on Trino to power comprehensive user-level Audience Reporting for app developers. Trino allows our customers to create easy-to-use and visual user LTV reports across their large app user datasets. Since the scale and scope of these reports can widely vary, Trino’s ability to quickly scale on distributed systems and query across our data lake makes access fast and affordable for all sizes of clients without messy and expensive ETL.
Amazon Web Services (AWS) is widely used for deploying and running Trino. Amazon Athena or Amazon EMR embed Trino for your usage.
Amazon Athena is a serverless, interactive analytics service built on open-source frameworks, supporting open-table and file formats. Athena provides a simplified, flexible way to analyze petabytes of data where it lives. Analyze data or build applications from an Amazon Simple Storage Service (S3) data lake and 30 data sources, including on-premises data sources or other cloud systems using SQL or Python. Athena is built on open-source Trino and Presto engines and Apache Spark frameworks, with no provisioning or configuration effort required.
Amazon EMR provides a managed Hadoop framework that makes it easy, fast, and cost-effective to process vast amounts of data across dynamically scalable Amazon EC2 instances. With EMR, you can launch a large Trino cluster in minutes. You don’t need to worry about node provisioning, cluster setup or tuning. Using Trino on EMR provides these benefits to customers:
Ampool is powered by Trino to provide SQL query federation across 200+ data sources. Combined with Ampool’s high-performance in-memory distributed caching engine, a turbocharged Trino query engine can speedup BI & ad-hoc analytical workloads by an order of magnitude. Ampool is built from the ground up to operate seamlessly in hybrid and multi-cloud environments.
At Bazaar, we are working to digitize Pakistan’s retail sector. Trino is the engine that powers the Bazaar Analytics Platform (aka Buraq). Buraq handles petabytes of data from a diverse set of data stores from Hive, S3 Lakes, RDS and Redshift. Trino allows us to build a scalable lake house architecture, that drives real time insights and machine learning workloads helping us to provide best in class customer experience.
The Chasing Securities Data Center uses Trino as the crucial general data query and data processing engine. Thousands of jobs per day use Trino to complete data processing from the ODS layer to the EDW layer, and they use Trino as their Ad-Hoc query service for the company. They also use Trino as the backend query engine for the data service API provided to other departments.
Chasing Securities also uses Trino to connect to nearly 100 data sources as the federal query engine, including Oracle, SQL Server, MySQL, HBase, and many more, and have developed hundreds of security industry-related UDF functions on Trino, which are used to add to meet various business needs.
Cloud Chef Labs’s DataRoaster provides data platforms running on Kubernetes to build a data lake and AI-based analytics platform. Trino is one of the components provided by DataRoaster as an interactive query engine to query the data lake. Users run Trino in DataRoaster as a cost-effective alternative to serverless services provided by other cloud providers.
The Data Appeal Company is a data provider that analyzes geospatial, business information, and customer perception data for any point of interest worldwide. Data Appeal leverages Trino high-performance capabilities to power its data analysis platform, providing an interactive location intelligence visualization for customers. Trino also enables our data scientists to query our various datasources with complete transparency, which in turn yields applicable, data-driven insights.
At datasapiens, we help your company to become a data-driven organization. Trino is at the core of that endeavour and is our primary data retrieval engine. We use it on top of various data sources from diverse industries including retail, gastronomy, travel and others. Trino unifies a diverse set of data stores such as Apache Hive, PostgreSQL, Apache Pinot, and Redshift. It allows us to build a scalable lakehouse architecture providing insights to thousands of satisfied users.
With its excellent data processing capability and high performance, Trino has a wide range of applications in DiDi. Currently, Trino processes over one million queries every day.
Dune is a web-based platform that allows you to query public blockchain data and aggregate it into beautiful dashboards. DuneSQL, based on the open-source Trino engine, is the custom-built query engine for efficient analysis of blockchain data.
ForePaaS leverages Trino as its main query engine technology. ForePaaS provides a lakehouse platform that can run on multiple open-source storage engines. The Trino-powered query engine allows users to run unified analytics and BI at petabyte scale on their lakehouse and directly-queryable data sources at lightning speeds. Trino also enforces the automated data access control so that analytics end-users only see the data relevant to them.
As the leading ground travel platform for businesses, Gett uses Trino as the core SQL engine for all data pipelines. Extracting data from heterogeneous sources, transforming it with the rich analytical functions and loading results to the visualisation tools to get wiser business decisions.
Google Cloud is widely used for deploying and running Trino. Dataproc supports embedding Trino for your usage in combination with other systems on Google Cloud. Trino also supports connecting to numerous data sources on Google Cloud including Google BigQuery.
Jampp employs Trino as the main interface between its Data Warehouse and the rest of the company, using it to feed their predictive algorithms as well as performing analytic queries, ETLs and monitoring data quality.
At JioSaavn, Trino plays a pivotal role in the data engineering stack and is the central integrated hub for all the data needs. It is used in ETL processing since its performance is much better than Hive and Spark for certain use cases. Trino analyzes billions of events logged everyday and generates insights with adhoc analytical queries. It is also very useful to query data from various databases like MongoDB, Cassandra, and MySQL. Trino is also extensively used to power BI tools and dashboards from both ETL pipeline data in ORC and Parquet files and real-time data using the Pinot and Druid connector.
Trino is a core piece of LINE’s BI tools and interactive analysis workflow. It is being actively used in areas such as KPI and system metrics monitoring, as well as data analytics for large datasets. Trino is the preferred tool for our ML engineers and data scientists to better understand the data on our datalake through fast, interactive queries.
Trino is the de-facto ad-hoc and interactive analytics engine at LinkedIn that enables fast exploration and analysis of data stored in massive data lake. Every month thousands of users from across the company issue millions of queries scanning 100s of petabytes of data from a host of clients, including programmatic access from internal applications.
Microsoft Azure is widely used for deploying and running Trino. Azure HDInsights on AKS embeds Trino for your usage in combination with other systems on Azure. Trino also supports connecting to numerous data sources on Azure.
Myntra is among India’s leading e-commerce platforms for fashion, beauty, and lifestyle. An integral part of the Flipkart Group, Myntra brings together technology and fashion to create the best experience for its fashion-forward customers, helping them look good. In alignment with our data-first organizational culture, Trino has been an incredible partner in our efforts to democratize data. Our teams run thousands of queries every day on petabytes of data right out of our in-house datalake. It has grown to be a critical component within our Myntra Data Platform (MDP) powering a wide array of workloads ranging from ad-hoc to batch queries.
Naver Corporation is a global ICT company, providing South Korea’s number one search portal “NAVER”. Naver uses Trino for data analytics and business intelligence (BI) purposes. Trino provides data scientists with quick access to their data lake, via a standard SQL interface. Trino process massive amounts of data from different sources, such as user logs and clickstream data, in real-time.
Nielsen Media is the market leader for measuring audiences, predicting outcomes and powering contents. Nielsen uses Trino as a query engine over its petabyte-scale data lake, serving batch and interactive use cases for products, modelling, research and testing. Trino’s open source nature also allows Nielsen to customize it to handle a wide spectrum of business needs.
NTT Communications’ telecom and internet services are powered by analysis on Trino. Trino enables interactive analytics over tables of terabytes in size which drive BI dashboards. The seamless feature to connect multiple data sources and query interactiveness are some of the driving factors that make Trino a pivotal technology of data analysis at NTT Communications.
Plural creates the future of open-source infrastructure. Plural is an open-source, self-hosting framework that allows you to deploy Trino on your own cloud infrastructure.
Plural provides a flexible, scalable solution to application delivery that gives you the autonomy you need to build and reconfigure the software you use freely. Plural includes support for Trino. It provides automated upgrade delivery, Terraform/Helm management, interactive runbooks, autoscaling, and all the features you’d expect out of a managed service for free. Additionally, Plural manages all the complexities with deployment on Kubernetes, providing sane defaults and interfaces for first-time K8s users and experienced users alike. Your access to your infrastructure as code is completely transparent, so all your configuration is fully ejectable from the Plural platform.
Trino provides Raft with a distributed query engine that can be deployed to the most secure networks of the Department of Defense. We work in an environment which data is highly siloed by nature. Trino provides a scalable solution to drive decisions from data coming from the edge which can connect to virtually any source. It is key to our strategy of deploying a Data Fabric, reducing the time to insights for our customers, and in return lower their operating costs.
As India’s largest bike taxi service, Rapido uses Trino to provide SQL query federation across various data sources, and as SQL engine in various data pipelines. It is being actively used in areas such as KPI and system metrics monitoring, as well as data analytics for large datasets. Trino is a core piece of Rapido’s data platform to provide interactive analysis workflow and visualisation with Metabase and Superset.
We chose Trino because it is fast, scalable, and integrates well within our ecosystem. We use Apache Spark and Hudi to replicate data from OLTP databases and ingest events emitted by microservices in our AWS S3 data lake while using Hive Metastore to store table definitions and metadata. Trino made it easy to query these raw tables and derive insights from them as our users were already comfortable with SQL.
Resurface uses Trino to turn every API call into a durable, observable transaction. Resurface makes it easy to create a complete system of record for your API calls, to speed escalations, improve CX, and enable data science, without any third-party data transfer. Trino gives us the flexibility to support very small or very large installations, and integrate with other customer systems, behind a consistent SQL interface. We’re proud to be developing purpose-built connectors for Trino that deliver consistently fast performance at any scale and budget.
At Shopify, Trino provides their data scientists with quick access to their data lake, via an industry standard SQL interface that joins and aggregates data across heterogeneous data sources. Shopify Trino clusters handle 15 Gbps and over 300 million rows of data per second.
Starburst offers a full-featured data lake analytics platform, built on open source Trino. The platform includes the capabilities needed to discover, organize, and consume data without the need for time-consuming and costly migrations. It supports the lake as the center of gravity, and additionally accesses data outside the lake when needed. Starburst founders and employess include the Trino co-creators, numerous maintainers, contributors and other experts, ready to support you.
Starburst Galaxy is a fully managed data lake analytics platform designed for large and complex data sets in and around your cloud data lake. It is the easiest and fastest way for you to start running queries at interactive speeds across data sources using the business intelligence and analytics tools you already know.
Starburst Galaxy takes just minutes to set up and takes care of the heavy lifting of designing, provisioning, maintaining, and securing your Trino infrastructure. In addition, Galaxy offers proprietary features such as fully managed connectors, global search, schema discovery, monitoring and metrics, and data sharing with Data Products that allow your data teams to focus on generating unique insights from your data – not managing and building analytics infrastructure.
Starburst Enterprise is a data lake analytics platform that allows organizations to access and analyze data on the lake and additional connected sources, including cloud data warehouses, legacy and on-premise databases, and modern streaming and NoSQL sources, all through a single, unified interface.
It is a fully supported, production-tested and enterprise-grade distribution of the popular open-source project Trino. Starburst Enterprise adds additional features, such as enterprise-grade security, access controls, a variety of supported connectors, improved performance, and a user-friendly interface.
With Starburst Enterprise, users can run ad-hoc and batch queries, build interactive dashboards, and perform data exploration on a variety of data sources. The platform is designed to be scalable and can handle large and complex data sets that are distributed across multiple systems.
Swrve uses Trino to power our behavioral insights system, allowing our customers to capture detailed data about user behavior in real time, in order to trigger personalized marketing campaigns. We needed a fast, reliable, SQL-based system supporting the level of fine-grained discovery of high-cardinality user behaviours that our customers demand, and Trino provides exactly that.
Trino has been at the core of the Customer Data Platform (CDP) of Treasure Data, and more than millions of SQL queries running every day with Trino are helping our customers for analyzing their business-critical datasets and making data-driven decisions.
We are happy to include your testimonial! Contact us on the community chat, and we will get you added.