본문 바로가기

좋아하는 것_매직IT/96.IT 핫이슈

Trino - 빅데이터용 분산 SQL 쿼리 엔진 (github.com/trinodb)

반응형

Trino - 빅데이터용 분산 SQL 쿼리 엔진을 소개합니다.

github 에서는 아래와 같이 소개하고 있고요..

Trino is a fast distributed SQL query engine for big data analytics.

한마디로, 빅데이터용 분산 SQL 쿼리 엔진이라고 머릿속에 넣어두시면 됩니다.

아래는 빌드 요구사항 이고요..

Build requirements

  • Mac OS X or Linux
  • Java 11.0.11+, 64-bit
  • Docker

아래는 Trino 관련 Building 에 대한 부분입니다.

Building Trino

Trino is a standard Maven project. Simply run the following command from the project root directory:

./mvnw clean install -DskipTests

On the first build, Maven downloads all the dependencies from the internet and caches them in the local repository (~/.m2/repository), which can take a while, depending on your connection speed. Subsequent builds are faster.

Trino has a comprehensive set of tests that take a considerable amount of time to run, and are thus disabled by the above command. These tests are run by the CI system when you submit a pull request. We recommend only running tests locally for the areas of code that you change.

아래는 Trino 실행에 대한 부분이고요.

Running Trino in your IDE

Overview

After building Trino for the first time, you can load the project into your IDE and run the server. We recommend using IntelliJ IDEA. Because Trino is a standard Maven project, you easily can import it into your IDE. In IntelliJ, choose Open Project from the Quick Start box or choose Open from the File menu and select the root pom.xml file.

After opening the project in IntelliJ, double check that the Java SDK is properly configured for the project:

  • Open the File menu and select Project Structure
  • In the SDKs section, ensure that JDK 11 is selected (create one if none exist)
  • In the Project section, ensure the Project language level is set to 11

Running a testing server

The simplest way to run Trino for development is to run the TpchQueryRunner class. It will start a development version of the server that is configured with the TPCH connector. You can then use the CLI to execute queries against this server. Many other connectors have their own *QueryRunner class that you can use when working on a specific connector.

Running the full server

Trino comes with sample configuration that should work out-of-the-box for development. Use the following options to create a run configuration:

  • Main Class: io.trino.server.DevelopmentServer
  • VM Options: -ea -Dconfig=etc/config.properties -Dlog.levels-file=etc/log.properties -Djdk.attach.allowAttachSelf=true
  • Working directory: $MODULE_DIR$
  • Use classpath of module: trino-server-dev

The working directory should be the trino-server-dev subdirectory. In IntelliJ, using $MODULE_DIR$ accomplishes this automatically.

If VM options doesn't exist in the dialog, you need to select Modify options and enable Add VM options.

Running the CLI

Start the CLI to connect to the server and run SQL queries:

client/trino-cli/target/trino-cli-*-executable.jar

Run a query to see the nodes in the cluster:

SELECT * FROM system.runtime.nodes;

Run a query against the TPCH connector:

SELECT * FROM tpch.tiny.region;

 

마지막으로, Trino 의 특징을 간단하게 정리하자면 아래와 같습니다.

- PrestoSQL 의 새 이름  
- Speed, Scale, Simplicity, Versatile  
- In-place Analysis  
- Query Federation  
- Runs Everywhere : On-prem, AWS, Azure, GCP,..

좀 더 자세한 사항은 아래 github 페이지를 참고부탁드릴께요~

오늘의 블로그는 여기까지고요..
항상 믿고 봐주셔서 감사합니다~

300x250