Trino - 빅데이터용 분산 SQL 쿼리 엔진을 소개합니다.
github 에서는 아래와 같이 소개하고 있고요..
Trino is a fast distributed SQL query engine for big data analytics.
한마디로, 빅데이터용 분산 SQL 쿼리 엔진이라고 머릿속에 넣어두시면 됩니다.
아래는 빌드 요구사항 이고요..
Build requirements
- Mac OS X or Linux
- Java 11.0.11+, 64-bit
- Docker
아래는 Trino 관련 Building 에 대한 부분입니다.
Building Trino
Trino is a standard Maven project. Simply run the following command from the project root directory:
./mvnw clean install -DskipTests
On the first build, Maven downloads all the dependencies from the internet and caches them in the local repository (~/.m2/repository), which can take a while, depending on your connection speed. Subsequent builds are faster.
Trino has a comprehensive set of tests that take a considerable amount of time to run, and are thus disabled by the above command. These tests are run by the CI system when you submit a pull request. We recommend only running tests locally for the areas of code that you change.
아래는 Trino 실행에 대한 부분이고요.
Running Trino in your IDE
Overview
After building Trino for the first time, you can load the project into your IDE and run the server. We recommend using IntelliJ IDEA. Because Trino is a standard Maven project, you easily can import it into your IDE. In IntelliJ, choose Open Project from the Quick Start box or choose Open from the File menu and select the root pom.xml file.
After opening the project in IntelliJ, double check that the Java SDK is properly configured for the project:
- Open the File menu and select Project Structure
- In the SDKs section, ensure that JDK 11 is selected (create one if none exist)
- In the Project section, ensure the Project language level is set to 11
Running a testing server
The simplest way to run Trino for development is to run the TpchQueryRunner class. It will start a development version of the server that is configured with the TPCH connector. You can then use the CLI to execute queries against this server. Many other connectors have their own *QueryRunner class that you can use when working on a specific connector.
Running the full server
Trino comes with sample configuration that should work out-of-the-box for development. Use the following options to create a run configuration:
- Main Class: io.trino.server.DevelopmentServer
- VM Options: -ea -Dconfig=etc/config.properties -Dlog.levels-file=etc/log.properties -Djdk.attach.allowAttachSelf=true
- Working directory: $MODULE_DIR$
- Use classpath of module: trino-server-dev
The working directory should be the trino-server-dev subdirectory. In IntelliJ, using $MODULE_DIR$ accomplishes this automatically.
If VM options doesn't exist in the dialog, you need to select Modify options and enable Add VM options.
Running the CLI
Start the CLI to connect to the server and run SQL queries:
client/trino-cli/target/trino-cli-*-executable.jar
Run a query to see the nodes in the cluster:
SELECT * FROM system.runtime.nodes;
Run a query against the TPCH connector:
SELECT * FROM tpch.tiny.region;
마지막으로, Trino 의 특징을 간단하게 정리하자면 아래와 같습니다.
- PrestoSQL 의 새 이름
- Speed, Scale, Simplicity, Versatile
- In-place Analysis
- Query Federation
- Runs Everywhere : On-prem, AWS, Azure, GCP,..
좀 더 자세한 사항은 아래 github 페이지를 참고부탁드릴께요~
오늘의 블로그는 여기까지고요..
항상 믿고 봐주셔서 감사합니다~
'좋아하는 것_매직IT > 96.IT 핫이슈' 카테고리의 다른 글
빠른 yes 터미널 명령어 만들어보기 (endler.dev) (0) | 2022.07.22 |
---|---|
mpv - 오픈소스 크로스플랫폼 미디어 플레이어 (mpv.io) (0) | 2022.07.21 |
JetBrains 제품 구독료 인상. (blog.jetbrains.com) (0) | 2022.07.19 |
SplinterDB - 고성능 임베디드 키밸류 저장소 (github.com/vmware) (0) | 2022.07.19 |
페이스북, 개인정보 보호용 URL 스트리핑 못하도록 링크 암호화 시작 (ghacks.net) (0) | 2022.07.19 |