English | 中文
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, HBase ...), exposes various interfaces(REST, WebSocket, Java ...), with multi-tenancy, high performance, and resource control.
Linkis connects with computation/storage engines(Spark, Hive, Python and HBase), exposes REST/WS interface, and executes multi-language jobs(SQL, Pyspark, HiveQL and Scala), as a data middleware.
Based on the microservices architecture, Linkis provides enterprise-level features of financial-level multi-tenant isolation, resource management and access control. It also offers convenient support to manage unified variables, UDFs, functions and resource files. it is also guaranteed with sophisticated task/job lifecycle management capabilities under high-concurrency, high-performance and high-availability scenarios.
Unified Job Execution Services: A distributed REST/WebSocket service for processing scripts execution requests from user.
Available computation engines so far: Spark, Python, TiSpark, Hive and Shell.
Available languages so far: SparkSQL, Spark Scala, PySpark, R, Python, HQL and Shell.
Resource Management Services: Available for real-time control/limit of resource usage from both perspectives of amount and load for both systems and users. With dynamic charts of resource statistics, it is convenient to monitor and manage resource usage for systems and users.
Available resource types so far: Yarn queue resources, server(CPU and memory), number of concurrent instances per user.
Application Management Services: Manages global user applications, including offline batch applications, interactive query applications and real-time streaming applications. Also provides powerful reusability especially for offline and interactive applications, with complete lifecycle management which automatically releases idle applications for users.
Unified Storage Services: The generic IO architecture can quickly integrate with various storage systems and provide a unified invokable entrance. It is also highly integrated with most common data formats and easy to use.
Unified Context Services: Unite resources files of users and systems (JAR, ZIP, Properties). With unified management of arguments/variables for users, systems and engines, it is achieved that modification in random place will reflect in all the other places automatically.
Material Library: System and user-level material management, capable of sharing, transferring materials and automatic lifecycle management.
Metadata Services: Real-time display of dataset table structure and partitions.
Compared with similar systems
Linkis is under the Apache 2.0 license. See the LICENSE file for details.