Sclera is an open platform that can be extended, as needed, with extensions that connect Sclera with multiple data sources, platforms and analytics libraries. The components documentation describes the pre-built extensions that are packaged with the Sclera installation.

These custom connectors can be built using the Sclera Extensions Software Development Kit (SDK). Sclera provides separate Scala and Java variants of the SDK – the Scala SDK is provided natively by the sclera-core component, while the Java SDK is a wrapper over the native Scala SDK that takes care of Java-Scala interoperability.

The SDK documentation, with illustrative source code for sample extensions, is publicly available on GitHub:

The following type of components are covered:

Data Access Connectors

These connectors enable ingestion of data from arbitrary sources in a ScleraSQL query. You only need to format the data as rows of a table, and Sclera will take care of evaluating streaming SQL queries on the same – these queries can include transforming, filtering and aggregating this data, as well as joining this data with data ingested from other connectors, or with data in tables stored in other data stores.

Sclera - Stock Ticker Connector, Sclera - CSV Connector, and Sclera - Text Files Connector are built using this SDK. For examples of how these connectors are used in Sclera, please refer to the SQL documentation.

To learn more about building data access connectors, please see the Sclera Datasource Extensions SDK documentation.

Database System Connectors

These connectors interface Sclera with an arbitrary database system, relational or non-relational, providing access to the underlying data, and also enable Sclera to push down computations in relevant parts user queries and commands on the interfaced database system.

Sclera - Oracle Connector, Sclera - MySQL Connector, Sclera - PostgreSQL Connector, and Sclera - Apache HBase Connector are built using this SDK. For examples of how these connectors are used in Sclera, please refer to the data platform connection reference documentation.

To learn more about building database system connectors, please see the Sclera Database System Extensions SDK documentation.

Machine Learning Library Connectors

These connectors interface Sclera with an arbitrary machine learning libraries, which provide implementations of classification, clustering and/or association rules mining. These connectors handle the invocation of the underlying library for training the models, and using them to label data in the query processing pipeline.

Sclera - Weka Connector and Sclera - Apache Mahout Connector are built using this SDK. For examples of how these connectors are used in Sclera, please refer to the SQL documentation.

To learn more about building machine learning library connectors, please see the Sclera Machine Learning Library Extensions SDK documentation.

Text Analytics Library Connectors

These connectors interface Sclera with an arbitrary text analytics libraries, to perform specific text analytics tasks. These connectors handle the invocation of the underlying library to process text data in table columns, in the query processing pipeline.

Sclera - Apache OpenNLP Connector is built using this SDK. For examples on how these connectors are used in Sclera, please refer to the SQL documentation.

To learn more about building text analytics library connectors, please see the Sclera Text Analytics Library Extensions SDK documentation.


Related Documentation