-
admin
MemSQL Training Course
Overview MemSQL is an in-memory, distributed, SQL database management system for cloud and on-premises. It’s a real-time data warehouse that immediately delivers insights from live and historical data. In this instructor-led, live training, participants will learn the essentials of MemSQL for development and administration. By the end of this training, participants will be able to: […]
-
admin
Apache Druid for Real-Time Data Analysis Training Course
Overview Apache Druid is an open-source, column-oriented, distributed data store written in Java. It was designed to quickly ingest massive quantities of event data and execute low-latency OLAP queries on that data. Druid is commonly used in business intelligence applications to analyze high volumes of real-time and historical data. It is also well suited for […]
-
admin
Spark Streaming with Python and Kafka Training Course
Overview Apache Spark Streaming is a scalable, open source stream processing system that allows users to process real-time data from supported sources. Spark Streaming enables fault-tolerant processing of data streams. This instructor-led, live training (online or onsite) is aimed at data engineers, data scientists, and programmers who wish to use Spark Streaming features in processing and […]
-
admin
Confluent KSQL Training Course
Overview Confluent KSQL is a stream processing framework built on top of Apache Kafka. It enables real-time data processing using SQL operations. This instructor-led, live training (online or onsite) is aimed at developers who wish to implement Apache Kafka stream processing without writing code. By the end of this training, participants will be able to: […]
-
admin
Apache Ignite for Developers Training Course
Overview Apache Ignite is an in-memory computing platform that sits between the application and data layer to improve speed, scale, and availability. In this instructor-led, live training, participants will learn the principles behind persistent and pure in-memory storage as they step through the creation of a sample in-memory computing project. By the end of this […]
-
admin
Unified Batch and Stream Processing with Apache Beam Training Course
Overview Apache Beam is an open source, unified programming model for defining and executing parallel data processing pipelines. It’s power lies in its ability to run both batch and streaming pipelines, with execution being carried out by one of Beam’s supported distributed processing back-ends: Apache Apex, Apache Flink, Apache Spark, and Google Cloud Dataflow. Apache […]
-
admin
Apache Apex: Processing Big Data-in-Motion Training Course
Overview Apache Apex is a YARN-native platform that unifies stream and batch processing. It processes big data-in-motion in a way that is scalable, performant, fault-tolerant, stateful, secure, distributed, and easily operable. This instructor-led, live training introduces Apache Apex’s unified stream processing architecture, and walks participants through the creation of a distributed application using Apex on […]
-
admin
Apache Flink Fundamentals Training Course
Overview Apache Flink is an open-source framework for scalable stream and batch data processing. This instructor-led, live training introduces the principles and approaches behind distributed stream and batch data processing, and walks participants through the creation of a real-time, data streaming application in Apache Flink. By the end of this training, participants will be able […]
-
admin
Apache Kafka for Python Programmers Training Course
Overview Apache Kafka is an open-source stream-processing platform that provides a fast, reliable, and low-latency platform for handling real-time data analytics. Apache Kafka can be integrated with available programming languages such as Python. This instructor-led, live training (online or onsite) is aimed at data engineers, data scientists, and programmers who wish to use Apache Kafka […]
-
admin
Building Kafka Solutions with Confluent Training Course
Overview This instructor-led, live training (online or onsite) is aimed at engineers who wish to use Confluent (a distribution of Kafka) to build and manage a real-time data processing platform for their applications. By the end of this training, participants will be able to: Install and configure Confluent Platform. Use Confluent’s management tools and services […]
