-
admin
Stream Processing with Kafka Streams Training Course
Overview Kafka Streams is a client-side library for building applications and microservices whose data is passed to and from a Kafka messaging system. Traditionally, Apache Kafka has relied on Apache Spark or Apache Storm to process data between message producers and consumers. By calling the Kafka Streams API from within an application, data can be […]
-
admin
Apache Ambari: Efficiently Manage Hadoop Clusters Training Course
Overview Apache Ambari is an open-source management platform for provisioning, managing, monitoring and securing Apache Hadoop clusters. In this instructor-led live training participants will learn the management tools and practices provided by Ambari to successfully manage Hadoop clusters. By the end of this training, participants will be able to: Set up a live Big Data […]
-
admin
Impala for Business Intelligence Training Course
Overview Cloudera Impala is an open source massively parallel processing (MPP) SQL query engine for Apache Hadoop clusters. Impala enables users to issue low-latency SQL queries to data stored in Hadoop Distributed File System and Apache Hbase without requiring data movement or transformation. Audience This course is aimed at analysts and data scientists performing analysis on data stored in Hadoop […]
-
admin
Moving Data from MySQL to Hadoop with Sqoop Training Course
Overview Sqoop is an open source software tool for transfering data between Hadoop and relational databases or mainframes. It can be used to import data from a relational database management system (RDBMS) such as MySQL or Oracle or a mainframe into the Hadoop Distributed File System (HDFS). Thereafter, the data can be transformed in Hadoop […]
-
admin
Hadoop with Python Training Course
Overview Hadoop is a popular Big Data processing framework. Python is a high-level programming language famous for its clear syntax and code readibility. In this instructor-led, live training, participants will learn how to work with Hadoop, MapReduce, Pig, and Spark using Python as they step through multiple examples and use cases. By the end of […]
-
admin
Apache NiFi for Developers Training Course
Overview Apache NiFi (Hortonworks DataFlow) is a real-time integrated data logistics and simple event processing platform that enables the moving, tracking and automation of data between systems. It is written using flow-based programming and provides a web-based user interface to manage dataflows in real time. In this instructor-led, live training, participants will learn the fundamentals […]
-
admin
Apache NiFi for Administrators Training Course
Overview Apache NiFi (Hortonworks DataFlow) is a real-time integrated data logistics and simple event processing platform that enables the moving, tracking and automation of data between systems. It is written using flow-based programming and provides a web-based user interface to manage dataflows in real time. In this instructor-led, live training (onsite or remote), participants will […]
-
admin
Tigon: Real-time Streaming for the Real World Training Course
Overview Tigon is an open-source, real-time, low-latency, high-throughput, native YARN, stream processing framework that sits on top of HDFS and HBase for persistence. Tigon applications address use cases such as network intrusion detection and analytics, social media market analysis, location analytics, and real-time recommendations to users. This instructor-led, live training introduces Tigon’s approach to blending […]
-
admin
Samza for Stream Processing Training Course
Overview Apache Samza is an open-source near-realtime, asynchronous computational framework for stream processing. It uses Apache Kafka for messaging, and Apache Hadoop YARN for fault tolerance, processor isolation, security, and resource management. This instructor-led, live training introduces the principles behind messaging systems and distributed stream processing, while walking participants through the creation of a sample […]
-
admin
Hadoop for Project Managers Training Course
Overview As more and more software and IT projects migrate from local processing and data management to distributed processing and big data storage, Project Managers are finding the need to upgrade their knowledge and skills to grasp the concepts and practices relevant to Big Data projects and opportunities. This course introduces Project Managers to the […]
