Tag: Python Programming Course Wan Chai

  • Apache Hadoop: Manipulation and Transformation of Data Performance Training Course

    Overview This course is intended for developers, architects, data scientists or any profile that requires access to data either intensively or on a regular basis. The major focus of the course is data manipulation and transformation. Among the tools in the Hadoop ecosystem this course includes the use of Pig and Hive both of which […]

    Read More

  • Apache Avro: Data Serialization for Distributed Applications Training Course

    Overview Audience Developers Format of the Course Lectures, hands-on practice, small tests along the way to gauge understanding Requirements A general familiarity with distributed computing. Course Outline Introduction Principles of Distributed Computing Apache Spark Hadoop Principles of Data Serialization How data object is passed over the network Serialization of objects Serialization approaches Thrift Protocol Buffers […]

    Read More

  • Hadoop for Business Analysts Training Course

    Overview Apache Hadoop is the most popular framework for processing Big Data. Hadoop provides rich and deep analytics capability, and it is making in-roads in to tradional BI analytics world. This course will introduce an analyst to the core components of Hadoop eco system and its analytics Audience Business Analysts Duration three days Format Lectures and […]

    Read More

  • HBase for Developers Training Course

    Overview This course introduces HBase – a NoSQL store on top of Hadoop.  The course is intended for developers who will be using HBase to develop applications,  and administrators who will manage HBase clusters. We will walk a developer through HBase architecture and data modelling and application development on HBase. It will also discuss using […]

    Read More

  • Advanced Hadoop for Developers Training Course

    Overview Apache Hadoop is one of the most popular frameworks for processing Big Data on clusters of servers. This course delves into data management in HDFS, advanced Pig, Hive, and HBase.  These advanced programming techniques will be beneficial to experienced Hadoop developers. Audience: developers Duration: three days Format: lectures (50%) and hands-on labs (50%). Requirements comfortable with […]

    Read More

  • Hadoop for Developers (4 days) Training Course

    Overview Apache Hadoop is the most popular framework for processing Big Data on clusters of servers. This course will introduce a developer to various components (HDFS, MapReduce, Pig, Hive and HBase) Hadoop ecosystem. Requirements comfortable with Java programming language (most programming exercises are in java) comfortable in Linux environment (be able to navigate Linux command […]

    Read More

  • Hadoop Administration on MapR Training Course

    Overview Audience: This course is intended to demystify big data/hadoop technology and to show it is not difficult to understand. Requirements Basic knowledge of Linux FS Basic Java Knowledge of Apache Hadoop (recommended) Course Outline Big Data Overview: What is Big Data Why Big Data is gaining popularity Big Data Case Studies Big Data Characteristics […]

    Read More

  • Administrator Training for Apache Hadoop Training Course

    Overview Audience: The course is intended for IT specialists looking for a solution to store and process large data sets in a distributed system environment Goal: Deep knowledge on Hadoop cluster administration. Requirements Basic Linux administration skills Basic programming skills Course Outline 1: HDFS (17%) Describe the function of HDFS Daemons Describe the normal operation […]

    Read More

  • Process Mining Training Course

    Overview Process mining, or Automated Business Process Discovery (ABPD), is a technique that applies algorithms to event logs for the purpose of analyzing business processes. Process mining goes beyond data storage and data analysis; it bridges data with processes and provides insights into the trends and patterns that affect process efficiency. Format of the Course […]

    Read More

  • Cluster Analysis with R and SAS Training Course

    Overview R is a programming language and software environment for statistical computing. SAS is a statistical software platform for predictive analysis, data management, advanced analytics, and more. With R in SAS, users can find natural groups of data for cluster analysis that are essential to data mining. This instructor-led, live training (online or onsite) is aimed […]

    Read More