JanusGraph is a graph database for storing and querying graphs containing hundreds of billions of vertices and edges distributed across a multi-machine cluster.
This instructor-led, live training (online or onsite) is aimed at engineers who wish to use JanusGraph to process very large graphs that require abnormal storage and computational capacity.
By the end of this training, participants will be able to:
- Install and configure JanusGraph.
- Integrate JanusGraph with multiple backend storage systems (Cassandra, HBase, etc.) and multiple indexing software (Elasticsearch, Solr, etc.).
- Configure multiples machines into a cluster for use by JanusGraph.
- Query the database using the Gremlin query language.
- Process graph data at scale, beyond what a single machine can provide.
- Support thousands of concurrent users traversing graph data in real time.
- Query graph data for analysis.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
- An understanding of databases
- Experience with the Linux command line
- System administrators
- Engineers that deal with data
Overview of JanusGraph Features and Architecture
Setting up the Environment
How JanusGraph Stores and Distributes Data
Planning the Database in Accordance with CAP Theorem (C=Consistency, A=Availability, P=Partitionability)
Installing and Configuring the JanusGraph Server
Integrating JanusGraph with Casandra
Integrating JanusGraph with HBase
Connecting with the Server (gremlin console, gremlin python and graphexp).
Querying the Database
Analyzing a Graph Using the Gremlin Graph Traversal Language
Modeling a Domain as a Graph
Enabling Search with Elasticsearch, Apache Solr or Apache Lucene
Integrating with Apache Spark to analyze Global Graph Data (OLAP)
Integrating JanusGraph with 3rd Party Storage Adapters
Summary and Conclusion