Data Infrastructure with Apache Cassandra E-Learning
In the Data Infrastructure with Apache Cassandra LearningKIT, you will explore Apache Cassandra, create Cassandra Clusters, and the Cassandra Query Language. You will explore installing Cassandra and making it available with Docker, creating Cassandra clusters and inspecting them using nodetool utility, and leveraging the data center in Cassandra.
You will also learn Cassandra Query Language to organize tables into keyspaces, specify replication strategy, and configurations, create tables, and insert and update data. You will also learn to work with CSV and JSON data, table partitioning and clustering key columns, and also performing aggregations on the data, and leveraging user-defined functions.
This LearningKit with more than 7 hours of learning is divided into three tracks:
Course content
Track 1: Intro to Data Infrastructure
In this track, the focus will be on data infrastructure in an organization, data mesh architecture, data tools, messaging platforms, and data stores.
Courses (½ hour +):
Setting up the Data Infrastructure in an Organization
Course: 46 Minutes
- Course Overview
- Data Infrastructure in an Organization
- Data Mesh Architecture
- Tools for Data Management
- Messaging Platforms
- Data Stores
- Course Summary
Track 2: Apache Cassandra
In this track, the focus will be on working with Apache Cassandra. Apache Cassandra is a decentralized, distributed, wide-column store that provides great performance at petabyte-scale data for specific types of data and operations. Cassandra is great for data that can be accessed via unique keys and where each row has potentially very different column attributes.
Courses (7 hours +)
Working with Apache Cassandra: Getting Started
Course: 1 Hour, 25 Minutes
- Course Overview
- The Benefits of Wide-column Stores
- The Apache Cassandra Architecture and Features
- Installing Apache Cassandra
- Starting Apache Cassandra
- Using Apache Cassandra with Docker
- Creating a Multi-node Cassandra Cluster with Docker
- Creating a Multi-node Cluster Using Docker Compose
- Changing the Snitch in Cassandra
- Creating a Cluster with Multiple Datacenters
- Course Summary
Working with Apache Cassandra: Using Replication, Consistency, & CQL
Course: 1 Hour, 49 Minutes
- Course Overview
- Using Replicas in Apache Cassandra
- Performing Replication with Multi-DC Clusters
- Creating Replicas in Multi-DC Clusters
- Apache Cassandra Consistency
- Read Consistency
- Viewing the Consistency of Cassandra Clusters
- Executing Basic CQL Commands
- Understanding Cassandra Table Settings
- Creating and Modifying Tables in CQL
- Executing Queries on Cassandra Tables
- Performing Updates on Cassandra Tables
- Tracing Query Information
- Course Summary
Working with Apache Cassandra: Using Collection & User-defined Fields
Course: 1 Hour, 5 Minutes
- Course Overview
- Using Set Fields in Cassandra
- Querying and Modifying Set Fields in Cassandra
- Using List Fields in Cassandra
- Storing Key-value Pairs in Maps
- Using Nested Collections
- Using User-defined Types
- Using Counter Fields
- Reading and Writing Data from CSV Files
- Using Data in JSON Format
- Course Summary
Working with Apache Cassandra: Partitioning & Clustering Keys
Course: 1 Hour, 42 Minutes
- Course Overview
- Partitioning Data
- Data Modeling in Apache Cassandra
- Features of Apache Cassandra
- Understanding Partitioning and Clustering Keys
- Viewing the Partitions in Tables
- Using Partitioning and Clustering Keys
- Partitioning and Clustering Data
- Using Multiple Partition Keys
- Adding Rows to Tables with Primary Keys
- Performing ORDER BY Operations
- Querying Tables with Primary and Clustering Keys
- Querying Tables with Primary Keys
- Course Summary
Working with Apache Cassandra: Indexes, Aggregations, & Programmatic Access
Course: 1 Hour, 2 Minutes
- Course Overview
- Indexes on Columns
- Querying Tables with Indexes
- Performing Grouping and Aggregations
- Creating User-defined Functions (UDFs)
- Running Atomic Queries
- Connecting to Cassandra from Programmatic APIs
- Modifying Cassandra Tables from Java
- Updating and Deleting Data from Java
- Course Summary
Assessment:
- Final Exam: Apache Cassandra