Search⌘ K
Join for free
Home>Courses>Building Scalable Data Pipelines with Kafka

Building Scalable Data Pipelines with Kafka

This course brings any Big Data newbie up to speed on one of the most widely used open-source platform to build data pipelines - Apache Kafka.

Beginner

62 Lessons

3h

Certificate of Completion

This course brings any Big Data newbie up to speed on one of the most widely used open-source platform to build data pipelines - Apache Kafka.
AI-POWERED

Explanations

AI-POWERED

Explanations

This course includes

8 Playgrounds
Course Overview
What You'll Learn
Course Content
Apply Your Skills

Course Overview

If you’re interested in Big Data, then Apache Kafka is a must-know tool. What started as an internal LinkedIn project to streamline data transmission and propagation among services has quickly grown to become a mainstay platform for building highly scalable data pipelines. Meet Apache Kafka - the ubiquitous tool to build pipelines for diverse use cases ranging from chronologically tracking user-activity on a website to implementing publish-subscribe feeds. This course introduces you to Kafka theory and ...Show More
If you’re interested in Big Data, then Apache Kafka is a must-know tool. What started as an internal LinkedIn project to streamline data transmission and propagation among services has quickly grown to become a mainstay platform for building highly scala...Show More

What You'll Learn

Learn the theory behind Kafka
Interact with a Kafka cluster running in the browser-terminal
Learn the theory behind Kafka

Show more

Course Content

1.

Basics

7 Lessons

Step through the fundamentals of Kafka, distributed systems, messaging patterns, and core components.

2.

Kafka Producer

4 Lessons

Unpack the core of Kafka Producers, message sending methods, configurations, and serialization techniques.

4.

Kafka Internals

7 Lessons

Break down complex ideas of Kafka's replication, controller, request processing, and reliability.

5.

Conclusion

1 Lessons

Compare Kafka's scalability, throughput, and real-time processing with other messaging systems.

6.

Appendix

3 Lessons

Activate Zookeeper insights, practical API use, and common distributed system solutions.

8.

Reference: Partitioning

4 Lessons

Learn how to use partitioning strategies to enhance scalability and optimize data pipelines.

9.

Reference: Transactions

9 Lessons

Discover the logic behind managing data transactions, isolation levels, and concurrent write challenges.

10.

Reference: Issues in Distributed Systems

4 Lessons

Examine the challenges in developing and maintaining distributed systems, including networking, time synchronization, and handling failures.

Course Author

Trusted by 2.5 million developers working at companies

Hands-on Learning Powered by AI

See how Educative uses AI to make your learning more immersive than ever before.

Instant Code Feedback

Evaluate and debug your code with the click of a button. Get real-time feedback on test cases, including time and space complexity of your solutions.

AI-Powered Mock Interviews

Adaptive Learning

Explain with AI

AI Code Mentor

Free Resources

FOR TEAMS

Interested in this course for your business or team?

Unlock this course (and 1,000+ more) for your entire org with DevPath