•   Pune: +91 82 82 82 9806

Big Data - Spark & Kafka QuickStart (Webinar)

Course Name : Big Data - Spark & Kafka QuickStart (Webinar)

Batch Schedule : 18-Jul-2020   To   26-Jul-2020

Schedule : Weekend - (Saturday-Sunday)

Duration : 4 Days

Timings : 9:00 AM  To  12:30 PM

Fees : Rs. 1500/- (Inc. 18% GST)

  • Students
  • Freshers
  • Working Professionals
Click to Register
  • Apache Spark 2
    • Spark concepts
    • Distributed Computing Challenges
    • Spark Architecture & Components
    • Spark Installation & Deployment
  • PySpark
    • PySpark Shell
    • PySpark installation
    • Executing Spark Python programs
    • Spark Web UI
    • Spark in Pycharm IDE
    • Spark on Databricks cloud
  • Apache Spark 2 - Spark Core
    • Spark RDD, Transformations & Actions, Data Load & Save
    • RDD characteristics & execution
    • Accumulators & Broadcast variables
    • RDD Internals: Distributed/Partitions, Lineage, Persistence
    • Implementing & Submitting Spark Job
    • Execution of Spark Job (RDD)
    • DAG visualization
  • Apache Spark 2 - Spark SQL
    • Spark SQL Introduction
    • Architecture
    • SQLContext & SparkSession
    • Data Frames & Datasets
    • Data Frame Columns & Expressions
    • Implementing & Executing Spark SQL job
    • User Defined Functions
    • File Formats & Loading data
    • Spark SQL data types & schema
    • Spark SQL functions
    • Global/Temporary views
    • Partitioning & Bucketing
    • SQLContext & HiveContext
  • Apache Spark 2 - Spark Streaming
    • Streaming concepts
    • Microbatches vs Continuous job
    • Spark Streaming concepts
    • Spark Structured Streaming concepts
    • Triggers, Event time-based processing & Watermark
    • Windowing Concept, Window Operations
    • Input sources & output sinks
    • Structured Streaming application execution
  • Apache Kafka
    • Kafka Architecture
    • Kafka Cluster Components & Configuration
    • Kafka Applications
    • Kafka Python client
    • Kafka Spark Source & Sink
Click to Register
  • Linux commands familiarity
  • Any RDBMS (like Oracle or MySQL)
  • Python3 programming skills
  • Hive architecture - Join Hive webinar to learn
  • XML awareness

 

Click to Register
  • This course is designed for Spark developers and covers developer machine installation. Spark administration and cluster setup is beyond scope of this course.
  • This course doesn’t include machine learning or data science using Spark.
  • Considering details to learn and participant's queries we may need to extend timings every day.
  • You need to have a Hive setup installed on your machine before the session. You may use the downloaded VM or follow setup instructions in your enrolment mail. You may take help from the instructor before the session.
Click to Register
  • Understand Spark architecture and components
  • Gain confidence in Spark dataframes and Spark SQL
  • Spark structured streaming with Apache Kafka
Click to Register
  • Core i3 (64-bit) and above
  • RAM Min 8 GB. Recommended: 16 GB+.
  • 64-bit Linux – Ubuntu.
  • You may use VM with the above configuration (available for download).
Click to Register
Sr.No Batch Code Start Date End Date Time
1 BD Spark & Kafka- O-01 18-Jul-2020 26-Jul-2020 9:00 AM  To  12:30 PM

Schedule : Weekend - (Saturday-Sunday)

Click to Register

Contact us

Sunbeam Market Yard Pune

'Sunbeam Chambers', Plot No.R/2, Market Yard Road, Behind Hotel Fulora, Gultekdi,    Pune - 411 037. MH-INDIA.

+91 82 82 82 9806
Sunbeam Hinjawadi Pune

"Sunbeam IT Park", Second Floor, Phase 2 of Rajiv Gandhi Infotech Park,Hinjawadi, Pune - 411057, MH-INDIA

+91 82 82 82 9806