Sumit Mittal - PokeTube

Sumit Mittal @UCbTggJVf0NDTfWX-C_gUGSg@youtube.com

None subscribers - no pronouns set

Hey! I'm Sumit Mittal - Founder & CEO of TrendyTech. I tr

Videos Shorts Community Playlists

Data Governance implementation | Azure Cloud | Cloud Data Engineer Interview Questions #interview

Advantages of PARQUET FILE FORMAT in Apache Spark | Data Engineer Interview Questions #interview

Using Partitioning for Optimizing the Query Performance | Big Data Interview Questions #interview

Understanding Apache Spark Architecture | Common Big Data Interview Questions #interview

What is DAG | DAGs in Spark provide a framework for optimized and fault-tolerant execution of tasks

Partitioning Vs Bucketing | Apache Spark Optimization Techniques #interview #question

Client Mode and Cluster Mode in Apache Spark Explained in 60 second #interview #question

Spark Application | Driver and Executor Role in Task Execution | DAG #interview #question

How is Apache Spark In-Memory Computation Different from MapReduce Processing | Highly Performant

Must Know Big Data Interview Questions | Row Based Vs Column Based File Format #interviewn#question

Real-time Big Data Project Common Scenarios | How are Duplicates handled in PySpark #interview

Understanding ACID Properties Under 60 seconds #interview #question

Understanding How Apache Spark Works Underneath the Hood | MapReduce Processing Big Data #interview

Important Types in Data Modeling | Interview Questions for Big Data Interviews #interview #bigdata

Important Big Data Terminologies | Name Node & Data Node | Spark architecture #interview

Understanding the Working of Azure Data Factory Service in Simple Terms #interview #question

DSA Question | Analytical Problem Solving | Logic building | Puzzle #interview #question

Understanding Azure Data Factory Service | Data Factory Vs Databricks #interview #question

Learning the Difference between Spark Session and Spark Context | Big Data #interview #question

Explanation on Handling Duplicate Rows in PySpark | dropDuplicates() | dropDuplicates(column_name)

Understanding the Different Types of Fact Tables in ETL | Extract Transform & Load #interview #sql

Understanding the Difference Between Repartition and Coalesce in Spark | Spark Optimization Strategy

Finally Youtube Silver Play Button

Medallion Architecture Explained in 60 Seconds |Azure Cloud - BRONZE, SILVER & GOLD Layer#interview

Understanding How to Handle Data Skewness in PySpark #interview

Understanding the Working of Apache Spark's Catalyst Optimizer in Improving the Query Performance

Understanding Slowly Changing Dimension SCD in ETL under 60 Seconds #interview #question

Ensuring Uniquness of Incoming Data in Incremental Load | Spark #interview

Understanding Apache Spark's Adaptive Query Execution - AQE| Spark Optimization Strategy #interview

Understanding the Use-Case of Snowflake Schema : Transactional or Reporting Database #interview

Explanation of Normal Forms | 1NF, 2NF, 3NF, 4NF #sql #interview

Exploring an Important feature of Spark Session #interview #bigdata

Understanding the Evaluation of Jobs, Stages and Tasks in a Spark Job #interview #question

Understanding Apache Spark's Optimizations Strategy | Lazy Evaluation for Performance Optimization

Understanding Change Data Capture CDC in Databricks #interview #question

Understanding Normalization under 60 Seconds | Data Engineering Interview Questions

Amazon Redshift's Sortkey Vs Distkey Explanation | AWS Services | #interview #question

Understanding Left Join | Left Join Vs Left Anti-Join | Spark #interview #question

Understanding How to Handle Cold Start Issue in Lambda Functions #interview #tips

Exploring AWS Services | Understanding Amazon Redshift Spectrum & Glue Catalog #interview #question

Spark Vs MapReduce Explained under 60Seconds | Exploring Advantages & Unlocking Efficiency of Spark

Explaining the Hash Function Employed in Bucketing Optimization within Apache Spark #interview

Partitions and CPU Core Allocation in Apache Spark

Ensuring Data Quality in Apache Spark | Best Practices for High Quality Data #interview #question

Understanding Slowly Changing Dimensions - SCD Type 2 in Apache Spark #interview #question

Understanding how to Optimize PySpark Job | Cache | Broadcast Join | Shuffle Hash Join #interview

Partitioning Vs Bucketing in Apache Spark under 60 Seconds #inteview #question

Understanding the Different Join Strategies in PySpark | Broadcast Join & Sort Merge Join #interview