Channel Avatar

Raja's Data Engineering @UCv4yD1y59pf69GurCYceT9Q@youtube.com

23K subscribers - no pronouns :c

Welcome to Raja's Data Engineering! Are you ready to embark


14:16
66. Databricks | Pyspark | Delta: Z-Order Command
15:32
65. Databricks | Pyspark | Delta Lake: Vacuum Command
13:16
64. Databricks | Pyspark | Delta Lake: Optimize Command - File Compaction
07:28
63. Databricks | Pyspark| Delta Lake: Restore Command
08:47
62. Databricks | Pyspark | Delta Lake: Time Travel
20:03
61. Databricks | Pyspark | Delta Lake : Slowly Changing Dimension (SCD Type2)
07:49
58. Databricks | Pyspark | Delta Lake : Update Delta Table
08:44
57. Databricks| Pyspark| Delta Lake: Different Approaches to Delete Data from Delta Table
10:47
56. Databricks| Pyspark | Delta Lake: Different Approaches to Insert Data Into Delta Table
11:29
55. Databricks| Pyspark| Delta Lake: Delta Table Instance
07:53
53. Databricks| Pyspark| Delta Lake: Solution Architecture
30:13
52. Databricks| Pyspark| Delta Lake Architecture: Internal Working Mechanism
10:27
51. Databricks | Pyspark | Delta Lake: Introduction to Delta Lake
11:56
54. Databricks | Delta Lake| Pyspark: Create Delta Table Using Various Methods
10:47
70. Databricks| Pyspark| Input_File_Name: Identify Input File Name of Corrupt Record
07:02
69. Databricks | Spark | Pyspark | Data Skewness| Interview Question: SPARK_PARTITION_ID
10:30
38. Databricks | Pyspark | Interview Question | Compression Methods: Snappy vs Gzip
06:56
50. Databricks | Pyspark: Greatest vs Least vs Max vs Min
08:14
37. Databricks | Pyspark: Dataframe Checkpoint
08:09
68. Databricks | Pyspark | Dataframe InsertInto Delta Table
06:00
36. Databricks: Autoscaling | Optimized Autoscaling
06:01
49. Databricks & Spark: Interview Question(Scenario Based) - How many spark jobs get created?
05:52
35. Databricks & Spark: Interview Question - Shuffle Partition
17:04
19. Databricks & Pyspark: Real Time ETL Pipeline Azure SQL to ADLS
14:43
17. Databricks & Pyspark: Azure Data Lake Storage Integration with Databricks
09:32
20. Databricks & Pyspark: Azure Key Vault Integration
12:08
18. Databricks & Pyspark: Ingest Data from Azure SQL Database
17:12
60. Databricks & Pyspark: Delta Lake Audit Log Table with Operation Metrics
11:30
59. Databricks Pyspark:Slowly Changing Dimension|SCD Type1| Merge using Pyspark and Spark SQL
09:08
48. Databricks - Pyspark: Find Top or Bottom N Rows per Group
15:03
34. Databricks - Spark: Data Skew Optimization
02:59
47. Databricks | Spark | Pyspark | Null Count of Each Column in Dataframe
05:53
46. Databricks | Spark | Pyspark | Number of Records per Partition in Dataframe
07:24
16. Databricks | Spark | Pyspark | Bad Records Handling | Permissive;DropMalformed;FailFast
10:08
33. Databricks | Spark | Pyspark | UDF
13:09
45. Databricks | Spark | Pyspark | PartitionBy
07:24
44. Databricks | Spark | Python Functions| Join
11:56
04. On-Heap vs Off-Heap| Databricks | Spark | Interview Question | Performance Tuning
10:41
39. Databricks | Spark | Pyspark Functions| Split
04:12
43. Databricks | Spark | Pyspark Functions| Part 4 : Array_Sort
17:14
26. Databricks | Spark | Adaptive Query Execution| Interview Question | Performance Tuning
13:33
25. Databricks | Spark | Broadcast Variable| Interview Question | Performance Tuning
04:48
42. Databricks | Spark | Pyspark Functions| Part 3 : Array_Except
05:11
41. Databricks | Spark | Pyspark Functions| Part 2 : Array_Intersect
17:14
40. Databricks | Spark | Pyspark Functions| Arrays_zip
18:51
30. Azure DataWarehouse | Architecture | Distribution methods | Hash | Round-Robin | Replicated
12:51
29. Azure Synapse Analytics| ADW Architecture | MPP | Part 1
18:56
23. Databricks | Spark | Cache vs Persist | Interview Question | Performance Tuning
19:42
24. Databricks| Spark | Interview Questions| Catalyst Optimizer
09:35
15. Databricks| Spark | Pyspark | Read Json| Flatten Json
09:18
28. Azure Synapse Analytics | Spark Pool Deployment
11:24
27. Azure Synapse Analytics| Spark Pool| Read Csv data from ADLS
18:12
21. Databricks| Spark Streaming
21:11
22. Databricks| Spark | Performance Optimization | Repartition vs Coalesce
14:17
32. Databricks| Pyspark| Handling Null Part 2
10:05
31. Databricks Pyspark: Handling Null - Part1
15:08
05. Databricks | Pyspark: Cluster Deployment
41:34
01. Databricks: Spark Architecture & Internal Working Mechanism
11:33
14. Databricks | Pyspark: Pivot & Unpivot
10:25
13. Databricks | Pyspark: Union & UnionAll