Channel Avatar

Soumil Shah @UC_eOodxvwS_H7x2uLQa-svw@youtube.com

None subscribers - no pronouns set

I earned a Bachelor of Science in Electronic Engineering and


08:23
Learn How to Use Apache Hudi Streamer with DataHUB An Open Source Metadata Platform
06:19
Getting Started with X-Table and Unity Catalog | Universal Datalakes | Hands on Labs
08:15
Hudi Using Spark SQL on AWS S3: Insert, Update, Deletes, Stored Procedures on AWS Glue Notebooks
07:03
How to Use Hudi Streamer on New EMR 7.1.0 Spark 3.5.1 and Hudi 0.14.1 | Hands-on Labs
04:29
How to Use Hudi Streamer with Hudi version 0.15.0 | Hands on Guide |
03:10
How to Execute Postgres Stored procedures in Spark | Hands on Guide
06:47
Learn How to Ingest Data from Hudi Incrementally hudi table changes into Postgres Using Spark
08:17
Universal Datalakes: Interoperability with Hudi, Iceberg, and Delta Tables with AWS Glue Notebooks
03:38
4 Different Ways to fetch Apache Hudi Commit time in Python and PySpark
05:41
OneTable to translate a Hudi table to Iceberg format and sync with Glue Catalog
04:30
Learn How to Run Apache X Table Sync Command on AWS Cloud Shell | Interoperate Hudi Iceberg delta
06:33
Learn How to Ingest XML files with AWS Glue into Hudi Datalakes | Step by Step guide
08:56
Hudi with Spark SQL for Beginners | Insert| Updates | Delete | incremental Query | Stored procedures
05:19
How we Utilized Hudi's Time Travel Query to Investigate Bid and Spend | Going Back in Time with Hudi
05:55
Hudi Cleaning Process | hoodie.keep.min.commits and hoodie.keep.max.commits Explained
02:58
AWS Glue Tutorial: How to Filter and Exclude S3 Files while reading as Glue Dynamic Frame
03:58
How to Read S3 Partitioned Data as Columns in AWS Glue DF
08:00
Multiple Spark Writers to Hudi tables | Hands on Labs
05:20
Learn How to Ingest data from pulsar Topic into Hudi with DeltaStreamer | Hands on Labs
03:46
Build Hudi Date Dimension in Minutes with Spark SQL Minio and Query with Trino
24:00
Hudi Streamer implementing Slowly Changing Dimension Type 2 and Query Real Time Trino | Hands on
04:43
Demo Video : Hudi Delta Streamer Implementing Slowly Changing Dimension and Query that using Trino
06:13
DeltaStreamer with incremental ETL and Broadcast Joins for Faster ETL
07:24
Learn How to use Cloudwatch metrics with Hudi AWS Glue Jobs
02:16
Tips to Feel Valued at Work: Overcoming Unappreciation
04:37
How to Use Spark 3.5.1 on Kubernetes running locally | Step by Step Guide using Helm
05:57
Learn how to Spinup Trino on Kubernetes running Locally on Windows | Mac machine | Simple Guide
01:04
Mastering ETL and Data Warehousing with AWS Glue
01:27
Mastering Elasticsearch Your Comprehensive Guide to Shards, Performance Tuning, and More
06:51
Unleashing the Power of Serverless: Serving Gold Hudi Tables with AWS Lambda
01:19
#1 Stay Motivated and Learn: Strategies and Tips to Keep Going
03:00
#1 Unlocking the Future of Data Management: Introducing OneTable by OneHouse
01:01
#2 Apache Hudi: Unveiling Copy-on-Write and Merge-on-Read Tables | Exploring Core Concepts
01:01
#7 Understanding Global and Non-Global Index Strategies
01:05
#2 Supercharge Your Data Processing with Apache Hudi!
01:30
#1 Revolutionizing Data Management: The Remarkable Story Behind Apache Hudi & Uber
08:25
How to configure Trino with Hudi and Hive metastore with MINIO Object Store
02:48
How to read Hudi Dataset Using AWS Glue Ray and Glue Notebooks (withouth Spark)
03:51
Daily Habits That TRANSFORMED My Life
08:38
Develop Full Text Search(Semantics Search) with Postgres (PgVector) and Python Hands on Labs
04:51
Learn How to Display Data From Hudi Tables to your Frontend with Flask and Daft (NO SPARK NEEDED)
02:58
How to Query Apache Hudi Tables with Python Using Daft: A Spark-Free Approach
03:07
How an empty S3 bucket can make your AWS bill explode | Medium Blog
05:24
How to use Flink with Hive MetaStore and Minio and Create Iceberg tables | Docker | Hands on Labs
06:27
Developer Guide: Getting Started with Flink PyFlink and Hudi Setting Up Your Local Environment and
08:04
Hudi with Kyuubi, a distributed & multi-tenant gateway, to provide serverless SQL on lakehouses
05:18
Learn How to read Data from Kafka and insert New Data into Postgres Using Trino
04:33
Learn How to Query Kafka Topics RealTime with trino
02:03
Gratitude Overflowing: A Huge Thank You to All 40,000 Subscribers for Your Unwavering Love & Support
17:16
Bringing Data Frm MySQL to Kafka Using Debezium, Joining Kafka Topics with Flink and ingest data
07:55
Build Universal Data lake with MySQL + Debezium+Kafka+DeltaSTreamer + Minio+HiveMetastore+Trino
27:58
Build Universal Data lake with Posgres + Debezium+Kafka+DeltaSTreamer + Minio+HiveMetastore+Trino
04:05
Upcoming Next Video: Real World Data Engineering Project|with Kafka + Debezium+ deltaStreamer+ Trino
08:55
Reading Data from Hudi INC & Joining with Delta Tables using HudiStreamer & SQL-Based Transformer
10:38
Building a Universal Data Lakehouse with Apache XTable, MinIO, and Trino (hudi | Iceberg| Delta)
23:10
Building DataLakeHouse: XTable, MinIO, StarRocks, DeltaStreamer - Interoperating Hudi, IceBerg,Delta
02:00
Universal DataLakehouse: Unlocking Data at Scale for the World's Most Sophisticated Organization|4/9
04:19
How to Get Email Alerts When AWS DMS Task Fails in an Event-Driven Fashion
03:48
A Simple Config-Driven Python Template for Rapid DMS to S3 Task | Single Task per Table Strategy
11:49
Dynamically Build and Schedule DeltaStreamer Jobs to EMR Serverless and Airflow Dag Creation