Beginner's Guides
118 videos • 583 views • by The Data Guy
1
What is Change Data Capture and How Do You Do it? Beginner's Guide to Change Data Capture
The Data Guy
Download
2
What are Slowly Changing Dimensions? Beginner's Guide to SCD's
The Data Guy
Download
3
How to Use Redshift with Airflow!
The Data Guy
Download
4
How to Become an Analytics Engineer! Beginner's Guide to Starting a Career in Analytics Engineering
The Data Guy
Download
5
Beginner's Guide to Using Streamlit: Introduction to Building Simple Python Web Apps!
The Data Guy
Download
6
How to Handle Schema Evolution: Best Practices for Adapting Your Database!
The Data Guy
Download
7
How to Build an ELT Pipeline with CrateDB and Airflow!
The Data Guy
Download
8
How to Create a Streamlit Reporting Dashboard Using Snowflake Data!
The Data Guy
Download
9
Data Analyst Vs. Analytics Engineer Vs. Data Engineer: What Career is Right for You?
The Data Guy
Download
10
Beginner Machine Learning: How to Use a Weaviate for Sentiment Analysis Using Airflow!
The Data Guy
Download
11
What is a Data Scientist? | What Do Data Scientists Do?
The Data Guy
Download
12
How to Create a Data Streaming Pipeline with Databricks Spark, Apache Kafka, and Airflow!
The Data Guy
Download
13
What is in the Modern Data Stack? Layer by Layer Breakdown!
The Data Guy
Download
14
How to Perform Data Quality Checks on a Postgres Database
The Data Guy
Download
15
How to Use Azure Data Factory! Beginner's Guide to Azure Data Factory
The Data Guy
Download
16
Beginner's Guide to Structured Query Language(SQL)! Basic SQL Tips and Tricks!
The Data Guy
Download
17
How to Build a Machine Learning Prediction Pipeline with Databricks! Beginner ML with Databricks
The Data Guy
Download
18
Beginner's Guide to CI/CD! Continuous Integration and Deployment Explained!
The Data Guy
Download
19
How to Build an ETL Pipeline with Postgres, Azure Data Factory, and Airflow!
The Data Guy
Download
20
How to Create a Personalized AI Assistant! OpenAI Assistant Update Breakdown!
The Data Guy
Download
21
What is Big Data? Big Data Explained!
The Data Guy
Download
22
How to Use Llama Index for Machine Learning! Beginner's Guide to Machine Learning
The Data Guy
Download
23
How to Create an Analytics Database with Apache Druid! Apache Druid Explained for Beginners!
The Data Guy
Download
24
Beginner's Guide to Python! Introduction to Python 101
The Data Guy
Download
25
Machine Learning/AI Buzzwords Explained! Common AI/ML Terms Broken Down for Beginners!
The Data Guy
Download
26
What is Apache Flink? Apache Flink Explained!
The Data Guy
Download
27
Top 10 Python Libraries for Machine Learning!
The Data Guy
Download
28
Intro to Node.js and Node Package Manager(NPM)! Backend Development for Beginners!
The Data Guy
Download
29
Introduction to Airbyte! Beginner's Guide to Airbyte
The Data Guy
Download
30
How to Create a Streaming Pipeline with Apache Flink! Apache Flink for Beginners
The Data Guy
Download
31
SQL Database Tier List! Popular SQL Databases Ranked
The Data Guy
Download
32
What is a Data Mesh Architecture? Data Mesh Advantages and Disadvantages!
The Data Guy
Download
33
What is a Data Fabric Architecture? Data Fabric Advantages and Downsides Explained!
The Data Guy
Download
34
How to Use Kubernetes Locally with Airflow!
The Data Guy
Download
35
Beginner's Guide to Looker! How to Use Looker for Data Visualizations!
The Data Guy
Download
36
What is Data Lineage and Why is it Important? Beginner's Guide to Data Lineage
The Data Guy
Download
37
How to Get Started with LangChain for LLM/AI Development! Beginner's Guide to LangChain
The Data Guy
Download
38
Apache Flink Vs. Airbyte: Which is the Right Tool for Your Use Case?
The Data Guy
Download
39
How to Choose the Right Data Model for Your Use Case! Every Type of Data Model Explained
The Data Guy
Download
40
Beginner's Guide to Using TensorFlow for Machine Learning with Neural Networks! Tensor Flow Intro
The Data Guy
Download
41
End to End Image Recognition Data Pipeline using Snowflake, Snowpark, Pytorch, and Streamlit!
The Data Guy
Download
42
How to Build a AI Agent that can Use the Internet with LangChain & Tavily! LangChain Guide Pt. 2
The Data Guy
Download
43
How to Run AWS Glue Jobs with Airflow! Use Airflow to Manage Amazon Glue Jobs
The Data Guy
Download
44
Top 3 Cryptocurrencies Explained for Beginners! Bitcoin, Ethereum, and Solana Explained!
The Data Guy
Download
45
How to Choose the Right CICD Tool for Your Use Case! Top CICD Tools explained!
The Data Guy
Download
46
How to Run AWS Lambda Functions Using Airflow! How to Get More out of Lambda Functions with Airflow!
The Data Guy
Download
47
Every Type of Web API Explained! How to Choose the Right API for Your Use Case
The Data Guy
Download
48
How to Run Spark and PySpark Locally! Spark & PySpark Local Development for Beginners
The Data Guy
Download
49
How to Create a Data Pipeline with BigQuery and Airflow! Beginner's Guide to BigQuery + Airflow
The Data Guy
Download
50
How to Do Machine Learning on a Local Spark Cluster with PySpark! PySpark ML for Beginners!
The Data Guy
Download
51
How to Create an AI Application with LangChain and LangServe! LangServe for Beginners!
The Data Guy
Download
52
How to Choose the Right Streaming Tool for You! Flink, Kafka, and More Compared!
The Data Guy
Download
53
How to Do Incremental Data Loading and Data Validation with PySpark and Spark! Spark Basics!
The Data Guy
Download
54
Getting Started with Starburst for Data Analytics! Starburst Managed Trino for Beginners!
The Data Guy
Download
55
What's the Right Data Quality Tool for You? Top Data Quality Tools Reviewed and Explained!
The Data Guy
Download
56
End to End AI Agent Application Project Using AgentKit by BCG!
The Data Guy
Download
57
How to Run a Spark Cluster with Multiple Workers Locally Using Docker
The Data Guy
Download
58
How to Add Microsoft Teams Alerts to Your Airflow Pipeline!
The Data Guy
Download
59
Should You Become a Data Scientist? Data Science Career's Pro's and Con's Explained
The Data Guy
Download
60
How to Transform Streaming Data with Spark! Spark Script for Monitoring Wordcount
The Data Guy
Download
61
What is an ML/AI Engineer? ML/AI Engineering Explained!
The Data Guy
Download
62
End to End Parallel Data Quality Check Pipeline Project with Airflow and Snowflake!
The Data Guy
Download
63
How to Use Every Single Data Format with Spark! Using CSV, XML, JSON files and more with PySpark!
The Data Guy
Download
64
How to Do Backfilling and Clean-Up Operations with Spark! Spark Essentials
The Data Guy
Download
65
Most Popular Data Engineering Terms Explained! Data Engineering Vocab 101
The Data Guy
Download
66
How to Sync Data Between S3 Buckets Using AWS DataSync and Airflow!
The Data Guy
Download
67
Beginner's Guide to Cloud Computing! Cloud Computing Explained!
The Data Guy
Download
68
How to Use Spark for Extract, Transform, Load (ETL) Workflows! PySpark for ETL Pipelines
The Data Guy
Download
69
How to Back-Up/Migrate Data from AWS DynamoDB to an S3 Bucket with Airflow!
The Data Guy
Download
70
How to Batch Process a Kafka Stream with Apache Spark! PySpark Batch Processing for Beginners
The Data Guy
Download
71
Graph Data Models Explained! Everything You Need to Know About Graph Data Models
The Data Guy
Download
72
How to Manage Google DataFlow Workflows with Apache Airflow!
The Data Guy
Download
73
How to Query Big Data Lightning Fast with StarRocks! Run StarRocks Locally Using Docker!
The Data Guy
Download
74
Data Partitioning Vs. Data Sharding! Data Partitioning and Data Sharding Explained and Compared!
The Data Guy
Download
75
How to Perform Sentiment Analysis with Apache Spark and Weaviate Vector Database!
The Data Guy
Download
76
How to Run Apache Kafka Locally Using Docker! Apache Kafka Quickstart Guide
The Data Guy
Download
77
Synchronous and Asynchronous Networks Explained! Synchronous Vs. Asynchronous Networks!
The Data Guy
Download
78
How to Use Kafka for ETL (Extract, Transform, Load) Pipelines! Apache Kafka for ETL Explained!
The Data Guy
Download
79
Data Architecture Terms and Principles Explained!
The Data Guy
Download
80
How to Optimize Your SQL Queries! SQL Query Optimization Guide!
The Data Guy
Download
81
What is Linearizability? Linearizability in Distributed Systems Explained!
The Data Guy
Download
82
End-to-End Automated Real Time Batch Processing Pipeline with Apache Kafka and Spark
The Data Guy
Download
83
Machine Learning Operations Best Practices! MLOp's Best Practices Explained
The Data Guy
Download
84
Big Data Architecture Options Compared and Contrasted! Big Data Architecture's Explained!
The Data Guy
Download
85
Beginner's Guide to MariaDB! Intro to The Open Source MySQL Competitor MariaDB
The Data Guy
Download
86
Beginner's Guide to Neo4j Graph Databases! Neo4j Graph Database Explained!
The Data Guy
Download
87
Micro-Service Best Practices! How to Build a Scalable Micro-Service Architecture!
The Data Guy
Download
88
How to Build Reliable, Scalable, and Maintainable Data Applications! Data Application Best Practices
The Data Guy
Download
89
How to Build a Real-Time Fraud Detection Pipeline with Apache Flink!
The Data Guy
Download
90
How to Keep Your Job as a Data Engineer! Best Practices for Data Engineering Job Security!
The Data Guy
Download
91
How to Aggregate Logs with Apache Kafka! Apache Kafka Log Aggregation Project
The Data Guy
Download
92
Downsides to Being a Data Engineer! Data Engineering Career Pitfalls and How to Avoid Them!
The Data Guy
Download
93
How to Run Airbyte Locally with abctl!
The Data Guy
Download
94
Python Basics For Data Engineers! Python Data Engineering Basics!
The Data Guy
Download
95
How to Detect Cyber-Security Threats with Apache Spark! Apache Spark Threat Detection Script!
The Data Guy
Download
96
Different Types of Data Engineers Explained! What Data Engineering Role is Right for You?
The Data Guy
Download
97
End-to-End Data Processing Project with Apache Kafka, Apache Spark, and Apache Airflow!
The Data Guy
Download
98
Data Partitioning Explained! Everything You Need to Know About Data Partitioning!
The Data Guy
Download
99
Data Engineering Crash Course Guide! How to Learn Data Engineering in 2024!
The Data Guy
Download
100
What's the Role of a Data Engineer in a Business? Data Engineer's Role in an Organization Explained!
The Data Guy
Download
101
How to Build a Fraud Detection Pipeline with PyFlink! Apache Flink Python Script for Fraud Detection
The Data Guy
Download
102
Project Set-Up Best Practices! How to Set Up Your Data Engineering Project File Structure and CICD!
The Data Guy
Download
103
How to Migrate Airflow DAG Metadata from One Airflow Environment to Another Using Airflow!
The Data Guy
Download
104
How to Migrate Data from a Redshift Database to Any SQL Database Using Airflow!
The Data Guy
Download
105
How to Optimize Your SQL Queries with Real World Examples! SQL Query Optimization for Business!
The Data Guy
Download
106
Top Python Utility Scripts for Data Engineers and Data Scientists!
The Data Guy
Download
107
Intro to Apache Iceberg! Apache Iceberg Explained for Beginners!
The Data Guy
Download
108
How to Migrate Data from a Postgres SQL Database to a Snowflake Database Using Apache Airflow!
The Data Guy
Download
109
End-to-End Postgres to Data Lake Data Processing Pipeline with Apache Spark, Airflow, and Minio!
The Data Guy
Download
110
Data Modeling Best Practices! Best Practices for Designing Your Data Models!
The Data Guy
Download
111
Apache Hudi Vs Apache Iceberg! Apache Hudi and Iceberg Comparison!
The Data Guy
Download
112
What is a Delta Lake? Delta Lakes Explained!
The Data Guy
Download
113
End to End Micro-Batching Pipeline With Apache Airflow and Kafka!
The Data Guy
Download
114
Best Practices for dbt With Real World Examples!
The Data Guy
Download
115
End-to-End Parallel ETL Pipeline with Airflow, Snowflake, and S3 Buckets!
The Data Guy
Download
116
How to Build a BigQuery Ingestion Pipeline from API's, SFTP Servers, and Pub/Sub with Airflow!
The Data Guy
Download
117
Data Engineering Interview Guide! How to Get a Data Engineering Job!
The Data Guy
Download
118
How to Get Started with Soda for Data Quality Checks!
The Data Guy
Download