How-To Guides
114 videos • 324 views • by The Data Guy
1
How to Create a Snowflake Stage in S3!
The Data Guy
Download
2
How to Do Model Prediction Using MLFlow and Airflow!
The Data Guy
Download
3
How to Manage Azure Data Factory Pipelines Using Airflow!
The Data Guy
Download
4
How to Create an End-to-End ETL Analytics Pipeline with Airflow and Databricks
The Data Guy
Download
5
How to Use Soda for Data Quality Checks with Airflow!
The Data Guy
Download
6
How to Use Great Expectations for Data Quality Checks with Airflow
The Data Guy
Download
7
Database vs. Data Warehouse vs. Data Lake: Ultimate Guide to Data Storage Solutions
The Data Guy
Download
8
How to Use MongoDB with Airflow!
The Data Guy
Download
9
Why Abaddon is a TERRIBLE Antagonist
Majorkill
Download
10
End to End ELT Pipeline Project for Financial Data Analytics!
The Data Guy
Download
11
How to Create an ELT Pipeline Using Airflow, Snowflake, and dbt!
The Data Guy
Download
12
How to Become a Data Engineer! Roadmap for Total Beginners with No Experience!
The Data Guy
Download
13
How to Run Airflow Locally without Docker Desktop using Podman!
The Data Guy
Download
14
How to Build an ETL Pipeline from Spark to a Cassandra Database using Airflow!
The Data Guy
Download
15
What is Data Modelling? Beginner's Guide to Data Models and Data Modelling
The Data Guy
Download
16
Introduction to Machine Learning with Airflow and Snowflake/Snowpark!
The Data Guy
Download
17
What is Databricks? Beginner's Guide to Databricks!
The Data Guy
Download
18
How to Migrate Data from MSSQL Server to Snowflake using Airflow!
The Data Guy
Download
19
How to Become an Analytics Engineer! Beginner's Guide to Starting a Career in Analytics Engineering
The Data Guy
Download
20
How to Migrate Data from an Oracle Database to a Postgres Database Using Airflow!
The Data Guy
Download
21
Beginner's Guide to Using Streamlit: Introduction to Building Simple Python Web Apps!
The Data Guy
Download
22
How to Handle Schema Evolution: Best Practices for Adapting Your Database!
The Data Guy
Download
23
How to Migrate Data from MongoDB to a MySql Database with Airflow!
The Data Guy
Download
24
How to Build an ELT Pipeline with CrateDB and Airflow!
The Data Guy
Download
25
How to Create a Streamlit Reporting Dashboard Using Snowflake Data!
The Data Guy
Download
26
Submitting PySpark Jobs to Databricks: ETL Workflows with Airflow Explained!
The Data Guy
Download
27
How to Create a Cloud Hosted Airflow Environment in 5 Minutes!
The Data Guy
Download
28
Data Analyst Vs. Analytics Engineer Vs. Data Engineer: What Career is Right for You?
The Data Guy
Download
29
End to End Data Processing Pipeline for Customer Analytics and Machine Learning!
The Data Guy
Download
30
Beginner Machine Learning: How to Use a Weaviate for Sentiment Analysis Using Airflow!
The Data Guy
Download
31
How to Perform Data Quality Checks on a Postgres Database
The Data Guy
Download
32
How to Use Azure Data Factory! Beginner's Guide to Azure Data Factory
The Data Guy
Download
33
Beginner's Guide to Structured Query Language(SQL)! Basic SQL Tips and Tricks!
The Data Guy
Download
34
How to Build a Machine Learning Prediction Pipeline with Databricks! Beginner ML with Databricks
The Data Guy
Download
35
Beginner's Guide to CI/CD! Continuous Integration and Deployment Explained!
The Data Guy
Download
36
How to Use Llama Index for Machine Learning! Beginner's Guide to Machine Learning
The Data Guy
Download
37
How to Create an Analytics Database with Apache Druid! Apache Druid Explained for Beginners!
The Data Guy
Download
38
Beginner's Guide to Python! Introduction to Python 101
The Data Guy
Download
39
How to Choose the Right Database for Your Use Case! Choosing the Right Database!
The Data Guy
Download
40
How to Create an ML Prediction Pipeline using Pinecone & Airflow!
The Data Guy
Download
41
Top 10 Python Libraries for Machine Learning!
The Data Guy
Download
42
Intro to Node.js and Node Package Manager(NPM)! Backend Development for Beginners!
The Data Guy
Download
43
Introduction to Airbyte! Beginner's Guide to Airbyte
The Data Guy
Download
44
How to Create a Streaming Pipeline with Apache Flink! Apache Flink for Beginners
The Data Guy
Download
45
Beginner's Guide to Looker! How to Use Looker for Data Visualizations!
The Data Guy
Download
46
Beginner's Guide to Tableau! How to Use Tableau for Data Visualization
The Data Guy
Download
47
How to Get Started with LangChain for LLM/AI Development! Beginner's Guide to LangChain
The Data Guy
Download
48
Beginner's Guide to Using TensorFlow for Machine Learning with Neural Networks! Tensor Flow Intro
The Data Guy
Download
49
End to End Image Recognition Data Pipeline using Snowflake, Snowpark, Pytorch, and Streamlit!
The Data Guy
Download
50
How to Build a AI Agent that can Use the Internet with LangChain & Tavily! LangChain Guide Pt. 2
The Data Guy
Download
51
End to End ML Location Recommendation Data Pipeline Using Snowflake's Snowpark!
The Data Guy
Download
52
How to Pull Information from the Google Rest API and Store it in AWS S3 with Airflow!
The Data Guy
Download
53
How to Run AWS Lambda Functions Using Airflow! How to Get More out of Lambda Functions with Airflow!
The Data Guy
Download
54
End-to-End Data Project: Create a Sentiment Analysis Engine with Snowpark, Streamlit, and Scikit NLP
The Data Guy
Download
55
How to Run Spark and PySpark Locally! Spark & PySpark Local Development for Beginners
The Data Guy
Download
56
How to Create a Data Pipeline with BigQuery and Airflow! Beginner's Guide to BigQuery + Airflow
The Data Guy
Download
57
How to Do Machine Learning on a Local Spark Cluster with PySpark! PySpark ML for Beginners!
The Data Guy
Download
58
How to Create an AI Application with LangChain and LangServe! LangServe for Beginners!
The Data Guy
Download
59
How to Do Incremental Data Loading and Data Validation with PySpark and Spark! Spark Basics!
The Data Guy
Download
60
Introducing Tembo, The Postgres Database for Everything: New Tool Test Drive
The Data Guy
Download
61
End-to-End Data Pipeline Project with Change Data Capture Using Postgres, Airflow, and Snowflake!
The Data Guy
Download
62
Getting Started with Starburst for Data Analytics! Starburst Managed Trino for Beginners!
The Data Guy
Download
63
How to Run a Spark Cluster with Multiple Workers Locally Using Docker
The Data Guy
Download
64
Introducing Tabular, the Managed Apache Iceberg Service! New Tool Test Drive
The Data Guy
Download
65
How to Add Microsoft Teams Alerts to Your Airflow Pipeline!
The Data Guy
Download
66
End to End AI Recommendation Pipeline with Cohere and Airflow!
The Data Guy
Download
67
How to Transform Streaming Data with Spark! Spark Script for Monitoring Wordcount
The Data Guy
Download
68
End to End Parallel Data Quality Check Pipeline Project with Airflow and Snowflake!
The Data Guy
Download
69
How to Do Backfilling and Clean-Up Operations with Spark! Spark Essentials
The Data Guy
Download
70
How to Sync Data Between S3 Buckets Using AWS DataSync and Airflow!
The Data Guy
Download
71
ETL vs ELT: Which Data Transformation Pipeline Type is Best for Your Use Case?
The Data Guy
Download
72
How to Use Spark for Extract, Transform, Load (ETL) Workflows! PySpark for ETL Pipelines
The Data Guy
Download
73
How to Rank URL Relevance Using PySpark and Google's PageRank Algorithm!
The Data Guy
Download
74
How to Query Big Data Lightning Fast with StarRocks! Run StarRocks Locally Using Docker!
The Data Guy
Download
75
How to Perform Sentiment Analysis with Apache Spark and Weaviate Vector Database!
The Data Guy
Download
76
How to Run Apache Kafka Locally Using Docker! Apache Kafka Quickstart Guide
The Data Guy
Download
77
How to Use Kafka for ETL (Extract, Transform, Load) Pipelines! Apache Kafka for ETL Explained!
The Data Guy
Download
78
Principles of Good Data Architecture! 9 Principles for Data Engineers to Follow!
The Data Guy
Download
79
The Data Engineering Life Cycle Explained! Data Engineering Lifecycle Breakdown
The Data Guy
Download
80
End-to-End MLOp's Pipeline Example with Airflow, Weaviate, and OpenAI!
The Data Guy
Download
81
How to Optimize Your SQL Queries! SQL Query Optimization Guide!
The Data Guy
Download
82
End-to-End Automated Real Time Batch Processing Pipeline with Apache Kafka and Spark
The Data Guy
Download
83
Beginner's Guide to Neo4j Graph Databases! Neo4j Graph Database Explained!
The Data Guy
Download
84
How to Build Reliable, Scalable, and Maintainable Data Applications! Data Application Best Practices
The Data Guy
Download
85
How to Build a Real-Time Fraud Detection Pipeline with Apache Flink!
The Data Guy
Download
86
How to Do Machine Learning Operations in Production with Airflow! MLOp's Production Pipeline Example
The Data Guy
Download
87
5 Data Projects to Help You Get a Data Engineering Job! 5 Solo Data Engineering Project Ideas
The Data Guy
Download
88
How to Detect Cyber-Security Threats with Apache Spark! Apache Spark Threat Detection Script!
The Data Guy
Download
89
End-to-End Data Processing Project with Apache Kafka, Apache Spark, and Apache Airflow!
The Data Guy
Download
90
Data Partitioning Explained! Everything You Need to Know About Data Partitioning!
The Data Guy
Download
91
Data Engineering Crash Course Guide! How to Learn Data Engineering in 2024!
The Data Guy
Download
92
Apache NiFi: Master Data Flows in Seconds
The Data Guy
Download
93
How to Build a Fraud Detection Pipeline with PyFlink! Apache Flink Python Script for Fraud Detection
The Data Guy
Download
94
Project Set-Up Best Practices! How to Set Up Your Data Engineering Project File Structure and CICD!
The Data Guy
Download
95
How to Migrate Airflow DAG Metadata from One Airflow Environment to Another Using Airflow!
The Data Guy
Download
96
Common Data Engineering Career Challenges and How to Overcome Them!
The Data Guy
Download
97
How to Migrate Data from a Redshift Database to Any SQL Database Using Airflow!
The Data Guy
Download
98
How to Optimize Your SQL Queries with Real World Examples! SQL Query Optimization for Business!
The Data Guy
Download
99
How to Use Index-All to Make Your Database and SQL Queries Hum!
The Data Guy
Download
100
Top Python Utility Scripts for Data Engineers and Data Scientists!
The Data Guy
Download
101
Intro to Apache Iceberg! Apache Iceberg Explained for Beginners!
The Data Guy
Download
102
How to Migrate Data from a Postgres SQL Database to a Snowflake Database Using Apache Airflow!
The Data Guy
Download
103
How to Batch Run Machine Learning Models with Apache Flink's PyFlink!
The Data Guy
Download
104
Introduction to Mesop, an Open-Source Streamlit Competitor for Creating Easy AI Applications!
The Data Guy
Download
105
End to End Incremental Data Lake Data Loading Pipeline with Apache Hudi, Trino, and Spark!
The Data Guy
Download
106
How to Build a Retrieval Augmented Generation Pipeline with Meta's LLAMA 3 Large Language Model!
The Data Guy
Download
107
Best Practices for Building Data Pipelines! How to Build the Best Data Pipelines
The Data Guy
Download
108
End-to-End Postgres to Data Lake Data Processing Pipeline with Apache Spark, Airflow, and Minio!
The Data Guy
Download
109
Data Modeling Best Practices! Best Practices for Designing Your Data Models!
The Data Guy
Download
110
End to End Micro-Batching Pipeline With Apache Airflow and Kafka!
The Data Guy
Download
111
Best Practices for dbt With Real World Examples!
The Data Guy
Download
112
How to Build a BigQuery Ingestion Pipeline from API's, SFTP Servers, and Pub/Sub with Airflow!
The Data Guy
Download
113
Data Engineering Interview Guide! How to Get a Data Engineering Job!
The Data Guy
Download
114
How to Get Started with Soda for Data Quality Checks!
The Data Guy
Download