Airflow for Beginners: Build AI models ETL Job in 20 mins

Опубликовано: 03 Май 2025
на канале: Luke J Byrne
388
20

Skip the setup pain—learn Airflow properly here: https://datacamp.pxf.io/Wygvoe

Build an Apache Airflow ETL Pipeline with Docker – From Scratch

In this tutorial, you’ll learn how to set up Apache Airflow using Docker and create a full ETL pipeline that pulls model data from the Hugging Face API, transforms it, and loads it into a PostgreSQL database using pgAdmin.

No fluff. Just a clear, practical walkthrough.

What you’ll learn:
• Set up Apache Airflow with Docker
• Create and run your first DAG
• Extract data from the Hugging Face API
• Clean and transform data in Python
• Load data into PostgreSQL using pgAdmin
• Connect Airflow to a database using hooks
• Pass data between tasks with xcom

Tech stack used:
Apache Airflow, Docker & Docker Compose, Python, PostgreSQL + pgAdmin, Hugging Face Transformers API

--------------------

📊 DataCamp Data Engineer Certification (50% OFF):
https://datacamp.pxf.io/POmAMN

👉 Join the Applied AI Community:
https://lukejb.short.gy/EVrRaI

📮 Join the Newsletter:
https://lukejbyrne.com/subscribe

🔗 Follow me on Linkedin:
  / lukejbyrne  

--------------------

🏅 RECOMMENDED COURSES:
(All under one subscription)

Top AI Courses:
AI Engineer for Developers Course - https://datacamp.pxf.io/gO6WRB
AI Engineer for Data Scientists Course - https://datacamp.pxf.io/XmnX4a
Developing AI Applications Course - https://datacamp.pxf.io/Bny25L

Top Data Courses:
Data Engineer in Python - https://datacamp.pxf.io/aOAWNZ
Associate Data Scientist in Python - https://datacamp.pxf.io/AP3Eg1
Data Analyst with Python - https://datacamp.pxf.io/GKVZbk
Associate Data Analyst in SQL - https://datacamp.pxf.io/Z6KyV1

--------------------

0:00 Introduction to Airflow & ETL Pipeline
0:45 Understanding the Architecture (Docker, Airflow, PostgreSQL, PGAdmin)
10:00 Customizing Docker for Dependencies (Dockerfile, requirements.txt)
15:45 Setting up pgAdmin and Database Connection
19:25 Coding the Airflow DAG (Extract, Transform, Load tasks)
24:13 Running the DAG & Verifying Data in pgAdmin

--------------------

Business inquiries: [email protected]