Hands-On Exercise
— Databricks Pipeline
1
Bronze Ingestion
Run
00_setup.py
to create schemas
Ingest ADLS2 Parquet into Bronze
Verify ~3M rows in Unity Catalog
2
Silver Cleaning
Quality filters: ~5-15% row reduction
Derived columns & zone enrichment
Delta Lake time travel & history
3
Gold KPIs
12 business aggregation tables
Trips by hour, day, zone, borough
Dashboard-ready for Priya's KPIs