Databricks Personas – Spark vs. SQL Users

Advanced User (Spark-centric)

Primary Language: PySpark / Scala

Typical Work: Complex pipelines, streaming, ML

Output: Reusable libraries, high-scale data flows

Best fit: Flexibility and performance for complex needs

Basic User (SQL-centric)

Primary Language: SQL

Typical Work: Clean tables for reporting, aggregations

Output: BI-ready datasets and data marts

Best fit: Repeatable, governed SQL transformations

dbt bridges the gap — giving SQL users the same engineering rigor as Spark users