Data engineers and data scientists benefit from using best practices learned from years of software development. This video walks through 3 of the most important practices to build quality analytics solutions. It is meant to be an overview of what following these practices looks like for a Databricks developer.
This video covers:
Version control basics and demo of Git integration with Databricks workspace
Automated tests with pytest for unit testing and Databricks Workflows for integration testing
CI/CD including running tests prior to deployment with GitHub Actions
All thoughts and opinions are my own , though for this video influenced by Databricks SMEs *
Intro video that discusses development process and full list of best practices is available here: • 7 Best Practices for Development and ...
Blog post for Developer Best Practices on Databricks: https://dustinvannoy.com/2025/01/05/b...
More from Dustin:
Website: https://dustinvannoy.com
LinkedIn: / dustinvannoy
Github: https://github.com/datakickstart
CHAPTERS
0:00 Intro
0:31 Version Control (Git)
7:57 Unit Tests + Integration Tests
28:00 Automated Deploy
36:35 Outro