Prod data in dev? Creating dummy data and masking sensitive data using Python & PySpark with Faker fast. Filip PastuszkaBest Practices, Data Engineering
Is DBT the new solution to all your data problems? Filip PastuszkaBest Practices, Data Engineering, Databricks
Mastering Databricks #1: 3 Foolproof Ways to Test Your Code Like a Pro Filip PastuszkaData Engineering, Databricks, Design Principles
Dive In: Beyond Swimming – Decoding Data Lakes, Delta Lakes, Lakehouses, and More! Filip PastuszkaData Storage
Relational vs distributed storage – which one is right for you? Filip Pastuszka3 CommentsData Storage