Data Engineering
-
Google Cloud Functions: How to Automate Data Ingestion Pipeline Within GCP
Introduction In the world of data, getting your hands on the data is the first, most critical step. Manually downloading…
Read More » -
Pandas vs PySpark: How to Choose the Right Tool for Your Data Ingestion Pipeline
Introduction Moving data from source to sink is the first technical step in a data pipeline. This process, called data…
Read More » -
How to Clean Text Data in Excel: A Beginner’s Guide
Introduction Clean text data in Excel is essential for accurate, reliable analysis.When working with text data on Excel, dirty data…
Read More » -
Steps and Best Practices To Transform Data For Analysis
Data transformation is the backbone of modern analytics. It involves converting raw data into actionable insights by cleansing, structuring, and…
Read More » -
Easily Load Data Into Google Cloud Storage Using Python
While Google Cloud Storage, Amazon S3 and Azure Blob all made it easy to create buckets and load files into…
Read More » -
How To Efficiently Load Data Into Postgres Using Python
To efficiently load data into postgres database using python, I will recommend two methods in this article. They are efficient,…
Read More » -
Easily Transform Data Using Pandas and Regular Expression
Introduction Let’s Easily transform this dataset using pandas and regular expression in line with the problem statement. In the extraction…
Read More » -
Beginner Friendly ETL Project with Python, Postgres, Power BI
As my workflow and learning process has always been. Learn ===> Apply ===> Grow. You can easily break into this…
Read More »