ETL

Copy SQL Databases between Windows 10 and CentOS using SQL Server data tools

October 19, 2022 by Nisarg Upadhyay

In this article, we are going to learn how to copy the SQL database created on different instances of the SQL Server. This article is the fourth article on Manage SQL Server on CentOS topic. In my previous article, Copy SQL Databases between Windows 10 and CentOS using the SQL Server import-export wizard, we learned how to copy data SQL database between windows 10 and CentOS Linux using the wizard.

Copy SQL Databases between Windows 10 and CentOS using the SQL Server import-export wizard

September 21, 2022 by Nisarg Upadhyay

In this article, we will learn how we can backup and restore the SQL Database in CentOS. This article is the third article on the topic Manage SQL Server on CentOS. In my previous article, we learned how to copy data SQL database between windows 10 and CentOS Linux using SQL Server management studio.

Understanding Transfer database objects in SSDT 2017

March 2, 2022 by Nisarg Upadhyay

In this article, we are going to learn how we can copy database objects between two databases of different instances of SQL Server. Sometimes, we receive the request to provide the specific database object to another database. To fulfill this requirement, instead of using Generating the T-SQL script for each object and export data using import-export task, we can use the Transfer SQL Server Object task of SSDT 2017. In this article, we will understand how we can transfer the database objects of a specific schema to another database.

Migrate tables with a foreign key to memory-optimized tables of SQL database

February 23, 2022 by Nisarg Upadhyay

In this article, we are going to learn how we can migrate the tables with a foreign key to memory-optimized tables of the SQL database.

Configure Transfer Database Task in SSDT 2017

January 27, 2022 by Nisarg Upadhyay

This article explains the transfer database task in SSDT 2017. After reading this article, you will be able to transfer the databases between the same server or different servers.

Configure ODBC drivers for Azure Database for PostgreSQL

January 24, 2022 by Nisarg Upadhyay

This article helps you learn how to configure an ODBC driver to connect the Azure Database for PostgreSQL. In my previous article, Configure ODBC drivers for PostgreSQL, you learned the step-by-step process to download, install and configure the ODBC driver for PostgreSQL. We also learned the process of creating a DSN used to connect PostgreSQL and populate data from it.

Incremental Data Extraction for ETL using Database Snapshots

January 10, 2022 by Dinesh Asanka

This article is to explain how to perform ETL using database snapshots and how to utilize database snapshots to perform incremental data extraction from an operational database to a data warehouse.

How to move tables to another filegroup of a SQL database

December 14, 2021 by Nisarg Upadhyay

This article explains the different methods to move tables between different filegroup of SQL databases.

Transfer error messages between SQL Server instances using SSDT 2017

November 29, 2021 by Nisarg Upadhyay

This is the fourth article in the series of Migrating SQL Server Objects using SSDT 2017. In the third article, we learned how we could transfer SQL logins between the servers. In this article, we will learn how to copy error messages to another instance using the Transfer Error Messages Task of SSDT 2017.

SSIS term extraction vs. term lookup

November 25, 2021 by Hadi Fadlallah

This article explains both the SSIS term extraction and term lookup components. Besides, it illustrates the use cases of each one of them.

SSIS Script task vs. Script Component

November 19, 2021 by Hadi Fadlallah

This article explains the SSIS Script task and Script component, the scripting tools available in the SQL Server Integration Services (SSIS).

Transfer SQL Logins between SQL Server instances using SSDT 2017

November 16, 2021 by Nisarg Upadhyay

This is the third article in the series of Migrating SQL Server Objects using SSDT 2017. In the second article, we learned how we can transfer the stored procedure of the master database. In this article, we will learn how to copy SQL Logins to another instance using SSDT 2017.

Transfer SQL Jobs between SQL Server instances using SSDT 2017

November 5, 2021 by Nisarg Upadhyay

In this series of articles, we will learn how we can clone the SQL Server instance using SSDT 2017. In my previous article, we learned how to Install and configure SSDT 2017 on Windows 10. In this series of articles, we will migrate the following SQL Server components to another server using SSIS tools provided in SSDT 2017:

Use Replication to improve the ETL process in SQL Server

November 4, 2021 by Dinesh Asanka

Introduction

This article describes how to improve the ETL Process in SQL Server by using the native replication technique in SQL Server.

Data Access Modes in SSIS OLE DB Destination: SQL Command vs. Table or View

October 5, 2021 by Hadi Fadlallah

This article compares the SSIS OLE DB Destination SQL command with the “Table or View” data access mode. In a previously published article, SSIS OLE DB Destination vs SQL Server Destination, in this series, we explained the OLE DB Destination component in SSIS and illustrated how it differs from the SQL Server Destination component. So, in this article, we will focus on the OLE DB Destination data access modes, not the component itself.

SSIS Merge Join vs. Merge Transformation

September 28, 2021 by Hadi Fadlallah

SSIS Merge Join and Merge transformations are two similar components used to combine two input data sources into one output. This article explains both transformations and illustrates when and how to use each one of them.

SSIS Pivot transformation vs. Unpivot transformation

September 8, 2021 by Hadi Fadlallah

This article explains the SSIS pivot transformation and how it differs from the unpivot transformation.

Develop ETL templates for Data Warehousing using Azure Data Factory

August 26, 2021 by Gauri Mahajan

In this article, we will learn how to develop ETL templates for data warehousing using Azure Data Factory.

Integrating APIs into data pipelines with Azure Data Factory

August 9, 2021 by Gauri Mahajan

In this article, we will learn how to invoke an API and process the data returned by the API using Azure Data Factory.

SSIS Lookup transformation vs. Fuzzy Lookup transformation

July 28, 2021 by Hadi Fadlallah

This article will explain how to use the SSIS lookup transformation and how it differs from the fuzzy lookup. This is a continuation of the SSIS features face-to-face series that we published previously on this website.

Introduction to Apache Spark

April 12, 2021 by Aveek Das

In this article, I am going to discuss Apache Spark and how to create robust ETL pipelines for transforming big data. I will start from the very basics of Spark and then provide details on how to install Spark and start building the pipelines. In the later part of the article, I will also discuss how to leverage the Spark APIs to do transformations and obtain data into Spark data frames and SQL to continue with the data analysis.

Implementing a Modular ETL in SSIS

November 24, 2020 by Aveek Das

In this article, I am going to demonstrate about implementing the Modular ETL in SSIS practically. In my previous article on Designing a Modular ETL Architecture, I have explained in theory what a modular ETL solution is and how to design one. We have also understood the concepts behind a modular ETL solution and the benefits of it in the world of data warehousing. We have also related the concept of microservices architecture in software development to that of the modular ETL solution.

Designing a Modular ETL Architecture

November 17, 2020 by Aveek Das

In this article, I am going to demonstrate in detail the Modular ETL Architecture. ETL is a vast concept which explains the methodology of moving data across various sources to destinations while performing some sort of transformations within it. This is an advanced article that considers that the user has a substantial amount of understanding about how ETL is implemented using different tools like SSIS and the underlying working principle along with how to deploy multiple packages using SSIS. It is extremely important to implement a well-designed ETL architecture for your organization’s workload, otherwise, it might lead to performance degradations along with other challenges. To keep things simple, I will just explain the Modular ETL Architecture in this article which will be followed by a detailed hands-on tutorial in the next article – “Implementing Modular Architecture in ETL using SSIS”.

An introduction to SSIS Data Lineage concepts

September 3, 2020 by Aveek Das

In this article, I am going to discuss SSIS data lineage concepts, which are often used while designing ETL workloads on a data warehouse. Although this article is focused on implementing data lineage using SSIS, it does not only confine to SSIS but to any ETL tools in the market using which data is moved from one source to a destination. In my previous article, Understanding Data Lineage in ETL, I have already discussed the generic importance of data lineage concepts for any ETL tool. I would definitely suggest you have a look at it if you want to understand in general how data lineage helps to track the source of a single record in the warehouse.

Understanding Data Lineage in ETL

September 3, 2020 by Aveek Das

In this article, I am going to explain what Data Lineage in ETL is and how to implement the same. In this modern world, where companies are dealing with a humongous amount of data every day, there also lies a challenge to efficiently manage and monitor this data. There are systems that generate data every second and are being processed to a final reporting or monitoring tool for analysis. In order to process this data, we use a variety of ETL tools, which in turn makes the data transformation possible in a managed way.