Hadi Fadlallah
SSIS XML Source expressions

SSIS XML Source vs. XML task

November 3, 2021 by

This article explains the SSIS XML Source and the SSIS XML task, which are the XML-related components in the SQL Server Integration Services.

This article is the sixteenth of the SSIS features face-to-face series that aims to compare similar SSIS components in order to remove any confusion for the data integration developers that use this technology.

Introduction

XML stands for Extensible Markup Language. It consists of codes, or tags, that describe the text in a digital document. XML was designed to store and transport data and to be both human- and machine-readable.

XML is a semi-structured file format that contains tags or other markers to separate semantic elements and enforce hierarchies of records and fields within the data. In XML, schema can be defined within the XML file (called inline schema) or stored within a separate XML schema definition file (XSD).

XML is a generic data storage format that comes bundled with several tools and technologies. It is a standard data format that becomes widely used in data transfer worldwide, making it frequently used in the data warehousing process.

In SQL Server Integration Services (SSIS), one task and one component are provided, allowing developers to perform several XML-related operations.

This article will explain the SSIS XML source component and how to use it to import data from XML files. Besides, we will explain the XML task and what kind of operation it performs.

Unfortunately, SSIS does not have a destination component for storing data within XML files. But there are several workarounds and third-party components that can be used to achieve that. We have previously illustrated them in the following article: SSIS XML Destination.

Integration Services (SSIS), XML

About Hadi Fadlallah

Hadi is an SQL Server professional with more than 10 years of experience. His main expertise is in data integration. He's one of the top ETL and SQL Server Integration Services contributors at Stackoverflow.com . Also, he published several article series about Biml, SSIS features, Search engines, Hadoop, and many other technologies. Besides working with SQL Server, he worked with different data technologies such as NoSQL databases, Hadoop, Apache Spark. He is a Neo4j and ArangoDB certified professional. On the academic level, Hadi holds two master's degrees in computer science and business computing. Currently, he is a Ph.D. candidate in data science focusing on Big Data quality assessment techniques. Hadi really enjoys learning new things everyday and sharing his knowledge. You can reach him on his personal website. View all posts by Hadi Fadlallah

150 Views