Optimizing ETL Pipelines with Informatica: Performance, Scalability, and Governance

Authors

  • Raghuvaran Kendyala University of Illinois at Springfield, Illinois, USA Author
  • Sandeep Batchu Western Kentucky University, Kentucky, USA Author
  • Vivek Sheetal Dhaduvai Texas A&M University - Kingsville, TX - USA Author
  • Kendyala Srinivasulu Harshavardhan University of Illinois at Springfield, Illinois, USA Author

Keywords:

ETL, Informatica PowerCenter, Data Quality

Abstract

The purpose of this paper is to explore the optimization of ETL (Extract, Transform, Load) pipeline using Informatica tools which focuses on performance, scalability, and governance. This paper’s objective is to explore the best strategies in the design, development, and administration of data integration by leveraging industry standard tools like Informatica PowerCenter, Informatica Test Data Management, Informatica Data Quality, Informatica Master Data Management, and Informatica Data Management Cloud (IDMC). Using these tools optimization of data transformation efficiency through efficient data processing techniques, workflow automation, and robust scripting, mainly using Unix shell scripting.

Readership Data

🌐

Refreshing Cached Analytics Data

The cached analytics data has become stale and www.thesciencebrigade.com is making a fresh request to fetch the latest data from Google Analytics. This may take 20-30 seconds depending on the server response time from Google Analytics. Please do not close the browser during this time. We appreciate your patience.

Downloads

Download data is not yet available.

Downloads

Published

27-11-2020

How to Cite

“Optimizing ETL Pipelines With Informatica: Performance, Scalability, and Governance ”. Journal of Science & Technology, vol. 1, no. 1, Nov. 2020, pp. 809-46, https://www.thesciencebrigade.com/jst/article/view/586.

Plaudit