Data Versioning and Its Impact on Machine Learning Models

Thirupurasundari Chandrasekaran; Sreenivasulu Ramisetty; Vamsi Krishna Eruvaram; Mohan Raja Pulicharla

doi:10.55662/JST.2024.5101

Data Versioning and Its Impact on Machine Learning Models

Authors

Thirupurasundari Chandrasekaran Sr. Project Manager, Phoenix, AZ USA Author
Sreenivasulu Ramisetty Data Architect, Conduent Services Inc Georgia, USA Author
Vamsi Krishna Eruvaram Sr. Data Engineer, Lowe's, USA Author
Mohan Raja Pulicharla Data Engineer, Maryland USA Author

PlumX DOI based Article Level Metrics

DOI:

https://doi.org/10.55662/JST.2024.5101

Keywords:

Machine Learning Models, Data Versioning, ML pipeline

Abstract

Data versioning in machine learning is of paramount importance as it ensures the reproducibility, transparency, and reliability of ML models. In the dynamic landscape of ML research, where models heavily rely on diverse datasets, data versioning plays a crucial role in maintaining consistency throughout the ML pipeline. By tracking changes in datasets over time and aligning machine learning models with specific versions of data, researchers can reproduce experiments, verify results, and address challenges related to data quality, collaboration, and model training. Effective data versioning practices contribute to the robustness of ML workflows, fostering trust in model outcomes and supporting advancements in the field.

Readership Data

−

🌐

Refreshing Cached Analytics Data

The cached analytics data has become stale and www.thesciencebrigade.com is making a fresh request to fetch the latest data from Google Analytics. This may take 20-30 seconds depending on the server response time from Google Analytics. Please do not close the browser during this time. We appreciate your patience.

Downloads

Download data is not yet available.

Citation Metrics

Downloads

Published

29-01-2024

Issue

Vol. 5 No. 1 (2024): Journal of Science & Technology

Section

Review Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

License Terms

Ownership and Licensing:

Authors of this research paper submitted to the journal owned and operated by The Science Brigade Group retain the copyright of their work while granting the journal certain rights. Authors maintain ownership of the copyright and have granted the journal a right of first publication. Simultaneously, authors agreed to license their research papers under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License.

License Permissions:

Under the CC BY-NC-SA 4.0 License, others are permitted to share and adapt the work, as long as proper attribution is given to the authors and acknowledgement is made of the initial publication in the Journal. This license allows for the broad dissemination and utilization of research papers.

Additional Distribution Arrangements:

Authors are free to enter into separate contractual arrangements for the non-exclusive distribution of the journal's published version of the work. This may include posting the work to institutional repositories, publishing it in journals or books, or other forms of dissemination. In such cases, authors are requested to acknowledge the initial publication of the work in this Journal.

Online Posting:

Authors are encouraged to share their work online, including in institutional repositories, disciplinary repositories, or on their personal websites. This permission applies both prior to and during the submission process to the Journal. Online sharing enhances the visibility and accessibility of the research papers.

Responsibility and Liability:

Authors are responsible for ensuring that their research papers do not infringe upon the copyright, privacy, or other rights of any third party. The Science Brigade Publishers disclaim any liability or responsibility for any copyright infringement or violation of third-party rights in the research papers.

How to Cite

“Data Versioning and Its Impact on Machine Learning Models”. Journal of Science & Technology, vol. 5, no. 1, Jan. 2024, pp. 22-37, https://doi.org/10.55662/JST.2024.5101.

Download Citation

Data Versioning and Its Impact on Machine Learning Models

Authors

PlumX DOI based Article Level Metrics

DOI:

Keywords:

Abstract

Readership Data

TOTAL COUNTRIES

TOTAL ABS. VIEWS

TOTAL PDF VIEWS

📊 Engagement Timeline

🏆 Competitive Performance

Downloads

Citation Metrics

Downloads

Published

Issue

Section

License

License Terms

How to Cite

Plaudit

Journal Snapshot

Readership Insights

Make a Submission

License Terms

Data Versioning and Its Impact on Machine Learning Models

Authors

PlumX DOI based Article Level Metrics

DOI:

Keywords:

Abstract

Readership Data

TOTAL COUNTRIES

TOTAL ABS. VIEWS

TOTAL PDF VIEWS

📈 Trending

📊 Engagement Timeline

🏆 Competitive Performance

Downloads

Citation Metrics

Downloads

Published

Issue

Section

License

License Terms

How to Cite

Plaudit

Journal Snapshot

Readership Insights

Make a Submission

License Terms