Systematic Review of Advancing Machine Learning Through Cross-Domain Analysis of Unlabeled Data

Authors

  • Yue Zhu Georgia Institute of Technology, GA USA Author
  • Johnathan Crowell Independent Researcher, USA Author

PlumX DOI based Article Level Metrics

DOI:

https://doi.org/10.55662/JST.2023.4104

Keywords:

Self-Supervised Learning, Pretext Tasks, Representation Learning, Contrastive Learning, Generative Models, Masked Language Modeling, Transfer Learning, Domain Adaptation, Multi-Modal Learning, Data Efficiency

Abstract

Self-supervised learning (SSL) has become a transformative approach in the field of machine learning, offering a powerful means to harness the vast amounts of unlabeled data available across various domains. By creating auxiliary tasks that generate supervisory signals directly from the data, SSL mitigates the dependency on large, labeled datasets, thereby expanding the applicability of machine learning models. This paper provides a comprehensive exploration of SSL techniques applied to diverse data types, including images, text, audio, and time-series data. We delve into the underlying principles that drive SSL, examine common methodologies, and highlight specific algorithms tailored to each data type. Additionally, we address the unique challenges encountered in applying SSL across different domains and propose future research directions that could further enhance the capabilities and effectiveness of SSL. Through this analysis, we underscore SSL's potential to significantly advance the development of robust, generalizable models capable of tackling complex real-world problems.

Readership Data

🌐

Refreshing Cached Analytics Data

The cached analytics data has become stale and www.thesciencebrigade.com is making a fresh request to fetch the latest data from Google Analytics. This may take 20-30 seconds depending on the server response time from Google Analytics. Please do not close the browser during this time. We appreciate your patience.

Downloads

Download data is not yet available.

References

van den Oord, A., Li, Y., & Vinyals, O. (2018). Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748.

Jaiswal, A., Babu, A. R., Zadeh, M. Z., Banerjee, D., & Makedon, F. (2020). A survey on contrastive self-supervised learning. Technologies, 9(1), 2. https://doi.org/10.3390/technologies9010002

Jin, W., Liu, L., Cai, Y., Zhou, J., Feng, X., Liu, J., ... & Tang, J. (2020). Self-supervised learning on graphs: Deep insights and new direction. arXiv preprint arXiv:2006.10141.

Krishnan, R., Rajpurkar, P., & Topol, E. J. (2022). Self-supervised learning in medicine and healthcare. Nature Biomedical Engineering, 6(12), 1346-1352. https://doi.org/10.1038/s41551-022-00911-4

Wang, Y., Albrecht, C. M., Braham, N. A. A., Mou, L., & Zhu, X. X. (2022). Self-supervised learning in remote sensing: A review. IEEE Geoscience and Remote Sensing Magazine, 10(4), 213-247. https://doi.org/10.1109/MGRS.2022.3182134

Tung, H.-Y., Tung, H.-W., Yumer, E., & Fragkiadaki, K. (2017). Self-supervised learning of motion capture. Advances in Neural Information Processing Systems, 30, 5234-5245.

Radford, A., Narasimhan, K., Salimans, T., & Sutskever, I. (2018). Improving language understanding by generative pre-training. arXiv preprint arXiv:1810.04805.

Schneider, S., Baevski, A., Collobert, R., & Auli, M. (2019). wav2vec: Unsupervised pre-training for speech recognition. arXiv preprint arXiv:1904.05862.

Chen, T., Kornblith, S., Norouzi, M., & Hinton, G. (2020). A simple framework for contrastive learning of visual representations. In International Conference on Machine Learning (pp. 1597-1607). PMLR.

Chen, X., & He, K. (2021). Exploring simple siamese representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 15750-15758).

He, K., Fan, H., Wu, Y., Xie, S., & Girshick, R. (2020). Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 9729-9738).

Grill, J.-B., Strub, F., Altché, F., Tallec, C., Richemond, P. H., Buchatskaya, E., ... & Munos, R. (2020). Bootstrap your own latent-a new approach to self-supervised learning. Advances in Neural Information Processing Systems, 33, 21271-21284.

Devlin, J., Chang, M.-W., Lee, K., & Toutanova, L. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the NAACL-HLT (Vol. 1, p. 2). https://doi.org/10.18653/v1/N19-1423

Baevski, A., Zhou, Y., Mohamed, A., & Auli, M. (2020). wav2vec 2.0: A framework for self-supervised learning of speech representations. Advances in Neural Information Processing Systems, 33, 12449-12460.

Liu, A. T., Li, S.-W., & Lee, H.-Y. (2021). TERA: Self-supervised learning of transformer encoder representation for speech. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, 2351-2366. https://doi.org/10.1109/TASLP.2021.3090991

Baevski, A., Auli, M., & Mohamed, A. (2019). Effectiveness of self-supervised pre-training for speech recognition. arXiv preprint arXiv:1911.03912.

Mohamed, A., Hsu, W.-N., Xiong, W., Pino, J., Wang, Y., Song, Z., ... & Auli, M. (2022). Self-supervised speech representation learning: A review. IEEE Journal of Selected Topics in Signal Processing, 16(6), 1179-1210. https://doi.org/10.1109/JSTSP.2022.3210287

Eldele, E., Ragab, M., Chen, Z., Wu, M., Koaik, R., Dong, L., ... & Gao, W. (2021). Time-series representation learning via temporal and contextual contrasting. arXiv preprint arXiv:2106.14112.

Franceschi, J.-Y., Dieuleveut, A., & Jaggi, M. (2019). Unsupervised scalable representation learning for multivariate time series. Advances in Neural Information Processing Systems, 32, 84-94.

Bhattacharjee, A., Karami, M., & Liu, H. (2022). Text transformations in contrastive self-supervised learning: A review. arXiv preprint arXiv:2203.12000.

Noroozi, M., Vinjimoor, A., Favaro, P., & Pirsiavash, H. (2018). Boosting self-supervised learning via knowledge transfer. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 9359-9367).

Kumar, P., Rawat, P., & Chauhan, S. (2022). Contrastive self-supervised learning: Review, progress, challenges and future research directions. International Journal of Multimedia Information Retrieval, 11(4), 461-488. https://doi.org/10.1007/s13735-022-00231-8

Baevski, A., Hsu, W.-N., Xu, Q., Babu, A., Gu, J., & Auli, M. (2022). Data2vec: A general framework for self-supervised learning in speech, vision, and language. In International Conference on Machine Learning (pp. 1298-1312). PMLR.

Reyes-Ortiz, J., Anguita, D., Ghio, A., Oneto, L., & Parra, X. (2012). Human activity recognition using smartphones. UCI Machine Learning Repository. https://doi.org/10.24432/C54S4K

Citation Metrics

Downloads

Published

20-01-2023

How to Cite

“Systematic Review of Advancing Machine Learning Through Cross-Domain Analysis of Unlabeled Data”. Journal of Science & Technology, vol. 4, no. 1, Jan. 2023, pp. 136-55, https://doi.org/10.55662/JST.2023.4104.

Plaudit