Maria Magdalena Balos

Data Scientist
Cambridge, UK · +44 7899 866 210 · mariabalos16@gmail.com · linkedin.com/in/mariabalos · github.com/mbalos16

Summary

Data scientist with a strong focus on applied machine learning, NLP, and generative AI, complemented by a background in UX research. Experienced in training and deploying speech models in production, building RAG systems, and implementing deep learning architectures from scratch. Combines deep ML expertise with a UX research background, enabling technical depth and user-centred thinking, with a pragmatic, committed approach to problem solving, delivering results independently and communicating findings clearly across technical and non-technical audiences.

Experience

Data Scientist — Vocality.ai

Jan 2025 – Present · Remote

Research Assistant — Museum of Archaeology and Anthropology

Aug 2023 – Jan 2024 · Cambridge, UK

UX Researcher — Singer Instruments

Sep 2022 – Mar 2023 · Remote

Projects

Manifold HyperConnections for Computer Vision In progress
arXiv:2512.24880

CON(e)VOLUTION – From LeNet to Vision Transformers
GitHub · Article

RAG-Driven Educational Assistant · Master's Dissertation
GitHub

Ryanair Timecapsule
GitHub

K-Means for Colour Palette Generation
Article

Education

Master's in Deep Learning & Generative AI · Datamecum, Madrid

Oct 2024 – Jul 2025

Intensive Program in Data Science · Datamecum, Madrid

Oct 2023 – May 2024

Master's in Interaction Design & UX · Universitat Oberta de Catalunya

Sep 2021 – Mar 2023

Bachelor's in Graphic Design & Digital Creations · Universitat Oberta de Catalunya

2018 – 2021

Skills

Machine Learning & Artificial Intelligence (ML & AI): Deep Learning, Natural Language Processing (NLP), Speech Models (TTS, ASR), RAG, LLMs, Computer Vision, CNNs, Transformers, Generative AI, Decision Trees, Gradient Boosting (XGBoost), Linear Regression

Frameworks & Tools: PyTorch, Hugging Face, LangChain, ChromaDB, Scikit-learn, Pandas, NumPy, Streamlit, Flask

Cloud & Infra: Google Cloud (GCP), MLOps, Version Control Systems (Git), Docker

Programming Skills: Python, SQL, Bash

Other: Data Analytics, Data Visualisation, Speaker Diarization, Whisper, UX Research, Agile, Teamwork, Unit Testing

Certifications & Continuous Learning

Languages

English (full professional) · Spanish (native) · Romanian (native) · French (basic)