Dr. Samuel Shamiri
📌 Professional Summary
Innovative and results-oriented Senior Data Scientist with a PhD in Statistics and over 15 years of experience in advanced statistical modeling, machine learning, large language models (LLMs), and cloud-based analytics. Proven ability to translate complex data into actionable insights for government, public policy, finance, and commercial sectors.
Deep expertise in NLP, generative AI, and Azure-based data science platforms. Track record of building and deploying scalable solutions using Python, R, Spark, Databricks, and MLflow.
🧰 Technical Proficiencies
Category | Skills |
---|---|
Cloud Platforms | Azure (ML, Databricks, Data Factory), Hadoop, Spark, Hive, AWS |
LLM & Deep Learning | BERT, T5, RNN (LSTM), CNN, GAN, RAG, Prompt Engineering, TensorFlow, PyTorch, MLflow |
Machine Learning & Statistics | Classification, Regression, Clustering, Forecasting, Anomaly Detection, Bayesian modelling, Bayesian Network, Monte Carlo Simulation |
Languages & Tools | Python, R, Scala, SQL, Git |
Frameworks | Hugging Face, MLOps, Docker, Spark-NLP |
💼 Professional Experience
Senior Data Scientist | Jobs and Skills Australia (May 2018 – Present)
- Successfully built and deployed an ensemble of machine learning models.
- Deployed an NLP pipeline using Transformer models on Azure Databricks to analyse large, unstructured datasets.
- Built system to match skills with qualifications.
- Integrated MLflow to manage model lifecycle — reducing deployment time by 30%.
- Advised on Azure-based scalable data infrastructure.
- Mentored teams on ML, Python, and Spark.
Associate Director | Analytic Partners (Jan 2017 – Mar 2018)
- Led ML initiatives; developed roadmap and increased client revenue by 15%.
- Built market mix and digital attribution models — optimizing spend by over $5M.
Data Scientist & Senior Analyst | Equifax & Starcom (2012 – 2016)
- Created credit risk models with XGBoost on Azure ML — boosting accuracy by 25%.
- Designed automated Python/R/SQL workflows — increasing team productivity.
- Delivered social network and econometric models for stakeholder intelligence.
Statistician | Forethought Market Research (2011 – 2012)
- Led training on R for Marketing Science team.
- Developed Hierarchical Bayes models (CBC, MaxDiff, MCMC, Gibbs).
- Built GUIs in R for client-side solutions.
Senior Lecturer | University of Malaya (2009 – 2010)
- Taught actuarial and financial math courses.
- Supervised final-year actuarial projects and ran applied stats workshops.
🎓 Education
- PhD in Statistics
- Master in Financial Economics (Econometrics)
- Degree in Statistics
🏅 Certifications & Publications
- Machine Learning (Stanford / Coursera)
- Apache Spark (MapR Technologies)
- Published 10+ peer-reviewed articles on statistical modelling & financial risk
- [Publication list available upon request]
🏆 Key Achievements
- LLM Deployment: Reduced manual review by 50% via Azure BERT pipeline.
- Azure Migration: Cut ML cost by 35% via Databricks refactor.
- Award Finalist: ADMA 2014, Cross-Channel Analytics.
- POEM Model: Developed unified MMM framework across continents.
- Prophecy Method: Helped develop AMSRS-award-winning market model.