Passion Fuels Purpose
ABOUT
Hi, I'm Farhan, a Master's student in Data Science with a strong passion for leveraging technology to solve real-world problems. With expertise in machine learning, data visualization, and building automated pipelines, I enjoy uncovering insights from data and crafting efficient solutions.
I also bring a year of experience in full-stack development, proficient in React, Node.js, and Next.js. My skills extend to deploying scalable applications, with hands-on knowledge of cloud platforms like AWS and Azure.
Whether I'm developing user-centric web applications, optimizing machine learning models, or building end-to-end data pipelines, I am committed to delivering high-quality, impactful work.

satisfied clients
projects completed
Years of experience
Skills
Experience
Data Analyst @Decision Neuroscience Lab
Nov 2022 – May 2024 | Ontario, Canada• Developed ETL pipelines in Python & SQL to clean large textual survey datasets, improving operational efficiency by 60%.
• Utilized Python and GPT-4 embedding models to encode textual responses, enabling deeper investigation into human goal-setting behaviors.
• Enhanced data quality through imputation using KNN, linear regression, and logistic regression models, ensuring more accurate analysis.
• Created data visualizations with ggplot2 and Matplotlib, to present complex analysis results.
• Fine-tuned transformer models (BERT) with PyTorch to label qualitative data with 82% accuracy.
Coop – Developer Intern @Baycrest Hospital
Aug 2022 – Dec 2022 | Toronto, ON, Canada• Programmed two web based interactive study paradigms using JavaScript (PsychoJS, NeuroBS) for neuroimaging and language research.
• Analyzed neuroimaging datasets via pandas and scikit-learn, identifying trends between structural damage and language impairments, contributing to automated aphasia diagnostic tools.
• Developed and deployed simple data analysis dashboards in Excel & Tableau for non-technical users, simplifying insights for layman audiences.
• Implemented Linux tcsh/bash & C++ scripts for batch processing neuroimaging data on remote servers.
• Streamlined healthcare administration by managing MySQL databases and automating data processing tasks, reducing admin work by 30%. .
Education
Masters in Data Science
2024-2025 | University of British Columbia | cGPA: 4.00Relevant courses included Into to Databases (PostgreSQL), Supervised Learning, Data Vasualization, & Data Pipelines
HBSc in Computer Science & Neuroscience
2020-2024 | University of Toronto | cGPA: 3.94Relevant courses included Data Structures and Algorithms, Software Engineering, Intro to ML, and Software Architecture.