Passion Fuels Purpose 

ABOUT

Hi, I'm Farhan, a Master's student in Data Science with a strong passion for leveraging technology to solve real-world problems. With expertise in machine learning, data visualization, and building automated pipelines, I enjoy uncovering insights from data and crafting efficient solutions.

I also bring a year of experience in full-stack development, proficient in React, Node.js, and Next.js. My skills extend to deploying scalable applications, with hands-on knowledge of cloud platforms like AWS and Azure.

Whether I'm developing user-centric web applications, optimizing machine learning models, or building end-to-end data pipelines, I am committed to delivering high-quality, impactful work.

_blank
+

satisfied clients

+

projects completed

+

Years of experience

Skills

Web
HTML, CSS, JavaScript
Pandas, Numpy, Dask
Scikit-Learn, Pytorch
NextJS, ReactJS
Altair, ggplot
NodeJS
Azure Cloud
AWS
Python
Machine Learning

Experience

  • Data Analyst @Decision Neuroscience Lab

    Nov 2022 – May 2024 | Ontario, Canada
    • Developed ETL pipelines in Python & SQL to clean large textual survey datasets, improving operational efficiency by 60%.

    • Utilized Python and GPT-4 embedding models to encode textual responses, enabling deeper investigation into human goal-setting behaviors.

    • Enhanced data quality through imputation using KNN, linear regression, and logistic regression models, ensuring more accurate analysis.

    • Created data visualizations with ggplot2 and Matplotlib, to present complex analysis results.

    • Fine-tuned transformer models (BERT) with PyTorch to label qualitative data with 82% accuracy.

  • Coop – Developer Intern @Baycrest Hospital

    Aug 2022 – Dec 2022 | Toronto, ON, Canada
    • Programmed two web based interactive study paradigms using JavaScript (PsychoJS, NeuroBS) for neuroimaging and language research.

    • Analyzed neuroimaging datasets via pandas and scikit-learn, identifying trends between structural damage and language impairments, contributing to automated aphasia diagnostic tools.

    • Developed and deployed simple data analysis dashboards in Excel & Tableau for non-technical users, simplifying insights for layman audiences.

    • Implemented Linux tcsh/bash & C++ scripts for batch processing neuroimaging data on remote servers.

    • Streamlined healthcare administration by managing MySQL databases and automating data processing tasks, reducing admin work by 30%. .

Education

  • Masters in Data Science

    2024-2025 | University of British Columbia | cGPA: 4.00

    Relevant courses included Into to Databases (PostgreSQL), Supervised Learning, Data Vasualization, & Data Pipelines

  • HBSc in Computer Science & Neuroscience

    2020-2024 | University of Toronto | cGPA: 3.94

    Relevant courses included Data Structures and Algorithms, Software Engineering, Intro to ML, and Software Architecture.