Arham Anwar

Hello! Glad to see you here. This is a collection of my personal projects made during free time, and open source projects from my official work at McGill University as a master in management analytics student and as a data scientist at Aldo Group.

  • All Categories
  • Visualization
  • Predictive
  • Prescriptive
  • Descriptive

GraphRAG with Neo4j integration

(Talk to the famous Sapiens Novel with graph rag optimized queries, visualize topic clusters and communities with neo4j to see how LLM works)

Strava/Spotify ML Pipeline

(Leveraging Spotify API, Strava API, Weather API to orchestrate an ELT that drives analytics & an ML pipeline to predict kudos on the latest activity)

Multi-Class Prediction

(Apache Kafka, MLflow, Multiclass Error Analysis, SHAP Dashboard, Docker, FastAPI, AzureML, AutoGluon, Blue-Green Deployment)

Tableau Portfolio

Anonymized views of some of the tableau dashboards created for clients such as Nike, Lehigh Hanson, USLS, and more

Academic Darwanism

(GPA & Network: Homophily Tests, Network Features as predictors, HITS algorithm)

Gurobi Projects

(LP, MIP, NLP, Uncertainty, Recent NP Hard Problems, Heuristics)

YOLOv9 Implementation Guide

(Demos featuring pinterest browsing tracking, and real world tracking)

Fuel Sourcing Optimization

(Description of the project, Tools used, Year)

Distilling a crime scene based on data

(PCA, Loadings, KMedians, Technical Report, RStudio, Cluster Interpretation, Advance Data Preprocessing)

Particle Swarm Optimization on UAVs for SAR missions

(Published in Springer "Journal of Brazilian Society of Mechanical Sciences & Engineering")

Multi-class Text Classification with Transformers

(20 Newsgroups data leveraging a pre-trained GPT-2 tokenizer & custom Transformer blocks)

Text generation with RNN

(Recurrent Neural Network character level/word level text generation tinyshakespeare dataset)

CNN Exploration on CIFAR-10

(Vanilla CNN, ResNet, EfficientNet, MobileNet, YOLOv3)

McGill Connect

(DBMS App Mock Social Media Project for McGill Stakeholders for the purposes of educational networking)

Apache Kafka Batch & Stream Data

(IMDB Movie rating prediction using batch and stream data for testing)

about me

I am a data scientist and machine learning enthusiast. Currently pursuing my Masters in Management Analytics at McGill University and working at Aldo Group.

Besides data science and machine learning, I also enjoy music production, trail running, and exploring new cafes and bars.

I have lived in multiple countries and enjoy learning about different cultures and languages.

I have worked on various projects and have a keen interest in deep learning, data visualization, and causal inference.