Vaibhav Shakkarwal
Welcome to my Digital Portfolio!
A Technology Enthusiasts
About Me
& What I do
AI Engineer · Software Engineer · Data Engineer
7+ years building intelligent systems across the full stack: LLM applications, RAG pipelines, data platforms, cloud infrastructure, and analytics. I like working on problems where good engineering actually matters, and turning complex, messy data into systems that hold up.
AI & Generative AI
⚡ Building production RAG pipelines, LLM agents, and fine-tuned models.
⚡ Designing GenAI applications from prototype to production-ready systems.
⚡ Comfortable across the full LLM toolchain: embeddings, vector stores, multi-agent orchestration, and prompt engineering.
Data Engineering
⚡ Building scalable pipelines and lakehouse architectures from the ground up.
⚡ Processing large volumes of data with distributed compute and streaming frameworks.
⚡ Getting reliable, well-governed data to the teams and systems that need it.
Software Engineering
⚡ Building backend services, REST APIs, and full-stack applications that scale.
⚡ Shipping production-grade code with clean architecture and CI/CD baked in.
⚡ Comfortable owning the whole stack, from database schema to deployed service.
Cloud, Analytics & Design
⚡ Deploying and running services on AWS, Azure, and GCP.
⚡ Turning data into dashboards and analytics products people actually use.
⚡ Designing interfaces where clarity and function go hand in hand.
Education
Dalhousie University
Aug 2021 – Dec 2022
Master of Digital Innovation
Specialization in Data Science and Digital Business
Faculty of Computer Science | GPA: 4.20 / 4.30
Guru Gobind Singh Indraprastha University
June 2015 – Jan 2019
Bachelor of Technology
Major in Computer Science and Electronics
GPA: 8.0 / 10.0 | Top 5 rank throughout
Experience
Fortinet
Jan 2023 – Present
Software and Data Engineer
Designed and shipped a production RAG platform for internal security intelligence, cutting analyst research time by 60% by surfacing relevant threat context from large-scale data in real time.
Built multi-agent LLM pipelines using LangChain and LangGraph, integrating OpenAI, Azure ML, and fine-tuned BERT models for classification, summarization, and threat triage.
Built data engineering pipelines with PySpark, Kafka, and Airflow on Databricks, processing millions of security events daily at sub-second latency.
Delivered $1M+ in measurable business impact through ML-driven automation, predictive analytics, and intelligent alerting across Security Operations and Product teams.
Owned the Databricks lakehouse: data modelling, governance, and the analytics layer behind 10+ executive dashboards in Power BI and Tableau.
KPMG Canada
May 2022 – Jan 2023
Digital Consultant
Software and data consultant at Tax Transformation and Technology, where AI-powered automation work contributed to a 35% improvement in team efficiency.
Built intelligent automation and DevOps pipelines on Azure; developed ML models for decision support and data-driven client advisory.
Designed Tableau and Power BI dashboards that translated complex tax data into clear, actionable insights for enterprise clients.
Served as Scrum Master across engineering, QA, and analytics teams to deliver project milestones on schedule.
Atlantic Canada
Opportunities Agency
Aug 2022 – Jan 2023
Data Scientist, Internship
Built NLP document classification models using BERT and transformer architectures to automate intake and routing of government grant applications.
Managed ML experiments end-to-end with MLflow and Azure ML, enabling reproducible model versioning across iteration cycles.
Delivered analytics reports and predictive models to inform regional economic development decisions for federal stakeholders.
InvenioLSI
Jan 2019 – Aug 2021
Associate Consultant
Software and data consultant delivering end-to-end digital tax system implementations for government clients across four countries, leading a team of 7.
Developed 100+ Tax Revenue and Management business processes, resulting in a 75% improvement in processing efficiency for government tax agencies.
Applied machine learning, NLP, and advanced analytics to modernize tax compliance workflows and surface fraud patterns.
Clients and Projects:
-
ZATCA – Saudi Arabia -
FTA – Qatar -
FTA – Dubai -
FRCS – Fiji
Projects and Research Papers
Data Mining on COVID-19
Analyzed daily COVID-19 case trends and applied Simple Linear
Regression and Support Vector Machine models to predict infection rates.
Produced visualizations to surface key inflection points in the pandemic data.
Fitbit Data Analysis
Analyzed FitBit Fitness Tracker data to uncover behavioral
trends in consumer health and activity patterns, delivering insights to inform
product and marketing decisions.
Smart Traffic System
Smart Traffic Management is a system where
centrally-controlled traffic signals and sensors regulate the flow of traffic
through the city in response to demand.