Hi there, I'm

Swastik De

Data Scientist

Transforming raw data into actionable insights with 3+ years of experience in machine learning, statistical modeling, and data storytelling.

Scroll down
portfolio.py
import pandas as pd
import numpy as np
from sklearn.ensemble import RandomForestClassifier

# Meet Swastik De
profile = {
    "role": "Data Scientist",
    "experience": "3 years",
    "passion": "ML & AI",
    "status": "Open to work"
}

print("Building amazing things"
      " with data! ๐Ÿš€")

About Me

SD
3+ Years Experience
15+ Projects Completed
5+ ML Models Deployed

I'm a passionate Data Scientist with over 3 years of experience turning complex datasets into meaningful insights. I specialize in building end-to-end machine learning pipelines, predictive models, and data-driven solutions that make a real business impact.

My work spans across industries including finance, healthcare, and e-commerce โ€” where I've applied techniques like natural language processing, computer vision, and time-series forecasting to solve real-world problems.

When I'm not wrangling data, I enjoy contributing to open-source projects, writing technical blog posts, and staying up to date with the latest research in AI/ML.

My Resume

View or download my professional resume

Technical Skills

Technologies and tools I work with every day

๐Ÿ Languages

Python SQL R PowerBI Tableau AWS-Quicksight Excel

๐Ÿค– Machine Learning

Scikit-Learn Classification Regression XGBoost LightGBM Keras

๐Ÿ“Š Data & Analytics

Pandas NumPy Matplotlib Seaborn Plotly Power BI Tableau

โ˜๏ธ Cloud & MLOps

AWS Azure ML Docker MLflow Airflow FastAPI

๐Ÿ—„๏ธ Databases

PostgreSQL MySQL MongoDB Redis BigQuery

๐Ÿ”ง Tools & Workflow

Git Jupyter VSCode DVC GitHub Actions

Featured Projects

A selection of my most impactful data science work

๐Ÿ“ˆ

Time Series Sales Forecasting

Built a hybrid forecasting model combining ARIMA and LSTM for retail sales prediction. Reduced MAPE by 18% compared to the existing baseline, helping optimize inventory management.

PythonLSTMARIMATensorFlow
๐Ÿฅ

Medical Image Classification

Developed a CNN model using transfer learning (ResNet-50) to classify chest X-rays for pneumonia detection. Achieved 94% AUC-ROC with an explainability layer using Grad-CAM.

PyTorchResNet-50Grad-CAMOpenCV
๐Ÿ›’

Recommendation System

Built a collaborative + content-based hybrid recommendation engine for an e-commerce platform. Improved click-through rate by 23% and increased average order value using matrix factorization and embeddings.

PythonCollaborative FilteringEmbeddingsSpark
๐Ÿ’ฐ

Fraud Detection System

Designed a real-time fraud detection pipeline for financial transactions using ensemble models and anomaly detection. Reduced false positive rates by 35% while maintaining high recall.

PythonIsolation ForestLightGBMKafka

Work Experience

Senior Data Scientist

2023 โ€“ Present

Tech Company ยท Full-time

  • Led a team of 4 data scientists to build predictive models that drove $2M in additional revenue
  • Architected end-to-end MLOps pipelines using MLflow, Airflow, and AWS SageMaker
  • Mentored junior data scientists and established best practices for model development
  • Collaborated with product and engineering teams to deploy ML models serving 500K+ users
PythonAWSMLflowTensorFlow

Data Scientist

2022 โ€“ 2023

Analytics Firm ยท Full-time

  • Built NLP models for document classification and information extraction from unstructured data
  • Developed A/B testing framework and statistical analysis pipelines for product experiments
  • Created interactive dashboards using Plotly and Power BI for stakeholder reporting
NLPPyTorchPower BIPostgreSQL

Junior Data Analyst

2021 โ€“ 2022

Startup ยท Full-time

  • Performed exploratory data analysis and statistical modeling on customer behavior data
  • Built automated ETL pipelines to consolidate data from multiple sources
  • Delivered weekly data-driven insights to the business team through clear visualizations
PythonSQLPandasTableau

Education

๐ŸŽ“

Post-Graduate Diploma in Statistical Methods and Analytics

Indian Statistical Institute ยท 2021 โ€“ 22

๐Ÿ“œ

M.Sc in Statistics & Computing

Visva-Bharati Central University ยท 2019-21

๐Ÿ“œ

Winter School on Deep Learning

Indian Statistical Institute ยท 2021

Get In Touch

I'm currently open to new opportunities. Whether you have a project in mind, want to collaborate, or just say hi โ€” my inbox is always open!