DATA SCIENTIST · ML ENGINEER · AI · 
LOADING
0%
Available for new roles

Ashish
Shiwlani

Data Scientist
ML & AI Engineer · Chicago, IL

M.S. in Data Science. Passionate about AI in healthcare and building ML solutions that actually matter. Python, TensorFlow, PyTorch, AWS, Azure, GCP.

About Me

Who I Am

I'm a Data Scientist and ML Engineer who genuinely loves digging into data and figuring out what it's trying to say. I hold an M.S. in Data Science and I'm at an early stage of my career where I'm excited to keep growing, take on new challenges, and work alongside people who care about doing good work.

My biggest passion right now is AI in healthcare. I truly believe machine learning has a huge role to play in early diagnosis, clinical decision support, and making healthcare more accessible to people who need it most. It's the kind of problem that keeps me up at night in the best way possible.

I've had the chance to work on real problems at places like the U.S. Treasury, Microsoft, and Vanguard, where I got hands-on with everything from building data pipelines to training classifiers and analyzing large datasets. I'm looking for a role where I can keep learning, contribute meaningfully, and work on things that actually make a difference.

Chicago, Illinois, United States
Top 4%
AI Researcher Globally
140+
Peer Reviews Completed
10TB+
Data Processed
3
Cloud Platforms
Technical Expertise

Skills & Tools

Programming Languages
PythonSQLRBash
ML & Deep Learning
Scikit-learnTensorFlowPyTorchKerasXGBoostLightGBMRandom ForestNLPComputer VisionLLMsHugging FaceTransformersGenerative AI
Data Analysis & Visualization
PandasNumPyExcelPower BITableauMatplotlibSeabornEDAA/B TestingHypothesis TestingStatistical ModelingPivot Tables
Data Science
Feature EngineeringModel EvaluationTime-SeriesAnomaly DetectionForecastingCross-ValidationSHAPRegressionClassificationClustering
Cloud & MLOps
AWS SageMakerAzure MLGCP Vertex AIMLflowDockerKubernetesCI/CDFastAPIAWS LambdaAzure Synapse
Data Engineering & Databases
DatabricksApache SparkPostgreSQLMongoDBBigQuerySnowflakeETL PipelinesGit / GitHubJupyter
Healthcare AI
Medical ImagingDICOMCNNClinical NLPSHAP ExplainabilityRisk ScoringECG AnalysisSymptom ClassificationPredictive Diagnostics
Tools I Work With

Tech Stack

Move your mouse to interact
Work History

Experience

Data Engineer
Aug 2024 — Present
U.S. Department of the Treasury · Washington, D.C.
  • Analyzed 10TB+ federal datasets using Python, SQL, and statistical modeling to identify data quality issues and surface patterns relevant to IRS audit preparation.
  • Built ML-based anomaly detection pipelines that helped reduce manual compliance review effort by around 30%, freeing up analyst time for higher-value work.
  • Created automated ETL workflows with built-in data quality checks, making the reporting process more consistent and reducing turnaround time for the team.
  • Collaborated with cross-functional teams to translate complex data findings into clear summaries and dashboards that non-technical stakeholders could act on.
  • Documented data pipelines and model workflows to ensure reproducibility and support smooth knowledge transfer across the team.
Data Engineer
Jan 2023 — Aug 2024
Microsoft · Redmond, WA
  • Worked with device telemetry data to build classification models (Random Forest, Gradient Boosting) that predicted firmware failures with around 90% precision across 5,000+ devices.
  • Cleaned and transformed large-scale sensor data into structured feature sets, making it ready for analysis and model training in Azure ML.
  • Supported A/B experiments using Apache Spark to compare model versions and identify which configurations worked best in real-world conditions.
  • Built time-series forecasting models to flag early signs of device degradation, helping engineering teams act before issues became outages.
  • Wrote clear documentation and contributed to team knowledge-sharing sessions on data preprocessing techniques and model evaluation methods.
Data Engineer
Apr 2022 — Dec 2022
Vanguard · Malvern, PA
  • Analyzed over 1 million daily financial transactions using clustering and anomaly detection techniques to surface irregular patterns and data quality issues.
  • Built data pipelines on AWS and Azure that streamlined how data moved from raw ingestion to analysis-ready format, cutting processing time significantly.
  • Used SQL and Python to pull, clean, and explore financial datasets, producing summaries and trend reports for internal stakeholders.
  • Helped automate parts of the reporting workflow, reducing the manual effort required to prepare regular performance and compliance reports.
  • Worked alongside senior engineers to understand financial data requirements and translate them into reliable, well-documented pipeline logic.
Data Scientist Intern
Jan 2022 — Apr 2022
CCC Intelligent Solutions · Chicago, IL
  • Researched adversarial machine learning techniques and implemented detection methods for data poisoning and input manipulation attacks using PyTorch.
  • Built a Computer Vision and NLP pipeline to flag potentially manipulated training data, which was later reviewed and incorporated into internal AI security practices.
  • Ran experiments, logged results in structured formats, and presented findings to the team in weekly check-ins.
  • Explored publicly available research papers on ML robustness and summarized key techniques into an internal reference document used by the broader team.
  • Gained hands-on experience working in a collaborative, fast-moving environment where clean code, documentation, and reproducibility were taken seriously.
Academic Background

Education

Graduate
Master of Science in Data Science
Illinois Institute of Technology
Chicago, Illinois · 2022
Undergraduate
Bachelor of Science in Computer Science
Shaheed Zulfikar Ali Bhutto Institute of Science and Technology
Karachi, Pakistan · 2020
Beyond the Classroom

Leadership & Global Experience

Social Intern — Cancer Awareness Volunteer
Summer 2018
AIESEC International · Kocaeli / Izmit, Turkey
  • Selected for a competitive social internship through AIESEC International and traveled to Kocaeli, Turkey, where I worked alongside local volunteers and healthcare professionals on a community health initiative.
  • Helped design and deliver educational sessions at local hospitals and community centers focused on early warning signs of cancer in children, reaching families across multiple neighborhoods in the Izmit region.
  • Organized cultural exchange events and social gatherings that brought together international volunteers and Turkish locals, fostering connections and mutual understanding across language barriers.
  • Collaborated with Turkish medical staff to translate and present clinical information in a way that was accessible and meaningful to non-specialist audiences.
  • The experience taught me a lot about cross-cultural communication, grassroots health education, and what it means to do meaningful work outside your comfort zone.
Head Delegate — Representing the Republic of Benin
February 2019
Harvard Model United Nations (HMUN) · Boston, MA
  • Represented the Republic of Benin at the Harvard Model United Nations Conference, one of the oldest and most prestigious MUN conferences in the world, as Head Delegate.
  • Prepared comprehensive position papers on global health infrastructure and sustainable development, grounding arguments in Benin's socioeconomic context and international policy frameworks.
  • Actively participated in committee debates, delivered formal speeches, and engaged in back-channel negotiations with delegates from over 100 countries to build consensus on draft resolutions.
  • Co-sponsored a resolution addressing public health access in West Africa, which was debated and passed in committee after multiple rounds of amendment and discussion.
  • The experience sharpened my ability to think on my feet, argue with evidence, and find common ground in high-pressure, fast-moving environments.
Conference Attendee
2018
Lead Co Conference · Istanbul, Turkey
  • Attended the Lead Co Conference in Istanbul, a leadership development event bringing together young professionals and emerging leaders from across Europe, Asia, and the Middle East.
  • Participated in interactive workshops on purpose-driven leadership, team dynamics, communication under pressure, and building lasting professional networks.
  • Connected with peers from diverse backgrounds and industries, gaining perspectives on leadership styles and organizational cultures very different from my own.
  • Left with a clearer sense of how to lead with intention and how cross-sector collaboration can drive meaningful impact at scale.
Open Source · GitHub

Things I've Built

Real projects — from healthcare AI to clinical decision tools. Each one started with a problem I wanted to actually solve.

zsh — ashishshiwlani
$
Connected · GitHub · ashishshiwlani
Featured
🫀
CardioChrono AI

Predicts your cardiovascular risk trajectory over the next 20 years. You upload an ECG scan, punch in your cholesterol numbers and lifestyle habits — CardioChrono's CNN reads the waveform, weighs it against your clinical profile, and generates a personalized PDF report with risk charts and preventive recommendations. It's the kind of tool that should be in every GP's office.

Python PyTorch CNN ECG Analysis Healthcare AI Jupyter
🧠
SmartSymptom AI

Type your symptoms the way you'd describe them to a friend — SmartSymptom figures out what you actually mean. It auto-corrects spelling, fuzzy-matches against a clinical symptom library, estimates how serious the situation is, and produces a health summary PDF you can hand to a doctor. Built on Random Forest classification with SpaCy NLP doing the heavy lifting on natural language input.

Python SpaCy NLP Random Forest Fuzzy Matching Healthcare
🔬
Breast Cancer Prediction from Mammograms

A multimodal neural network that combines DICOM mammogram images with patient risk factors — age, BMI, smoking history, family history — to classify tumors as benign or malignant. The model uses a CNN for image features and merges them with clinical inputs for a more complete picture. SHAP values explain which factors drove each prediction, and a desktop GUI lets clinicians upload scans and download a full PDF risk report per patient.

Python TensorFlow CNN DICOM SHAP Computer Vision Tkinter
📊
Breast Cancer Diagnosis — Classic ML

A rigorous ML study on the UCI Breast Cancer Wisconsin dataset. 569 patients, 30 cell nucleus measurements, one question: benign or malignant? The Random Forest model is tuned with GridSearchCV, cross-validated for consistency, and the most diagnostic features — like radius mean and perimeter — are surfaced with importance plots. Clean, documented, reproducible. Exactly what you'd want before trusting a model in a medical context.

Python scikit-learn Random Forest GridSearchCV UCI Dataset EDA
Academic Output

Research & Publications

Google Scholar

hover & click to reveal research data

📄
0
Publications
📈
0
Citations
🏆
0
h-index
🔬
0
i10-index
Machine Learning Deep Learning Healthcare AI Cardiovascular Disease Computer Vision NLP Renewable Energies Clinical Decision Support Neuroscience
2026

Edge-AI Meets the Heart: Real-Time Cardiovascular Monitoring with Cloud-Connected Wearables

Advances in Artificial Intelligence and Machine Learning
Edge AICardiologyCloud
2025

A Systematic Review of AI-based Clinical Decision Support Systems: From Development and Implementation to Applications

Pakistan Journal of Life and Social Sciences
AIHealthcareCDSS
2025

AI-Augmented Precision Lifestyle Interventions for Type 2 Diabetes Remission: A Systematic Review

Advances in Knowledge Base Systems, Data Science & Cybersecurity
MLDiabetes
2024

AI in Neuroeducation: A Systematic Review Aligned with Neuroscience Principles for Optimizing Learning

Journal of Development and Social Sciences
AINeuroscience
2024

Advances in AI and ML for Neurodegenerative Disease: A Literature Review

Peer-Reviewed Journal
Deep LearningAlzheimer's
2024

Analysis of Multi-modal Data Through Deep Learning to Diagnose CVDs: A Review

International Journal of Membrane Science and Technology
Deep LearningCVDMulti-modal
2024

Revolutionizing Healthcare: The Impact of AI on Patient Care, Diagnosis, and Treatment

JURIHUM: Jurnal Inovasi dan Humaniora
AIHealthcare
2024

Role of Cloud-Deployed Graph Neural Networks in Mapping Coronary Artery Disease Progression

Peer-Reviewed Journal
GNNsCloudCAD
Let's Connect

Get in Touch

Whether you're looking to collaborate on a research project, explore an idea that's been sitting in the back of your mind, talk about a role, or just connect, my inbox is always open. I genuinely enjoy conversations that go somewhere unexpected, so don't hesitate to reach out about anything.

🏥 AI Healthcare 🤖 Machine Learning 📊 Data Analytics 🧠 Deep Learning ☁️ Cloud AI 🔬 Clinical AI
Open to new opportunities

Based in Chicago, IL. Excited about AI healthcare, ML, data analytics, and any team building things that genuinely matter. Open to remote, hybrid, or on-site.