Analytics Portfolio
Soumyadipta Das
Analytics, AI, Forecasting, & Data Products.
Analytics & AI Manager with over five years of hands-on experience across data science, statistics, R programming, Shiny apps, PySpark, Azure Databricks, and AI-enabled products.
Forecasting engine - statistics - automation - AI
Portfolio Summary
Practical analytics delivery with a statistics-first core.About
As a dedicated data science professional with over five years of hands-on experience, I thrive at the intersection of data, technology, and business. My skill set spans statistics, data science, R programming, interactive applications using RShiny, and artificial intelligence techniques for meaningful insights and strategic decision-making.
Throughout my career, I have solved complex problems, optimized processes, uncovered actionable insights, built robust data models, developed predictive analytics, and visualized data to communicate findings effectively.
I am committed to continuous learning and professional development, bringing innovative solutions to rapidly evolving data science and AI challenges.
Top Skills
- Data Science
- Statistical Modeling
- R Programming
- Python Programming
- Artificial Intelligence
- Azure Databricks
- PySpark
- SQL
- Time Series Analysis
- Machine Learning
Capability Map
Core Strengths
AI-Assisted Products
Full-stack forecasting products with custom ML models, AI chatbots, visualization, exports, and planning workflows.
Databricks Engineering
PySpark performance optimization, automated pipelines, Lakehouse migration, high-volume ETL, and data quality controls.
Technical Leadership
Cross-functional leadership, client translation, MLOps practices, architecture standards, and mentoring data teams.
Client analytics - operations - forecasting
Experience
Roles centered on analytics delivery, automation, and business planning.Analytics & AI Manager
EXL
- Architected and developed a full-stack Capacity Forecasting application using Shiny on Databricks Apps from scratch, integrating custom ML models and an AI-powered chatbot to streamline resource planning.
- Optimized backend performance using PySpark within Databricks, reducing plan creation runtime by 94%, from 1.5 hours to 5 minutes.
- Commercialized internal AI assets by deploying a Code Translation solution to multiple external enterprise clients.
- Led a unified data migration framework for complex legacy ecosystems including SAS, Salesforce, and SAP into a modern Databricks Lakehouse architecture.
- Designed automated Databricks pipelines with custom error detection and alerting for high data integrity with minimal manual intervention.
- Orchestrated large-scale ETL workflows using PySpark for high-volume data ingestion and transformation while maintaining strict data quality standards.
- Provided technical leadership across code quality, architecture, MLOps practices, and stakeholder translation.
- Mentored junior data scientists and engineers across Python, PySpark, and cloud architecture practices.
Lead Assistant Manager
EXL
- Developed an AI-powered chatbot inside a Shiny-based ML forecasting platform, streamlining FTE planning and enabling 70% faster scenario-based decision-making through self-service analytics.
- Led a team of data scientists to upgrade demand forecasting models with complex feature engineering, clustered weather patterns, and market indicators.
- Collaborated with clients across utilities, finance, and ecommerce to design and execute strategic operational plans.
- Automated reporting workflows to improve insight generation and reduce manual effort.
- Developed and deployed digital and AI-driven forecasting solutions that contributed to new revenue streams.
Assistant Manager
EXL
- Led migration of legacy SAS workflows for long-term capacity planning to Azure Databricks using PySpark.
- Built scalable machine learning pipelines for demand planning and inventory optimization.
- Enhanced data engineering operations with automated ETL pipelines in Databricks and Azure.
- Conducted root-cause analysis on demand anomalies using statistical techniques and time-series decomposition.
Statistical Trainee
Indian Statistical Institute, Kolkata · Full-time
- Devised a method for index-number base-year optimization to improve analytical consistency in economic datasets.
- Compared computational efficiency and time complexity for algorithms fitting multiple regressions simultaneously.
- Evaluated non-parametric smoothing methods to identify techniques with minimum execution time.
- Prepared findings on computational efficiencies for non-parametric smoothing methods.
- Used R, Python, MS Excel, and STATA.
Research Intern
Indian Statistical Institute, Kolkata · Internship
- Modeled and forecasted Index of Industrial Production time-series data.
- Completed the project "Modelling and Forecasting for IIP Time Series Data".
- Used R and Python for statistical modeling and forecast evaluation.
Data Scientist
InstaDataHelp Analytics Services · Freelance
- Performed statistical consulting for diverse industry clients, generating actionable insights through customized analytical pipelines.
- Applied advanced statistical modeling and hypothesis testing to support business reporting and operations.
- Delivered statistical analysis for client projects across varied datasets.
- Used R, Stata, SAS, MS Excel, Jamovi, XLMiner, XLSTAT, MATLAB, Minitab, and Octave.
Data Scientist
InstaDataHelp Analytics Services · Internship
- Worked on varied data science projects across statistical analysis and reporting workflows.
- Built early-stage statistical pipelines for project analysis and insight generation.
- Used R, Stata, SAS, MS Excel, Jamovi, XLMiner, and XLSTAT.
Student Member
Royal Statistical Society · Part-time
GitHub highlights - Shiny - PySpark - AI apps
Featured Projects
Selected work from GitHub, automatically ordered by latest commit date when the page loads.Python + Shiny
Airport Resource Optimization
Resource Optimization Airport resource optimization Shiny app focused on operational planning and allocation workflows.Python + PyShiny
Forecasting Tool
Strategic Planning Forecasting app for planning workflows, built around PyShiny and forecasting interactions.Python + Streamlit
Airport Resource Management
Dashboard for Planners Interactive dashboard and resource planning application designed for airport operational planners.R Shiny + AI
AI Forecasting Tool
Forecasting Workflow Forecasting app for time-series and non-time-series data, combining visualization, modeling, and AI support.Python + Flask
Smart Meter Journey
Smart Meter Scorecard Flask application for tracking smart-meter journey workflows, scorecard views, and operational meter insights.Python + Flask
Metering Capacity and Demand Planning
Capacity Planning Planning application for smart-meter capacity, demand forecasting, and resource planning workflows.Data science blogs - travel blogs
Writing
Technical explainers and Himalayan travel guides.Data Science Blogs
Travel Blogs
Statistics - mathematics - recognition
Education
Academic foundation in statistics, probability, inference, and applied modeling.University of Calcutta
M.Sc., Statistics
Grade: CGPA 7.129 out of 10
Specialized in Advanced Non-parametric Methods and Advanced Probability Theory. Also studied Pure Mathematics and Applied Mathematics as CBCS.
M.Sc. project: Analysis of WPI Time Series Data.
Coursework included Advanced Regression Analysis, Statistical Learning, Applied Multivariate Analysis, Statistical Inference, Stochastic Process, Sequential Analysis, Design of Experiments, Sample Survey, Advanced Probability Theory, and Advanced Non-parametric Methods.
University of Calcutta
B.Sc. Hons., Statistics
Grade: 1st class
Coursework included Regression Analysis, Multivariate Analysis, Statistical Inference, Demography, Probability Theory, and Exploratory Data Analysis.
Tools learned: C, Minitab, and MS Excel for statistical analysis.
Belur High School
H.S., 12th standard - Statistics, Mathematics, Physics, Chemistry
Uttarpara Children's Own Home
Madhyamik, 10th standard
Honors - awards - recognition
Honors & Awards
Professional and academic recognition.The Genius Award - H1 - 2024
Associated with EXL.
Topper in B.Sc. Statistics Honours from College
Ranked first in B.Sc. Statistics Honours in the Department of Statistics of Surendranath College.
Mathematical Competence Test (A.I.M.T.)
Earned 15th rank in the mathematics competition.