Saroj Rimal
the Data Analyst

An aspiring and detail-focused Data Analyst interested in Data Science, Visualization, Storytelling, Machine Learning, and Big Data with problem understanding, solving capabilities, and translating them into data-driven insights to support business goals

My Portfolio

Here are some of my expertise

Data Analytics

Analyzing data to get insights and helping businesses in data-driven decision making

Data Engineering

Connecting multiple data sources and Managing data pipelines, workflows, and ETL processes

Statistical Analysis

Leveraging statistical techniques to understand the data and extract business insights

Data Reporting & Visualization

Creating interactive reports and dashboards to help make informed data decisions

Machine Learning & Deep Learning

Building algorithms to identify trends and patterns and reduce human error

Business Knowledge

Leveraging data to help stakeholders make data-driven business decisions

Sentiment Analysis of London AirBnb

Sentiment Analyis of comments and reviews of AirBnb inside London area and provide potential suggestions to improve the negative reviews.

Platform Used:

Python; Jupyter Notebook

Libraries Used

Pandas, NumPy, Matplotlib, NLTK

Tomato Diseases Identification using CNN (Deep Learning)

Research and develop algorithm for processing unstructured data using Data Augmentation and Image Data Generator techniques with accuracy of 97%. Train multiple Neural Networks image data and compare them based on accuracy, size of data, and time taken. Delivered an Open-source Mobile Application platform.

Tools used

Python (Tensorflow, Tensorboard, Numpy), Deep Learning Models (CNN), Android Mobile App (Java)

Predictive Data Analysis on
Daily Bike Rentals

The main goal is to build a predictive model using Machine-Learning to predict the expected number of rentals in a given day.

Platform and Model used

Python (Pands,NumPy, Matplotlib, Seaborn) ML Model (Liner Regression) R-squared

Detection of Malware using Machine Learning and Deep Learning algorithms

Perform descriptive analysis and summary statistics on Malware data Prepared the raw data and trained using different algorithms and compared them based on accuracy, precision, f1-score, recall, size of data, and time taken Build ML and DL models to identify the malwares

Platform and Model used

Python (Pands,NumPy, Matplotlib, Seaborn), ML & DL Models (Logistic Regression, Decision Tree, Random Forest, Neural Networks)