Handling Trees in Data Science Algorithmic Interview

Handling Trees in Data Science Algorithmic Interview

Algorithms and data structures are an integral part of data science. While most of us data scientists don’t take a proper algorithms course while studying, they are crucial all the same. Many companies ask data structures and algorithms as part of their interview process for hiring data scientists. Now the question that many people ask here is what is the use of asking a data scientist such questions. The way I like to describe it is that a data structure question may be thought of as a coding aptitude test.
A simple introduction to Linked Lists for Data Scientists

A simple introduction to Linked Lists for Data Scientists

Algorithms and data structures are an integral part of data science. While most of us data scientists don’t take a proper algorithms course while studying, they are important all the same. Many companies ask data structures and algorithms as part of their interview process for hiring data scientists. Now the question that many people ask here is what is the use of asking a data scientist such questions. The way I like to describe it is that a data structure question may be thought of as a coding aptitude test.
Dynamic Programming for Data Scientists

Dynamic Programming for Data Scientists

Algorithms and data structures are an integral part of data science. While most of us data scientists don’t take a proper algorithms course while studying, they are important all the same. Many companies ask data structures and algorithms as part of their interview process for hiring data scientists. Now the question that many people ask here is what is the use of asking a data scientist such questions. The way I like to describe it is that a data structure question may be thought of as a coding aptitude test.
The 5 most useful Techniques to Handle Imbalanced datasets

The 5 most useful Techniques to Handle Imbalanced datasets

Have you ever faced an issue where you have such a small sample for the positive class in your dataset that the model is unable to learn? In such cases, you get a pretty high accuracy just by predicting the majority class, but you fail to capture the minority class, which is most often the point of creating the model in the first place. Such datasets are a pretty common occurrence and are called as an imbalanced dataset.
3 Industries That Benefit from Data Science

3 Industries That Benefit from Data Science

Collecting and analysing data, including but not limited to text, images, and video formats, is a huge part of various industries. It can be an incredibly complex process to sift through massive amounts of data and leverage it to benefit your business by discovering key patterns. Many people who begin learning data science or are considering taking it up are often employed in other industries, to begin with. They may be afraid that pursuing this new area will leave them high and dry with few prospects, and considering how taking up data science requires a good background of Probability and Statistics, they may not think it’s worth the effort.
Using Gradient Boosting for Time Series prediction tasks

Using Gradient Boosting for Time Series prediction tasks

Time series prediction problems are pretty frequent in the retail domain. Companies like Walmart and Target need to keep track of how much product should be shipped from Distribution Centres to stores. Even a small improvement in such a demand forecasting system can help save a lot of dollars in term of workforce management, inventory cost and out of stock loss. While there are many techniques to solve this particular problem like ARIMA, Prophet, and LSTMs, we can also treat such a problem as a regression problem too and use trees to solve it.
Take your Machine Learning Models to Production with these 5 simple steps

Take your Machine Learning Models to Production with these 5 simple steps

Creating a great machine learning system is an art. There are a lot of things to consider while building a great machine learning system. But often it happens that we as data scientists only worry about certain parts of the project. But do we ever think about how we will deploy our models once we have them? I have seen a lot of ML projects, and a lot of them are doomed to fail as they don’t have a set plan for production from the onset.