Data Science

How to Deploy a Streamlit App using an Amazon Free ec2 instance?

How to Deploy a Streamlit App using an Amazon Free ec2 instance?

A Machine Learning project is never really complete if we don’t have a good way to showcase it.

While in the past, a well-made visualization or a small PPT used to be enough for showcasing a data science project, with the advent of dashboarding tools like RShiny and Dash, a good data scientist needs to have a fair bit of knowledge of web frameworks to get along.

And Web frameworks are hard to learn. I still get confused in all that HTML, CSS, and Javascript with all the hit and trials, for something seemingly simple to do.

Not to mention the many ways to do the same thing, making it confusing for us data science folks for whom web development is a secondary skill.

This is where StreamLit comes in and delivers on its promise to create web apps just using Python.

In my last post on Streamlit , I talked about how to write Web apps using simple Python for Data Scientists.

But still, a major complaint, if you would check out the comment section of that post, was regarding the inability to deploy Streamlit apps over the web.

And it was a valid complaint.

A developer can’t show up with his laptop every time the client wanted to use the app. What is the use of such an app?

So in this post, we will go one step further deploy our Streamlit app over the Web using an Amazon Free ec2 instance.


Setting up the Amazon Instance

Before we start with using the amazon ec2 instance, we need to set one up. You might need to sign up with your email ID and set up the payment information on the AWS website . Works just like a simple sign-on. From here, I will assume that you have an AWS account and so I am going to explain the next essential parts so you can follow through.

  • Go to AWS Management Console using https://us-west-2.console.aws.amazon.com/console .

  • On the AWS Management Console, you can select “Launch a Virtual Machine”. Here we are trying to set up the machine where we will deploy our Streamlit app.

  • In the first step, you need to choose the AMI template for the machine. I select the 18.04 Ubuntu Server since it is applicable for the Free Tier. And Ubuntu.

MLWhiz: Data Science, Machine Learning, Artificial Intelligence

  • In the second step, I select the t2.micro instance as again it is the one which is eligible for the free tier. As you can see t2.micro is just a single CPU instance with 512 MB RAM. You can opt for a bigger machine if you are dealing with a powerful model or are willing to pay.

MLWhiz: Data Science, Machine Learning, Artificial Intelligence

  • Keep pressing Next until you reach the “6. Configure Security Group” tab. You will need to add a rule with Type: “Custom TCP Rule”, Port Range:8501, and Source: Anywhere. We use the port 8501 here since it is the custom port used by Streamlit.

MLWhiz: Data Science, Machine Learning, Artificial Intelligence

  • You can click on “Review and Launch” and finally on the “Launch” button to launch the instance. Once you click on Launch you might need to create a new key pair. Here I am creating a new key pair named streamlit and downloading that using the “Download Key Pair” button. Keep this key safe as it would be required every time you need to login to this particular machine. Click on “Launch Instance” after downloading the key pair

MLWhiz: Data Science, Machine Learning, Artificial Intelligence

  • You can now go to your instances to see if your instance has started. Hint: See the Instance state, it should be showing “Running”

MLWhiz: Data Science, Machine Learning, Artificial Intelligence

  • Select your instance, and copy the Public DNS(IPv4) Address from the description. It should be something starting with ec2.

  • Once you have that run the following commands in the folder you saved the streamlit.pem file. I have masked some of the information here.

    chmod 400 streamlit.pem

    ssh -i "streamlit.pem" ubuntu@<Your Public DNS(IPv4) Address>

MLWhiz: Data Science, Machine Learning, Artificial Intelligence


Installing Required Libraries

Whoa, that was a handful. After all the above steps you should be able to see the ubuntu prompt for the virtual machine. We will need to set up this machine to run our app. I am going to be using the same streamlit_football_demo app that I used in my previous post .

We start by installing miniconda and adding its path to the environment variable.

sudo apt-get update

wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh -O ~/miniconda.sh

bash ~/miniconda.sh -b -p ~/miniconda

echo "PATH=$PATH:$HOME/miniconda/bin" >> ~/.bashrc

source ~/.bashrc

We then install additional dependencies for our app to run. That means I install streamlit and plotly_express .

pip install streamlit
pip install plotly_express

And our machine is now prepped and ready to run.


Running Streamlit on Amazon ec2

As I am set up with the instance, I can get the code for my demo app from Github . Or you can choose to create or copy another app as you wish.

git clone https://github.com/MLWhiz/streamlit_football_demo.git

cd streamlit_football_demo
streamlit run helloworld.py

MLWhiz: Data Science, Machine Learning, Artificial Intelligence

Now you can go to a browser and type the external URL to access your app. In my case the address is http://35.167.158.251:8501 . Here is the output. This app will be up right now if you want to play with it.

MLWhiz: Data Science, Machine Learning, Artificial Intelligence


A Very Small Problem Though

We are up and running with our app for the world to see. But whenever you are going to close the SSH terminal window the process will stop and so will your app.

So what do we do?

TMUX to the rescue. TMUX allows us to keep running our sessions even after we leave the terminal window. It also helps with a lot of other things but I will just go through the steps we need.

First, we stop our app using Ctrl+C and install tmux

sudo apt-get install tmux

We start a new tmux session using the below command. We keep the name of our session as StreamSession. You could use any name here.

tmux new -s StreamSession

MLWhiz: Data Science, Machine Learning, Artificial Intelligence

You can see that the session name is “StreamSession” at the bottom of the screen. You can now start running streamlit in the tmux session.

streamlit run helloworld.py

MLWhiz: Data Science, Machine Learning, Artificial Intelligence

You will be able to see your app at the External URL . The next step is to detach our TMUX session so that it continues running in the background when you leave the SSH shell. To do this just press Ctrl+B and then D (Don’t press Ctrl when pressing D)

MLWhiz: Data Science, Machine Learning, Artificial Intelligence

You can now close your SSH session and the app will continue running at the External URL.

And Voila! We are up and running.

Pro TMUX Tip: You can reattach to the same session by using the attach command below. The best part is that you can close your SSH shell and then maybe come back after some hours and reattach to a session and keep working from wherever you were when you closed the SSH shell.

tmux attach -t StreamSession

Simple Troubleshooting:

If your app is not hosting at 8501, it means that an instance of streamlit app is already running on your system and you will need to stop that. You can do so by first finding the process ID

ps aux | grep streamlit

You will see something like:

ubuntu   20927  2.4 18.8 713780 189580 pts/3   Sl+  19:55   0:26 /home/ubuntu/miniconda/bin/python /home/ubuntu/miniconda/bin/streamlit run helloworld.py

You will need to kill this process. You can do this simply by

kill -9 20947

Conclusion

Our Final App

Streamlit has democratized the whole process to create apps, and I couldn’t recommend it more. If you want to learn more about how to create awesome web apps with Streamlit then read up my last post.

In this post, we deployed a simple web app on AWS using amazon ec2.

In the process of doing this, we created our own Amazon ec2 instance, logged into the SSH shell, installed miniconda and dependencies, ran our Streamlit application and learned about TMUX. Enough learning for a day?

So go and show on these Mad skills. To end on a lighter note, as Sten Sootla says in his satire piece which I thoroughly enjoyed:

The secret: it’s not what you know, it’s what you show.

If you want to learn more about how to structure a Machine Learning project and the best practices, I would like to call out his excellent third course named Structuring Machine learning projects in the Coursera Deep Learning Specialization . Do check it out.

Thanks for the read. I am going to be writing more beginner-friendly posts in the future too. Follow me up at Medium or Subscribe to my blog

Also, a small disclaimer — There might be some affiliate links in this post to relevant resources, as sharing knowledge is never a bad idea.

Start your future with a Data Analysis Certificate.
comments powered by Disqus