A step-by-step introductory on getting started with Data Analysis in Julia.

Photo by Pixabay from Pexels

Getting started with Julia is pretty straightforward, especially when you are familiar with Python. For this walk-through we will be using data on Covid-19 as provided by the Center for Systems Science and Engineering at Johns Hopkins University in their GitHub repository.

Getting Started

For our data analysis we will be using just a few packages to keep thing simple: CSV, DataFrames, Dates and Plots. Simply type the statement using followed by the name of the package and you are ready to go.

In case packages are not yet added to your project environment, you can added them easily.

Reading Data

Reading…


Powerful Python Packages to Boost Your Data Analytics Workflow

Photo by David Clode on Unsplash

Sometimes you discover powerful python packages that boost your data analytics or data science workflow you wish you knew before. In this story I share three of my favorites.

1. Pandas Profiling

While the pandas df.describe() method is great for exploratory data analysis, you most likely would prefer a deeper understanding of your data. Here the pandas profiling package comes in handy! You can install using the pip package manager by running:

pip install pandas-profiling[notebook]

For our example we will use the Iris dataset and start with describing the data using the pandas df.describe() method.


Powerful pandas functions explained to boost your data analytics workflow.

Photo by Sid Balachandran on Unsplash

When using pandas in you data science or data analytics projects, you sometimes discover powerful new functions you wish you knew before. Here is my personal top 5.

1. Web Scraping

Pandas has a powerful method read_html() for scraping data tables from webpages.

Let’s assume we need data on gross national income. It is available in a data table on Wikipedia.


With a little help of an easy to use Python script…

Photo by 🇸🇮 Janko Ferlič on Unsplash

The topics your Medium story was curated into are easy to find when you know where to look. Zulie Rane wrote an excellent story¹ explaining the steps to find these topics manually. In this story I describe how automate these steps.

Prerequisite

We will use the programming language Python to make a little script to automagically return the topics your story was curated into. So make sure you have Python installed on your laptop. If it isn’t preinstalled you can download and install it from python.org.

Script

The script to automate…


When you like Jupyter, you will love Pluto!

Photo by David Menidrey on Unsplash

Pluto is a lightweight and easy to use reactive notebook for the Julia language. In this story I will share my experience with Pluto, especially the five features I love most.

Getting started

Getting started with Pluto is easy. You just add the Pluto package to your project environment and you are good to go. Start Pluto by typing Pluto.run() from the Julia REPL and Pluto opens in your default web browser.

In my workflow I usually navigate to my project directory and start Pluto with the command:

julia --threads auto --project=. -e "using Pluto; Pluto.run()"

Top 5 Features

There a many thing to love…


How to plot choropleth maps in Julia using the Plots package

Photo by KOBU Agency on Unsplash

For this walk-through we will use shapefiles and data published as open data by Statistics Netherlands and the National Institute for Public Health and the Environment. We are plotting a thematic map with daily Covid-19 cases using the Plots package.

Load Packages

First we will have to load a few packages.

When needed, add missing packages to your Julia project environment. For a step-by-step instruction, see my previous story.

Create Directories

To keep things organized, we will create some directories in our project environment.

Create Function

For mapping actual values to colors, we need to normalize those values. …


An introduction to Julia’s built-in package manager for generating project environments and managing package dependencies.

📦 Manager (Photo by Nana Smirnova on Unsplash)

When you start to code multiple projects in Julia, it is recommended to use project specific environments for reproducibility and minimizing package dependencies. Julia has a great built-in package manager to make things easy. In this story I share my workflow step-by-step.

The Package Manager

Pkg is Julia’s built-in package manager and handles operations such as adding, updating and removing packages. Pkg has it’s own read evaluate print loop (REPL). In my workflow, when I want to create a new project environment, I usually start Julia from the directory where I keep my coding projects. …

René

Senior Information Manager with a passion for all things data. Official author of Towards Data Science (TDS).

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store