Data Engineering

Data Engineering

12 posts
PySpark - Concatenate String Columns of a DataFrame
Academy Membership PySparkPython

PySpark - Concatenate String Columns of a DataFrame

Introduction In this tutorial, we will show you how to concatenate multiple string columns of a PySpark DataFrame into a single column. In order to do this, we will use the functions concat() and concat_ws() of PySpark. Import Libraries First, we import the following python modules: from pyspark.sql...

PySpark - Group and Concatenate Strings in a DataFrame
Academy Membership PySparkPython

PySpark - Group and Concatenate Strings in a DataFrame

Introduction In this tutorial, we will show you how to group and concatenate strings in a PySpark DataFrame. In order to do this, we will use the groupBy() method in combination with the functions concat_ws(), collect_list() and array_distinct() of PySpark. Import Libraries First, we import the following...

How to containerize a FastAPI Application with Docker
Academy Membership FastAPIDocker

How to containerize a FastAPI Application with Docker

Introduction FastAPI, a high-performance Python web framework, coupled with Docker, a powerful containerization tool, can significantly boost the efficiency of your development workflow. In this blog post, we'll walk you through the process of setting up a FastAPI project using a Dockerfile, providing a flexible and scalable solution...

Structured vs. Semi-structured vs. Unstructured Data
Academy Membership DataData Engineering

Structured vs. Semi-structured vs. Unstructured Data

Introduction Data comes in different forms, each with its own characteristics and challenges. Basically, there are three main categories of data: Structured, Semi-structured and Unstructured Data. In this tutorial, we explore the characteristics and some examples for each kind of data. Structured Data First, let's have a look...

How to set up a FastAPI Project
Academy Membership FastAPIPython

How to set up a FastAPI Project

Introduction FastAPI has quickly gained popularity as a modern, fast and easy-to-use Python web framework for building RESTful APIs. In this tutorial, we show you step-by-step how to set up a FastAPI project. Prerequisites First of all, make sure you have Python installed on your system. Furthermore, it is recommended...

You’ve successfully subscribed to Deep Learning Nerds | The ultimate Learning Platform for AI and Data Science
Welcome back! You’ve successfully signed in.
Great! You’ve successfully signed up.
Success! Your email is updated.
Your link has expired
Success! Check your email for magic link to sign-in.