Data Engineering

Data Engineering

10 posts
PySpark - Group and Concatenate Strings in a DataFrame
Academy Membership PySparkPython

PySpark - Group and Concatenate Strings in a DataFrame

Introduction In this tutorial, we will show you how to group and concatenate strings in a PySpark DataFrame. In order to do this, we will use the groupBy() method in combination with the functions concat_ws(), collect_list() and array_distinct() of PySpark. Import Libraries First, we import the following...

How to containerize a FastAPI Application with Docker
Academy Membership FastAPIDocker

How to containerize a FastAPI Application with Docker

Introduction FastAPI, a high-performance Python web framework, coupled with Docker, a powerful containerization tool, can significantly boost the efficiency of your development workflow. In this blog post, we'll walk you through the process of setting up a FastAPI project using a Dockerfile, providing a flexible and scalable solution...

Structured vs. Semi-structured vs. Unstructured Data
Academy Membership DataData Engineering

Structured vs. Semi-structured vs. Unstructured Data

Introduction Data comes in different forms, each with its own characteristics and challenges. Basically, there are three main categories of data: Structured, Semi-structured and Unstructured Data. In this tutorial, we explore the characteristics and some examples for each kind of data. Structured Data First, let's have a look...

How to set up a FastAPI Project
Academy Membership FastAPIPython

How to set up a FastAPI Project

Introduction FastAPI has quickly gained popularity as a modern, fast and easy-to-use Python web framework for building RESTful APIs. In this tutorial, we show you step-by-step how to set up a FastAPI project. Prerequisites First of all, make sure you have Python installed on your system. Furthermore, it is recommended...

What is a Data Lakehouse?
Academy Membership Data EngineeringDatabricks

What is a Data Lakehouse?

Introduction In this tutorial, we want to explain the characteristics of a Data Lakehouse. In order to do this, we will take a closer look at the key features of Data Lakes and Data Warehouses and how a Data Lakehouse combines the best of both worlds. Definition At its core,...

You’ve successfully subscribed to Deep Learning Nerds | The ultimate Learning Platform for AI and Data Science
Welcome back! You’ve successfully signed in.
Great! You’ve successfully signed up.
Success! Your email is updated.
Your link has expired
Success! Check your email for magic link to sign-in.