Premium

A list of premium posts for academy members containing hands-on tutorials, best practices, career advices and learning paths.

358 premium posts
How to Load API Data into DuckDB with dlt
Academy Membership dltDuckDB

How to Load API Data into DuckDB with dlt

📘 Introduction In this hands-on tutorial, you will learn how to load data from a REST API into DuckDB using dlt. This is a great first local data pipeline because you do not need a cloud warehouse, a complex setup, or production credentials. We will use dlt to fetch data...

dlt vs dbt: What is the Difference?
Academy Membership dltdbt

dlt vs dbt: What is the Difference?

📘 Introduction When you start learning modern data pipelines, you will quickly hear about both dlt and dbt. The names look similar, and both tools are used in data engineering workflows. But they solve different problems. In this beginner-friendly guide, we will explain the difference between dlt and dbt, where...

What Is n8n and Why Is It So Powerful?
Academy Membership n8nAgentic AI

What Is n8n and Why Is It So Powerful?

📘 Introduction Automation has become a central pillar of modern digital operations, but most tools either limit flexibility or lock you into rigid, predefined steps. n8n stands out because it bridges the gap between low-code usability and deep technical power. It brings together visual workflow building, full control over data,...

Load Data from a JSON File into DuckDB using Python
Academy Membership DuckDBPython

Load Data from a JSON File into DuckDB using Python

📘 Introduction Working with structured data is a core part of any data engineering or analytics workflow. DuckDB, often called the SQLite for analytics, makes it incredibly easy to query local data files—including JSON—without requiring a complex setup. In this tutorial, you’ll learn how to load JSON data...

Install n8n Locally with npm: A Step-by-Step Guide
Academy Membership n8nAgentic AI

Install n8n Locally with npm: A Step-by-Step Guide

📘 Introduction n8n is a flexible and powerful automation platform that helps you connect apps, orchestrate workflows, and even build AI-driven processes — all without writing code. Installing n8n locally with npm gives you a lightweight, customizable setup that runs directly on your system, making it perfect for development, debugging, and...

How to convert CSV to JSON in Python using Pandas
Academy Membership PandasPython

How to convert CSV to JSON in Python using Pandas

📘 Introduction JSON is one of the most widely used data formats for APIs, configurations, storage, and modern applications. Converting CSV to JSON in Python is incredibly simple using Pandas. In this tutorial, we will walk through the full process: creating a sample CSV file, loading it into a Pandas DataFrame,...

Load Data from a Parquet File into DuckDB using Python
Academy Membership DuckDBPython

Load Data from a Parquet File into DuckDB using Python

📘 Introduction In this tutorial, you’ll learn how to load data from a Parquet file into a DuckDB database using Python. DuckDB’s native Parquet support makes it fast and efficient to work with columnar data, making it ideal for analytics, ETL pipelines, and Python data projects. You’ll see...

How to convert CSV to Parquet in Python using Pandas
Academy Membership PandasPython

How to convert CSV to Parquet in Python using Pandas

📘 Introduction In modern data workflows, Parquet is a popular columnar storage format for efficient data storage and faster analytics. Converting CSV to Parquet in Python is straightforward using Pandas and PyArrow. In this tutorial, we will walk you through the complete process: from creating a sample CSV file, reading it...

Install n8n Locally with Docker Desktop: A Step-by-Step Guide
Academy Membership n8nDocker

Install n8n Locally with Docker Desktop: A Step-by-Step Guide

📘 Introduction n8n is a powerful automation platform that gives you the freedom to build workflows, integrate apps, and create AI-powered automations without writing any code. Running n8n locally with Docker Desktop provides a fast, clean, and reliable setup that isolates your environment while keeping your system clutter-free. With...

Insert Data from a CSV File into DuckDB using Python
Academy Membership DuckDBPython

Insert Data from a CSV File into DuckDB using Python

📘 Introduction In this tutorial, you will learn how to insert data from a CSV file into a DuckDB database using Python. DuckDB is a powerful in-process analytical database that makes working with structured data fast, simple, and efficient—perfect for data projects, notebooks, and local analytics. ✅ Prerequisites Before you...

Install n8n Locally with Docker Compose: A Step-by-Step Guide
Academy Membership n8nDocker

Install n8n Locally with Docker Compose: A Step-by-Step Guide

📘 Introduction n8n is a powerful workflow automation platform that lets you connect apps, automate processes, and even build AI agents — all without writing a single line of code. Running n8n locally using Docker Compose gives you a clean, reliable, and easily reproducible environment that’s perfect for development, experimentation, and...

Insert Data into DuckDB Using Python: A Practical Guide
Academy Membership DuckDBPython

Insert Data into DuckDB Using Python: A Practical Guide

📘 Introduction DuckDB has quickly become a favorite tool among data engineers and analysts because of its speed, simplicity, and ability to run analytical SQL queries directly within Python. Whether you’re prototyping data pipelines, running local analytics, or managing lightweight data storage, knowing how to insert data into DuckDB is...

How to Rename Multiple DataFrame Columns at Once in PySpark
Academy Membership PySparkPython

How to Rename Multiple DataFrame Columns at Once in PySpark

📘 Introduction Renaming columns is one of the most common transformations you’ll perform when cleaning or standardizing data in PySpark. Whether you’re aligning tables from different systems, preparing data for machine learning, or simply making column names more readable, updating many column names at once can quickly become tedious...

Query CSV files in Python using DuckDB
Academy Membership DuckDBPython

Query CSV files in Python using DuckDB

📘 Introduction One of DuckDB’s most useful features is the ability to query CSV files directly—no need to load them into a database first. This tutorial will guide you through running SQL queries on a CSV file using Python. ✅ Prerequisites Before you begin, make sure you have: 🐍☑️ Installed Python...

Query a Pandas DataFrame using DuckDB
Academy Membership DuckDBPython

Query a Pandas DataFrame using DuckDB

📘 Introduction If you enjoy working with pandas but wish you could use clean, powerful SQL at any time, then DuckDB is the right tool for you. With DuckDB, you can query your DataFrames instantly without having to set up a database, run a server, or change your workflow. ✅ Prerequisites Before...

How to install DuckDB with pip
Academy Membership DuckDBData Engineering

How to install DuckDB with pip

📘Introduction In this blog post, we’ll walk through how to install DuckDB, a fast and lightweight analytical database engine designed for modern data workflows. Whether you're working on data analysis, ETL pipelines, or experimenting with in-process SQL queries, DuckDB is incredibly easy to set up and...

PySpark coalesce() Function Explained
Academy Membership PySparkPython

PySpark coalesce() Function Explained

📘 Introduction In many real-world datasets, the same type of information can appear in more than one column. A customer may provide an email address, a phone number, or a backup contact, and different systems may populate different fields. When you want to select the first available non-null value...

Using pre-hooks in dbt: A Hands-On Guide
Academy Membership dbtData Engineering

Using pre-hooks in dbt: A Hands-On Guide

📘Introduction In this hands-on dbt tutorial, you'll learn how to use pre-hooks to automate tasks such as creating backup tables before a model runs. Pre-hooks allow you to execute SQL before your dbt model builds, which is useful for auditing, data quality checks, or preparing...

Overview of all important YAML Files in dbt
Academy Membership dbtData Engineering

Overview of all important YAML Files in dbt

📘Introduction When working with dbt (data build tool), YAML files are the backbone of your project’s configuration. They define how dbt behaves, how your models connect to data sources, and how metadata, documentation, and tests are managed. Understanding these YAML files and knowing where they are located within your...

How to Generate a Hash from Multiple Columns in PySpark
Academy Membership PySparkData Engineering

How to Generate a Hash from Multiple Columns in PySpark

📘 Introduction When processing massive datasets in PySpark, it’s often necessary to uniquely identify rows or efficiently detect changes across records. Using multiple columns as a composite key can quickly become cumbersome and inefficient — especially during joins or deduplication. A better solution is to generate a single hash value derived...

Configuring DEV and PROD Environments in dbt
Academy Membership dbtData Engineering

Configuring DEV and PROD Environments in dbt

📘Introduction In this hands-on dbt tutorial, you’ll learn how to configure separate development (DEV) and production (PROD) environments to safely build, test, and deploy your data models. We’ll walk through why environment separation matters and how to configure your profiles.yml so you can switch between environments...

Using Jinja loops for flexible Aggregations in dbt
Academy Membership dbtData Engineering

Using Jinja loops for flexible Aggregations in dbt

📘Introduction In this hands-on dbt tutorial, you’ll learn how to make your aggregations dynamic and flexible using Jinja loops inside your SQL models. Instead of writing multiple aggregation functions, you’ll see how to dynamically generate aggregation logic — saving time and reducing repetitive SQL code. 🎓 Preparing for dbt...

Making CASE Statements dynamic with Jinja Loops in dbt
Academy Membership dbtData Engineering

Making CASE Statements dynamic with Jinja Loops in dbt

📘Introduction In this hands-on dbt tutorial, you’ll learn how to make your SQL transformations dynamic and reusable by using Jinja loops inside CASE statements. This approach helps you replace repetitive SQL logic with concise, maintainable code — a valuable skill for any Data Engineer. 🎓 Preparing for dbt Analytics Engineering...

Using For Loops in your dbt models with Jinja
Academy Membership dbtData Engineering

Using For Loops in your dbt models with Jinja

📘Introduction In this hands-on dbt tutorial, you'll learn how to use Jinja for loops inside your dbt models to make your SQL code more dynamic and automated. Instead of manually repeating similar SQL logic for multiple columns, tables, or conditions — you can use for loops with Jinja...

Overwriting Project Variables at Runtime in dbt
Academy Membership dbtData Engineering

Overwriting Project Variables at Runtime in dbt

📘 Introduction In this hands-on dbt tutorial, you'll learn how to overwrite project variables at runtime — a powerful feature that lets you dynamically change your dbt model behavior without modifying your code or dbt_project.yml. This is especially useful when you need to run the same transformation...

GraphQL vs. REST: Understanding the Key Differences
Academy Membership APIData Engineering

GraphQL vs. REST: Understanding the Key Differences

📘Introduction In this tutorial, you'll learn how GraphQL and REST differ and why developers choose one over the other. Whether you’re building modern web apps or managing large datasets, understanding these differences is essential for efficient, scalable applications. 🟣 GraphQL GraphQL (Graph Query Language) is a query language...

You’ve successfully subscribed to Deep Learning Nerds | The ultimate Learning Platform for AI and Data Science
Welcome back! You’ve successfully signed in.
Great! You’ve successfully signed up.
Success! Your email is updated.
Your link has expired
Success! Check your email for magic link to sign-in.