Prerequisites and tools

This lesson covers tools and setup. Use AuraDB Professional for Neo4j and Northwind CSV files for data. You’ll use Data Importer in Module 3. The course focuses on schema analysis and modeling decisions; the source could be PostgreSQL, BigQuery, Snowflake, or any relational store.

What you need

To complete this course, you will need:

  • A Neo4j AuraDB Professional instance (no credit card required)

  • The Northwind CSV files (instructions below)

  • Basic understanding of relational databases: tables, rows, columns, foreign keys

  • Basic understanding of graph databases: nodes, relationships, properties

  • Familiarity with SQL queries

  • Basic knowledge of Cypher query language

Prerequisites

If you’re new to Neo4j or graphs, complete these first:

Neo4j Database

Create an AuraDB Professional instance for this course:

  • No credit card required - AuraDB Professional includes a free tier

  • Graph Data Science - Run algorithms (PageRank, community detection) after importing

  • Data Importer included - The visual import tool is built into the Aura console

Creating an AuraDB Professional instance

To create an AuraDB Professional instance:

  1. Go to console.neo4j.io/graphacademy

  2. Sign in or create a Neo4j account

  3. Click New Instance

  4. Select AuraDB Professional

  5. Choose a region close to you

  6. Save your connection credentials securely

Why AuraDB Professional?

Why AuraDB Professional?

AuraDB Professional provides access to Graph Data Science algorithms without requiring a credit card. After importing your data, you can run algorithms like PageRank, community detection, and pathfinding directly on your graph.

Northwind Dataset

The Northwind data is available as pre-exported CSV files in the Neo4j GitHub repository, ready to import directly into Neo4j:

Northwind CSV files: https://github.com/neo4j-graph-examples/northwind/tree/main/import

You can:

  • Download the CSV files and upload them to the Neo4j Data Importer

  • Use LOAD CSV in Cypher to import directly from the GitHub raw URLs

Quick start

The import process is covered in Module 3. Use these CSV files with the Data Importer or Cypher in your AuraDB instance.

Using the Data Importer

The Data Importer is built into the AuraDB console. For Northwind, use the CSV files from the previous section:

Open the Data Importer

  1. In Neo4j Aura, open your AuraDB Professional instance

  2. Click Import in the left sidebar

Add your data

  1. In the Data Importer, click Files and upload the Northwind CSV files from https://github.com/neo4j-graph-examples/northwind/tree/main/import

  2. Or drag and drop the CSV files into the Files panel

Map and import

After adding files to the canvas:

  1. Configure each file as a Node or use it to create Relationships

  2. Set the node labels and property mappings

  3. Define relationship types and connect nodes

Module 3 covers the mapping process in detail.

Approaches for Importing Data

Data sources and import methods:

  • Relational Database Management Systems (RDBMS)

  • Web APIs

  • Public data directories

  • BI tools

  • Excel

  • Flat files (CSV, JSON, XML)

Import factors and options

Import method depends on these factors:

  • The source of the data

  • The volume of data

  • The frequency of the import

  • The complexity of the data model

  • The transformation required

Import approaches (choose based on your data volume, update frequency, and transformation needs):

  • One-off batch import of all data

  • One-off load with a regular update

  • Continuous import of data

  • Real-time application updates

  • ETL (Extract, Transform, Load) pipelines

Options for Importing Data

Import options and tools:

A flowchart showing numerous paths and options for importing data into Neo4j. The flowchart lists import tools such as neosemantics

Use this flowchart to pick an import tool for your data source. For this course, you will use Neo4j Data Importer or Arrows.app (Module 3).

Check Your Understanding

1. What is the best solution?

True or False - Using an ETL pipeline will always be the best solution for importing data into Neo4j.

  • ❏ True

  • ✓ False

Hint

ETL is one option. Others include Data Importer, LOAD CSV, neo4j-admin, and custom apps. Choose based on data source, volume, and whether import is one-time or recurring.

Solution

False. Choose the import method based on: data source (CSV, relational database, API), data volume, update frequency (one-time vs recurring), and transformation complexity.

Summary

In this lesson, you explored some of the approaches for importing data.

You can try the optional mapping challenge to practice the relational-to-graph concepts, or continue to the next module to start designing your graph model.

Chatbot

How can I help you today?