Building recommendation systems with Neo4j

Ready to create Netflix-style recommendations that actually work? Learn how to build, optimize, and deploy smart suggestion systems with Neo4j, from basic graphs to production-ready features.

Data Science

Mateusz Jasiński

27 Feb 2025

7 min read

3D network diagram with metallic connections on blue background

How to build a recommendation system with Neo4j

Modern applications now rely on recommendation systems to power personalized suggestions that enhance user experiences. Companies like Netflix, Amazon, and Spotify have demonstrated the tremendous value of well-implemented recommendation engines, with Netflix estimating that their recommendation system saves them $1 billion annually in customer retention. In this article, we'll explore how to use Neo4j, a leading graph database, to create a simple yet effective recommendation system.

Understanding recommendation systems

A recommendation system is designed to filter information and suggest items or content to the end user. These systems typically fall into three main categories:

Collaborative filtering

This approach analyzes user behavior patterns to suggest items that similar users have enjoyed. You've likely seen this in action with messages like 'Customers who bought this also bought...' recommendations, which leverage collective user preferences to make predictions.

Content-based filtering

This method focuses on item characteristics rather than user behavior, matching products based on their inherent features and attributes. Netflix's 'Similar movies to what you've watched' feature is a prime example of content-based filtering in action.

Hybrid approaches

Modern systems combine collaborative and content-based filtering for more sophisticated recommendations. Spotify's Discover Weekly exemplifies this approach, using both listening history and song attributes to create personalized playlists that feel both familiar and fresh.

The main objective is to offer tailored suggestions based on user preferences, interests, or past behaviors. These systems are widely used across various business domains, such as social media, e-commerce, and entertainment, playing a crucial role in personalizing user experiences and boosting customer satisfaction.

In social media, recommendation systems are vital for enhancing user engagement and content discovery. By collecting information like hashtags, posts, and user relations, these systems help users find more relevant posts, articles, videos, or new friends.

Network diagram showing 6 users and their following relationships in coral on dark background

E-commerce

In e-commerce, recommendation systems are essential for improving customer satisfaction and boosting revenue. They define highly personalized product suggestions based on user preferences, purchase history, and browsing history, encouraging cross-selling and upselling.

Understanding Neo4j and graph databases

Neo4j is a graph database management system optimized for handling connected data. Unlike traditional relational databases, Neo4j's native graph storage provides unique advantages for building efficient recommendation systems.

Key components

Neo4j's foundation rests on nodes and relationships. Nodes represent entities like users or products, storing properties as key-value pairs and carrying multiple labels for categorization. Relationships connect these nodes directionally and can also carry properties – for example, a "PURCHASED" relationship might include timestamp and price data.

Performance benefits

The database's architecture uses index-free adjacency, meaning relationships are physical connections rather than database joins. This enables constant-time traversal between nodes, making complex queries significantly faster than traditional databases. Neo4j's native graph processing optimizes pattern matching and graph algorithms, essential features for recommendation systems.

Cypher Query Language

Cypher is a query language similar to SQL, used for creating, browsing, and updating information in the graph database. It allows for highly optimal operations on the database.

# Create Node

CREATE (node:label{key_1:value, key_2:value ...)

# Matching by nodes

MATCH (node:label)
RETURN node

# Matching by relationship

MATCH (node:label)<-[: Relationship]-(n)
RETURN n

Building a recommendation system using Neo4j

For this article, we will build a movie recommendation system based on movies watched by followed users.

The system will allow the creation of relationships between the User and the Movie. Users should also be able to follow other users. The goal is to display movies watched by followed users.

Build our graph

We start by adding some data to our database. A few users and a few films ought to be made. Let's begin with our first query.

CREATE (john:User{firstName:"John"}), (tom:User{firstName:"Tom"}),
  (mark:User{firstName:"Mark"})

CREATE (titanic:Movie{title:"Titanic"}),(avatar:Movie{title:"Avatar"}),
  (forrest:Movie{title:"Forrest Gump"})

Neo4j graph showing users and movies nodes with their relationships in a recommendation system

Now that our initial nodes have been successfully created, let's examine the graph. We must now add a few node-to-node relations. Start with Tom's favorite movies and the users that he follows.

# Create User -> Movie relationship called Watched

MATCH
(avatar:Movie{title: "Avatar"}),
(tom:User{firstName:"Tom"})
CREATE (tom)-[:Watched]->(avatar)

# Create user -> user relationship called Following

MATCH
(tom:User{firstName:"Tom"}),
(mark:User{firstName: "Mark"})
CREATE (tom)-[:Following]->(mark)

A few operations later, our final graph will look like this:

Neo4j movie recommendation graph showing 'Watched' and 'Following' relationships between users and movies

Let's make some queries

When our graph is filled with data, we can start exploring it. Let's begin with an easy task: finding all nodes with the label users.

MATCH (users:User)
return users
===================
RESULT
╒═══════════════════════════╕
│users                      │
╞═══════════════════════════╡
│(:User {firstName: "John"})│
├───────────────────────────┤
│(:User {firstName: "Tom"}) │
├───────────────────────────┤
│(:User {firstName: "Mark"})│
└───────────────────────────┘

Easy, right? Now, let's try to find all the movies that John watched.

MATCH (user:User{firstName:"John"})--(movies:Movie)
return movies
=========================
RESULT
╒════════════════════════════════╕
│movies                          │
╞════════════════════════════════╡
│(:Movie {title: "Avatar"})      │
├────────────────────────────────┤
│(:Movie {title: "Forrest Gump"})│
└────────────────────────────────┘

Okay, this one was a bit more challenging, but hopefully, you got the concept. Next, let's find all users followed by Tom.

MATCH (user:User)-[:Following]->(following)
WHERE ID(user) = 6
return following
==========================
╒═══════════════════════════╕
│following                  │
╞═══════════════════════════╡
│(:User {firstName: "John"})│
├───────────────────────────┤
│(:User {firstName: "Mark"})│
└───────────────────────────┘

As you can see, using Cypher is very similar to SQL. Our final task is to build a query for our recommendation system: finding all movies watched by users followed by Tom.

MATCH (tom:User{firstName: "Tom"})-[:Following]->(users)-[:Watched]->(movies)
return movies
==============================
RESULT
╒════════════════════════════════╕
│movies                          │
╞════════════════════════════════╡
│(:Movie {title: "Titanic"})     │
├────────────────────────────────┤
│(:Movie {title: "Avatar"})      │
├────────────────────────────────┤
│(:Movie {title: "Forrest Gump"})│
└────────────────────────────────┘

Almost there! One more thing to do: we need to exclude movies already watched by Tom. For that, we'll use the WHERE NOT clause.

MATCH
(tom:User{firstName: "Tom"}),
(tom)-[:Following]->(users)-[:Watched]->(otherMovies)
WHERE NOT (tom)-[:Watched]->(otherMovies)
return otherMovies
=================================
RESULT
╒════════════════════════════════╕
│otherMovies                     │
╞════════════════════════════════╡
│(:Movie {title: "Titanic"})     │
├────────────────────────────────┤
│(:Movie {title: "Forrest Gump"})│
└────────────────────────────────┘

Developer resources: tools & libraries for Neo4j enhancement

Building a recommendation system with Neo4j offers powerful capabilities for creating personalized user experiences. The graph database structure naturally fits recommendation scenarios, making it easier to model complex relationships and query patterns.

Remember to:

Start simple

Begin with basic recommendation algorithms and gradually introduce more complex features. This allows you to establish a baseline performance and better understand your users' needs before scaling up.

Monitor performance

Track key metrics as your dataset grows, including recommendation accuracy, user engagement, and system response time. Use these insights to optimize your recommendation engine.

Maintain data quality

Implement regular data cleaning routines to remove outdated entries, fix inconsistencies, and ensure your recommendation system works with reliable, up-to-date information.

Experiment with strategies

Test different recommendation approaches to find what works best for your specific use case. A/B testing can help identify which methods drive the highest user engagement.

Gather user feedback

Collect and analyze user interactions and explicit feedback to continuously improve your recommendations. This helps ensure your system stays aligned with user preferences and expectations.

Neo4j offers multiple drivers for popular programming languages like Python or Java, making it versatile for developers.

Like what you've learned? This is just the tip of the iceberg. Our team can help you build a robust recommendation system that drives real business results. Check out our Web Development services and let's turn this potential into profit.

Portrait of a man in a suit and glasses, looking composed against a neutral background.

Mateusz Jasiński

Engineering Manager

A grown-up kid who replaced toys with tech tools - developer by day, greenkeeper by night. Bugs and weeds don’t stand a chance.

A man standing in the office in front of the Kellton sign, wearing a black shirt and glasses. — Sebastian Spiegel
Backend Development Director

Inspired by our insights? Let's connect!

You've read what we can do. Now let's turn our expertise into your project's success!

Get in touch

Building recommendation systems with Neo4j

How to build a recommendation system with Neo4j

Understanding recommendation systems

Collaborative filtering

Content-based filtering

Hybrid approaches

E-commerce

Understanding Neo4j and graph databases

Key components

Performance benefits

Cypher Query Language

Building a recommendation system using Neo4j

Build our graph

Let's make some queries

Developer resources: tools & libraries for Neo4j enhancement

Remember to:

Start simple

Monitor performance

Maintain data quality

Experiment with strategies

Gather user feedback

Mateusz Jasiński

Engineering Manager

Inspired by our insights? Let's connect!

Get in touch with us

Get to know us

Tell us about your needs

Free consultation

Case Studies

Industries

Building recommendation systems with Neo4j

How to build a recommendation system with Neo4j

Understanding recommendation systems

Collaborative filtering

Content-based filtering

Hybrid approaches

Social networks

E-commerce

Understanding Neo4j and graph databases

Key components

Performance benefits

Cypher Query Language

Building a recommendation system using Neo4j

Build our graph

Let's make some queries

Developer resources: tools & libraries for Neo4j enhancement

Remember to:

Start simple

Monitor performance

Maintain data quality

Experiment with strategies

Gather user feedback

Mateusz Jasiński

Engineering Manager

Inspired by our insights? Let's connect!

Get in touch with us

Get to know us

Tell us about your needs

Free consultation