graph database | Max De Marzi

Mar 15 2013

Java, Random

A Peek behind the Neo4j Lucene Index Curtain

Did you know you can write Javascript in the Neo4j console to access the Neo4j API?
Try it. Open up your Neo4j Web Admin Console and type:

neo4j-sh (0)$ eval db EmbeddedGraphDatabase [data/graph.db]

OMG! I know, Neo4j is crazy. So much to play with, I’ve been at it for a few years and I haven’t even dug into this area. What else can we do here?
Continue reading →

Tagged graph database, java, javascript, lucene, luke, neo4j

Feb 14 2013

6 Comments

Cypher, Deployment, Random

Neo4j and Gatling sitting in a tree, Performance T-E-S-T-ing

neo4j_loves_gatling

I was introduced to the open-source performance testing tool Gatling a few months ago by Dustin Barnes and fell in love with it. It has an easy to use DSL, and even though I don’t know a lick of Scala, I was able to figure out how to use it. It creates pretty awesome graphics and takes care of a lot of work for you behind the scenes. They have great documentation and a pretty active google group where newbies and questions are welcomed.

It ships with Scala, so all you need to do is create your tests and use a command line to execute it. I’ll show you how to do a few basic things, like test that you have everything working, then we’ll create nodes and relationships, and then query those nodes.
Continue reading →

Tagged cypher, gatling, graph database, neo4j, nosql, performance, scala, testing

Jan 28 2013

17 Comments

Cypher, Heroku, Neography, Problems

Facebook Graph Search with Cypher and Neo4j

Update: Facebook has disabled this application

Your app is replicating core Facebook functionality.

Facebook Graph Search has given the Graph Database community a simpler way to explain what it is we do and why it matters. I wanted to drive the point home by building a proof of concept of how you could do this with Neo4j. However, I don’t have six months or much experience with NLP (natural language processing). What I do have is Cypher. Cypher is Neo4j’s graph language and it makes it easy to express what we are looking for in the graph. I needed a way to take “natural language” and create Cypher from it. This was going to be a problem.
Continue reading →

Tagged cypher, facebook, github, graph database, heroku, ruby, search

Dec 14 2012

5 Comments

Deployment

Setting up a Neo4j Cluster on Amazon

There are multiple ways to setup a Neo4j Cluster on Amazon Web Services (AWS) and I want to show you one way to do it.

Overview:

Create a VPC
Launch 1 Instance
Install Neo4j HA
Clone 2 Instances
Configure the Instances
Start the Coordinators
Start the Neo4j Cluster
Create 2 Load Balancers
Next Steps

We’ll start off by logging on to Amazon Web Services and creating a Virtual Private Cloud:

Continue reading →

Tagged amazon, aws, ec2, graph database, java, load balancers, neo4j, network

Nov 27 2012

1 Comment

Java

Pathfinding with Neo4j Unmanaged Extensions

In Extending Neo4j I showed you how to create an unmanaged extension to warm up the node and relationship caches. Let’s try doing something more interesting like exposing the A* (A Star) search algorithm through the REST API. The graph we created earlier looks like this:
Continue reading →

Tagged github, graph database, java, neo4j, nosql

Nov 26 2012

19 Comments

Cypher, Java

Extending Neo4j

One of the great things about Neo4j is how easy it is to extend it. You can extend Neo4j with Plugins and Unmanaged Extensions. Two great examples of plugins are the Gremlin Plugin (which lets you use the Gremlin library with Neo4j) and the Spatial Plugin (which lets you perform spatial operations like searching for data within specified regions or within a specified distance of a point of interest).

Plugins are meant to extend the capabilities of the database, nodes, or relationships. Unmanaged extensions are meant to let you do anything you want. This great power comes with great responsibility, so be careful what you do here. David Montag cooked up an unmanaged extension template for us to use on github so lets give it a whirl. We are going to clone the project, compile it, download Neo4j, configure Neo4j to use the extension, test the extension and tweak it a bit.
Continue reading →

Tagged cypher, extension template, github, graph database, java, mvn, neo4j, nosql, software, spatial operations, technology

Nov 14 2012

6 Comments

Cypher, Heroku, Neography, Visualization

CrunchBase on Neo4j

NeoTechnology was featured on TechCrunch after raising a Series B round, and it has an entry on CrunchBase. If you look at CrunchBase closely you’ll notice it’s a graph. Who invested in what, who co-invested, what are the common investment themes between investors, how are companies connected by board members, etc. These are questions we can ask of the graph and are well suited for graph databases.
Continue reading →

Tagged cypher, github, graph database, heroku, neo4j, network, ruby, visualization

Oct 11 2012

4 Comments

Cypher, Visualization

Hubway Data Visualization Challenge with Neo4j

Michael Hunger imported the Hubway Challenge dataset into a Neo4j graph database, and made it available for us to play with.
Continue reading →

Tagged cypher, d3.js, graph database, heroku, javascript, neo4j, relationship graph, visualization

Oct 03 2012

1 Comment

Problems, Random

Hunting Trolls with Neo4j!

Allison Sparrow shared a link to Patentula, a company interested in finding better ways to explore patent data and hunt patent trolls. What caught my attention is this quote from the video below:

What we tried to do with it, is bypass any sort of keyword processing in order to find similar patents. The reason we’ve done this is to avoid the problems encountered by other systems that rely on natural language processing or semantic analysis simply because patents are built to avoid detection by similar keywords…we use network topology (specifically citation network topology) to mine the US patent database in order to predict similar documents.

Continue reading →

Tagged graph database, network, network topology, patent, relationship graph

Sep 07 2012

1 Comment

Problems, Random

Networks, Crowds, and Markets

I’ve had “Networks, Crowds, and Markets: Reasoning About a Highly Connected World” by David Easley and Jon Kleinberg on my bookshelf for a few months now, and a conversation with a client reminded me that I hadn’t finished reading it (barely started really). It is available from Cambridge University Press, but also on the web and in PDF format.
Continue reading →

Tagged centrality, cluster, graph, graph database, max flow, network, pagerank, six degrees of kevin bacon

Max De Marzi

Graphs, Graphs, and nothing but the Graphs

Tag Archives: graph database

A Peek behind the Neo4j Lucene Index Curtain

Neo4j and Gatling sitting in a tree, Performance T-E-S-T-ing

Facebook Graph Search with Cypher and Neo4j

Setting up a Neo4j Cluster on Amazon

Pathfinding with Neo4j Unmanaged Extensions

Extending Neo4j

CrunchBase on Neo4j

Hubway Data Visualization Challenge with Neo4j

Hunting Trolls with Neo4j!

Networks, Crowds, and Markets