You are here

How to index Linked Data from Resource Description Framework (RDF) to Solr or Elastic Search

Import and index linked data from semantic knowledge graph for full text search, faceted search and text mining

The open source extract transform load (ETL) plugin enhance_rdf for granular indexing of Resource Description Framework (RDF) knowledge graphs is a data enrichment plugin for RDF files or RDF connector for SPARQL endpoints or triplestores.

Since files in Resource Description Framework (RDF) format are a structured graph and more like a dataset or database with multiple or many different entries than a single document, it doesn't extract and index a RDF file only as one whole single file or document like standard document files.

It is indexing the RDF data more granular by distinguishing different entities or RDF subjects:

Aggregates RDF triples to Solr or Elastic search documents

For each entity or RDF subject in all RDF triples it generates an own searchable semi-structured (for faceted search) but text based (for full text search) document with the content type knowledge graph.

Or in other words it creates for each entity or subject from the RDF graph a database row with columns for each property containing all readable and searchable text.

Therefore it index each URI/subject as Solr or Elastic Search document with its properties (RDF predicates) as facets/fields/database columns and its objects (RDF objects) as values.

Transform references like URIs to human readable and searchable labels

For full text search capabilities this Extract Transform Load (ETL) plugin transforms referenced URIs to their label(s), i.e. from properties like rdfs:label or SKOS labels from an Simple Knowledge Organisation System (SKOS) based thesaurus.

User interfaces for full text search, faceted search, fuzzy search & text analysis in linked data knowledge graphs

So you can use easy search user interfaces (UI) for full text search, interactive filters (faceted search) & text mining on Resource Description Framework (RDF) knowledge graphs

How to import and index an linked data graph from a file in Resource Description Framework (RDF)

Enable the RDF enhancer plugin (free open source software) by adding enhance_rdf to the option config['plugins'].

Then just index the RDF file by one of the following methods or tools for indexing files:

How to import and index an linked data from a graph database or triple store (SPARQL)

Please donate for implementation of Extract Transform Load (ETL) tool or connector for triple stores like Apache Jena by SPARQL queries.

Open Source tools for import RDF to Solr or Elastic Search

Alternate free software and open source tools for import of RDF data or SPARQL results to Solr or Elastic Search: