Before we kick off some posts on some of the semantic search engines that are available out there, we felt it was important to break semantic web technology down a little bit and try to show how it actually works.
Some other folks in the sourcing world have written excellent posts on semantic search as it applies to recruiting, and rather than re-hashing what they’ve already so generously shared, we recommend reading these posts to get a good idea of how semantic search can be useful in recruiting:
But how does the semantic web actually work? We’ll try to break this rather complicated process down…
Let’s start with visualization. Here is the diagram of how the semantic web operates – also called the Semantic Web Stack:

Most of you are probably scratching your head at this point, saying, “I thought OWL was a type of bird.” So we’ll do our best to define these various components, with the help of The World Wide Web Consortium (W3C), Wikipedia, and a few other resources. There are specific groupings in this stack:
The Hypertext Web technologies, which are the basic foundations for semantic web:
The Standardized Semantic Web technologies, which are the tools that have been accepted as standards for building semantic web applications:
…and the Unrealized Semantic Web technologies, which include tools that are not standardized or are in conceptual status but are important to the functionality of the semantic web:
Makes perfect sense, doesn’t it? Well not quite – so here’s how we see it, in layperson terms:
Each layer is built up on the layers below it. So starting with the pre-established web addresses (of which each semantic search company may choose different sources), these become the knowledge base that make up the semantic web database.
UNICODE is worked in there for universal translation purposes so that the semantic technologies can read the various sites no matter how they’ve been written, and XML provides a way that computers can derive information about what is written and stored on the Internet (hence, syntax).
From there we get into the deciphering of information, which comes through the RDF technology – your basic logic statement machine.
However, as anyone who studied statistics or logic in high school knows, external factors can affect logic – these would be the surrounding words, phrases, and so forth that help in determining the actual meaning of the word or phrase in question. RDFS, SPARQL, OWL, and RIF/SWRL all work together in a series of checks and balances based on rules, variables, known facts, and assumptions. Sort of like a science fair project.
Factor in unifying logic and proof to ensure your “fuzzy logic” makes the most possible sense (that can be determined by a machine), then filter through some abstract ‘Trust technologies’, while all along being verified with RDF digital signatures through the Cryptology element, and out the user interface you go!
This, of course, is the highly non-technical depiction of how the semantic web basically works. Essentially, semantic web is like a plug-in to the natural web – it is not a replacement by any stretch of the imagination. It is a very sophisticated filtration system. And as such, semantic web tools can be developed for general search as well as proprietary use within an enterprise setting. We’ll explore a couple of these possibilities in the coming weeks.
We welcome anyone to contribute a more formal explanation than we’ve been able to provide. But hopefully this will give you a base understanding of the technology behind some of the search engines we’ll be outlining over the next month or two. We hope you will enjoy the SourceCon Semantic Search Series!