Intelligence Factory AI

SemDB: Solving the Challenges of Graph RAG

Matt Furnari, CTO

•

11/21/2024

•

RAG

SemDB

OGAR

Buffaly

Summary & Key Insights

In the beginning there was keyword search.

Eventually word embeddings came along and we got Vector Databases and Retrieval Augmented Generation (RAG). They were good for writing blog posts about topics that sounded smart, but didn’t actually work well in the real world. Fast forward a few years and some VC hungry individuals bolted Graph Databases onto the Vector Databsaes and Graph RAG was born.

It’s still great for blog posts. Still doesn’t work well in the real world.

Enter SemDB.ai.

SemDB is an abbreviation for Semantic Database. It’s a database of “semantics” – a database of meaning. SemDB strives to go beyond mathematical tricks and triples. It stores “meaning”. It allows us to index, retrieve, and act upon data by its meaning – not just its cosine similarity.

Behind the scenes, SemDB uses Ontology-Guided Augmented Retrieval (OGAR); a leap forward, enabling faster, more cost-effective, and scalable solutions for real-world applications.

In this post we will focus on a few shortcomings of the Graph RAG approach and how SemDB solves them. Take a look at this article Graph RAG Has Awesome Potential, But Currently Has Serious Flaws | by Troyusrex | Generative AI for an overview of both Graph RAG and some of its problems.

Advantages of Graph RAG

Graph RAG is a huge advance over traditional Vector search.

Enhanced Contextual Understanding: By leveraging graph structures, Graph RAG can capture complex relationships between entities, leading to more accurate and context-aware information retrieval. This is particularly useful for tasks requiring deep understanding and reasoning.
Improved Retrieval Precision: Graph RAG can improve retrieval precision by using graph-based indexing and retrieval methods. This ensures that the most relevant information is retrieved, even if it is buried within a large dataset.
Mitigation of Hallucination: Traditional language models sometimes generate "hallucinated" information, which is not accurate or relevant. Graph RAG helps mitigate this issue by referencing structured knowledge bases, ensuring the generated content is grounded in factual data.
Domain-Specific Knowledge: Graph RAG can be tailored to specific domains by incorporating domain-specific knowledge graphs, making it highly effective for specialized applications such as legal research, medical diagnostics, and technical documentation.

‍

Problems with Graph RAG

But, real world Graph RAG applications have a couple significant problems:

Speed: Graph RAG is horrendously slow for real world applications, often taking minutes to respond.
Cost: Data preparation can cost many thousands of dollars for moderately sized datasets.
Scalability: The reliance on clustered communities makes scaling challenging.
Accuracy: Testing has shown little increase in search accuracy compared to traditional RAG.

‍

SemDB to the Rescue

If the progression has been

Keyword Search → Vector Search (RAG) → Graph Search (Graph RAG)

Then let’s skip ahead a few progressions and get the end:

Keyword Search → Vector Search (RAG) → Graph Search (Graph RAG) → ??? →OGAR - Ontology Guided Augmented Retrieval

You gotta admit, it’s an awesome acronym, right? OGAR…. Grrr. Vector Search and Graph RAG attempt to allow us to search by meaning. Before the arrival of ChatGPT, scientists used to think about things like “How do we represent meaning? What does it mean “to mean”?” There is a rich history of meaning representation that goes beyond word embeddings (vectors) and triples (graphs). Unfortunately, it’s now easier to outsource every task to a multi-hundred gigabyte neural network, than it is to write code. When all you have is an LLM, everything looks like a prompt engineering task. In contrast to Graph RAG, Semantic Database (SemDB) is designed to handle complexity effortlessly. Its ontology-driven framework and Local Understanding solve the problems of Graph RAG.

‍

Local Understanding

As I previously mentioned, not everything needs to be outsourced to ChatGPT. SemDB is able to understand somewhere around 80-90% of sentence inputs without the use of an LLM. That means it can do 80-90% of the processing work without paying a per-token fee.

‍One of the greatest challenges with traditional Graph RAG systems is the prohibitively high cost of entity extraction, driven by heavy reliance on LLMs. Each data chunk and cluster requires multiple LLM calls, quickly adding up to tens of thousands of dollars for large datasets. SemDB, however, does most of this work locally, without involving Big Brother Open AI.

‍Why is that important?

Cost: Less LLM calls mean less $$$.
Accuracy: Local Understanding allows for Organization Specific vocabularies.
Speed: Local Understanding means local processing… and that’s fast.
Security: Not every piece of data needs to be sent to our AI overlords, so that they may use it to train their next models
Note: Open AI and Google both super-duper promise not to ever use your data to train their models. Seriously, they pinky-sweared and everything.

‍

Cost Advantages of Local Understanding

With Local Understanding, SemDB significantly reduces the dependency on costly LLM calls, allowing organizations to process larger datasets at a fraction of the price:

Reduced External LLM Calls:
Traditional systems require 1 LLM call per data chunk and 1 per cluster. SemDB’s Local Understanding handles these tasks algorithmically, bypassing the need for external calls entirely.
This approach slashes costs, making large-scale projects financially viable.
Scalable Data Extraction:
Because Local Understanding operates within the organization’s infrastructure, there is no incremental cost for scaling. SemDB can handle datasets with millions of entities without ballooning expenses.
For example, where traditional methods might cost $60,000 for a million records, SemDB achieves the same results at a fraction of the cost, with no ceiling on dataset size or complexity.
Optimized Processing for Domain-Specific Graphs:
By tailoring its Local Understanding capabilities to the specific needs of the organization, SemDB enables the creation of more complex, richly detailed graphs without incurring additional costs.

‍

Beyond Cost Savings: Enabling Richer Graphs

SemDB’s ability to extract more data for less cost doesn’t just save money—it also empowers organizations to build bigger, more detailed, and more accurate graphs:

Incorporating Nuanced Relationships: Local Understanding allows SemDB to detect subtle, domain-specific relationships that external systems might overlook, enriching the knowledge graph with deeper insights.
Expanding Data Coverage: By lowering costs, organizations can afford to process larger datasets, capturing more entities and relationships that drive value.
Iterative Improvement: SemDB’s architecture allows for ongoing refinement of graphs as new data becomes available, further enhancing accuracy and depth.
Organization Specific Vocabularies: Every company has their own lingo, vocabulary, and internal speak that the LLMs don’t fully understand. SemDB is able to capture that meaning, store it, and operate upon it like any other semantic nugget.

Organizations form their own vocabularies

Conclusion

At Intelligence Factory we use SemDB as the backbone of our applications. It allows us to build complex graphs for various domains. Honestly, our customers don’t care one bit about the advantages of Ontologies over Graphs. Some projects we’ve built on SemDB:

HIPAA Compliant Chat Bots: That don’t hallucinate give dieting advice to anorexics.
Iterative Improvement: SemDB’s architecture allows for ongoing refinement of graphs as new data becomes available, further enhancing accuracy and depth.
Sales Tools: To discover mine thousands of conversations for missed opportunities

What’s most important, however, is that you can take advantage of these technologies with our consumer focused products: FeedingFrenzy.ai and SemDB.ai. Both are built on this infrastructure and offer features that make running your business easier. For the more technical side of things, feel free to check out Buffa.ly.

APCM, Explained: What It Is, Why It Matters, What Patients Gain

9/18/25

Primary care is carrying more risk, more responsibility, and more expectation than ever. The opportunity is that we finally have a model that pays for the work most teams already do between visits. The risk is jumping into tooling and tactics before we agree on the basics....

Noncompete Clauses In Healthcare: The FTC Warning, APCM Staffing, And Platform Partnerships

9/16/25

The Federal Trade Commission’s Sept. 12 warning to healthcare employers is a simple message with real operational consequences. Overbroad noncompetes, no‑poach language, and “de facto” restraints chill worker mobility and can limit patients’ ability to choose their clinicians. For practices building Advanced Primary Care Management teams, restrictive templates do more than create legal risk...

‍

The APCM Quick Start Guide: Converting Medicare's Complex Care Program Into Practice Growth

9/9/25

Advanced Primary Care Management represents Medicare's most ambitious attempt to transform primary care economics. Unlike previous programs that nibbled at the margins, APCM fundamentally restructures how practices organize, deliver, and bill for comprehensive care...

13 Things You Need To Implement Advanced Primary Care Management (APCM)

9/5/25

Advanced Primary Care Management (APCM) is Medicare’s newest program, introduced in 2025 with three billing codes: G0556, G0557, and G0558. This represents a pivotal shift toward value-based primary care by offering monthly reimbursements for delivering continuous, patient-focused services. You're already providing these services—why not get paid for it?

When Women's Health Can't Wait: How Remote Care Creates Presence in Life's Most Critical Moments

8/26/25

At 2 AM, a new mother in rural Alabama feels her heart racing. She's two weeks postpartum, alone with a newborn while her husband works the night shift. Her blood pressure reading on the home monitor shows 158/95. Within minutes, her care team receives an alert. By 6 AM, a nurse has called, medications are adjusted, and what could have been a stroke becomes a story of crisis averted.

Medical Remote Care: How Vendor Models Shift Margin and When to Bring RPM In-House

8/18/25

Many health systems pay full-service RPM vendors $40–$80 PMPM for services they can in-source for far less. With 2025 Medicare rates and OIG scrutiny, it's time to revisit the build-vs-buy math.

Why 73% of Practices Still Fear Remote Care and How the Winning 27% Think Differently

8/11/25

A few months ago, a physician at a 12-doctor practice in rural California called me frustrated. His practice was hemorrhaging money on readmissions, his nurses were burning out from phone tag with chronic disease patients, and his administrator was getting pressure from...

Reclaiming Revenue: How Smart Medical Executives Are Transforming Remote Care into Sustainable Profit Centers

8/6/25

Medical executives today face an uncomfortable reality: while navigating shrinking margins and mounting operational pressures, many are unknowingly surrendering millions in Medicare reimbursements to third-party vendors. The culprit? Poorly structured Remote Patient Monitoring (RPM), Chronic Care Management (CCM)...

RPM’s $16.9B Gold Rush: Why 88% of Claims Skip CMS Review (And How Industry Leaders Are Responding)

7/23/25

Remote Patient Monitoring (RPM) has rapidly evolved from emerging healthcare innovation into a strategic necessity. Driven aggressively by CMS reimbursement policies, RPM adoption has accelerated at unprecedented rates...

Medicare's $4.5 Billion Wake-Up Call: What the VBID Sunset Reveals About Risk, Equity, and the Next Era of Value

7/17/25

In a single December blog post, CMS just rewrote the playbook for $400 billion in annual Medicare Advantage spending. The termination of the Medicare Advantage Value-Based Insurance Design...

Why the AMA’s 2026 RPM Changes Are Exactly What Your Practice Needs

7/8/25

If you've spent any time managing a remote patient monitoring (RPM) program, you already know the drill: juggling the 16-day rule, keeping track of clinical minutes, chasing compliance, and often wondering if this is really what patient-centered care was meant to feel like...

Healthcare Needs a Group Chat, And Digital Twins Are the Invite

7/1/25

Let’s be honest. Managing your health today feels like trying to coordinate a group project where nobody checks their messages. Your cardiologist, endocrinologist...

The Great Code Shift: Turning the ICD-11 Mandate into a Competitive Advantage

6/25/25

The healthcare industry still has scars from the ICD-9 to ICD-10 transition. The stories are legendary in Health IT circles: coder productivity plummeting, claim denials surging, and revenue cycles seizing up for months. It was a painful lesson in underestimation...

Beyond the Box: Finding the Signal in RPM's Next Chapter

6/19/25

In my work with healthcare organizations across the country, I see two distinct patient profiles coming into focus. They represent the past and future of remote care, and every successful practice must now build a bridge between them...

The Living Echo: How Digital Twins Are Reshaping Personalized Healthcare and Operational Excellence

6/11/25

The healthcare landscape is continuously evolving, and among the most profound shifts emerging is the concept of the Digital Twin for Patients. This technology isn't merely an abstract idea...

Why the MIPS MVP Model is the Future—and How Your Practice Can Win

6/2/25

Change is inevitable in healthcare. Often, it feels overwhelming—but occasionally, a new shift arrives that genuinely makes things simpler...

Does RPM Miss What Patients Really Need?

5/27/25

It starts with a data spike… a sudden drop in movement, a rise in reported pain. The alert pings the provider dashboard, hinting at deterioration. But what if that signal isn’t telling the whole truth

Transforming Chronic Pain: The Power of RPM, RTM, and CCM

5/19/25

Chronic pain isn’t just a condition, it’s a thief. It steals time, joy, and freedom from over 51 million Americans, according to the CDC, costing the economy $560 billion a year. As someone passionate about healthcare innovation, I’ve seen how this silent struggle affects patients, families, and providers...

Introduction: Demystifying Ontology—Returning to the Roots

5/16/25

In the tech industry today, we frequently toss around sophisticated terms like "ontology", often treating them like magic words that instantly confer depth and meaning. Product managers, software engineers, data scientists—everyone seems eager to invoke..

APCM Codes: The Quiet Revolution in Primary Care

5/13/25

Picture Mary, 62, balancing a job and early diabetes. Her doctor, Dr. Patel, is her anchor—reviewing labs, coordinating with a nutritionist, tweaking her care plan. But until 2025, Dr. Patel wasn’t paid for this invisible work...

It Always Starts Small: Lessons from the Front Lines of Healthcare Audits

4/28/25

In healthcare, most of the time, trouble doesn't announce itself with sirens and red flags. It starts quietly. A free dinner here. A paid talk there. An event that feels more like networking than education...

Unveiling RPM Fraud Risks—A Technical Dive into OIG Findings and FairPath’s AI Fix

4/24/25

The Office of Inspector General’s (OIG) 2024 report, Additional Oversight of Remote Patient Monitoring in Medicare Is Needed (OEI-02-23-00260), isn't just an alert—it's a detailed playbook exposing critical vulnerabilities in Medicare’s Remote Patient Monitoring (RPM) system...

‍

Telemedicine App Ends Gender Preference Issues with AWS Powered AI

4/19/24

AWS machine learning enhances MEDEK telemedicine solution to ease gender bias for sensitive online doctor visits...

SemDB: Solving the Challenges of Graph RAG

Summary & Key Insights

Advantages of Graph RAG

Problems with Graph RAG

SemDB to the Rescue

Local Understanding

Cost Advantages of Local Understanding

Beyond Cost Savings: Enabling Richer Graphs

Conclusion

Read More

APCM, Explained: What It Is, Why It Matters, What Patients Gain

Noncompete Clauses In Healthcare: The FTC Warning, APCM Staffing, And Platform Partnerships

The APCM Quick Start Guide: Converting Medicare's Complex Care Program Into Practice Growth

13 Things You Need To Implement Advanced Primary Care Management (APCM)

When Women's Health Can't Wait: How Remote Care Creates Presence in Life's Most Critical Moments

Medical Remote Care: How Vendor Models Shift Margin and When to Bring RPM In-House

Why 73% of Practices Still Fear Remote Care and How the Winning 27% Think Differently

Reclaiming Revenue: How Smart Medical Executives Are Transforming Remote Care into Sustainable Profit Centers

RPM’s $16.9B Gold Rush: Why 88% of Claims Skip CMS Review (And How Industry Leaders Are Responding)

Medicare's $4.5 Billion Wake-Up Call: What the VBID Sunset Reveals About Risk, Equity, and the Next Era of Value

Why the AMA’s 2026 RPM Changes Are Exactly What Your Practice Needs

Healthcare Needs a Group Chat, And Digital Twins Are the Invite

The Great Code Shift: Turning the ICD-11 Mandate into a Competitive Advantage

Beyond the Box: Finding the Signal in RPM's Next Chapter

The Living Echo: How Digital Twins Are Reshaping Personalized Healthcare and Operational Excellence

Why the MIPS MVP Model is the Future—and How Your Practice Can Win

Does RPM Miss What Patients Really Need?

Transforming Chronic Pain: The Power of RPM, RTM, and CCM

Introduction: Demystifying Ontology—Returning to the Roots

APCM Codes: The Quiet Revolution in Primary Care

It Always Starts Small: Lessons from the Front Lines of Healthcare Audits

Unveiling RPM Fraud Risks—A Technical Dive into OIG Findings and FairPath’s AI Fix

The Cost of Shortcuts: Lessons From a $4.9 Million Mistake

One Biller, One Gap: How a Missing Piece Reshapes Everything

The System Is Rigged: How AI Helps Independent Docs Fight Back

Trust Is the Real Technology: A Lesson in Healthcare Partnerships

Million Dollar Surprise

Unlocking AI: A Practical Guide for IT Companies Ready to Make the Leap

Agentic RAG: Separating Hype from Reality

From Black Boxes to Clarity: Buffaly's Transparent AI Framework

Bridging the Gap Between Language and Action: How Buffaly is Revolutionizing AI

When Retrieval Augmented Generation (RAG) Fails

Metagraphs and Hypergraphs with ProtoScript and Buffaly

Chunking Strategies for Retrieval-Augmented Generation (RAG): A Deep Dive into SemDB’s Approach

Is Your AI a Toy or a Tool? Here’s How to Tell (And Why It Matters)

Stop Going Solo: Why Tech Founders Need a Business-Savvy Co-Founder (And How to Find Yours)

Why OGAR is the Future of AI-Driven Data Retrieval

The AI Mirage: How Broken Systems Are Undermining the Future of Business Innovation

A Sales Manager’s Perspective on AI: Boosting Efficiency and Saving Time

Prioritizing Patients for Clinical Monitoring Through Exploration

10X Your Outbound Sales Productivity with Intelligence Factory's AI for Twilio: A VP of Sales Perspective

Practical Application of AI in Business

AI: What the Heck is Going On?

Paper Review: Compression Represents Intelligence Linearly

SQL for JSON

Telemedicine App Ends Gender Preference Issues with AWS Powered AI