Jaepil Jeong | Blog

All Insights Tech Research Engineering Case Studies Company

The Missing Layer: A Structural Gap in the AI Infrastructure Stack

by Jaepil Jeong | April 10, 2026

Between models and hardware lies an invisible structural gap — a data layer that should exist but doesn't. We examine why the current AI infrastructure stack is fragmented, why this is a structural problem rather than a transitional one, and what conditions a proper solution must satisfy. We then present UQA and the Cognica engine as one concrete response to these conditions.

Read Post

Tech5 min read

Graph Queries in a Unified Database: From Cypher to Posting Lists

by Jaepil Jeong | March 26, 2026

Graph databases solve relationship-heavy problems elegantly, but adding a separate graph system alongside your relational database creates operational complexity. We explain how Cognica integrates graph queries into its unified algebra, enabling Cypher and SQL to compose in a single transaction without data duplication.

Read Post

Research7 min read

Vector Scores Are Not Probabilities: Likelihood Ratio Calibration for Hybrid Search

by Jaepil Jeong | March 25, 2026

A cosine similarity of 0.85 tells you an angle, not a probability. We show how to transform vector similarity scores into calibrated relevance probabilities using distributional statistics that ANN indexes already compute — completing the probabilistic unification of text and vector retrieval.

Read Post

Research11 min read

Why Sigmoid? The Mathematical Inevitability Behind Bayesian BM25

by Jaepil Jeong | February 23, 2026

Sigmoid is not a design choice — it is a mathematical theorem. We show why the sigmoid function is the unique valid transform for converting BM25 scores to probabilities, completing Robertson's Probability Ranking Principle after 50 years.

Read Post

Tech15 min read

Building a Probabilistic Search Engine: Bayesian BM25 and Hybrid Search

by Jaepil Jeong | February 1, 2026

Modern search systems struggle to combine lexical matching with semantic understanding. We explore how we built a probabilistic ranking framework in Cognica Database that transforms BM25 scores into calibrated probabilities, enabling principled fusion of text and vector search results.

Read Post

Tech18 min read

JIT Toolchain: Building a Disassembler and CPU Emulator for Database Development

by Jaepil Jeong | January 19, 2026

The essential infrastructure that makes Copy-and-Patch JIT development and debugging practical. We explore the multi-architecture disassembler for validation and software CPU emulator for cross-platform testing and debugging.

Read Post

Tech16 min read

Copy-and-Patch JIT: Achieving Native Code Performance with Microsecond Compilation

by Jaepil Jeong | January 17, 2026

How Cognica Database Engine breaks the JIT compilation latency barrier. We explore Copy-and-Patch JIT compilation, a technique that achieves 2-10x speedup over interpretation while keeping compilation time under one millisecond per kilobyte of bytecode.

Read Post

Engineering8 min read

Why NOT Operations are Difficult in Vector Search

by Jaepil Jeong | February 3, 2025

We discuss why NOT operations are difficult in vector search.

Read Post

Research4 min read

Momentarily Reconstructed Contexts: A New Approach to LLM Usage

by Jaepil Jeong | December 15, 2024

We propose a new approach to LLM usage by momentarily reconstructing the context.

Read Post

Company4 min read

Starting a New Journey for the Big Ideas

by Jaepil Jeong | June 13, 2023

We know that terms like big data, data lakes, and web-scale are fancy and attractive, but those are only everyday issues for very few of us. Most companies will never deal with the petabytes scale of the data. Let's be practical and stay on the ground. Most companies just need a simple but powerful database system to solve real problems. We are here to build a product for most companies, not just for unicorns. Our mission is to solve the common problems often associated with existing database systems and simplify software development by keeping your software stacks as simple as possible.

Read Post