Insights23 min read
The Missing Layer: A Structural Gap in the AI Infrastructure Stack
by Jaepil Jeong | April 10, 2026
Between models and hardware lies an invisible structural gap — a data layer that should exist but doesn't. We examine why the current AI infrastructure stack is fragmented, why this is a structural problem rather than a transitional one, and what conditions a proper solution must satisfy. We then present UQA and the Cognica engine as one concrete response to these conditions.
Read Post
Tech5 min read
Graph Queries in a Unified Database: From Cypher to Posting Lists
by Jaepil Jeong | March 26, 2026
Graph databases solve relationship-heavy problems elegantly, but adding a separate graph system alongside your relational database creates operational complexity. We explain how Cognica integrates graph queries into its unified algebra, enabling Cypher and SQL to compose in a single transaction without data duplication.
Read Post
Research7 min read
Vector Scores Are Not Probabilities: Likelihood Ratio Calibration for Hybrid Search
by Jaepil Jeong | March 25, 2026
A cosine similarity of 0.85 tells you an angle, not a probability. We show how to transform vector similarity scores into calibrated relevance probabilities using distributional statistics that ANN indexes already compute — completing the probabilistic unification of text and vector retrieval.
Read Post
Research11 min read
Why Sigmoid? The Mathematical Inevitability Behind Bayesian BM25
by Jaepil Jeong | February 23, 2026
Sigmoid is not a design choice — it is a mathematical theorem. We show why the sigmoid function is the unique valid transform for converting BM25 scores to probabilities, completing Robertson's Probability Ranking Principle after 50 years.
Read Post
Tech15 min read
Building a Probabilistic Search Engine: Bayesian BM25 and Hybrid Search
by Jaepil Jeong | February 1, 2026
Modern search systems struggle to combine lexical matching with semantic understanding. We explore how we built a probabilistic ranking framework in Cognica Database that transforms BM25 scores into calibrated probabilities, enabling principled fusion of text and vector search results.
Read Post
Tech18 min read
JIT Toolchain: Building a Disassembler and CPU Emulator for Database Development
by Jaepil Jeong | January 19, 2026
The essential infrastructure that makes Copy-and-Patch JIT development and debugging practical. We explore the multi-architecture disassembler for validation and software CPU emulator for cross-platform testing and debugging.
Read Post
Tech16 min read
Copy-and-Patch JIT: Achieving Native Code Performance with Microsecond Compilation
by Jaepil Jeong | January 17, 2026
How Cognica Database Engine breaks the JIT compilation latency barrier. We explore Copy-and-Patch JIT compilation, a technique that achieves 2-10x speedup over interpretation while keeping compilation time under one millisecond per kilobyte of bytecode.
Read Post
Engineering8 min read
Why NOT Operations are Difficult in Vector Search
by Jaepil Jeong | February 3, 2025
We discuss why NOT operations are difficult in vector search.
Read Post
Research4 min read
Momentarily Reconstructed Contexts: A New Approach to LLM Usage
by Jaepil Jeong | December 15, 2024
We propose a new approach to LLM usage by momentarily reconstructing the context.
Read Post
Company4 min read
Starting a New Journey for the Big Ideas
by Jaepil Jeong | June 13, 2023
We know that terms like big data, data lakes, and web-scale are fancy and attractive, but those are only everyday issues for very few of us. Most companies will never deal with the petabytes scale of the data. Let's be practical and stay on the ground. Most companies just need a simple but powerful database system to solve real problems. We are here to build a product for most companies, not just for unicorns. Our mission is to solve the common problems often associated with existing database systems and simplify software development by keeping your software stacks as simple as possible.







