Sigmoid is not a design choice — it is a mathematical theorem. We show why the sigmoid function is the unique valid transform for converting BM25 scores to probabilities, completing Robertson's Probability Ranking Principle after 50 years.

Read Post

We propose a new approach to LLM usage by momentarily reconstructing the context.

Read Post