2024-01-01

Sublinear Time Algorithms

Sublinear time algorithms represent a paradigm shift in computational efficiency, allowing processing of extremely large datasets by reading only a fraction of the input. While exact deterministic sublinear algorithms exist for some problems, most solutions require randomization and approximation techniques, with applications spanning optimization, property testing, and distribution analysis.

Original archive.is archive.ph web.archive.org

Log in to get one-click access to archived versions of this article.

read comments on news aggregators:

Related articles

Markov Chains explained visually

Markov chains are mathematical systems that model transitions between different states with associated probabilities, represented through transition matrices or diagrams. The concept finds practical applications in various fields, from weather prediction to Google's PageRank algorithm, with the ability to simulate real-world phenomena by incorporating probabilistic state transitions.

Postgres as a Graph Database: (Ab)using pgRouting

pgRouting, a Postgres extension typically used for geospatial routing, offers versatile graph functionality beyond GIS applications. The extension can handle task scheduling, resource allocation in distributed systems, and power recommendation engines, making it a lightweight alternative to specialized graph databases.

Why We Designed TigerBeetle's Docs from Scratch | TigerBeetle Blog

TigerBeetle rebuilt their documentation site from scratch, moving away from Docusaurus to achieve better performance, simplicity, and integration with their zero-dependency philosophy. The new implementation uses Zig and Pandoc, resulting in a 10x reduction in footprint while maintaining functionality and adding features like integrated search and offline capabilities.

Relaxed Radix Balanced Trees | Peter Horne-Khan

An in-depth exploration of Relaxed Radix Balanced (RRB) Trees explains their implementation for immutable vectors, focusing on efficient merging operations. The data structure combines radix search with size tables to maintain balanced trees while allowing flexible node sizes, offering improved performance over traditional Persistent Vectors.

0+0 > 0: C++ thread-local storage performance

An in-depth analysis of thread-local storage (TLS) performance in C++, examining how different implementations and contexts affect access speed. Core findings show that TLS access is fastest in executables without constructors, while shared libraries and constructors significantly degrade performance due to complex initialization and addressing mechanisms.

On word embeddings - Part 3: The secret ingredients of word2vec

An in-depth analysis reveals that word embedding models like word2vec aren't inherently superior to traditional distributional semantic methods, with hyperparameter optimization being more crucial than algorithm choice. The study demonstrates that Singular Value Decomposition (SVD) often outperforms popular embedding methods in word similarity tasks, while Skip-gram Negative Sampling (SGNS) excels in analogy tasks.

The Largest Sofa You Can Move Around a Corner | Quanta Magazine

A mathematician has proven that Gerver's sofa shape, with an area of approximately 2.2195, is the largest possible shape that can move around a 90-degree corner in a hallway, solving a 60-year-old mathematical problem without computer assistance. Jineon Baek's elegant proof introduces new mathematical techniques that could help solve other optimization problems.

Tiny Pointers

A novel data structure called 'tiny pointer' is introduced, enabling the replacement of traditional logn-bit pointers with o(logn)-bit pointers while maintaining constant-factor time overhead. The research presents optimal constructions for both fixed-size and variable-size tiny pointers, demonstrating practical applications in five classic data structure problems.

Writing My Own Dithering Algorithm in Racket

An exploration of image dithering techniques demonstrates how to convert color images to black-and-white using various algorithms, including a novel approach that balances between Atkinson and Floyd-Steinberg methods. The author developed this technique while building Guten, a custom newspaper printer that requires monochrome images, showcasing practical applications of dithering in modern technology.

Pushing to Github after a shallow clone is horribly slow

Git's shallow clones can inadvertently disable important optimizations during push operations, particularly affecting the first push in a repository. Using '--depth 2' when cloning maintains one un-mangled commit between the shallow graft point and branch tip, enabling faster initial pushes by preserving Git's optimization capabilities.