51eshare's Tech Blog
This site is dedicated to sharing IT-related technologies.
Managing Concurrent Delta Lake Writes from a Multi-User Web Application Managing Concurrent Delta Lake Writes from a Multi-User Web Application
The core challenge began with a simple, yet deceptive, requirement: allow our data science team to collaboratively clean
Building a Reproducible ML Training Pipeline with Apache Iceberg as a Versioned Dataset on DigitalOcean Building a Reproducible ML Training Pipeline with Apache Iceberg as a Versioned Dataset on DigitalOcean
The project started with a familiar, sinking feeling. We were retraining a computer vision model in Keras, built six mon
Constructing an Asynchronous NLP Pipeline for Document Similarity Alerts with Spring Boot, SNS, and ChromaDB Constructing an Asynchronous NLP Pipeline for Document Similarity Alerts with Spring Boot, SNS, and ChromaDB
The initial system was deceptively simple and fundamentally broken for any real-world load. A user would upload a docume
Implementing a Fault-Tolerant Replicated State Machine Using Raft and Embedded SQLite Implementing a Fault-Tolerant Replicated State Machine Using Raft and Embedded SQLite
The project required a fault-tolerant data store with SQL capabilities for a set of services running in a resource-const
51 / 52