51eshare's Tech Blog
This site is dedicated to sharing IT-related technologies.
Implementing an End-to-End Exactly-Once WebSocket Delivery System for Apache Spark Streaming Data Implementing an End-to-End Exactly-Once WebSocket Delivery System for Apache Spark Streaming Data
The initial system was straightforward, almost naive. A real-time operations dashboard, designed to give stakeholders a
Building a Transactional Data Lakehouse Ingest Pipeline with TiDB CDC NATS and Apache Hudi Building a Transactional Data Lakehouse Ingest Pipeline with TiDB CDC NATS and Apache Hudi
The 24-hour latency on our core analytics dataset was no longer acceptable. Our batch ETL jobs, pulling snapshots from a
Implementing a Real-Time Feature Pipeline with FastAPI Cassandra and PyTorch for Mobile Personalization Implementing a Real-Time Feature Pipeline with FastAPI Cassandra and PyTorch for Mobile Personalization
The technical pain point was latency. Our initial personalization engine ran server-side. A user action in the Flutter a
Integrating a Hadoop Batch Feature Pipeline with a Real-Time FastAPI Serving Layer Integrating a Hadoop Batch Feature Pipeline with a Real-Time FastAPI Serving Layer
The core technical pain point was training-serving skew. Our machine learning models, trained on batch features computed
Constructing an Idempotent Batching Consumer for Bridging a REST API and HBase via RabbitMQ Constructing an Idempotent Batching Consumer for Bridging a REST API and HBase via RabbitMQ
The initial system architecture was straightforward, almost deceptively so. A fleet of RESTful API endpoints, built with
4 / 7