Cloudflare recently detailed how it manages its servers’ egress traffic using “soft-unicast “. Soft-unicast allows multiple servers to share a single IPv4 address for their egress traffic while redirecting the response packets to the correct physical server. It provides a scalable, cost-efficient solution for Cloudflare to offer various products that require tagged egress IP addresses.
Uber recently published how it dramatically reduced its logging costs using Compressed Log Processor (CLP). CLP is a tool capable of losslessly compressing text logs and searching them without decompression. It achieved a 169x compression ratio on Uber’s log data, saving storage, memory, and disk/network bandwidth.
Netflix recently published how it built Timestone, a custom high-throughput, low-latency priority queueing system. Netflix built the queuing system using open-source components such as Redis, Apache Kafka, Apache Flink and Elasticsearch. Engineers state that they built Timestone since they could not find an off-the-shelf solution that met all of its requirements.
Atlassian recently published how it exceeded 99.9999% of availability with its Tenant Context Service. Atlassian achieved this high availability by implementing highly-autonomous client sidecars, able to proactively shield themselves from complete AWS region failures. Sidecars query multiple services concurrently to accomplish this goal and ensure that requests are entirely isolated internally.
Slack recently published how it implemented the Circuit Breaker pattern to improve its CI/CD pipeline availability. Before this project, engineers at Slack saw challenges as peak request volumes in internal tooling caused cascade failures in dependent systems. Since completion, engineers saw increased service availability and fewer bad developer experiences like flakiness from failing services.
Spotify engineers recently published how they standardized architecture diagrams at the company. They defined a standard system model named the Spotify Software Model and adapted the C4 model to visualize it. This combination created a shared language used across the organization, which helps communication, aids decision-making, and supports Spotify’s software’s evolution.
Uber recently introduced Unified Signup and Login (USL), an effort to consolidate signup and login experiences across all Uber apps and services. USL lowers the engineering complexity and maintenance overhead and allows faster rollout of security policies and fixes. Over the last two years, Uber rolled out USL and currently, more than 78% of Uber’s traffic has adopted USL.
Lightbend recently launched Kalix, a new PaaS offering for building cloud-native, business-critical applications using any programming language with no databases. Kalix is a unified application layer that pulls together the necessary pieces for writing software and abstracts their implementation details. Lighbend intends for it to provide developers with an innovative NoOps developer experience.
Netflix engineers recently published how they built Studio Search, using Apache Kafka streams, an Apache Flink-based Data Mesh process, and Elasticsearch to manage the index. They designed the platform to take a portion of Netflix’s federated GraphQL graph and make it searchable. Today, Studio Search powers a significant portion of the user experience for many applications within the organisation.
During the past month, I enjoyed participating in InfoQ’s yearly Software Architecture & Design Trends. The key takeaways of this report are: The complete report is available here. Along with the report, we published an accompanying podcast episode. This podcast episode is a chance to listen in on part of the editors’ conversation and hear…
Dropbox recently published how it made the camera upload process for Android faster and more reliable. Dropbox engineers removed shared Android and iOS C++ code and replaced it with a platform-native Kotlin implementation. The engineers are pleased with the decision to rewrite the process, stating that error rates went down and upload performance greatly improved….
LinkedIn recently published how it handles overload detection and remediation in its microservices. Its solution, Hodor, provides an adaptive solution that works out of the box with no configuration. It is a platform-agnostic mechanism to run overload detectors and load shedders inside the monitored process that samples load and sheds traffic from within the application’s…