Tech News

Hosting and Consuming the Media Feature Computation Model at Netflix

Netflix Uses Metaflow to Manage Hundreds of AI/ML Applications at Scale

Netflix recently published how its Machine Learning Platform (MLP) team provides an ecosystem around Metaflow, an open-source machine learning infrastructure framework. By creating various integrations for Metaflow, Netflix is able to support hundreds of Metaflow projects maintained by multiple engineering teams. Read more on InfoQ

An architecture diagram illustrating the cache-aside pattern implemented in CacheFront

Uber’s CacheFront: Powering 40M Reads per Second with Significantly Reduced Latency

Uber developed an innovative caching solution, CacheFront, for its in-house distributed database, Docstore. CacheFront enables over 40M reads per second from online storage and achieves substantial performance improvements, including a 75% reduction in P75 latency and over 67% reduction in P99.9 latency, demonstrating its effectiveness in enhancing system efficiency and scalability. Read more on InfoQ

A diagram depicting Zone-aware routing between pods, where traffic is routed between pods in the same zone

DoorDash Uses Service Mesh and Cell-Based Architecture to Significantly Reduce Data Transfer Costs

In a recent move, DoorDash has significantly optimized its cloud infrastructure costs. The company faced increased cross-AZ data transfer costs when transitioning to a microservices architecture. To substantially reduce this cost, DoorDash implemented zone-aware routing with its Envoy-based service mesh, taking advantage of its Cell-Based Architecture. Read more on InfoQ

An architecture diagram depicting how asynchronous data redaction jobs work with Kafka topics.

Privacy Engineering at Scale: DoorDash’s Journey in Geomasking and Data Protection

DoorDash recently published how it proactively embeds privacy into its products. It explains the importance of Privacy Engineering, an often overlooked software architecture practice, and provides an example of geomasking users’ address data to protect their privacy better. Read more on InfoQ

A conceptual diagram showing how Verifiable Credentials work

Revolutionizing Digital Identity: How Verifiable Credentials Offer a New Era of Privacy and Control

Auth0 recently published an in-depth explanation of Verifiable Credentials (VCs). The article emphasizes the potential of VCs to transform how identities are managed online. It highlights the limitations of current identity systems and how VCs can address these gaps, particularly in allowing identity claims to be disclosed without issuers knowing, thereby enhancing privacy and control…

The architecture diagram for Reddit's REV2 system

Reddit Unveils REV2: Modernised Rule-Execution with Kubernetes, Kafka, and Flink Stateful Functions

Reddit’s Safety Engineering team recently published how it modernised its Rule-Execution system, which detects and acts on policy-violating content in real time. The new architecture includes improvements like transitioning from legacy EC2-based systems to Kubernetes, better rule version control with Github and S3 storage, and the capability to scale more efficiently with Flink Stateful Functions. Read…

An architecture diagram of LinkedIn's Iris escalation management system with iris-message-processor.

LinkedIn’s Open-Source “iris-message-processor” Achieves 86.6x Faster Escalation Management Speeds

LinkedIn developed a new open-source service called “iris-message-processor” to enhance the performance and reliability of its existing Iris escalation management system. “iris-message-processor” significantly improves processing speeds, being ~4.6x faster under average loads and ~86.6x faster under high loads than its predecessor. Read more on InfoQ

A diagram depicting the InfluxDB 3.0 software architecture.

Inside InfluxDB 3.0: Exploring InfluxDB’s Scalable and Decoupled Architecture

InfluxData recently unveiled the system architecture for InfluxDB 3.0, its newest time-series DB. Its architecture encompasses four major components responsible for data ingestion, querying, compaction, and garbage collection and includes two main storage types. The architecture caters to operating the DB on-premise and natively on major cloud providers. Read more on InfoQ

The word "Password" in an illustration.

Implementing Application Level Encryption at Scale: Insights from Atlassian’s Use of AWS and Cryptor

Atlassian recently published how it performs Application Level Encryption at scale on AWS while utilising high cache hit rates and maintaining low costs. Atlassian’s solution runs over 12,500 instances and manages over 1,540 KMS keys. It performs over 11 billion decryptions and 811 million encryptions daily, costing $2,500 per month versus a potential $1,000,000 per…

An architecture illustration showing how LIquid is used for online recommendations

LinkedIn’s LIquid Graph Database: Scaling Real-Time Data Access for 930+ Million Members

LinkedIn recently published how LIquid, its graph database, automates the indexing and real-time access of all connections to members, schools, skills, companies, positions, jobs, events, etc. This knowledge graph, known as the Economic Graph, has 270 billion edges and growing, currently handling a workload of 2 million queries per second. Read more on InfoQ

The C4 Context diagram of StackFeed.io

StackFeed: Cloud Service Updates as a Service

In the fast-paced field of cloud services, staying current with service updates is often challenging. The recently announced StackFeed aims to address this issue, especially for software architects managing multi-cloud architectures. Architects select cloud services they’re interested in, and StackFeed generates a customised service update feed consumable using an RSS reader, Slack or Teams. Read…

An architecture diagram of message traversal in the Slack software system

Real-Time Messaging Architecture at Slack

Slack recently described how it sends millions of messages daily in real-time across the globe. The company provides a comprehensive insight into its architecture, designed to manage real-time messages at scale. It highlights the unique challenges posed by delivering real-time messages across different time zones and regions and how Slack’s engineers designed the infrastructure to…