Tech News

Netflix Uses Metaflow to Manage Hundreds of AI/ML Applications at Scale

Netflix Uses Metaflow to Manage Hundreds of AI/ML Applications at Scale

Netflix recently published how its Machine Learning Platform (MLP) team provides an ecosystem around Metaflow, an open-source machine learning infrastructure framework. By creating various integrations for Metaflow, Netflix is able to support hundreds of Metaflow projects maintained by multiple engineering teams. Read more on InfoQ

Uber’s CacheFront: Powering 40M Reads per Second with Significantly Reduced Latency

Uber’s CacheFront: Powering 40M Reads per Second with Significantly Reduced Latency

Uber developed an innovative caching solution, CacheFront, for its in-house distributed database, Docstore. CacheFront enables over 40M reads per second from online storage and achieves substantial performance improvements, including a 75% reduction in P75 latency and over 67% reduction in P99.9 latency, demonstrating its effectiveness in enhancing system efficiency and scalability. Read more on InfoQ

DoorDash Uses Service Mesh and Cell-Based Architecture to Significantly Reduce Data Transfer Costs

DoorDash Uses Service Mesh and Cell-Based Architecture to Significantly Reduce Data Transfer Costs

In a recent move, DoorDash has significantly optimized its cloud infrastructure costs. The company faced increased cross-AZ data transfer costs when transitioning to a microservices architecture. To substantially reduce this cost, DoorDash implemented zone-aware routing with its Envoy-based service mesh, taking advantage of its Cell-Based Architecture. Read more on InfoQ

Privacy Engineering at Scale: DoorDash’s Journey in Geomasking and Data Protection

Privacy Engineering at Scale: DoorDash’s Journey in Geomasking and Data Protection

DoorDash recently published how it proactively embeds privacy into its products. It explains the importance of Privacy Engineering, an often overlooked software architecture practice, and provides an example of geomasking users’ address data to protect their privacy better. Read more on InfoQ

Revolutionizing Digital Identity: How Verifiable Credentials Offer a New Era of Privacy and Control

Revolutionizing Digital Identity: How Verifiable Credentials Offer a New Era of Privacy and Control

Auth0 recently published an in-depth explanation of Verifiable Credentials (VCs). The article emphasizes the potential of VCs to transform how identities are managed online. It highlights the limitations of current identity systems and how VCs can address these gaps, particularly in allowing identity claims to be disclosed without issuers knowing, thereby enhancing privacy and control…

Reddit Unveils REV2: Modernised Rule-Execution with Kubernetes, Kafka, and Flink Stateful Functions

Reddit Unveils REV2: Modernised Rule-Execution with Kubernetes, Kafka, and Flink Stateful Functions

Reddit’s Safety Engineering team recently published how it modernised its Rule-Execution system, which detects and acts on policy-violating content in real time. The new architecture includes improvements like transitioning from legacy EC2-based systems to Kubernetes, better rule version control with Github and S3 storage, and the capability to scale more efficiently with Flink Stateful Functions. Read…

LinkedIn’s Open-Source “iris-message-processor” Achieves 86.6x Faster Escalation Management Speeds

LinkedIn’s Open-Source “iris-message-processor” Achieves 86.6x Faster Escalation Management Speeds

LinkedIn developed a new open-source service called “iris-message-processor” to enhance the performance and reliability of its existing Iris escalation management system. “iris-message-processor” significantly improves processing speeds, being ~4.6x faster under average loads and ~86.6x faster under high loads than its predecessor. Read more on InfoQ

Inside InfluxDB 3.0: Exploring InfluxDB’s Scalable and Decoupled Architecture

Inside InfluxDB 3.0: Exploring InfluxDB’s Scalable and Decoupled Architecture

InfluxData recently unveiled the system architecture for InfluxDB 3.0, its newest time-series DB. Its architecture encompasses four major components responsible for data ingestion, querying, compaction, and garbage collection and includes two main storage types. The architecture caters to operating the DB on-premise and natively on major cloud providers. Read more on InfoQ

Implementing Application Level Encryption at Scale: Insights from Atlassian’s Use of AWS and Cryptor

Implementing Application Level Encryption at Scale: Insights from Atlassian’s Use of AWS and Cryptor

Atlassian recently published how it performs Application Level Encryption at scale on AWS while utilising high cache hit rates and maintaining low costs. Atlassian’s solution runs over 12,500 instances and manages over 1,540 KMS keys. It performs over 11 billion decryptions and 811 million encryptions daily, costing $2,500 per month versus a potential $1,000,000 per…

LinkedIn’s LIquid Graph Database: Scaling Real-Time Data Access for 930+ Million Members

LinkedIn’s LIquid Graph Database: Scaling Real-Time Data Access for 930+ Million Members

LinkedIn recently published how LIquid, its graph database, automates the indexing and real-time access of all connections to members, schools, skills, companies, positions, jobs, events, etc. This knowledge graph, known as the Economic Graph, has 270 billion edges and growing, currently handling a workload of 2 million queries per second. Read more on InfoQ

StackFeed: Cloud Service Updates as a Service

StackFeed: Cloud Service Updates as a Service

In the fast-paced field of cloud services, staying current with service updates is often challenging. The recently announced StackFeed aims to address this issue, especially for software architects managing multi-cloud architectures. Architects select cloud services they’re interested in, and StackFeed generates a customised service update feed consumable using an RSS reader, Slack or Teams. Read…

Real-Time Messaging Architecture at Slack

Real-Time Messaging Architecture at Slack

Slack recently described how it sends millions of messages daily in real-time across the globe. The company provides a comprehensive insight into its architecture, designed to manage real-time messages at scale. It highlights the unique challenges posed by delivering real-time messages across different time zones and regions and how Slack’s engineers designed the infrastructure to…