Engineering.salesforce.com

3 metrics relevant to every service, always

WebIn Salesforce Security Engineering, our teams retrospected on these similarities and identified 3 positive “tells” for service health. Now, our engineering efforts explicitly invest …

Actived: 1 days ago

URL: https://engineering.salesforce.com/3-metrics-relevant-to-every-service-always-9f2620f4adec/

READS: Service Health Metrics

WebHealth Indicators. The handy acronym READS describes the minimal set of indicators for every service: Request rate, errors, availability, duration/latency, and saturation. Every …

Category:  Health Go Health

5 Design Patterns for Building Observable Services

Web1. The Outside-In Health Check. In this pattern, you ping your service endpoint using using a health-check service or a synthetic testing tool. We use a tool we built in-house, but …

Category:  Health Go Health

6 Ways We Deliver on Our Promise of Availability and Performance

WebIn a climate where the health of our employees is less predictable, this is even more important. Here are a few ways we remain ready: Site Reliability Engineering coverage. …

Category:  Health Go Health

How, Not Why: An Alternative to the Five Whys for Post-Mortems

WebDefending against failure is a big part of our job. We defend systems against failure with techniques like redundancy, auto-scaling load, balancing load, shedding monitoring, …

Category:  Health Go Health

Autonomous Monitoring and Healing Networks

WebThe need of the hour is a robust, self-reliant automated monitoring tool that provides great insight and a lesser degree of manual intervention. We need autonomous interventions …

Category:  Health Go Health

Managing Availability in Service Based Deployments with …

WebThe Problem. At Salesforce, trust is our number one value. What this equates to is that our customers need to trust us; trust us to safeguard their data, trust that we will keep our …

Category:  Health Go Health

The Power of AI: Strengthening Application Security by Eliminating

WebAST encountered two main challenges for eliminating secrets in code: Attribution: With ~120,000 repositories that were created over long periods of time, combined with issues …

Category:  Health Go Health

Building a Fault-Tolerant Data Pipeline for Chatbots

WebMontgomery Scott. Apache Kafka is a durable and fault-tolerant publish-subscribe messaging system. While it is highly scalable, it is not just for high throughput use-cases. …

Category:  Health Go Health

Hadoop/HBase on Kubernetes and Public Cloud (Part I)

WebAt Salesforce, we run a large number of HBase and HDFS clusters in our own data centers. More recently, we have started deploying our clusters on Public Cloud infrastructure to …

Category:  Health Go Health

Building Resilience: Delivering Your Best Under Pressure

WebLast fall, we hosted psychologist, researcher, and CEO of Kintla, Bill Redmon, for a talk on building resilience.As many of us find ourselves working under unprecedented …

Category:  Health Go Health

Scaling an Alerting Service Using AWS Lambda Functions

WebScaling an Alerting Service Using AWS Lambda Functions. Sanyogita ranade. Nov 17 - 9 min read. Anyone who takes monitoring their services seriously knows that alerting is an …

Category:  Health Go Health

Setting up a Web API for Success

WebSetting up a Web API for Success. Scott Walker. Mar 29 - 10 min read. In the current technology landscape, web APIs are used everywhere. In fact, almost every major …

Category:  Health Go Health

How Salesforce Optimizes Performance in China

WebDuring Dreamforce 2017, Jeff Cheng and Rob Mercado gave a talk entitled Performance Optimization: Greater China Region and Beyond (see the full talk, and refer to the …

Category:  Health Go Health

How is Salesforce Einstein Optimizing AI Classification Model …

WebMLOP uses mathematical formulas based on a confusion matrix to determine a model’s overall performance. MLOP also validates models by subjecting them to tests using …

Category:  Health Go Health

Transforming Service Reliability Through an SLOs-Driven Culture

WebAt Salesforce, Trust is our number-one value, and it has its own special meaning to each part of the company. In our Technology, Marketing, & Products (TMP) organization, a big …

Category:  Health Go Health

Take A Moment To Refocus

WebIn Refocus, an Aspect represents each of the different characteristics of a monitored subject. An Aspect can be anything that is measurable including things like “request latency”, …

Category:  Health Go Health

The Unified Infrastructure Platform Behind Salesforce Hyperforce

WebThe Unified Infrastructure Platform Behind Salesforce Hyperforce. If you’re paying attention to Salesforce technology at all, you’ve no doubt heard about Hyperforce, our new …

Category:  Health Go Health

Automation at Scale: Migrating 200K Machines from CentOS 7 to …

WebAs CentOS 7 races to the end of its operational runway, Salesforce Engineering tackles its new OS upgrade task head-on. This time around, the team faces a much bigger hurdle: …

Category:  Health Go Health