An observability solution that is down when you need it provides zero value.
That’s why at Chronosphere, we offer a 99.9% uptime SLA and then strive to overachieve on that number. Last year, we were proud to deliver greater than 99.99% uptime to our customers. That’s less than 1 hour of downtime for the entire year!
Chronosphere achieves more than 99.9% uptime
Uptime is more than an approach. Chronosphere’s cloud native observability platform was built from the ground-up for cloud native scale and complexity, which means greater reliability.
Single tenant deployment on dedicated storage infrastructure. Unlike many other SaaS solutions, Chronosphere is single tenant.
This means no noisy neighbors, industry leading SLAs, and individual customer status pages that let you see exactly what’s happening in your tenant.
We store 3x data copies in diverse availability zones.
We ensure strong consistency on quorum writes and reads.
Geographically dispersed regions enable highly consistent data, collected across three availability zones.
(Built on the same front-end serving infrastructure that powers Google)
All of this enables us to deliver a minimum of 99.99% uptime to our customers. Going from three nines to four nines is an exponentially more difficult task, but the result is well worth the effort — reducing customer downtime from over eight hours of downtime a year, to under one hour per year.
We have a more rigorous approach to how we define uptime and our SLAs
We continuously monitor that read and write paths are available, to ensure that data is persisting as expected and that no data is lost.
Other vendors rely on a simple endpoint ping.
We start counting downtime as soon as we detect non-availability of the Chronosphere Covered Services.
Other vendors use the timestamp of the alert in the vendor’s monitoring systems or the timestamp of the customer’s ticket.
Every customer gets a dedicated status page so that you can always check on the availability of your Chronosphere instance.
No filing a support ticket!
When your observability solution is highly reliable, it allows you to deliver the best customer experiences.
Before: DoorDash experienced constant data loss and full system outages. Any change made by developers could easily impact the monitoring tool’s ability to do its job.
Challenge: In a competitive market, user experience is everything and it is directly tied to revenue. App downtime led to poor customer experiences and risked customer satisfaction, retention, growth, and, ultimately, revenue.
Now: DoorDash experiences 99.99% of reliability across ingest, console, and query and experienced approximately one minute of downtime in a six month period.
Before: Constant metrics outages plagued the infrastructure team tasked with managing observability. Also, resource limitations impacted triaging and management.
Challenge: Abnormal’s AI-based email threat detection software took off during the pandemic. The company needed to find an observability solution that could keep pace with over 300% business growth while achieving its targeted 99.9% SLA.
Now: With Chronosphere, Abnormal improved reliability and stability of its metrics system. The company is leveraging Chronosphere’s Control Plane to improve overall stability to greater than 99.9% uptime.
Before: The company provides 99.9% SLA to its customers, but it was only receiving 90% of availability with its previous observability vendor and often experiences outages during peak seasonal events.
Challenge: Downtime posed major risk due to its economic impact on merchants, customers, and engineers.
Now: Chronosphere has improved the FinTech company’s observability stack availability to at least 99.9% and enabled them to seamlessly scale even during peak demand.