{"id":47215,"date":"2022-08-09T15:12:17","date_gmt":"2022-08-09T15:12:17","guid":{"rendered":"https:\/\/www.techrepublic.com\/?p=3989024"},"modified":"2022-08-09T15:12:17","modified_gmt":"2022-08-09T15:12:17","slug":"on-call-cloud-operations-cost-organizations-an-average-of-2-5-million-per-year","status":"publish","type":"post","link":"https:\/\/cloudnewshub.com\/?p=47215","title":{"rendered":"On-call cloud operations cost organizations an average of $2.5 million per year"},"content":{"rendered":"<div id>\n<p> Ticketing data is key to gaining insight into on-call operations and uncovering opportunities to improve productivity, according to a new report from Dimensional Research and Shorline.io. <\/p>\n<\/div>\n<div id>\n<figure id=\"attachment_3989039\" aria-describedby=\"caption-attachment-3989039\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-3989039\" src=\"http:\/\/cloudnewshub.com\/wp-content\/uploads\/2022\/08\/on-call-cloud-operations-cost-organizations-an-average-of-2-5-million-per-year.jpg\" alt=\"Hexagon structure with gears and cloud on the gray background. Eps 10 vector file.\" width=\"1400\" height=\"540\"><figcaption id=\"caption-attachment-3989039\" class=\"wp-caption-text\">Image: Adobe Stock<\/figcaption><\/figure>\n<p>Organizations are spending an average of $2.5 million per year on on-call operations, according to a report by Dimensional Research and automation provider Shoreline.io. They also suffer an average of 8.7 major incidents each year, 62% of which escalate to the C-suite, the <a href=\"https:\/\/www.shoreline.io\/offer\/2022-production-operations-benchmark-survey\" target=\"_blank\" rel=\"noopener noreferrer\">Managing On-Call Operations Report <\/a>found.<\/p>\n<aside class=\"pinbox right\">\n<h3 class=\"heading\">Cloud: Must-read coverage<\/h3>\n<\/aside>\n<p>The report highlights a number of challenges and opportunities for the <a href=\"https:\/\/www.techrepublic.com\/article\/you-should-be-optimizing-your-cloud-operations-right-now-heres-how\/\">cloud operations<\/a> industry, maintaining that even though organizations are spending millions of dollars per year on on-call operations, they continue to suffer major outages that impact customer and employee productivity.<\/p>\n<h2>Cloud reliability challenges<\/h2>\n<p>Some 97% of organizational leaders said they prioritize <a href=\"https:\/\/www.techrepublic.com\/article\/cloud-backup-services\/\">cloud reliability<\/a>. Yet despite this focus, companies highlight several major impediments to improving reliability. At the top of the list is the complexity of the environments they are managing.<\/p>\n<p>\u201cAs a company\u2019s product complexity increases, it becomes harder and harder to find SRE [site reliability engineering] and DevOps professionals that have the breadth of experience needed,\u2019\u2019 the report said.<\/p>\n<p><b>SEE: <\/b><a href=\"https:\/\/www.techrepublic.com\/resource-library\/whitepapers\/hiring-kit-cloud-engineer\/\"><b>Hiring Kit: Cloud Engineer<\/b><\/a><b> (TechRepublic Premium)<\/b><\/p>\n<p>The second biggest issue respondents cited is the lack of time to focus on preventing incidents or automating fixes. \u201cThis truly becomes a vicious cycle where the less time a team has, the less they can invest in improvements, while the product continues to grow and become more complex,\u2019\u2019 the report noted. \u201cAs the load on operations teams increases, people leave, causing the burden to be shared by fewer people.\u201d<\/p>\n<p>This report makes the case for organizations to start investing in incident prevention and repair automation right away, no matter where they are on their journey.<\/p>\n<p>Among the other key findings:<\/p>\n<ul>\n<li>&nbsp;Service providers and human error are responsible for 72% of major incidents<\/li>\n<li>Human error is 5x more likely to cause a major outage than automation error<\/li>\n<li>The average time to resolve escalated incidents is 10.7 hours<\/li>\n<li>Fifty-five percent of incidents are escalated to second-line responders or experts outside of the on-call team<\/li>\n<li>Forty-eight percent of incidents are low value, repetitive, toil<\/li>\n<\/ul>\n<p>As more organizations prioritize reducing the total number of incidents, decreasing costs, and shortening the time to recover, the survey indicated how significant reliability is:<\/p>\n<ul>\n<li>&nbsp;Ninety-eight percent of organizations face challenges in delivering highly reliable cloud applications<\/li>\n<li>SRE teams grew 26% in the last 12 months<\/li>\n<li>Cloud footprints grew 38% in the last 12 months<\/li>\n<li>Modern technologies are making infrastructure management more difficult, with 73% reporting that <a href=\"https:\/\/www.techrepublic.com\/article\/multicloud-the-smart-persons-guide\/\">multicloud<\/a> makes their job harder and 52% reporting that Kubernetes and microservices make their job harder<\/li>\n<\/ul>\n<p>\u201cThe growth of cloud footprints is outpacing the growth of on-call teams,\u201d said Diane Hagglund, principal at Dimensional Research, in a statement. \u201cCloud environments are becoming increasingly complex while it is particularly challenging to find staff with the expertise to meet on-call needs, leaving incident response teams struggling to meet reliability demands.\u201d<\/p>\n<p><b>SEE: <\/b><a href=\"https:\/\/www.techrepublic.com\/resource-library\/downloads\/icloud-vs-onedrive-which-is-best-for-mac-ipad-and-iphone-users-free-pdf\/?r=28123414\"><b>iCloud vs. OneDrive: Which is best for Mac, iPad and iPhone users? (free PDF)<\/b><\/a> <b>(TechRepublic)<\/b><\/p>\n<h2>How to improve on-call productivity<\/h2>\n<p>The report details several recommendations for improving on-call including:<\/p>\n<h3>Ensure incident management systems provide insight<\/h3>\n<p>Ninety-eight percent of organizations reported struggles with their incident management approach. Using ticketing data to gain insight into on-call operations is key to uncovering opportunities to improve productivity.<\/p>\n<h3>Attack escalations<\/h3>\n<p>The biggest opportunity to improve on-call productivity is by reducing incident escalations, which account for 78% of on-call time. Investing in self-service tools to empower support teams will not only reduce the total number of escalations but will provide more comprehensive diagnostic data.<\/p>\n<h3>Attack repetitive, low-value work or toil<\/h3>\n<p>Forty-eight percent of incidents are repetitive, presenting an opportunity to create self-healing incident remediation that frees teams of repetitive tasks so they can dedicate more time to improving resiliency, securing environments, and lowering costs to further improve productivity.<\/p>\n<p>\u201cThe current approach to on-call is unsustainable, with the rapid growth of cloud infrastructure leaving SRE teams faced with thousands of hours of work per month,\u201d said Anurag Gupta, founder and CEO at Shoreline.io, in a statement. \u201cUtilizing automation to address escalations and eliminate low value, repetitive work will dramatically improve team productivity and overall customer experience.\u201d<\/p>\n<p>Dimensional Research said over 300 on-call practitioners, managers and executives were polled to learn about incident response in production cloud environments. Survey participants are responsible for running businesses that manage less than 20 to over 10,000 nodes, the firm said.<\/p>\n<\/p><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Ticketing data is key to gaining insight into on-call operations and uncovering opportunities to improve productivity, according to a new report from Dimensional Research and Shorline.io. Image: Adobe Stock Organizations are spending an average of $2.5 million per year on on-call operations, according to a report by Dimensional Research and automation provider Shoreline.io. They also [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":47216,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[40,783],"tags":[],"class_list":["post-47215","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-cloud","category-cloudsync"],"_links":{"self":[{"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/posts\/47215","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=47215"}],"version-history":[{"count":0,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/posts\/47215\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/media\/47216"}],"wp:attachment":[{"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=47215"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=47215"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=47215"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}