Welcome to Trident Information Systems!
Any Query? Write us to - info@tridentinfo.com
Let's Socialize

DevOps Services

Advancing Azure service quality with artificial intelligence: AIOps

We are going to share our vision on the importance of infusing AI into our cloud platform and DevOps process. Gartner referred to something similar as AIOps (pronounced “AI Ops”) and this has become the common term that we use internally, albeit with a larger scope. Today’s post is just the start, as we intend to provide regular updates to share our adoption stories of using AI technologies to support how we build and operate Azure at scale. Why AIOps? There are two unique characteristics of cloud services: The ever-increasing scale and complexity of the cloud platform and systems The ever-changing needs of customers, partners, and their workloads To build and operate reliable cloud services during this constant state of flux, and to do so as efficiently and effectively as possible, our cloud engineers (including thousands of Azure developers, operations engineers, customer support engineers, and program managers) heavily rely on data to make decisions and take actions. Furthermore, many of these decisions and actions need to be executed automatically as an integral part of our cloud services or our DevOps processes. Streamlining the path from data to decisions to actions involves identifying patterns in the data, reasoning, and making predictions based on historical data, then recommending or even taking actions based on the insights derived from all that underlying data.   Figure 1. Infusing AI into cloud platform and DevOps. The AIOps vision AIOps has started to transform the cloud business by improving service quality and customer experience at scale while boosting engineers’ productivity with intelligent tools, driving continuous cost optimization, and ultimately improving the reliability, performance, and efficiency of the platform itself. When we invest in advancing AIOps and related technologies, we see this ultimately provides value in several ways: Higher service quality and efficiency: Cloud services will have built-in capabilities of self-monitoring, self-adapting, and self-healing, all with minimal human intervention. Platform-level automation powered by such intelligence will improve service quality (including reliability, and availability, and performance), and service efficiency to deliver the best possible customer experience. Higher DevOps productivity: With the automation power of AI and ML, engineers are released from the toil of investigating repeated issues, manually operating and supporting their services, and can instead focus on solving new problems, building new functionality, and work that more directly impacts the customer and partner experience. In practice, AIOps empowers developers and engineers with insights to avoid looking at raw data, thereby improving engineer productivity. Higher customer satisfaction: AIOps solutions play a critical role in enabling customers to use, maintain, and troubleshoot their workloads on top of our cloud services as easily as possible. We endeavor to use AIOps to understand customer needs better, in some cases to identify potential pain points and proactively reach out as needed. Data-driven insights into customer workload behavior could flag when Microsoft or the customer needs to take action to prevent issues or apply workarounds. Ultimately, the goal is to improve satisfaction by quickly identifying, mitigating, and fixing issues. Figure 2. AI for Cloud: AIOps and AI-Serving Platform. AIOps Moving beyond our vision, we wanted to start by briefly summarizing our general methodology for building AIOps solutions. A solution in this space always starts with data—measurements of systems, customers, and processes—as the key of any AIOps solution is distilling insights about system behavior, customer behaviors, and DevOps artifacts and processes. The insights could include identifying a problem that is happening now (detect), why it’s happening (diagnose), what will happen in the future (predict), and how to improve (optimize, adjust, and mitigate). Such insights should always be associated with business metrics—customer satisfaction, system quality, and DevOps productivity—and drive actions in line with prioritization determined by the business impact. The actions will also be fed back into the system and process. This feedback could be fully automated (infused into the system) or with humans in the loop (infused into the DevOps process). This overall methodology guided us to build AIOps solutions in three pillars. Figure 3. AIOps methodologies: Data, insights, and actions. AI for systems Today, we’re introducing several AIOps solutions that are already in use and supporting Azure behind the scenes. The goal is to automate system management to reduce human intervention. As a result, this helps to reduce operational costs, improve system efficiency, and increase customer satisfaction. These solutions have already contributed significantly to the Azure platform availability improvements, especially for Azure IaaS virtual machines (VMs). AIOps solutions contributed in several ways including protecting customers’ workload from host failures through hardware failure prediction and proactive actions like live migration and Project Tardigrade and pre-provisioning VMs to shorten VM creation time. Of course, engineering improvements and ongoing system innovation also play important roles in the continuous improvement of platform reliability. Hardware Failure Prediction is to protect cloud customers from interruptions caused by hardware failures.  Microsoft Research and Azure have built a disk failure prediction solution for Azure Compute, triggering the live migration of customer VMs from predicted-to-fail nodes to healthy nodes. We also expanded the prediction to other types of hardware issues including memory and networking router failures. This enables us to perform predictive maintenance for better availability. Pre-Provisioning Service in Azure brings VM deployment reliability and latency benefits by creating pre-provisioned VMs. Pre-provisioned VMs are pre-created and partially configured VMs ahead of customer requests for VMs. As we described in the IJCAI 2020 publication, As we described in the AAAI-20 keynote mentioned above,  the Pre-Provisioning Service leverages a prediction engine to predict VM configurations and the number of VMs per configuration to pre-create. This prediction engine applies dynamic models that are trained based on historical and current deployment behaviors and predicts future deployments. Pre-Provisioning Service uses this prediction to create and manage VM pools per VM configuration. Pre-Provisioning Service resizes the pool of VMs by destroying or adding VMs as prescribed by the latest predictions. Once a VM matching the customer’s request is identified, the VM is assigned from the pre-created pool to the customer’s subscription. AI for DevOps AI can boost engineering productivity and help in shipping

Advancing Azure service quality with artificial intelligence: AIOps Read More »

Are you managing your IoT devices’ software in the right way?

Enterprises today are part of a new world: a software-driven world. And while this new world has brought many great innovations, it is also forcing organizations in all industries to rethink their software development practices, to deliver new features and offerings faster. Those entities that will embrace and incorporate change most effectively, will be the winners. In a world where industry disruptors set new market conditions, new standards for product delivery and customer satisfaction, industry incumbents are faced with two options: either continue using their existing approaches and tools, with the risk of lagging behind the competition, or adapt to the “new normal” and deliver products and applications faster, with less resources. As it pertains to the “new normal” DevOps standards, organizations now face many challenges such as cost overruns, software development projects that don’t scale in line with the enterprise growth, and increased market demands for speed. On top of that, the available outdated testing tools don’t offer visibility to ensure the right specifications get tested in the right time. Lean and agile principles So, how can you make sure your organization is ready to manage unexpected changes, and deal with any dependencies that you already have under the hood? How do you ensure a strong balance between the existing business and the new development? Many of you may already be familiar with lean and agile principles and have probably even tried applying them in smaller teams. But what we’ve seen so far in the market is that many of you struggle to apply these principles across the entire organization. Lean and agile principles can help you reach your goals in today’s hyper-competitive world of digital product delivery. By becoming a lean and agile enterprise your organization will be able to adapt faster to the needs of the market by improving internal collaboration and communication. You will be able to learn in real-time from your clients to ensure that you are producing the prioritized set of features that drive economic value. By managing test labs, test planning, and ensuring the tight linkage between product demand and delivery, your organization will be able to reduce waste (time, effort, resources), while ensuring that your business strategy is aligned with the investment and development goals. Client successes Let’s have a look at a few examples of what some of the industry leaders have achieved, using lean and agile processes. 50 percent reduction in time to market because of increased efficiency. 80 percent reduction and software defects through earlier testing and easier detection of discrepancies. 50 percent decrease in user time to acceptance of software releases. 57 percent development cost reduction. Nationwide achieved 50 percent improvement in code quality and 70 percent reduction in system downtime by applying lean principles to transform the software delivery lifecycle. Diagnostic Grifols, a world-leading healthcare enterprise headquartered in Barcelona Spain, increased the efficiency of development documentation by 30 percent—facilitating compliance, ensuring consistency of records across all product lines, and reducing operational costs.    Photo: Experience the benefits of collaboration and agile development Time to transform It’s time to transform your organization into a lean and agile enterprise. It’s time to ensure that your firm can adjust to any market change, predict the unpredictable, keep costs low, deliver new features and offerings faster, and never lose a beat with your customers. If you would like to learn more, let’s get connected! Our IBM solution enables companies to improve visibility and transparency across the product delivery lifecycle by providing a single source of truth. It also enables enterprises to define a process custom to each organization, and it ensures quality and compliance. All using lean and agile processes.

Are you managing your IoT devices’ software in the right way? Read More »