Achieving cloud-native network automation at a global scale with Nephio

In 2007, in order to meet ever increasing traffic demands of YouTube, Google started building what is now the Google Global Cache program. Over the past 15 years, we have added thousands of edge caching locations around the world, with widely varying hosting conditions—some in customer data centers, some in remote locations with limited connectivity. Google manages the software and hardware lifecycles of all these systems remotely. Although the fleet size and serving corpus have grown by several orders of magnitude during this time, the operations team overseeing it has remained relatively small and agile. How did we do it?We started with a set of automation tools for software deployment (remotely executing commands), a set of tools for auditing/repairs (if this condition occurs, run that command), and a third set of tools for configuration management. As the fleet grew and was deployed in more varied environments, we discovered and fixed more edge cases in our automation tools. Soon, the system started reaching its scaling limits, and we built a new, more uniform and more scalable system in its place. We learned a few key lessons in the process:Intent-driven, continuously reconciling systems are more robust at scale than imperative, fire-and-forget tools.Distributed actuation of intent is a must for large-scale edge deployments. Triggering all actions from a centralized location is not reliable and does not scale, especially for edge deployments.Uniformity in systems is easier to maintain. Being able to manage deployment, repairs, and configuration using common components and common workflows (in other words, files checked into a repository with presubmit validation, review, version control, and rollback capability) reduces cognitive load for the operations team and allows more rapid response with fewer human errorsThis pattern repeats time and time again across many large distributed systems at Google, and we believe these tenets are key as network function vendors and communication service providers look to adopt cloud-based network technologies. For example, in a 5G deployment involving hundreds of locations (or many hundreds of thousands, in the case of RAN), with containerized software components, the industry needs better tools to handle deployment and operations at scale. Working with the community to address these issues, we hope to drive a common Kubernetes-based, cloud-native network automation architecture, while also providing extension points for vendors to innovate and adapt to their specific requirements.That’s why Google Cloud founded the Nephio project in April 2022. The Nephio community launched with 24 founding organizations and has now grown 2X since launching. In addition to the founding members, new participating organizations include Vodafone, Verizon, Telefonica, Deutsche Telekom, KT, HPE, Red Hat, Windriver, Tech Mahindra, and others. Over 150 developers across the globe participated in the community kickoff meeting hosted by the Linux Foundation on May 17, 2022.Google Cloud is collaborating with communication service providers, network function vendors, and cloud providers in Nephio by:Working with the community to refine the cloud native automation architecture, and define a common data model based on the Kubernetes Resource Model (KRM) and Configuration as Data (CaD) approach. This new model needs to support cloud infrastructure, network function deployment, and management of user journeys.Contributing to the development of an open, fully functional reference implementation of this architecture.Open sourcing several key building blocks, such as kpt, Porch and ConfigSync. We are also planning to open source controllers, Google Cloud infrastructure CRDs, additional sample NF CRDs, and operators to jumpstart the Nephio project.Google Cloud will also integrate Nephio with our Google Distributed Cloud Edge platform, combining the advantages of a fully managed hardware platform with Nephio-powered deployment and management of network functions to our customers.The Nephio community is complementary to many of the existing open source communities and standards. Nephio is closely working with adjacent communities in CNCF, LF Networking, and LF Edge to provide an end-to-end automation framework for telecommunication networks.By working with the community in this open manner, we believe that, together, we can advance the state of the art of network automation, improving the deployment and management of network functions on cloud native infrastructure. We welcome the industry to join us in this effort. For more information, please visit the Nephio website at www.nephio.org. And please register to join us online or in-person at the Nephio developer summit on June 22 and 23.Related ArticleAchieving cloud-native network automation in telecommunications togetherGoogle Cloud and Linux Foundation launch Nephio an open source cloud native network automation program.Read Article
Quelle: Google Cloud Platform

The Retirement Tracker simplifies and socializes early retirement on Google Cloud

A lot of people talk about retirement but far fewer people have the information and tools to plan for it properly. Just how much money you need to live comfortably once you stop working can be the million-dollar (or more!) question. Although there is no shortage of retirement calculators, many only provide a limited one-time analysis and require detailed personal information that may be sold to third parties. We developedThe Retirement Tracker with one idea: to empower individuals to take control over their retirement planning with tools to easily plan, track, and even socialize their early retirement.With The Retirement Tracker, people can aggregate their financial accounts—including savings, 401Ks, and stock portfolios—on one safe, convenient retirement app. The Retirement Tracker analyzes real-time data from these accounts to track net worth and automatically update retirement targets. A small part of this information, such as stock transactions, can even be shared among people’s self-created investment groups to encourage information sharing and friendly competition.Scaling up for early retirement on Google CloudWhen building The Retirement Tracker, we needed a technology partner that would enable us to securely and effectively scale while saving time and administrative costs. That’s why we started working withGoogle Cloud and partnering with theGoogle for Startups Cloud Program.Google Cloud gives us a highly secure-by-design infrastructure, valuable cloud credits to obtain products from an expansive technology platform, access to dedicated startup experts, and potential for joining the Google Cloud Marketplace.Even though we are a small team, we innovate quickly and easily onGoogle Workspace using Gmail, Google Docs, Sheets, Calendar, and Meet. We also store and protect all sensitive company documents on Google Cloud and post our“Restimators” investment video series on YouTube. More recently, we’ve adoptedFirebase to scale and manage our infrastructure while accelerating the development of The Retirement Tracker.In just days, we implemented Plaid authentication and authorization protocols, enabling customers to quickly and securely connect details about their investment and savings accounts to The Retirement Tracker. This is a process that possibly would have taken us months if we had to manually build these security capabilities from scratch.Google Firebase now delivers a seamless customer experience by aggregating and displaying near real-time data from multiple financial accounts on a single dashboard. On the back end, Firebase automatically queries read-only tokens, securely accesses account balance changes, and encrypts sensitive data in the cloud.  Firebase also makes it easy for customers to administer internal investment groups and selectively socialize information such as stock purchases and sales—without revealing transaction quantities or prices. Customers create these small invite-only groups to help family and friends improve their retirement portfolios with friendly competition and strategic crowdsourcing. Customers can also participate in additional investment discussions hosted by The Retirement Trackeron Discord.Building a sustainable financial futureSince we started using Google and Google Cloud solutions, everything is easier to build and scale. We constantly perfect the customer experience with new features and services, while leaving our IT and cloud infrastructure in the hands of Google Cloud experts. Demand for our app is growing fast as we prepare to move The Retirement Tracker out of beta in 2022. Moving forward, we’re excited to continue to grow in the Google for Startups Cloud Program, and with our dedicated Google team to improve the observability and reliability of The Retirement Tracker to handle the volume of users we’re anticipating in 2023 and beyond. To help us do so, we’re exploring additional Google Cloud solutions such asLooker,BigQuery, andCloud Spanner. These solutions will enable us to rapidly expand our services and offer customers a variety of new benefits from using The Retirement Tracker. Our participation in the Google for Startups Cloud Program has been instrumental to our success. The Startup Success Manager has worked with our team to identify programs we could apply to in order to strengthen our relationship even further. With Google Cloud, we’re making early retirement easier and more accessible on one convenient, highly secure mobile app. We can’t wait to see what we accomplish next as we drive innovation and financial inclusion by empowering people to plan, track, and socialize retirement planning that can be at once so important and so difficult for so many people.  If you want to learn more about how Google Cloud can help your startup, visit our pagehere to get more information about our program, and sign up for our communications to get a look at our community activities, digital events, special offers, and more.
Quelle: Google Cloud Platform

Snap Inc. adopts Google Cloud TPU for deep learning recommendation models

While many people still think of academic research when it comes to deep learning, Snap Inc. has been applying deep learning models to improve its recommendation engines on a daily basis. Using Google’s Cloud Tensor Processing Units (TPUs), Snap has accelerated its pace of innovation and model improvement to enhance the user experience. Snap’s blog Training Large-Scale Recommendation Models with TPUs tells the story of how the Snap ad ranking team leveraged Google’s leading-edge TPUs to train deep learning models quickly and efficiently. But there’s a lot more to the story than the how, and that’s what we’re sharing here.Faster leads to betterSnap’s ad ranking team is charged with training the models that make sure the right ad is served to the right Snapchatter at the right time. With 300+ million users daily and millions of ads to rank, training models quickly and efficiently is a large part of a Snap ML engineer’s daily workload. It’s simple, really: the more models Snap’s engineers can train, the more likely they are to find the models that perform better—and the less it costs to do so. Better ad recommendation models translate to more relevant ads for users, driving greater engagement and improving conversion rates for advertisers.Over the past decade, there has been tremendous evolution in the hardware accelerators used to train large ML models like those Snap uses for ad ranking, from general-purpose multicore central processing units (CPUs) to graphics processing units (GPUs) to TPUs. TPUs are Google’s custom-developed application specific integrated circuits (ASICs) used to accelerate ML workloads. TPUs are designed from the ground up to minimize time to accuracy when training large models. Models that previously took weeks to train on other hardware platforms can now be trained in hours on TPUs—a product of Google’s leadership and experience in machine learning (dig into the technology in Snap’s blog).Benchmarking successSnap wanted to understand for itself what kind of improvements in training speed it might see using TPUs. So, the Snap team benchmarked model training using TPUs versus both GPUs and CPUs, and the results were impressive. GPUs underperformed TPUs in terms of both throughput and cost, with a reduction in throughput of 67 percent and an increase in costs of 52 percent when using GPUs. Similarly, TPU-based training drastically outperformed CPU-based training for Snap’s most common models. For example, when looking at their standard ad recommendation model, TPUs slashed processing costs by as much as 74 percent while increasing throughput by as much as 250 percent—all with the same level of accuracy.Because TPU embedding API is a native and optimized solution for embedding-based operations, it performs embedding-based computations and lookups more efficiently. This is particularly valuable to recommenders, which have additional requirements such as fast embedding lookups and high memory bandwidth.Benefits across the boardFor Snap’s ad ranking team, those improvements translate into tangible workflow advantages. It’s not unusual for Snap to have a month’s worth of data that includes all the logs of users who were shown particular ads and a record of whether they interacted with an ad or not. That means it has millions of data points to process, and Snap wants to model them as quickly as possible so it can make better recommendations going forward. It’s an iterative process, and the faster Snap can get the results from one experiment, the faster its engineers can spin up another with even better results—and they’d much prefer to do that in hours rather than days. Increased efficiency and velocity benefit Snapchatters, too. The better the models are, the more likely they are to correctly predict the likelihood that a given user will interact with a particular ad, improving the user experience and boosting engagement. Improved engagement leads to higher conversion rates and greater advertiser value—and given the volumes of ads and users Snap deals with, even a one percent improvement has real monetary impact.Working at the leading edgeSnap is working hard to improve its recommendation quality with the goal of delivering greater value to advertisers and a better experience for Snapchatters. That includes going all-in on leading-edge solutions like Google TPUs that allow its talented ML engineers to shine. Now that you know the whole story, see how Snap got there with the help of Google: Training Large-Scale Recommendation Models with TPUs.Related ArticleCloud TPU VMs are generally availableCloud TPU VMs with Ranking & Recommendation acceleration are generally available on Google Cloud. Customers will have direct access to TP…Read Article
Quelle: Google Cloud Platform

No more normal? No problem when you build supply chains with data and AI

What if, after all the upheavals and innovations of the past two years, we’re not headed for some new normal but instead an era of no more normal?“There are big, big challenges that need to be solved every single day by supply chain professionals,” Hans Thalbauer, Google Cloud’s managing director for supply chain and logistics, pointed out during our recent Supply Chain & Logistics Spotlight event. Among the issues Thalbauer ticked off were changes from the pandemic, consumer demand, labor shortages, the climate crisis, geopolitical instability, and energy shortages.“And the thing is, it’s not just a short term issue, we think it’s a long-term and systemic issue,” Thielbauer said. “There’s a big question out there, which is: How will global trade change? Is it really transforming and translating into something new? Will global trade continue to work as is?”Even experts at the White House are asking these very questions at this very time. The same day as the Supply Chain & Logistics Spotlight, the president’s Council of Economic Advisors released their annual report with an entire chapter dedicated to supply chain. In it, they noted that once-obscure, and ideally invisible, supply chains had “entered dinner table conversations.” And for good reason. “Because of outsourcing, offshoring, and insufficient investment in resilience, many supply chains have become complex and fragile,” the economists wrote. Nor are they alone in worrying about the future of logistics.Whatever the outcomes—more global or local, more automated or disintermediated, more agile or fragile—one of the likeliest results is a greater reliance on technology, and especially data, to help handle all the disruptions and interruptions on the horizon. Leaders in the field, including at The Home Depot, Paack, and Seara Foods, are discovering opportunities in a few key areas: connecting data from end to end; the power of platforms to access and share information; and the importance of predictive analytics to mitigate issues as, or even before, they arise.“We need to create visibility, flexibility, and innovation,” Thalbauer said. “Too often companies just focus on their orders, forecasts, and inventory, but typically they ignore the rest of the world. We need to bring in the public information, the traffic, weather, climate, and financial risks, connect that with the enterprise data, and we need to actually enable community data to create collaboration between business partners at every tier.”End-to-end dataCompanies have always sought visibility from the factory to the warehouse to the store and now the front door, and all the points in between. Both the challenge and necessity of seeing into all these is that as the data has grown, and our capabilities along with it, so has the complexity. It’s at a scale no humans can manage, which makes the importance not only of data but analytics and AI all the more essential.Home Depot has had a front row seat to these growing interdependencies—especially when it comes to serving competing yet complimentary clienteles. The pandemic presented its share of unexpected opportunities, as the combination of soaring home values, disposable income, and DIYers looking for (stay at) home projects led to runs on everything from lumber to sheds-turned-offices to garage doors. Empty shelves can lead to angry customers. And in this case, it wasn’t just homeowners and renters Home Depot was contending with, explained Chris Smith, vice president of IT Supply Chain at Home Depot, but also an increasingly important base of contractors and even large-scale developers. Both tended to need different materials, at different scales, and shopped in different ways, and these demands have only expanded during the pandemic.Whatever the future of logistics look like—more global or local, more automated or disintermediated, more agile or fragile—one of the likeliest results is a greater reliance on technology.“We really have what we call an omnichannel algorithm.” Chris Smith, VP of IT Supply Chain at The Home Depot. “It’s really marrying up the customer’s preferences with our understanding of capacity, assortment, inventory availability, taking all that together, and saying: How do we best meet the customer promise and do it with the most efficient use of our supply chain? So where do we fulfill it from, where is the inventory available, and how do we do that in a way that’s most economical for us while still meeting the promise of the customer,” Smith said. Paack, a last-mile delivery start-up serving the UK, Spain, France, Portugal, and Italy, is similarly pushing the envelope on fulfillment. The company focuses on combining a wealth of data—from drivers, customers, sensors, weather, and more—to ensure guaranteed delivery. So far, their success rate is approaching 98% of on-time delivery, with special scheduling tools to ensure customers are available to receive their packages.Using solutions like the Last Mile Fleet Solution from Google Maps Platform, Paack can manage drivers and customers in real-time.“The granularity of information we can collect in terms of which routes are being effectively followed by the driver’s route versus planned routes, the ability for them to change directions, because we might know locally of better ways to go, notifications from the customer as to their availability—these really allows us to build a better experience for everyone,” Olivier Colinet, chief product and technology officer for Paack, said. “We want first-time drivers to be the most productive drivers, and this first step allows us to do so.”Power of platformsPaack’s success exemplifies the power of building a strong platform for customers and workers, as well as tapping existing platforms, like Google Maps, to bolster your own.On the other side of the globe, the world’s largest meat supplier is seeking to empower thousands of ranchers and farmers with a platform of their own. Seara, a Brazil-based supplier of pork, chicken and eggs that is part of the globe-spanning JBS conglomerate, launched its SuperAgroTech platform in July 2021. Though in development for years, the program could hardly have come at a more critical time for the global food supply. The food industry was already coping with pandemic-related shortages and shutdowns, and then came the spillover effects from the war in Ukraine.“In general, the entire supply chain was affected and the operation had to adapt to new working conditions,” Thiago Acconcia, the director of innovation and strategy at Seara, said. “So in the farms, in the field, the same situations are repeated, and the creation of this digital online platform enters as a facilitator when it gives autonomy to the farmer, providing them with the data input and digital communication.” It’s a level of connectivity the farmers never had with Seara before—and vice versa.The technology has been deployed to more than 9,000 farms at launch. Through a range of IoT sensors, monitoring devices, and data inputs from farmers, operators and Seara data, teams can track a host of results. These include yields, animal health, profits, and even environmental and social impacts, which are becoming increasingly important features for consumers.The eventual goal is to reach 100% digital management of the farm.“So today, we are able to activate any producer in a few seconds, regardless of the location,” Acconcia said. With SuperAgroTech, the platform “doesn’t mind if it’s in the very south of the country, if it’s in the central part. It’s strengthening the relationships with our producers and also promoting a level of personalized attention they’ve never had.”Such platforms also provide a level of visibility and connectivity rarely enjoyed before, as well as a virtuous cycle between data collection, analysis, and insights put back into action on the platform. In an unpredictable world, this kind of integration is becoming essential.Stacks of containersPredictive AnalyticsAs a company’s digital strategies evolve through integrated data and robust platforms, one of the most exciting opportunities arises around predictive analytics.While seeing into the future remains science fiction (at least for now), AI, cloud, and even emerging quantum computing are providing robust ways to better reveal trends, make connections, and anticipate both opportunities and interruptions.Home Depot has looked at ways to quickly adapt its digital stores using consumer data and AI to create better experiences, as well as smoothing out supply chain issues. Home Depot’s Chris Smith pointed to a listing for an out-of-stock appliance or tool, for example, that will quickly offer other locations or items for sale as a convenient alternative.“We can apply machine learning in many different ways to make better, faster decisions, both in how we support moving inventory through our supply chain or how we understand available capacity to support our customers,” Smith said. “And with automation, from our distribution centers to our forecasting and replenishment systems, we’re going to continue to look at places where we can optimize and automate to make better decisions.”For Paack, predictions could come in the form of traffic or storms or even the likelihood that a repeat customer will be available or not, without having to prompt them.And at Seara, the role of data and analytics is not just vital to the business but the very vitality of the world. As climate, supply chains, global conflicts, migration, and other issues continue to constrain the food supply, anticipating issues could be the difference between salvaging a crop or not. “We started creating advanced analytics by means of AI tools to not only notify real-time problems but also to predict what’s going to happen in the near and long future,” Acconcia said. “We are talking about the world’s food, and SuperAgroTech has the role to feed the world, and to overcome these biggest challenges.”
Quelle: Google Cloud Platform