A guide to deploying AI in edge computing environments

Combining AI with edge computing can be complicated: the edge is a place where resource costs must be controlled, and where other IT optimization initiatives, such as cloud computing, are difficult to apply.

Edge computing is an application deployment model in which some or all of an application is hosted close to the real-world systems it is designed to support. These applications are often referred to as real-time and IoT because they interact directly with real-world elements such as sensors and effectors and require high reliability and low latency.

The edge is typically on-premise, close to users and processes, and often on a small server with limited system software and performance. This local edge is often linked to other application components running in the cloud.

As AI grows in power and complexity, opportunities for edge deployment scenarios increase. Deploying AI in edge computing environments offers a wide range of benefits across industries, but successful implementation requires specific capability and platform considerations.