Hug Face’s Ben Bartenshaw talks about AI systems engineering

Machine Learning


Hugging Face’s Ben Burtenshaw recently presented the potential of AI agents in systems engineering, arguing that coding agents should be leveraged for these complex tasks. In his talk, Bartenshaw emphasized that AI agents are becoming increasingly capable, moving beyond simple code generation and into more advanced system-level engineering.

Hugging Face's Ben Burtenshaw talks AI systems engineering - AI Engineer

Hugging Face’s Ben Burtenshaw talks about AI systems engineering — From an AI engineer

Visual TL;DR. AI Agent for Engineering enables systems engineering tasks. Systems engineering tasks include custom kernels. Custom kernels lead to performance optimization. Agent benchmarks are required for systems engineering tasks. AI Agents for Engineering builds multi-agent labs. Multi-agent labs create automated research labs. Systems engineering tasks advance AI systems engineering. Autoresearch Labs powers AI systems engineering.

  1. AI Agents for Engineering: Coding agents that evolve beyond simple code generation
  2. Systems Engineering Tasks: Tackling complex engineering challenges, discovering APIs, and connecting systems
  3. Custom kernels: Optimize performance with code specific to specific hardware
  4. Performance optimization: Achieve faster execution through customized kernel development
  5. Agent Benchmarking: Measuring and Comparing AI Agent Capabilities and Performance
  6. Multi-agent labs: Building automated research labs with interconnected AI agents
  7. Autoresearch Labs: Enabling AI agents to conduct research and development autonomously
  8. AI systems engineering: Leveraging AI agents to design and implement complex systems

Visual TL;DR
Visual TL;DR—startuphub.ai AI Agent for Engineering enables systems engineering tasks. AI Agents for Engineering builds multi-agent labs. Multi-agent labs create automated research labs. Systems engineering tasks advance AI systems engineering. Autoresearch Labs powers AI systems engineering enable build create progress strengthen AI agents for engineering

system engineering tasks

Performance optimization

multi-agent lab

auto research lab

AI system engineering

From startuphub.ai · Publishers behind this format

Visual TL;DR—startuphub.ai AI Agent for Engineering enables systems engineering tasks. AI Agents for Engineering builds multi-agent labs. Multi-agent labs create automated research labs. Systems engineering tasks advance AI systems engineering. Autoresearch Labs powers AI systems engineering enable build create progress strengthen AI agentengineering

systemengineering tasks

performanceoptimization

multi-agent lab

auto research lab

AI systemengineering

From startuphub.ai · Publishers behind this format

Visual TL;DR—startuphub.ai AI Agent for Engineering enables systems engineering tasks. AI Agents for Engineering builds multi-agent labs. Multi-agent labs create automated research labs. Systems engineering tasks advance AI systems engineering. Autoresearch Labs powers AI systems engineering enable build create progress strengthen AI agents for engineering Coding agents that evolve beyond simple codegeneration system engineering tasks Tackling complex engineering challenges,Discover APIs, connect systems Performance optimization Achieve faster executionCustomized kernel development multi-agent lab Build an automated laboratoryInterconnected AI agents auto research lab Enabling AI agents to conduct research;develop autonomously AI system engineering Utilize AI agents for complex systemsDesign and implementation

From startuphub.ai · Publishers behind this format

Visual TL;DR—startuphub.ai AI Agent for Engineering enables systems engineering tasks. AI Agents for Engineering builds multi-agent labs. Multi-agent labs create automated research labs. Systems engineering tasks advance AI systems engineering. Autoresearch Labs powers AI systems engineering enable build create progress strengthen AI agentengineering coding agentevolve beyondSimple code… systemengineering tasks complex approachengineeringassignment,… performanceoptimization achieve fasterthrough executionTuned kernel… multi-agent lab buildingautomatic laboratoryinterconnected… auto research lab Enable AI agentconduct researchAnd development… AI systemengineering Utilization of AIagent of the complexSystem design and…

From startuphub.ai · Publishers behind this format

Visual TL;DR—startuphub.ai AI Agent for Engineering enables systems engineering tasks. Systems engineering tasks include custom kernels. Custom kernels lead to performance optimization. Agent benchmarks are required for systems engineering tasks. AI Agents for Engineering builds multi-agent labs. Multi-agent labs create automated research labs. Systems engineering tasks advance AI systems engineering. Autoresearch Labs powers AI systems engineering enable included leads to need build create progress strengthen AI agents for engineering Coding agents that evolve beyond simple codegeneration system engineering tasks Tackling complex engineering challenges,Discover APIs, connect systems custom kernel Optimize performance with special featuresCode for specific hardware Performance optimization Achieve faster executionCustomized kernel development Agent benchmark Measuring and comparing AI agentsability and performance multi-agent lab Build an automated laboratoryInterconnected AI agents auto research lab Enabling AI agents to conduct research;develop autonomously AI system engineering Utilize AI agents for complex systemsDesign and implementation

From startuphub.ai · Publishers behind this format

Visual TL;DR—startuphub.ai AI Agent for Engineering enables systems engineering tasks. Systems engineering tasks include custom kernels. Custom kernels lead to performance optimization. Agent benchmarks are required for systems engineering tasks. AI Agents for Engineering builds multi-agent labs. Multi-agent labs create automated research labs. Systems engineering tasks advance AI systems engineering. Autoresearch Labs powers AI systems engineering enable included leads to need build create progress strengthen AI agentengineering coding agentevolve beyondSimple code… systemengineering tasks complex approachengineeringassignment,… custom kernel optimizeperformance withSpecial code… performanceoptimization achieve fasterthrough executionTuned kernel… agentbenchmark measurement andAI agent comparisonAbility and… multi-agent lab buildingautomatic laboratoryinterconnected… auto research lab Enable AI agentconduct researchAnd development… AI systemengineering Utilization of AIagent of the complexSystem design and…

From startuphub.ai · Publishers behind this format

The role of AI agents in systems engineering

Burtenshaw emphasized that AI agents are no longer just tools for writing snippets of code. They are evolving into sophisticated collaborators capable of tackling complex engineering challenges. He pointed to the increasing acceptance of coding agents, citing the example of Andrej Karpathy and DHH, who have been using coding agents for many years. This acceptance is growing as agents demonstrate the ability to perform tasks such as discovering APIs, connecting systems, and even managing home automation devices.

Custom kernels and performance optimizations

A large portion of Burtenshaw’s presentation focused on creating and optimizing custom compute kernels specifically for AI workloads. He described the basic components of the kernel (functions compiled to run on the GPU and executed from Python) and emphasized the importance of optimizing them for efficiency. Burtenshaw showed how custom kernels, such as the popular Flash Attention, can significantly increase computational density, reduce the time it takes to communicate tensors, and ultimately keep the GPU running at optimal performance.

He also introduced Hugging Face’s “Kernel” library, a platform designed to make it easy to build computing kernels. This library aims to enforce a uniform and predictable structure, ensure reproducibility, provide native PyTorch compatibility, and encourage community sharing. Burtenshaw demonstrated how developers can publish their own kernels to the hub for others to access.

Benchmarks and agent performance

To illustrate the agent’s effectiveness in this area, Burtenshaw presented benchmark results. He shared how to use agents to generate, benchmark, and optimize CUDA kernels. In a particular example, we highlighted an average speedup of 1.94x on H100 GPUs for the Qwen3-8B model when using the agent to generate and optimize kernels. This demonstrates the tangible performance gains that can be achieved through agent-assisted engineering.

The power of multi-agent automated research labs

Mr. Bartenshaw also delved into the concept of a multi-agent automated research lab, outlining a system comprised of specialized agents working together. This system includes:

  • Researcher: Scout Hugface papers for ideas and define research directions.
  • planner: It acts as a central coordinator, owns the experimental cues, and proposes hypotheses.
  • Worker agent: Run experiments, get code, and test hypotheses.
  • Reporter: Monitor job progress, synchronize status, and provide an overview of active jobs and anomalies.

This multi-agent approach enables systematic and automated exploration of hyperparameters and model architectures, resulting in a more efficient and effective research cycle. Monitoring and visualizing these experiments using tools like Trackio can provide important insights into the research process.

Important points

Mr. Burtenshaw concluded with several important points.

  • Agents work best when using primitives and public, well-defined interfaces rather than overly abstract interfaces.
  • Hugging Face Hub is a robust platform ready to support AI workloads with core infrastructure for storage, compute, and versioning.
  • Multi-agent systems can be effectively built with specialized roles to automate and accelerate AI research.

The presentation highlighted the increasing capabilities of AI agents in systems engineering, highlighting their potential to drive efficiency and innovation in the field.

© 2026 StartupHub.ai. Unauthorized reproduction is prohibited. Please do not type, scrape, copy, reproduce or republish this article in whole or in part. Use for AI training, fine-tuning, search enhancement generation, or as input to any machine learning system is prohibited without a written license. Substantially similar derivative works will be pursued to the fullest extent of applicable copyright, database, and computer abuse laws. See our Clause.



Source link