• ISA provides technical resources and standards to help industrial automation professionals advance their careers and the field. We enable automation professionals worldwide to solve problems and enhance their skills by bringing people together to create new technologies and share best practices with future automation professionals.
    • Industry Insights

  • We attract over 140,000 unique automation professionals monthly, making us the premier online content provider and the only dedicated electronic magazine in the automation industry.

    Monthly Magazine

    • More things to read

    Back
    Back
  • M logo for Automation.com Monthly. Link to current issue.

The Shrinking AI Revolution: Why Bigger Isn’t Always Better, Especially in Manufacturing

By: Nikhil Makhija
24 March, 2026
3 min read
Feature Image for The Shrinking AI Revolution: Why Bigger Isn’t Always Better, Especially in Manufacturing
As manufacturers embrace Industry 4.0, smart factories and connected operations, a different reality is emerging: smaller, more efficient AI models are often the better tool for the job.

In the world of Artificial Intelligence (AI), a belief persists that bigger models automatically mean better outcomes. The very term Large Language Model (LLM) reinforces this idea, suggesting that scale measured by billions of parameters is the ultimate advantage.

While frontier AI models with hundreds of billions of parameters are undeniably powerful, this assumption does not always hold — particularly in manufacturing environments, where latency, reliability, cost, data sovereignty and system integration matter as much as raw intelligence.

As manufacturers embrace Industry 4.0, smart factories and connected operations, a different reality is emerging: smaller, more efficient AI models are often the better tool for the job. These models are not only catching up in capability but are becoming strategically superior for many industrial use cases.

Let's explore the evolving balance between large and small AI models — and why efficiency, specialization and deployability are redefining AI value in modern manufacturing.

1. Small models are getting smarter — and that changes industrial AI economics

One of the most striking trends in AI today is how rapidly competent general intelligence is being compressed into smaller models. A commonly cited benchmark is the MMLU (Massive Multitask Language Understanding) test, which is a benchmark used to measure an AI's general-purpose ability. It consists of over 15,000 multiple-choice questions spanning subjects like math, history, law and medicine, requiring a combination of factual recall and problem-solving.

To put scores into perspective:

  • Random guessing: 25%
  • Average human: ~35%
  • Human domain expert: ~90%
  • Today’s frontier AI models: high 80s

This benchmark powerfully illustrates the pace of innovation. In 2020, the massive 175-billion-parameter GPT-3 scored 44% on the MMLU — respectable, but far from mastery. Now, let's use a score of 60% as the threshold for a "competent generalist." The progress in model efficiency has been staggering, with a rapid decrease in the model size required to pass this mark:

Advertisement
  • February 2023: Llama 1–65B •    Jul 2023: Llama 2–34B
  • September 2023: Mistral 7B
  • March 2024: Qwen 1.5 MoE (under 3B active parameters)

Why this matters for manufacturing

For manufacturers, this trend directly impacts where and how AI can be deployed:

  • AI can now run closer to the production line, not just in centralized clouds
  • Smaller models enable edge inference on factory-floor hardware
  • Costs drop enough to allow AI on a scale across plants, machines, and processes

This is foundational to smart and connected manufacturing, where AI must operate reliably within operational technology (OT) constraints—not just IT environments.

2. Value at a fraction of the cost: A perfect fit for smart factories

In real-world business tasks, smaller models often deliver results that are statistically indistinguishable from large models, while being dramatically cheaper and faster. Studies show:

  • Mistral 7B performs on par with GPT-3.5 Turbo for news summarization
  • Cost and latency improvements can exceed 30×
  • IBM Granite 13B models match or outperform models five times larger on enterprise Q&A tasks

Manufacturing implications

This efficiency advantage aligns perfectly with Industry 4.0 priorities, including:

  • Production reporting and shift handover summaries
  • Maintenance log analysis
  • Quality inspection documentation
  • Standard operating procedure (SOP) guidance
  • Supplier and material classification

In these scenarios, manufacturers do not need open-ended reasoning across the entire internet. They need fast, accurate, domain-specific intelligence — delivered reliably and economically.

3. Where large models still matter in industrial contexts

Despite the rise of small models, scale still matters for certain high-complexity manufacturing tasks.

Large models remain superior for:

  • Cross-domain engineering reasoning (e.g., linking mechanical, electrical, and software systems across product lifecycles)
  • Document-heavy compliance analysis (ISO standards, safety regulations, multi-hundred-page technical specifications)
  • Global operations and multilingual coordination (capturing nuanced language differences across regions and suppliers)

In practice, many manufacturers will adopt a hybrid AI architecture—using large models centrally and small models locally.

4. Small models are often preferable in Industry 4.0 and edge environments

In manufacturing, smaller models are not just good enough; they are often the only practical option.

On-Device and Edge AI in Smart Manufacturing Small models enable:

  • Real-time anomaly detection on machines
  • Low-latency operator assistance
  • Offline operation in air-gapped or safety-critical environments
  • Data privacy for proprietary production data

This is critical for:

  • Predictive maintenance
  • Computer vision-assisted inspections
  • AI copilots for technicians on the shop floor

Fine-tuned, manufacturing-specific AI

A 7B–13B model fine-tuned on:

  • Maintenance manuals,
  • Failure mode histories,
  • Sensor metadata and
  • Plant-specific SOPs

can outperform a general-purpose frontier model — because it knows your factory, not the internet. This aligns with the industry 4.0 principle of context-aware intelligence embedded into operations.

Figure 1: Hybrid AI architecture for smart manufacturing.

Conclusion: The Right AI Tool for the Right Manufacturing Job

The AI size debate is not about winners and losers; instead, it is about fit for purpose.

  • Large models excel at broad, exploratory reasoning
  • Small models dominate in cost, speed, deployability and industrial reliability

For manufacturers pursuing smart factories, connected assets and resilient operations, the future is not one massive model, but an ecosystem of right-sized AI — from cloud to edge, from enterprise planning to machine-level execution. As AI continues to shrink while becoming smarter, one question becomes central for manufacturing leaders: How will hyper-efficient, domain-specific AI embed directly into production systems redefine productivity, quality, and operational intelligence in the next phase of Industry 4.0?

Reference:

IBM Technology. Small vs. Large AI Models: Trade-offs & Use Cases Explained. (Jun. 10, 2025). Accessed: Jan. 7, 2026. [Online Video]. Available: https://www.youtube.com/watch?v=0Wwn5IEqFcg

Advertisement

Trending Articles

Advertisement

Related Articles

View all Articles and News
Advertisement
Advertisement