On this page
TL;DR for the time-pressed
An AI Bill of Materials (AIBOM) is a structured, machine-readable inventory of every component that contributes to an AI system - foundation models, fine-tuned variants, training datasets, embeddings, vector stores, evaluation suites, and the software dependencies that surround them. The federal anchor is NTIA's Minimum Elements for an SBOM (July 2021), the AI-extending standards are NIST AI 100-1 (January 2023) together with SPDX 3.0 and CycloneDX 1.6, and the regulatory pressure point is Executive Order 14110 (October 2023) reporting obligations for frontier models. This playbook covers what to ship now - the formats, the fields, the CI integration, and the contractual asks for AI vendors. Updated 2026-05-20.
AIBOM vs SBOM: what is actually different
An SBOM lists software components; an AIBOM lists software, data, and model components. The structural difference is that AI systems carry three additional categories of dependencies that have no clean analogue in classical software:
- Models and weights. Foundation models, fine-tuned checkpoints, LoRA adapters, quantised variants. Each is a unique component with its own provenance, license, and risk profile.
- Training data and evaluation data. Datasets used at pre-training, fine-tuning, RLHF, and evaluation. The license, consent basis, and known biases of each dataset propagate to the resulting model.
- Embeddings and vector stores. Vector indices created from proprietary data are a distinct asset that an AI system depends on; they often contain personal data or trade secrets and require their own lifecycle.
The NTIA SBOM Minimum Elements (supplier, component name, version, unique identifier, dependency relationship, author of the SBOM, timestamp) translate cleanly to AI components, but the data and model categories also require new fields - training-data lineage, evaluation metrics, license-for-output, and the model's safety classification. SPDX 3.0 and CycloneDX 1.6 both ship AI/ML profiles that codify these new fields.
The trap to avoid: shipping an SBOM that excludes the model and pretending it is an AIBOM. The whole point of an AIBOM is to give a procurer or auditor a complete picture of the AI system's supply chain, including the parts that are not software. Areebi customers ship one consolidated AIBOM per deployed AI system.
NTIA minimum elements adapted for AI
The 2021 NTIA Minimum Elements are still the federal baseline. The table below maps each element to its AI-adapted form, with concrete examples of what a CISO should expect to see in an AIBOM record.
| NTIA element | SBOM (classical software) | AIBOM (AI system) |
|---|---|---|
| Supplier name | OpenSSL Foundation | OpenAI, Anthropic, Meta, Hugging Face dataset uploader |
| Component name | openssl | gpt-4o-2024-05-13, claude-3-7-sonnet-20250219, llama-3-70b-instruct, common-crawl-2024-q4 |
| Version of component | 3.2.1 | Model version + checkpoint + quantisation; dataset snapshot date |
| Unique identifier | CPE, PURL, SWID | PURL for HF/registry models, content hash for weights, dataset DOI |
| Dependency relationship | Includes openssl | Fine-tuned-from llama-3-70b; embeddings-from text-embedding-3-large; trained-on common-crawl + first-party-tickets |
| Author of SBOM | The vendor or build system | The AI vendor or the deploying controller; signed and timestamped |
| Timestamp | Build time | Build time + training cutoff + last-evaluation time |
Areebi's compliance team operationalises this mapping by requiring every AI vendor to deliver an SPDX 3.0 or CycloneDX 1.6 document with the AI profile fields populated. The policy engine refuses to register an AI vendor without an AIBOM attached.
SPDX 3.0 AI profile: the field-level breakdown
SPDX 3.0 (April 2024) introduces an explicit AI profile that adds AI-specific classes on top of the core SPDX model. The key classes a CISO will see in a well-formed SPDX 3.0 AIBOM are AIPackage, DatasetPackage, and the relationships between them. The field-level expectations:
- AIPackage. Includes typeOfModel (eg. language model, image model), modelExplainability, energyConsumption, safetyRiskAssessment, useSensitivePersonalInformation, standardCompliance (eg. NIST AI 100-1, ISO/IEC 42001), limitation, hyperparameter list, modelDataPreprocessing, intendedUse, and informationAboutTraining.
- DatasetPackage. Includes datasetType (eg. training, evaluation, fine-tuning), datasetSize, datasetUpdateMechanism, sensor used to collect, sensitivePersonalInformation, anonymisationMethodUsed, knownBias, intendedUse, and confidentiality level.
- Relationships. trainedOn, fineTunedFrom, evaluatedOn, deployedWith, hasPackagePurpose. These relationships are how an AIBOM expresses the dependency graph of an AI system.
Source: SPDX 3.0 specification published April 2024 by the Linux Foundation under ISO/IEC 5962:2021 framework. The Areebi platform stores the parsed SPDX 3.0 document per vendor and exposes it to procurement, security, and audit teams via the audit log.
CycloneDX 1.6 ML-BOM: components and the model card link
CycloneDX 1.6 (March 2024) ships an ML-BOM extension with a typed component model that is more procurement-friendly than SPDX 3.0 for some teams. The component types relevant to AI:
- component type=machine-learning-model. Captures the model identifier, version, and a structured modelCard subobject covering modelParameters, datasets, considerations (ethical, fairness, environmental), and quantitativeAnalysis (performance metrics, fairness metrics).
- component type=data. Captures dataset metadata - name, type, source, license, contents, governance, sensitivity, classification.
- component type=cryptographic-asset. Useful for model signatures and dataset hashes, allowing supply-chain integrity verification.
Source: CycloneDX v1.6 specification published by OWASP/CycloneDX Working Group in March 2024. The practical advantage of CycloneDX over SPDX 3.0 for many teams is that the modelCard subobject mirrors the Model Cards proposal from Mitchell et al (2019), so security and ML teams can reuse existing model card content directly.
See Areebi in action
Get a 30-minute personalised demo tailored to your industry, team size, and compliance requirements.
Get a DemoEO 14110 contractor reporting: what is actually required
Executive Order 14110 (Safe, Secure, and Trustworthy Development and Use of AI, October 30, 2023) imposed a set of reporting obligations on developers of dual-use foundation models and the federal contractors who depend on them. The relevant Section 4.2(a)(i) defines the threshold (models trained using a quantity of computing power greater than 10^26 integer or floating-point operations, or biological sequence models above defined thresholds) and requires:
- Notification of red-team test results. Developers must report the results of red-team testing of dual-use foundation models to the federal government.
- Information about training runs. Developers must report the location, scale, and ownership of compute clusters used for training above-threshold models.
- Information about cybersecurity protections. Developers must report on cybersecurity protections for training pipelines and model weights.
The status of EO 14110 in 2026 is itself a moving target - portions have been adjusted under subsequent executive action - but the operational expectation has not changed: federal contractors are expected to be able to produce AIBOM artefacts on demand to demonstrate the integrity, provenance, and safety profile of any AI system used on federal data. The Areebi OMB M-24-18 federal contractor checklist walks through the operational pattern.
ISO/IEC 5230 OpenChain and the open-source license dimension
The licence dimension of an AIBOM is where many teams stumble. Foundation models ship under a wide range of licences (Llama Community License, Apache 2.0, MIT, OpenRAIL, custom commercial), datasets often have ambiguous or undocumented licences, and the resulting compositions can create derivative-work obligations that the deploying organisation never reviewed.
ISO/IEC 5230:2020 (OpenChain Specification) is the conformance standard for open-source licence compliance programmes. Procurement and legal teams treating ISO/IEC 5230 as the baseline for software licence governance should extend it to AI components. The Areebi platform stores the parsed licence for every AI component in the AIBOM and surfaces conflicts (eg. a non-commercial-licensed model used in a commercial product) to legal review.
How to generate an AIBOM in CI
The AIBOM should be a build-time artefact, not a periodic spreadsheet. The pattern Areebi customers ship in production:
- Inventory the components. Every model, dataset, embedding store, evaluation suite, and inference dependency in the build is registered in a manifest file (typically YAML).
- Resolve provenance. CI fetches the SPDX or CycloneDX document for each upstream component (model card, dataset card, vendor SBOM) and stitches them into the build manifest.
- Hash and sign. Each component's content hash is computed and verified against the supplier's published hash. The resulting AIBOM is signed with the build system's key.
- Publish. The signed AIBOM is published to an artefact store and attached to the release. Areebi's audit log records the AIBOM hash so an auditor can verify the AIBOM that was actually shipped.
- Diff on every release. CI diffs the new AIBOM against the previous release and fails the build if there are unapproved changes (new dataset, new model variant, licence drift).
The Areebi compliance team designs this control so the AIBOM is treated as a release artefact with the same rigour as the signed binary.
What to ask AI vendors before signing
Most procurement teams sign AI vendor contracts without an AIBOM clause. That is the single biggest unforced error in 2026 AI procurement. The clauses Areebi recommends:
- SBOM/AIBOM delivery. Vendor delivers a current SPDX 3.0 AI profile or CycloneDX 1.6 ML-BOM document for every model version. Initial delivery within 30 days; updates within 14 days of any material change.
- Training-data warranty. Vendor warrants that training data was obtained on a lawful basis and that the vendor can identify the sources to the customer on request. This is the operational answer to the EDPB Opinion 28/2024 cross-phase contamination doctrine (see our EDPB Opinion 28/2024 playbook).
- Red-team evidence. Vendor delivers summary red-team and evaluation results for safety-relevant test suites (eg. red-teaming guide).
- Licence transparency. Vendor lists the licence of every model, dataset, and evaluation component that ships in the AIBOM, including any third-party licences that propagate.
- Notification of material change. Vendor notifies customer of any material change to the model architecture, training data, or safety profile.
Areebi's vendor onboarding workflow captures each of these as gating conditions; the policy engine refuses to enable an AI vendor that has not delivered the required artefacts.
Mapping AIBOM fields to NIST AI 100-1
NIST AI 100-1 (AI RMF 1.0, January 2023) does not mandate an AIBOM, but every function in the AI RMF assumes one. The mapping below is what Areebi customers use to defend their AI RMF programme to auditors.
| NIST AI RMF function | AIBOM field that supports it |
|---|---|
| Govern (GV-1, GV-2) | Component inventory, ownership, accountability |
| Map (MP-2, MP-3) | Training-data lineage, intended use, deployment context |
| Measure (MS-2, MS-3) | Evaluation suites, fairness metrics, performance metrics |
| Manage (MG-2, MG-4) | Change management, incident response, vendor escalation |
The Areebi NIST AI RMF hub and the function-by-function deep dives (Govern, Map, Manage) walk through the auditor's checklist.
Areebi's point of view
An AIBOM is not a compliance artefact - it is a control surface. Once you can enumerate every model, dataset, and embedding store in your AI estate, you can enforce policy on each one: licence checks, data residency, training-data lineage, red-team evidence, and notification of material change. Areebi's policy engine treats the AIBOM as the source of truth for what an AI system actually contains, and refuses to let a vendor or model into production without a complete, signed, current AIBOM. This is the single highest-leverage control we have shipped to regulated customers in the past year, and the reason most enterprise AI security questionnaires now lead with an AIBOM question.
Frequently Asked Questions
Do I need both an SBOM and an AIBOM?
Yes. An SBOM covers the classical software components that surround an AI system; an AIBOM extends to models, datasets, embeddings, and evaluation suites. Most teams ship a combined document under SPDX 3.0 or CycloneDX 1.6, which support both software and AI components in one bill of materials. The Areebi platform stores both as one consolidated artefact per deployed system.
Which format should I require from AI vendors - SPDX 3.0 or CycloneDX 1.6?
Both are acceptable. SPDX 3.0 is the ISO/IEC 5962 standard with an explicit AI profile; CycloneDX 1.6 has the ML-BOM extension with a typed component model that is more procurement-friendly for some teams. Areebi recommends asking vendors to deliver either, and storing the parsed document in the AI vendor record so security, legal, and procurement can all consume the same source of truth.
Does EO 14110 require me to publish an AIBOM?
EO 14110 does not mandate public AIBOM publication; it imposes reporting obligations on developers of dual-use foundation models above the 10^26 FLOP threshold and operational expectations on federal contractors. The practical pattern in 2026 is that federal contractors are expected to be able to produce AIBOM artefacts on demand for any AI system used on federal data. Status of EO 14110 has shifted under subsequent executive action; AIBOM-as-control persists regardless of the specific executive order in force.
How often should the AIBOM be regenerated?
On every release of the AI system, automatically in CI. Vendor-supplied AIBOMs should be updated within 14 days of any material change (new training data, new model variant, new evaluation result). Areebi's audit log records the AIBOM hash with each release so auditors can verify the exact AIBOM that was shipped.
What happens if a model in my AIBOM is found to have a contaminated training dataset?
The AIBOM is the trigger for the incident response. The deploying controller uses the dependency graph in the AIBOM to identify every downstream artefact (fine-tuned variants, embeddings) that may also be contaminated, and either retrains, applies mitigations, or removes the affected components. The Areebi <a href='/blog/data-poisoning-enterprise-defense'>data poisoning defence guide</a> walks through the response pattern.
Does ISO/IEC 42001 reference AIBOM directly?
ISO/IEC 42001:2023 does not use the term AIBOM, but its Annex A controls on AI system documentation, supplier management, and data resources are best satisfied by an AIBOM. Auditors increasingly treat AIBOM as the operational evidence behind ISO/IEC 42001 Annex A.6 (supplier relationships) and A.7 (data for AI systems).
Related Resources
Stay ahead of AI governance
Weekly insights on enterprise AI security, compliance updates, and governance best practices.
Stay ahead of AI governance
Weekly insights on enterprise AI security, compliance updates, and best practices.
About the Author
Areebi Research
The Areebi research team combines hands-on enterprise security work with deep AI governance research. Our analysis is informed by primary sources (NIST, ISO, OECD, federal registers, IAPP) and the operational realities of CISOs running AI programs in regulated industries today.
Ready to govern your AI?
See how Areebi can help your organization adopt AI securely and compliantly.