[Congressional Bills 119th Congress]
[From the U.S. Government Publishing Office]
[H.R. 9341 Introduced in House (IH)]
<DOC>
119th CONGRESS
2d Session
H. R. 9341
To require the Director of the National Institute of Standards and
Technology to develop guidelines to assist agencies with preparing open
Government data assets to be used to train artificial intelligence
models, and for other purposes.
_______________________________________________________________________
IN THE HOUSE OF REPRESENTATIVES
June 18, 2026
Mr. Babin (for himself and Ms. Lofgren) introduced the following bill;
which was referred to the Committee on Science, Space, and Technology
_______________________________________________________________________
A BILL
To require the Director of the National Institute of Standards and
Technology to develop guidelines to assist agencies with preparing open
Government data assets to be used to train artificial intelligence
models, and for other purposes.
Be it enacted by the Senate and House of Representatives of the
United States of America in Congress assembled,
SECTION 1. SHORT TITLE.
This Act may be cited as the ``AI-Ready Federal Data Guidelines
Act''.
SEC. 2. AI-READY DATA GUIDELINES FOR FEDERAL AGENCIES.
(a) In General.--The National Institute of Standards and Technology
Act is amended by inserting after section 21 (15 U.S.C. 278g-4) the
following new section:
``SEC. 21A. AI-READY DATA GUIDELINES FOR FEDERAL AGENCIES.
``(a) Development of Guidelines.--
``(1) In general.--The Director, in consultation with the
Director of the Office of Science and Technology Policy, the
Secretary of Energy, the Director of the Office of Management
and Budget, and the head of any other Federal agency the
Director considers appropriate, shall develop voluntary
guidelines to assist agencies with preparing datasets,
including open Government data assets, to be used to train
artificial intelligence models.
``(2) Elements.--In developing the guidelines under
paragraph (1) the Director shall carry out the following:
``(A) Address, to the extent practicable, the
following:
``(i) Data formatting and structure,
including guidelines to ensure datasets are
interpretable by artificial intelligence
systems.
``(ii) Data labeling and annotation,
including scalable methods such as
programmatic, automated, and expert-guided
approaches for preparing data for use in
artificial intelligence development.
``(iii) Data quality evaluation, including
guidelines to assess the suitability of
datasets for use in artificial intelligence
systems.
``(iv) Metadata and documentation,
including information sufficient to enable
appropriate interpretation and use of datasets
for artificial intelligence development.
``(v) Data maintenance, including guidance
for the ongoing management and updating of
datasets to ensure continued suitability for
use in artificial intelligence systems.
``(vi) Data availability, including
guidelines for improving and expanding
automated access to publicly available
information for artificial intelligence model
development and use.
``(B) Enable flexible implementation, to the extent
practicable, for various use cases across sectors and
scientific domains.
``(C) Ensure, to the extent practicable,
consistency with Circular A-119 of the Office of
Management and Budget.
``(D) Conformity assessment procedures, to the
extent practicable.
``(b) Pilot Programs for AI-ready Data Guidelines.--
``(1) In general.--The Director, in coordination with the
Director of the Office of Science and Technology Policy, the
Secretary of Energy, and the head of any other Federal agency
the Director determines appropriate, may carry out pilot
programs to support the development of conformity assessment
procedures for AI-ready datasets used in specific sectors and
scientific domains.
``(2) Requirements.--If pilot programs under this
subsection are carried out, such programs shall--
``(A) not exceed one year in duration;
``(B) develop supplemental guidelines for AI-ready
datasets used in specific sectors and scientific
domains in accordance with the guidelines published
under subsection (a);
``(C) assess the impact of such guidelines on data
usability, interoperability, and readiness for use in
artificial intelligence systems in specific sectors and
scientific or domains;
``(D) identify technical, operational, or resource
challenges associated with future implementation and
maintenance of such guidelines in specific sectors and
scientific domains; and
``(E) develop, as practicable and appropriate, a
process and materials for the transition to an
appropriate non-Federal entity of such guidelines for
future implementation of and updates to such
guidelines.
``(3) Selection of topics.--If pilot programs under this
subsection are carried out, such programs shall prioritize
areas--
``(A) with significant national security and
industrial competitiveness implications, such as
biotechnology and biomanufacturing; and
``(B) with respect to which Federal agencies
control and maintain AI-ready datasets.
``(3) Participation.--If pilot programs under this
subsection are carried out, the Director shall carry out not
more than two concurrent such programs through federally funded
research programs, National Laboratories, institutions of
higher education, or partnerships with the private sector.
``(c) Congressional Briefings.--Not later than one year after the
publication of the guidelines under subsection (a) and annually
thereafter for five years, the Director shall brief the Committee on
Science, Space, and Technology of the House of Representatives and the
Committee on Commerce, Science, and Transportation of the Senate on the
implementation of this section.
``(d) Prohibition.--The Director may not transfer or reprogram any
funds from any other program, project, office, or other entity or
activity of the Institute to carry out this section.
``(e) Definitions.--In this section:
``(1) Agency.--The term `agency' has the meaning given such
term in section 3502 of title 44, United States Code.
``(2) Artificial intelligence.--The term `artificial
intelligence' has the meaning given such term in section 5002
of the National Artificial Intelligence Initiative Act of 2020
(15 U.S.C. 9401).
``(3) Artificial intelligence model.--The term `artificial
intelligence model' means a component of an artificial
intelligence system that is--
``(A) derived using mathematical, computational,
statistical, or machine-learning techniques; and
``(B) used as part of an artificial intelligence
system to produce outputs from a given set of inputs.
``(4) Artificial intelligence system.--The term `artificial
intelligence system' means any data system, software, hardware,
application, tool, service, or utility that operates in whole
or in part using artificial intelligence.
``(5) Biomanufacturing.--The term `biomanufacturing' has
the meaning given such term in section 10002 of the Research
and Development, Competition, and Innovation Act (42 U.S.C.
18901; popularly referred to as the `CHIPS and Science Act').
``(6) Conformity assessment procedure.--The term
`conformity assessment procedure' has the meaning given such
term in section 451 of the Trade Agreements Act of 1979 (19
U.S.C. 2571).
``(7) Director.--The term `Director' means the Director of
the National Institute of Standards and Technology.
``(8) Institution of higher education.--The term
`institution of higher education' has the meaning given such
term in section 101 of the Higher Education Act of 1965 (20
U.S.C. 1001).
``(9) National laboratory.--The term `National Laboratory'
has the meaning given such term in section 2 of the Energy
Policy Act of 2005 (42 U.S.C. 15801).
``(10) Open government data asset.--The term `open
Government data asset' has the meaning given such term in
section 3502 of title 44, United States Code.''.
(b) Conforming Amendment.--Subsection (f) of section 22A of the
National Institute of Standards and Technology Act (15 U.S.C. 278h-1)
is repealed.
<all>