Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Introduction to AI for Official Statistics

AI for Official Statistics cover art

Brock Webb


This book exists to help move official statistics beyond the traditional, into applied machine learning and onto GenAI.

Federal statistical agencies collect, process, and publish some of the most consequential data in the world. The methods behind that work are evolving. Machine learning, large language models, and agentic AI are not future possibilities; they are current tools with real applications in survey operations, data processing, and statistical production.

This book covers what those methods are, how they work, when they apply, and when they do not. Every chapter includes plain-language explanation, practical context for federal operations, and guidance on explaining methods to leadership and stakeholders.

All examples use public datasets from the Census Bureau, Bureau of Labor Statistics, and other federal sources. No proprietary data, no restricted access required.


How This Book Is Organized

Fifteen chapters across five parts, each building on the previous.

These examples use public U.S. Census Bureau data because it is public and accessible through a stable API that is well documented. The concepts — when to apply machine learning, how to evaluate AI systems, what questions to ask — apply across all of official statistics.


Contents