Stop guessing, start measuring: USA Today on AI in the newsroom

Jessica Davis, Vice President of AI Product at USA Today, came with a practical framework, and a case study from her own newsroom.

Her argument is straightforward: evaluations – not goodwill or a journalist’s thumbs up – are the foundation any newsroom needs before it can trust AI at scale.

Davis was speaking at our ongoing World News Media Congress in Marseille, where she previewed her Capstone research at the City University of New York (CUNY).

What she brought to the stage was less a polished product announcement than a frank, practical account of what it actually takes to move AI from experimentation into production.

The ceiling on human-in-the-loop

Most newsrooms today are working with what Davis calls assistive AI: tools that talk back, generate text, surface information. You prompt it, you get an output, you decide what to do with it. It’s useful, but it puts all the cognitive weight on the journalist.

The direction of travel is somewhere more consequential: autonomous, agentic AI that doesn’t wait to be prompted, but takes action on a goal.

“Assistive AI is like the mouth – it can talk to you, but you have to copy and paste and take the action yourself,” Davis said. “Agentic AI is more like the hands. You give it a goal, and it can take action for you.”

The problem is that the current model of a human reviewing every output doesn’t scale. And without a smarter approach to oversight, the human in the loop becomes a bottleneck and eventually a burnout.

“We have trusted products. Trust is a key asset to our organisation. When we’re working with AI systems, they can be wrong, and they can be confidently wrong. And sometimes it’s subtle when they fail.”

The public records lesson

The most instructive part of Davis’s talk was a case study from USA Today’s own newsroom. The organisation built an agent to help journalists navigate public records requests, a workflow riddled with complexity, since laws vary across all 50 states and officials can reject requests that cite the wrong statute.

The team spent months building and testing. “The agent kept hallucinating. It kept getting things slightly wrong. And slightly wrong, in a public records context, means the request fails,” Davis noted.

Then they introduced evaluations – a structured method for defining exactly what success looked like and measuring against it.

“We moved from months to being able to ship to production within a week. And from there, we shipped multiple features within days,” she added.

What good evaluation actually looks like

Davis was careful to distinguish between what’s possible at scale – automated AI scoring AI – and what’s accessible to most newsrooms right now.

“You don’t need a data science team to start. You need a definition of success that’s specific enough to measure,” she pointed out.

The instinct in many newsrooms is to ask journalists for feedback via a thumbs up or thumbs down. Davis is clear that this doesn’t work. On the other end, detailed feedback forms are a non-starter too – journalists were too busy for that.

The solution was structured criteria developed with journalists.

“So, we brought a group of journalists into the process early, worked through what a successful public records request actually required, and built those requirements into the evaluation framework,” said Davis.

(L-R) Pundi S. Sriram (Chief Product Officer & Business Head, STEP from The Hindu Group, India), Jessica Davis (Vice President, AI Product, USA TODAY Co., USA AI Product Newsroom), Jan Helin (Chief Product Officer, Bonnier News, Sweden), and Kevin Anderson (Director of the Digital Revenue Network, WAN-IFRA, UK)

A new governance model

For Davis, evaluations are also the new governance model – dynamic, requiring continuous monitoring, and still something she is figuring out at scale with her data science team.

“This takes a black box system and helps journalists and product managers see: this is how it works, this is when it fails, this is what it needs to do to be successful in my workflow,” she said.

That, she said, creates trust, clarity about where AI belongs in a workflow and where it doesn’t, and speed.

“We can be strategic about when we need to intervene and when, through the data, we can actually trust the system to do the job we’ve given it,” she added.

Source link

News
AI tools help The Quint drive engagement, subscriptions
The average session duration has risen by roughly a minute, bringing it to nearly five minutes per user, according to Tarun Jain, Product Head. “Creating these tools internally not only reduced licensing costs but also ensured the products were shaped around editorial priorities from the outset,” said Jain. He was speaking at WAN-IFRA’s Digital Media…
Read More AI tools help The Quint drive engagement, subscriptions
News
Artificial Intelligence in Latin American Newsrooms: Moving from Exploration to Editorial Practice
This article brings together experiences that show how different media organisations across the region are making practical decisions to integrate artificial intelligence responsibly and with tangible impact on their daily operations. When discussing artificial intelligence in journalism, the real challenge is not the technology itself, but how to integrate it into newsrooms: which problems to…
Read More Artificial Intelligence in Latin American Newsrooms: Moving from Exploration to Editorial Practice
News
Israel’s Gaza aid cutoff was both immoral and a strategic disaster
Israel’s restrictions on humanitarian aid in Gaza are, first and foremost, a moral atrocity. Israeli policies since March, most notably the initial shutdown on aid entering the Strip, were very obviously going to cause a hunger crisis down the line. There can be no defense for intentionally starving children. But strikingly, the policy has also…
Read More Israel’s Gaza aid cutoff was both immoral and a strategic disaster
News
From Norway to India: How AI is reshaping global fact-checking efforts
Chouhan said journalists are increasingly relying on artificial intelligence to assist their fact-checking efforts. He noted that key technologies being employed include machine learning algorithms, natural language processing, and image, audio, and video recognition. Chouhan, who was speaking at WAN-IFRA’s Bangalore AI Forum, outlined several reasons why AI is becoming essential in fact-checking: Sheer volume…
Read More From Norway to India: How AI is reshaping global fact-checking efforts
News
Beyond prompting: How Reuters is empowering its staff to harness the AI revolution
“In any moment of a big transformational change, you’re dealing with people.” While much discussion around AI’s impact on the news industry has focused on the various AI tools available, Jane Barrett argues that not enough attention has been paid to the people living through this transformative moment. This includes the crucial need to help…
Read More Beyond prompting: How Reuters is empowering its staff to harness the AI revolution
News
Where Is American democracy headed?
American democracy is not in a good place. Institutional breakdown and mistrust define our political moment. Polarization has broken our politics, and President Donald Trump has elevated fealty to him — as opposed to the Constitution — as the core principle of governance. And the American story is part of a global story. If the…
Read More Where Is American democracy headed?

The ceiling on human-in-the-loop

The public records lesson

What good evaluation actually looks like

A new governance model

Similar Posts