An academic research paper introducing SHADES, a multilingual benchmark designed to evaluate how large language models (LLMs) generate and reinforce stereotypes across different languages and cultural contexts.
A comprehensive assessment that maps how artificial intelligence is currently being used, governed, and managed across local, state, and federal governments in the United States.
A policy brief outlining concrete actions states can take to regulate tenant screening practices and reduce harm from inaccurate reports, automated scoring, and discriminatory impacts in the rental housing market.
This internal glossary defines key terms and concepts related to automating enrollment proofs for public benefits programs to support shared understanding among product and policy teams.
This research explores how software engineers are able to work with generative machine learning models. The results explore the benefits of generative code models and the challenges software engineers face when working with their outputs. The authors also argue for the need for intelligent user interfaces that help software engineers effectively work with generative code models.
The Policy2Code Prototyping Challenge explored utilizing generative AI technology to translate U.S. government policies for public benefits into plain language and code, culminating in a Demo Day where twelve teams showcased their projects for feedback and evaluation.
The team aimed to automate applying rules efficiently by creating computable policies, recognizing the need for AI tools to convert legacy policy content into automated business rules using Decision Model Notation (DMN) for effective processing and monitoring.
The strategic plan outlines intentions to responsibly leverage artificial intelligence (AI) to enhance health, human services, and public health by promoting innovation, ethical use, and equitable access across various sectors, while managing associated risks.
U.S. Department of Health and Human Services (HHS)