Content

/

Letters And Testimony

/

Comment on Request for Information on the Development of an Artificial Intelligence (AI) Action Plan

letters and testimony

Comment on Request for Information on the Development of an Artificial Intelligence (AI) Action Plan

March 15, 2025

The featured image for a post titled "Comment on Request for Information on the Development of an Artificial Intelligence (AI) Action Plan"

Today, I submitted a comment in response to the National Science Foundation and Networking and Information Technology Research and Development Program’s request for information regarding the development of an Artificial Intelligence Action Plan. Click here to download a PDF of the full comment.

Thank you for the opportunity to comment on the National Science Foundation’s (NSF) and the Network and Information Technology Research Development’s (NITRD) request for information on the development of the Artificial Intelligence (AI) Action Plan. In my comment, I seek to draw attention to an issue that significantly impacts AI development: copyright law. This comment is adapted from a recent paper I co-authored with Tim Hwang, titled: “Copyright AI, and Great Power Competition.”

This document is approved for public dissemination. The document contains no business-proprietary or confidential information. Document contents may be reused by the government in developing the AI Action Plan and associated documents without attribution.

1. Introduction

Artificial intelligence (AI) has the makings of a general-purpose technology (GPT) that could usher in a new industrial revolution. As a GPT, AI can be diffused into existing processes while creating new products, services, and markets. This has significant implications for national soft power (economic strength, technological adoption, cultural impact) and hard power (military capabilities, statecraft, national sovereignty, and geopolitical competition).

The most significant geopolitical AI competition is between the United States and China. The Chinese Communist Party (CCP) has made clear its desire to change the status quo, with AI featuring heavily in such plans. In 2018, President Xi Jinping stated that China must "ensure that our country marches in the front ranks where it comes to theoretical research in this important area of AI, and occupies the high ground in critical and AI core technologies." In 2023, Xi directed the People's Liberation Army to "raise the presence of combat forces in new domains and of new qualities."

2. Copyright Law as a Threat to U.S. AI Leadership

There is one area of law that could deal a serious blow to U.S. efforts to maintain AI leadership: copyright. Nearly all leading AI model developers in the U.S. face lawsuits from copyright holders claiming that the inclusion of copyrighted material in training materials constitutes infringement. Currently, 39 open copyright cases against model developers seek damages and ask for data sets to be destroyed and models retrained. If plaintiffs are successful, they would cause irreparable harm to America's AI sector, with grave implications for America's economy, military capabilities, and geopolitical standing. Identifying the threat copyright lawsuits pose to American model developers and operationalizing policies to mitigate potential harms will be critical to advancing American competitiveness in AI development.

3. The Case for Fair Use in AI Training

Current "frontier" AI models would not exist without the ability to access and analyze vast amounts of data available online. Training large-scale models using prevailing scaling techniques requires massive data sets that can provide the necessary diversity, complexity, and quality of data. The default rule under U.S. copyright law is that unauthorized copying of material is actionable infringement. However, AI firms argue that accessing and analyzing such data constitutes fair use.

Fair use is a doctrine intertwined with copyright law for centuries and officially enshrined in the Copyright Act of 1976. It allows for nonpermissive use of copyrighted works in a transformative manner as a counterbalance to the limited monopoly that copyright holders enjoy. It is seen as a counterbalance to the limited monopoly that copyright-holders enjoy over their work to “promote the Progress of Science and useful Arts,” the original justification for copyright protection. Fair use is determined on a case-by-case basis, with judges relying on a four-part test to determine whether the use of a copyrighted work qualifies as fair use. The four aspects considered are:

  • The purpose and character of the use, including whether such use is of a commercial nature or for nonprofit educational purposes;
  • The nature of the copyrighted work;
  • The amount and substantiality of the portion used in relation to the copyrighted work as a whole; and
  • The effect of the use upon the potential market for or value of the copyrighted work.

The plaintiffs in the various lawsuits are a mix of creatives, such as comedians and artists, and corporate rights-holders, such as the New York Times and Getty Images. While each complaint is unique, all the cases assert that the use of copyrighted works without permission or payment is a willful infringement of copyright law, and that model developers must pay significant fines, destroy any models that were trained on such materials, and commit to not using copyrighted data in training in the future.

It is worth noting that this response is not new in some respects. Responses to new technologies are often severe, especially when incumbents and politically connected groups face disruptive or uncertain futures. In the early 2000s, file-sharing platform Napster was sued by record labels for its role in the distribution of pirated music. This litigation ultimately bankrupted the company and halted the use of file-sharing broadly. Not content with disrupting the use of the technology alone, the Recording Industry Association of America began suing individuals who shared or downloaded music outside official channels. Distortion, fear mongering, and attempts to halt innovation to preserve their place in the market have been hallmarks of such incumbents, making a clear-eyed and nuanced analysis all the more critical in the case of AI training and copyright.

There are strong arguments for protecting under fair use the use of copyrighted material in training, and a line of jurisprudence stretching back more than 50 years can be instructive on the questions being raised today.

Three key themes from successful fair use defenses apply to AI training:

  • Fair use allows people to share information, helping to uncover new insights and uses.
  • Fair use protects the right to learn and iterate upon prevailing ideas.
  • Fair use ensures a counterweight to the monopoly on ideas that copyright creates.

The common threads among these cases are the expansion of access to information, an opportunity to create new products or services, and the freedom to learn and build upon existing knowledge to create anew. When considering the promises of AI diffusion and improvement and how the law should treat this budding technology, these ideas should be front of mind.

Current copyright lawsuits, however, threaten such promises and risk stunting AI progress in the United States and boosting the standing and competitiveness of the Chinese AI apparatus.

4. Costs if U.S. Falls Behind in AI Development

4.1 Direct Costs from Litigation

Copyright infringement fines can range from $750 to $30,000 per infringement, with maximum penalties of $150,000 for willful infringement. Given the size of AI training datasets, successful lawsuits could result in fines totaling billions or even trillions of dollars, potentially bankrupting leading AI model developers. Beyond the monetary fines, some plaintiffs are asking courts to require the re-training or destruction of AI models whose training data included copyrighted works.

4.2 Ripple Effects Across the AI Stack

The second-order effects of adverse judgments would impact the entire AI stack, including:

  • Hardware: Reduced demand for high-performance semiconductors and custom computing hardware, undermining U.S. investments in semiconductor production.
  • Data Centers: Diminished investment in data center infrastructure, which currently exceeds $400 billion annually.
  • Applications: Stifled innovation in AI applications and startups. In the private sector, leading startup accelerator Y Combinator has reported that more than seventy-five percent of its summer cohort were focusing on AI models and tools, and fifty percent of its most recent cohort relied on AI models and tools to support their operations.

4.3 Military and National Security Implications

The Department of Defense (DOD) already procures systems from commercial firms or open-source models and fine-tunes them for military capabilities. Specifically, models from Meta and OpenAI are being integrated and deployed to support American military and intelligence operations. Legal restrictions on AI development would inhibit the government from accessing and deploying the private sector's most advanced models, putting American military and intelligence communities at a disadvantage compared to Chinese counterparts.

4.4 Opportunity Costs and Free Expression Impact

Beyond direct investment losses, there are significant opportunity costs in fields poised for AI-driven efficiency gains, such as drug development, physical infrastructure, and transportation. Copyright maximalism could also restrict the use of AI tools for individual expression, criticism, and speech.

5. How Copyright Restrictions Benefit China

Chinese model developers operate under two sets of rules: strict regulations for "public use" within China, and more permissive conditions for research, enterprise, or industrial uses. This dual approach allows China to control domestic information while enabling Chinese firms to compete globally.

China has exempted model developers from data and content restrictions, including copyright limitations, if the model is designed for "enterprise, education and research, or industry association." This gives Chinese developers significant advantages in leveraging vast amounts of data and an incentive to partner with industry and research organizations to protect investment and scale the technology.

Chinese model development is and will continue to support the CCP’s strategic and military might. The CCP's expansive surveillance capabilities are being enhanced with AI to monitor dissent. Chinese firms are producing innovative products combining AI models with data to improve sensors, radar systems, and computer vision—all dual-use technologies with both civilian and military applications.

If the United States loses its leadership position in AI, China will fill the void, with negative consequences domestically and globally. Unlike in China, American AI developers do not have to abide by government-imposed speech codes or socialist values, and they can choose whether to partner with the state. Allowing Chinese AI models to be developed and diffuse more quickly will harm both American soft and hard-power.

6. Policy Recommendations

The legal battles around AI and copyright raise the specter that a domestic, commercial squabble of interests might prevent the United States from retaining its edge in the global competition around the technology. The victory of the litigants in the many pending lawsuits against AI companies may lead to legal regimes that ultimately impose ruinous monetary damages, allow the extraction of burdensome settlements, and in extreme cases result in the court-ordered destruction of AI models themselves. All would do great harm to the emerging market of products and services leveraging the technology.

Regardless of the outcome, the persistent uncertainty around the legal standing of training raises real costs, as well. In all likelihood, pending lawsuits that appear likely to set precedent in this area will take years to resolve. Facing this uncertainty, leading AI developers may very well choose to rebase their model development operations in countries with more favorable environments for training. Other nations, such as Japan and Singapore, have explicitly carved out exceptions for the training of AI systems, and others could follow suit in an effort to capitalize on a sustained period of American inaction.

The state of the research field and the market for AI services is evolving fast enough that the United States cannot afford to wait on the slow process of judicial policymaking in this area. Properly constituted, copyright can serve not as a threat to technological progress and American advantage, but as an asset. I propose four immediate policy actions to this end.

6.1 Congressional Clarification of Fair Use

Congress has the power to settle the copyright status of training immediately. It should pass a bill that asserts unequivocally that all forms of training AI systems are a fair use, providing these activities with a clear mandate to move forward. Such a definitive approach would:

  • Immediately lower the risk of engaging in model training, encouraging new entrants to join the market
  • Counter the existential risk posed by pending lawsuits against leading American AI companies
  • Allow American AI companies to continue growing toward global dominance

6.2 Secure Training Data for American Industry

Training data—particularly high-quality training data—is an increasingly scarce commodity even as it remains critical for advancing AI capabilities. Global competitiveness depends not just on access to talent and advanced semiconductors, but also on securing the best datasets.

The U.S. Trade Representative should prioritize bringing the full weight of American commercial power to bear on opening up and securing valuable sources of training data worldwide for U.S. firms. This may include:

  • Trade deals that carve out fair use-style exceptions for model training in the intellectual property frameworks of other nations
  • Agreements to obtain exclusive access to valuable datasets in other countries, including multilingual materials, scientific corpora, and multimodal datasets

6.3 Establish the U.S. Bureau of Data Management

Some of the most valuable untapped datasets for AI advancement exist within the government. The formation of a U.S. Bureau of Data Management (BDM) within the Department of Commerce could accelerate U.S. leadership in AI.

The BDM would serve as a dedicated team of regulatory experts and technologists tasked with:

  • Identifying and "liberating" valuable government datasets that could be open-sourced or licensed to AI developers
  • Serving as a clearinghouse for archival collections, legal codes, weather data, and other valuable information
  • Gathering feedback from industry about the most valuable datasets to prioritize
  • Working with state and local governments to make their data available for AI training

6.4 Accelerate Opt-Out Standards

Copyright is being used as a lever for parties seeking to defend the right of individuals to "opt-out" of having their creative work used in AI training. While this is an important right to preserve, litigation is not the best method to achieve this end.

Clear technical protocols and strong norms can help foster an ecosystem where opting out is easily indicated and respected. "Robots.txt," an informal voluntary standard that indicates whether a website can be indexed by bots, demonstrates this can work at scale.

The government should establish a task force bringing together technology companies, academic researchers, rights-holders, and technical experts to establish and promote a common standard allowing creators to clearly indicate when copying for AI training is permitted or not.

7. Conclusion

Debates around copyright and intellectual property have existed since America's founding, evolving to balance incentives for creation with access to information to promote progress. AI's emergence has added a new dimension to these questions, as it represents a general-purpose technology with the potential to reshape economies, militaries, and societies.

The United States is positioned to lead in AI development and deployment, but China is actively working to surpass American capabilities. Copyright lawsuits currently present the greatest impediment to American AI progress. To ensure continued U.S. leadership, clarity around accessing data and training models under the fair use doctrine is paramount.

The policy recommendations outlined above would provide immediate relief to the AI industry while securing America's competitive advantage in this critical technology domain. By taking these steps, the United States can maintain its technological edge, strengthen national security, and promote economic prosperity in the AI era.

Explore More Policy Areas

InnovationGovernanceNational SecurityEducation
Show All

Stay in the loop

Get occasional updates about our upcoming events, announcements, and publications.