Four of the nation's leading book publishers have sued the Internet Archive, the online library best known for maintaining the Internet Wayback Machine. The Internet Archive makes scanned copies of books—both public domain and under copyright—available to the public on a site called the Open Library.
"Despite the Open Library moniker, IA's actions grossly exceed legitimate library services, do violence to the Copyright Act, and constitute willful digital piracy on an industrial scale," write publishers Hachette, HarperCollins, Wiley, and Penguin Random House in their complaint. The lawsuit was filed in New York federal court on Monday.
For almost a decade, the Open Library has offered users the ability to "borrow" scans of in-copyright books via the Internet. Until recently, the service was based on a concept called "controlled digital lending" that mimicked the constraints of a conventional library. The library would only "lend" as many digital copies of a book as it had physical copies in its warehouse. If all copies of a book were "checked out" by other patrons, you'd have to join a waiting list.
In March, as the coronavirus pandemic was gaining steam, the Internet Archive announced it was dispensing with this waiting-list system. Under a program it called the National Emergency Library, IA began allowing an unlimited number of people to check out the same book at the same time—even if IA only owned one physical copy.
Before this change, publishers largely looked the other way as IA and a few other libraries experimented with the digital lending concept. Some publishers' groups condemned the practice, but no one filed a lawsuit over it. Perhaps the publishers feared setting an adverse precedent if the courts ruled that CDL was legal.
But the IA's emergency lending program was harder for publishers to ignore. So this week, as a number of states have been lifting quarantine restrictions, the publishers sued the Internet Archive.
In an email to Ars Technica, IA founder Brewster Kahle described the lawsuit as "disappointing."
"As a library, the Internet Archive acquires books and lends them, as libraries have always done," he wrote. "Publishers suing libraries for lending books, in this case, protected digitized versions, and while schools and libraries are closed, is not in anyone's interest."
“They have a pretty strong case”
The publishers' legal argument is straightforward: the Internet Archive is making and distributing copies of books without permission from copyright holders. That's generally illegal unless a defendant can show it is authorized by one of copyright law's various exceptions.
Legal experts tell Ars that the Internet's Archive's best response is to argue that its program is fair use. That's a flexible legal doctrine that has been used to justify a wide range of copying over the decades—from recording television broadcasts for personal use to quoting a few sentences of a book in a review. Most relevant for our purposes, the courts have held that it is a fair use to scan books for limited purposes such as building a book search engine.
When considering a fair use claim, courts consider several factors, including the impact of the use on the market for the original work. A book search engine, for example, is not a substitute for reading books but, rather, helps readers find new books they might want to buy. This is one of the reasons the courts found that book scanning for a search engine was legal under fair use.
But it's harder to come up with compelling arguments that the Internet Archive's open-ended lending program is fair use.
James Grimmelmann, a copyright scholar at Cornell University, told Ars that he is withholding judgment until he sees the Internet Archive's response. However, he said, "it seems like the publishers have a pretty strong case."
"I think there are arguments for fair use, but they're not terribly strong arguments," he said in a Monday phone interview.
A pandemic exception?
The Internet Archive would have had a stronger argument if it had continued to limit the number of copies that could be lent out. In that scenario, IA could argue that the program's impact on the market was little different from a conventional library.
Obviously, a patron who checks out a book from a library is less likely to purchase a copy, undermining the market for the book. On the other hand, libraries themselves buy many books—and the more popular a book is, the more copies libraries must buy. So the overall impact of libraries on demand for books is not clear.
But once the IA stopped buying a copy of a book for every copy it lent out, this argument became a lot weaker. An institution like AI can buy a single copy of a book and then "lend" it to dozens, hundreds, or thousands of people at the same time. There's little doubt that this has a negative impact on the market for new books.
Instead, the Internet Archive will likely need to make a more novel argument—that the unique circumstances of a pandemic justifies allowing types of infringement that would be clearly illegal at other times. Grimmelmann wasn't able to identify any other cases where courts have made that kind of leap.
I also spoke to John Bergmayer, a copyright expert at the copyright reform group Public Knowledge. He said there was a "pretty strong fair use argument" for both the Internet Archive&Read More – Source