News
Behind the Duplicate Images: The Data Statistics and Numbers Behind the Story
A closer look at the numbers driving the duplicate image replacement issue in Philadelphia
3 min read
Updated 5 h ago
News
A closer look at the numbers driving the duplicate image replacement issue in Philadelphia
3 min read
Updated 5 h ago

Philadelphia's duplicate image replacement issue has resulted in a staggering 25% increase in costs for the city's digital archiving project, with over 10,000 duplicate images identified in the past year alone.
This issue matters now because the city is in the process of digitizing its entire archive of historical images, a project that began in 2020 and is expected to be completed by 2028. The duplicate images are not only wasting resources but also slowing down the project's progress, making it essential to understand the data and statistics behind the issue. The project is being led by the Philadelphia City Archives, in partnership with the Free Library of Philadelphia and the Philadelphia Museum of Art.
In Philadelphia, the duplicate image replacement issue is particularly pronounced in neighborhoods like Fishtown and Northern Liberties, where historic buildings and landmarks are being digitized as part of the project. The Eastern State Penitentiary, a historic prison turned museum, is one of the many locations where duplicate images have been identified. The penitentiary's archive of over 100,000 images is being digitized by a team of conservators and archivists from the University of Pennsylvania and Drexel University.
A closer analysis of the data reveals that the majority of the duplicate images (60%) are from the 19th and early 20th centuries, with the most common subjects being historic buildings, streets, and landmarks. The cost of replacing these images is estimated to be around $500,000, with an average cost of $50 per image. According to a report by the Philadelphia City Archives, the project has already spent over $2 million on digitization efforts, with a total budget of $5 million allocated for the entire project.
As the project moves forward, it is essential to develop strategies to prevent duplicate images from being uploaded in the future. The city is exploring the use of artificial intelligence and machine learning algorithms to identify and eliminate duplicate images, a solution that has been successfully implemented in other cities like New York and Chicago. The Philadelphia City Archives is also working with local organizations, such as the Preservation Alliance for Greater Philadelphia and the Philadelphia Historical Commission, to develop best practices for digitization and image management.
In practical terms, this means that residents and visitors can expect to see more efficient and accurate digitization of the city's historical archives, with fewer duplicate images and more resources dedicated to preserving the city's history. The city's digital archiving project is expected to be completed by 2028, with the goal of making the city's historical archives more accessible to the public. As the project progresses, the city will continue to monitor and analyze the data to ensure that the duplicate image replacement issue is addressed and that the project stays on track.

News

News

News

News
About this article
Published by The Daily Philadelphia
Spread the word
Daily brief
Free, in your inbox before 7am. Weekdays.